2026-04-10 · 16 sources
Harness engineering is the new meta — everyone from Cole Medin to Anthropic is racing to make AI coding agents deterministic, repeatable, and cheap enough to actually ship with.
Nine videos in a week — Nate's sprint covers the adviser strategy for slashing Opus costs, Karpathy's Obsidian knowledge bases, running Claude Code free via Ollama/OpenRouter, and a $10K AI stock trading challenge that's equal parts entertaining and reckless.
Claude Just Told Us to Stop Using Their Best Model
Anthropic's adviser strategy: Opus brains, Haiku prices.
Watch this if you want to cut your Claude API costs by 12%+ while keeping near-Opus intelligence in your agents.
What it is: Anthropic's official "adviser strategy" — pair Opus as a thinking advisor with Sonnet or Haiku as the executor. The executor only escalates to Opus when it actually needs the horsepower.
How it works:
Benchmarks Nate covers:
Cost breakdown (per million tokens):
Key distinction: The adviser strategy exists in the Messages API, not natively in Claude Code. Nate demos both contexts — API implementation and how to approximate it in Claude Code.
Tools & links:
Why it matters for you: If you're building agents via the Anthropic SDK, this is a direct cost optimization. Structure your agent pipelines so only the hard reasoning steps hit Opus.
I Gave OpenClaw $10,000 to Trade Stocks
Two AI bots, $10K each, 30-day trading war.
Watch for the autonomous agent architecture — cron jobs, Discord monitoring, strategy-fed LLMs — skip if you think AI stock trading is a good retirement plan.
What it is: Nate and his co-host Salmon each gave their AI trading bots $10,000 real cash for a 30-day competition. Loser pays $100 to a subscriber.
How the bots work:
Rules:
Results teased: Nate was up $210 mid-day, then crashed on Monday. Bot recovered by scalping.
Tools & links:
Why it matters for you: The architecture is interesting — cron-triggered autonomous agents with real API integrations and inter-agent communication. The trading results are entertainment, but the agent design pattern (scheduled execution, strategy injection, monitoring via Discord) is reusable for non-financial automation.
I Tested Claude's New Managed Agents... What You Need To Know
Hands-on with Anthropic's managed agents API.
Watch this — 100K views in a day means this is the video people are sharing, and managed agents are Anthropic's play to own the agent orchestration layer.
What it is: Nate's first-look test of Claude's new Managed Agents feature — Anthropic's hosted agent orchestration.
Key details:
Tools & links:
Why it matters for you: If Anthropic is building managed agent orchestration into their API, that directly competes with what tools like Archon and custom harnesses do. Worth understanding what they're offering natively before building your own.
Claude’s New AI Just Changed the Internet Forever
Claude Mythos and Project Glass Wing breakdown.
Watch this for Nate's practical take on Mythos security implications — 183K views says the AI community is spooked.
What it is: Nate's breakdown of Claude Mythos — Anthropic's unreleased frontier model that's finding decades-old software vulnerabilities.
Key details:
Why it matters for you: Security implications for anyone building web apps. If AI can find 27-year-old vulnerabilities in OpenBSD, your SaaS app's attack surface just got a lot more interesting.
Andrej Karpathy Just 10x'd Everyone's Claude Code
Earlier upload of the same Karpathy video.
Skip — this is a duplicate/earlier version of the 249K-view video above.
What it is: Appears to be an earlier upload or variant of the Karpathy knowledge base video. The 249K-view version is the one to watch.
Why it matters for you: It doesn't — watch the other one.
Planning In Claude Code Just Got a Huge Upgrade
UltraPlan upgrades Claude Code's planning system.
Watch if you use Claude Code daily — planning is the difference between agents that drift and agents that ship.
What it is: Nate covers a significant upgrade to Claude Code's planning capabilities, referenced as "UltraPlan" in his Skool link.
Key details:
Tools & links:
Why it matters for you: Better planning in Claude Code means your mx-workflow slash commands get more reliable agent behavior. If there's a new planning API or pattern here, it could improve how `/mx:plan` and `/mx:implement` work.
Andrej Karpathy Just 10x’d Everyone’s Claude Code
Karpathy's Obsidian knowledge base method, step by step.
Watch this one — 249K views, and the technique of having Claude Code auto-build an interconnected knowledge graph from your own content is immediately useful.
What it is: Nate implements Andrej Karpathy's viral LLM knowledge base concept — using Claude Code to auto-organize content into an Obsidian vault with backlinks, tags, and relationship graphs.
How it works:
Nate's setup:
Karpathy's original process:
1. Data ingest — source documents (PDFs) into Claude Code
2. Claude Code compiles them into interconnected wiki
3. Health checks keep everything consistent
4. Uses Obsidian as the IDE for visual markdown
Tools & links:
Why it matters for you: This is a pattern you could apply to mx-brief itself — auto-building a knowledge graph of tools, trends, and techniques mentioned across all your scraped videos. Each scrape cycle adds to the graph. Over time you'd have a searchable, interconnected map of the entire AI tooling landscape.
How to Use Claude Code for 99% CHEAPER
Cost-cutting tricks for Claude Code usage.
Skip unless you're hitting token limits — the Ollama video below covers the same territory with more depth.
What it is: Companion or earlier version to the Ollama + Claude Code video. Likely covers OpenRouter free tier and local model strategies.
Key details:
Why it matters for you: The Ollama video below is the more complete version of this topic.
Ollama + Claude Code = 99% CHEAPER
Run Claude Code free with local or cloud models.
Watch this if you want to experiment with Claude Code's harness without burning API credits — two methods, both surprisingly easy.
What it is: Two methods to run Claude Code without paying Anthropic for tokens — local models via Ollama and cloud models via OpenRouter's free tier.
The car analogy Nate uses:
Method 1: Ollama (local models)
Method 2: OpenRouter (cloud, free tier)
Key distinction Nate makes:
Tools & links:
Why it matters for you: Good for experimentation and testing workflows without burning credits. Don't expect Opus-level results from a local 7B model, but for iterating on prompts and testing harness logic, this saves real money.
Chuck's honest moment: he admits he spends more time building AI tooling than actually using it, and Perplexity Computer's 19-model orchestration made him question everything — plus MePalace is a legit AI memory system worth trying.
i didn't want to like this....
Perplexity Computer humbles Chuck's custom AI setup.
Watch for Chuck's honest self-reflection on over-engineering AI tooling — and his breakdown of what Perplexity Computer actually does under the hood.
What it is: Chuck — who built 103 custom Claude Code skills, runs OpenClaw with 7+ agents, and has a deeply over-engineered personal AI stack — tests Perplexity Computer and has an existential crisis.
Chuck's current setup (for context):
What Perplexity Computer is:
What Chuck built with it:
The real insight (and it's good):
Tools & links:
Why it matters for you: This is the builder's trap in a nutshell. Chuck's realization — that he's optimizing the toolchain instead of shipping — is worth hearing from someone with his level of setup. The question for you: is mx-workflow the axe or the tree?
Milla Jovovich made an AI memory tool…..it’s pretty good
MePalace: AI memory system with structured retrieval.
Watch this — 511K views and the tool actually works. A structured memory system that compresses months of conversations into 120 tokens.
What it is: MePalace — an open-source AI memory system designed by Milla Jovovich and her collaborator Ben. Claims to be the highest-scoring AI memory system ever benchmarked.
How it works:
Chuck's install process:
Chuck's results:
Stats:
Tools & links:
Why it matters for you: Your mx-brief pipeline already builds up context over time. MePalace's structured memory approach — wings/halls/rooms — could be a pattern for how you persist and retrieve cross-session context in mx-workflow. The 120-token compression for months of conversations is wild if it actually holds up.
Gemma 4 on the iPhone (local AI, no internet required)
Google's Gemma 4 running locally on iPhone.
Watch if local on-device AI interests you — Chuck demos Japanese pill bottle translation with zero internet.
What it is: Google's Gemma 4 model running locally on an iPhone — no cloud, no internet required.
Key details:
Tools & links:
Why it matters for you: On-device AI is getting real. If Gemma 4 can translate Japanese on a phone, local inference for developer tools is right around the corner.
Anthropic says NO MORE OpenClaw!!
Anthropic cuts off third-party harness support.
Watch for the industry implications — if Anthropic is killing third-party harnesses, every OpenClaw user needs a migration plan.
What it is: Anthropic officially stops full support for third-party harnesses, calling out OpenClaw specifically. Chuck is upset.
What happened:
Chuck's reaction:
The bigger picture:
Tools & links:
Why it matters for you: You're already on Claude Code, so this doesn't hurt you directly. But it validates the bet on building within Anthropic's official ecosystem (mx-workflow as Claude Code skills) rather than on third-party harnesses that can get cut off overnight.
Archon just relaunched as an open-source harness builder for AI coding — this is directly relevant to mx-workflow and how you orchestrate Claude Code sessions.
The Next Evolution of AI Coding Is Harnesses - Here's How to Build Them
Archon relaunches as open-source harness builder.
This is the most relevant video of the week for mx-workflow — Cole built exactly what you're building, open-sourced it, and it's worth understanding his architecture.
What it is: Cole's Archon project — previously an "AI command center" — has been completely overhauled into the first open-source harness builder for AI coding. Think of it as a framework for encoding your entire development process as a repeatable workflow.
Core concept:
How Archon workflows work:
Pre-packaged workflows included:
How you use it:
The evolution Cole describes:
Tools & links:
Direct comparison to mx-workflow:
Why it matters for you: This is your direct competitor/inspiration. Cole's approach of mixing deterministic commands with agent prompts in a DAG-like workflow is a pattern worth stealing. The human approval gate and parallel session handling are features mx-workflow could benefit from.
Full Archon Guide - Build AI Coding Harnesses That Actually Ship (LIVE)
Full livestream walkthrough of Archon setup.
Watch if you want the deep tutorial after the intro video — this is the full build session, start to finish.
What it is: Cole's full livestream walkthrough of Archon — covers everything from what harness engineering is to building custom workflows.
Key details:
Tools & links:
Why it matters for you: If the intro video piques your interest, this is the full reference session.
I Built Self-Evolving Claude Code Memory w/ Karpathy's LLM Knowledge Bases
Claude Code memory that learns from your sessions.
Watch this — Cole takes Karpathy's knowledge base idea and applies it specifically to Claude Code memory evolution, which is directly relevant to how mx-workflow manages context.
What it is: Cole implements Karpathy's viral LLM knowledge base concept but focuses it on Claude Code's own memory — making it self-evolving based on your coding sessions.
Key insight from Cole:
Key details:
Tools & links:
Why it matters for you: mx-workflow already has CLAUDE.md and memory files. Cole's approach of making memory self-evolving — where each session automatically enriches the knowledge base — could make your workflow context richer over time without manual curation. This is the difference between static memory and compounding intelligence.
An 18-year-old non-coder vibe-coded a clipping SaaS to $5K/month, and Koerner walks through a missed-call AI agent you can sell to small businesses for $500-$2K/month using GoHighLevel.
He Can't Code but His AI Agents Make Him $5K/Month
18-year-old vibe-codes clipping SaaS to $5K/month.
Watch for the tools and approach — an 18-year-old non-coder built a startup that competes with a $50M-funded company using AI agents.
What it is: Interview with Vadim, 18, founder of Vugola — a clipping tool for streamers and marketers. $5K in first month, zero coding knowledge.
The business:
How he built it:
Key quotes:
Tools mentioned:
Why it matters for you: The side hustle angle: clipping/content repurposing is a growing market, the barrier to entry is effectively zero with AI agents, and the unit economics work because it's organic marketing (win-win for clipper and client). If you're looking for a side hustle idea with recurring revenue, this model is proven.
I Built an App the Government Doesn't Want You to See
AI-built tool to find government contracts on sam.gov.
Watch for the build process — Replit Agent 4's parallel agents building backend, frontend, and landing page simultaneously is a workflow worth seeing.
What it is: Chris builds a tool that simplifies finding federal government contracts from sam.gov — a site with $834 billion in annual contracts that's brutal to navigate.
The opportunity:
How he built it:
1. Wrote 3 simple sentences describing the idea to Claude
2. Asked Claude to generate a detailed prompt for Replit
3. Pasted the Claude-generated prompt into Replit Agent 4
4. Replit Agent 4 runs multiple AI agents in parallel:
5. Needed a sam.gov API key
6. Result: type in your business, pick your state, see contracts you can bid on with dollar amounts
Cost comparison:
Tools & links:
Side hustle angle:
Why it matters for you: Two things: (1) the prompt-chaining workflow — using Claude to write a better prompt for another AI tool — is a meta-technique worth adopting. (2) Government contract discovery is a legitimate SaaS niche with free data and terrible incumbents.
The Most Profitable Business Everyone Overlooks
Spray drone business: $132K in 24 days.
Skip unless you want a non-tech side hustle — this is about agricultural spray drones, not AI.
What it is: Interview with Mike from Drone Deer Recovery — ex-Amish entrepreneur who pivoted from drone deer recovery ($500/job) to agricultural spray drones.
The numbers:
How it works:
The side hustle angle:
Why it matters for you: This is a non-tech testosterone business. Interesting as diversification from software, but not directly applicable to your AI projects.
This Might Be the Easiest Way to Sell AI to Businesses
Build and sell missed-call AI agents to SMBs.
Watch if you want a repeatable AI side hustle — this is a $500-$2K/month per client service you can build in GoHighLevel in one sitting.
What it is: Step-by-step build of an AI agent that texts back missed calls, has full conversations, answers questions, and books appointments — then a framework for selling it to small businesses.
The opportunity:
What Chris builds:
Build process:
1. Go to GHL → Automation → Create Workflow → Template → "Missed Call Text"
2. Trigger: incoming call (missed)
3. AI responds with human-sounding text
4. Full conversation flow: answers questions, handles objections, books appointments
5. Chris shares the exact prompt that makes the AI sound human
The dog-fooding strategy:
Revenue model:
Tools & links:
Why it matters for you: This is the most actionable side hustle of the week. Low technical barrier, recurring revenue, proven demand, and you can use AI to both build the product AND find clients. The dog-fooding angle is clever.
Codie's shorts this week are motivation-heavy but the actionable gem is her AI writing fix: prompt 5-6 times, feed it your voice, cut 30% — stop publishing first-draft AI slop.
Resumes are dead. Prove value.
Send ideas to hiring managers, not resumes.
Skip unless you're job hunting — Codie's advice: AI pre-screens resumes, so bypass Indeed entirely and send hiring managers proof of what you can do.
What it is: 60-second take on why traditional job applications are dead.
The secret:
Why it matters for you: If you're ever hiring, flip this: look for people who send you proof of work, not PDFs.
Say it out loud. Win bigger.
Public accountability makes you hit your goals.
Skip — motivational short. The tactic: post your goals publicly so fear of embarrassment drives execution.
What it is: Codie's argument for public accountability.
The secret:
Why it matters for you: If you're building a side hustle, consider building in public. The accountability loop is real.
Speed ≠ Direction Fix
Working harder won't fix wrong strategy.
Skip — this is a promo for Codie's Growth Accelerator Workshop in Austin.
What it is: Tony Robbins quote + pitch for Codie's Growth Accelerator Workshop.
The secret:
Why it matters for you: The principle is sound — direction over speed — but this is mainly a workshop ad.
The 3 Jobs Of A CEO
CEO's three core responsibilities distilled.
Skip unless you're scaling past solo — no transcript available, but 70K views suggests Codie nailed a nerve.
What it is: Codie's framework for the three core CEO jobs. No transcript available.
Key details:
Why it matters for you: Relevant when you're scaling a side hustle past the solo phase.
Success Isn’t Linear. Ever.
Grind for years, then one day everything shifts.
Skip — motivational short. The punchline: success is a step function, not a line.
What it is: Codie's take on the nonlinear path to success.
The secret:
Why it matters for you: Save this for the day your side hustle feels pointless.
Stop Sounding Like AI
AI writes average — prompt harder, cut 30%.
Watch this 90-second short — Codie's AI writing fix is the most actionable content advice of the week.
What it is: Codie's framework for not sounding like AI slop.
The secret:
The fix:
The mindset:
Why it matters for you: If mx-brief generates any user-facing text, this prompting philosophy applies. Multi-pass prompting with voice examples is worth building into your pipeline.
Stop Working. Start Scaling.
Systems over sweat equity.
Skip — motivational short about building systems instead of doing everything yourself.
What it is: Codie's recurring theme: stop trading time for money, build systems.
Key details:
Why it matters for you: The principle applies to mx-workflow — you're literally building systems that automate your dev process. Keep going.
Short but resonant: we blew past the Turing test and nobody threw a party — Casey Muratori's point is that we're so busy replacing jobs we forgot to be amazed.
We Passed the Turing Test and Nobody Celebrated
We passed the Turing test and shrugged.
Watch this short clip — Casey Muratori's point lands: we achieved one of computing's greatest milestones and immediately asked how to cut jobs with it.
What it is: Clip featuring Casey Muratori reflecting on how the AI community blew past the Turing test without celebration.
The message:
What to prepare for:
Why it matters for you: A good perspective reset. You're building tools on top of something that would have been science fiction 5 years ago. Don't forget to enjoy it.
TurboQuant compression is making 16GB machines viable for local AI, and Intel's Arc Pro B70 with 128GB VRAM is priced near an RTX 5090 — the local AI hardware game just shifted.
After This, 16GB Feels Different
TurboQuant compression makes 16GB machines viable for local AI.
Watch if you're running local models on a Mac Mini — TurboQuant might let you punch way above your RAM class.
What it is: Alex covers TurboQuant — a new compression technique that's changing what's possible on memory-constrained machines for local AI inference.
Key insight:
Hardware context:
Mac Mini comparison:
Tools & links:
Why it matters for you: If you're running Ollama + Claude Code locally (per Nate's video), TurboQuant compression could let you run bigger, smarter models on your current hardware.
Full post →
Intel just CRUSHED Nvidia & AMD GPU pricing
Intel Arc Pro B70: 128GB VRAM at RTX 5090 prices.
Watch if you're considering a dedicated AI inference GPU — Intel just made 128GB VRAM affordable.
What it is: Alex tests Intel's Arc Pro B70 against NVIDIA RTX Pro 4000 and AMD R9700.
The headline numbers:
Why this matters for local AI:
Mac Mini comparison:
Tools & links:
Why it matters for you: If you ever want a dedicated local AI inference machine separate from your Mac, Intel just made the cost/VRAM equation much more interesting. 128GB VRAM at ~$2K is a game changer for running full-size models locally.
Full post →Claude Mythos is the headline — Anthropic built a model so good at finding software vulnerabilities they refuse to release it, and the Claude Code source leak revealed how memory actually works under the hood.
AI News: The Scariest AI Model Ever!
Claude Mythos finds 27-year-old vulnerabilities nobody else could.
Watch for the Mythos benchmark breakdown — this is the most comprehensive coverage of why Anthropic won't release their best model.
What it is: Matt's weekly AI news roundup, headlined by Claude Mythos — Anthropic's unreleased frontier model.
Claude Mythos breakdown:
Benchmarks vs Opus 4.6:
Other news in the roundup:
245-page system card highlights:
Why it matters for you: If you're building web apps, the security landscape just changed. AI models that can find decades-old vulnerabilities in hardened systems will eventually be available to bad actors. Defensive security tooling is about to become a very hot space.
This Unknown AI Model is Shockingly Good
Arcee's Trinity model rivals Opus, open-source.
Skip unless you're evaluating open-source models — Trinity-Large-Thinking benchmarks near Opus 4.6 under Apache 2.0, but Matt himself can't find a meaningful way to differentiate it.
What it is: Arcee (American company, first Matt's heard of them) drops Trinity-Large-Thinking — an open-source frontier reasoning model under Apache 2.0.
Benchmarks:
Matt's honest take:
Tools & links:
Why it matters for you: Another open-source model approaching frontier quality. If you're using Ollama for free Claude Code (per Nate's video), this could be worth testing as the local model.
OpenAI Just Made History (Holy Sh*t?!)
OpenAI raises $122B at $852B valuation.
Skip — the numbers are staggering but don't change how you build. Sora losing $1M/day is the only actionable detail.
What it is: OpenAI's record-breaking fundraise and Sora shutdown.
The numbers:
Why it matters for you: It doesn't, directly. But Sora's death at $1M/day losses shows that even OpenAI can't sustain money-losing AI products forever. Revenue models matter.
WTF Is OpenAI Doing??
OpenAI buys a tech podcast network.
Skip — OpenAI acquired TBPN (tech business podcast). Interesting for media nerds, irrelevant for builders.
What it is: OpenAI acquired TBPN (Tech Business Production Network) — a daily live tech podcast.
Key details:
Why it matters for you: It doesn't. Media consolidation is interesting but won't affect your web apps.
Claude's Source Code Got Leaked Across The Whole Internet
Claude Code leak reveals how AI memory actually works.
Watch this — the leaked code reveals how Claude Code's memory system actually works under the hood, and it's directly relevant to how you build mx-workflow.
What it is: Anthropic's Claude Code source was leaked on GitHub. The most interesting finding: how Claude Code's memory and context actually works internally.
Key revelations from the leak:
In plain terms:
Why the spread:
Why it matters for you: This is how your mx-workflow memory system should think about scaling. Index files that point to searchable content, not giant context dumps. The grep-for-identifiers pattern is exactly how you'd want to handle growing amounts of scraped video data without blowing up context windows.
claude-code
Security hardening, Vertex AI wizard, Monitor tool.
Major Bash permission security fixes — if you use auto-permissions or bypass mode, update immediately.
What changed (v2.1.97–v2.1.100):
Security fixes (critical):
Stability fixes:
Breaking changes:
Links:
Why it matters for you: The Bash permission security fixes are serious — several bypasses allowed arbitrary code execution in auto-permissions mode. Update now.
kit
Form submit validation, security fix for chunked requests.
The `BODY_SIZE_LIMIT` enforcement fix closes a potential DoS vector on chunked uploads in your SvelteKit apps.
What changed (2.56.1–2.57.1):
Breaking changes:
Links:
Why it matters for you: The `submit()` boolean return is a clean pattern for form validation in your SvelteKit apps, and the chunked request fix patches a real security gap.
Anthropic
Sonnet 4.6 launched, Opus 4.6 upgraded, ad-free commitment.
Opus 4.6 is the model powering your Claude Code right now — Sonnet 4.6 is the cost-effective option for any API-driven features you build.
What changed:
Breaking changes:
Links:
Why it matters for you: Model IDs to use in your apps: `claude-opus-4-6` (best quality), `claude-sonnet-4-6` (best value). Both are 4.6 family with major agentic improvements.
Supabase
Agent Skills for correct AI-driven Supabase builds, custom OIDC.
Agent Skills could directly improve how Claude Code builds against your shared mx-supabase instance — worth integrating into your workflow.
What changed:
Breaking changes:
Links:
Why it matters for you: The Agent Skills package is designed exactly for your use case — AI agents building Supabase apps. Could improve RLS and migration quality when Claude Code builds against mx-supabase.
@claudeai
Launched the Advisor Strategy — Opus as a brain, Sonnet/Haiku as hands, near-Opus quality at a fraction of the cost via a single API call.
@sama
Codex hit 3M weekly users; launching a $100/mo ChatGPT Pro tier and resetting usage limits to celebrate.
@AnthropicAI
Dropped Claude Mythos Preview for cybersecurity (Project Glasswing), committing $100M in credits to secure critical/open-source software, plus an engineering deep-dive on Managed Agents.
@bcherny
Claude Code shipped a Monitor tool (background scripts that wake the agent on events), a Bedrock/Vertex setup wizard, and a 3x P99 speedup on file search.