15 min read

Fable 5: AI Migrates Two Months of Code in a Day

Fable 5, an AI model, migrated two months of code in a single day, demonstrating AI's growing autonomy in software development.

Fable 5: AI Migrates Two Months of Code in a Day

They’re not just building models anymore; they’re building AI that builds itself and ships its own code.


The Intake

📊 12 episodes across 7 podcasts

⏱ 562 minutes of intelligence analyzed

🎙 Featuring: Alex Bowcut (Head of Engineering, Sphere), Sam Charrington (Host, The TWIML AI Podcast), Nathaniel Whittemore (Host, The AI Daily Brief), Daniel Faggella (Host, Emerj Artificial Intelligence Research), Ravi Marwaha (COO & Chief Technology Product Officer, Arango), Art Shectman (CEO and Founder, Elephant Ventures), Marilie Fouché (Host, The AI in Business Podcast), Donald Trump (Former US President), Bernie Sanders (US Senator), Sam Altman (CEO, OpenAI), Mustafa Suleyman (CEO, Microsoft AI), Nilay Patel (Editor-in-Chief and Host, The Verge), Chantel Prat (Cognitive Neuroscientist), Jeremy Utley (Host), Henrik Werdelin (Host, Barkbox), Chris Benson (Principal AI and Autonomy Research Engineer, Practical AI LLC), Daniel Whitenack (CEO, PredictionGuard), Rebecca Hinds (Head of the Work AI Institute, Glean), Nathan Labenz (Host, The Cognitive Revolution), Erik Torenberg (Host, The Cognitive Revolution)


Presented by

Velocity Road

AI That Moves the Needle in 90 Days

Not consulting. Not software. Orchestration. We deploy AI that actually works for middle market companies and their PE investors.

Learn More

The Big Shift

AI isn't just a tool anymore; it's becoming a collaborator that can build, reason, and take on responsibilities, fundamentally changing knowledge work and software development.

The conversation this week points to a major pivot: from prompting AI for specific tasks to delegating entire responsibilities to agentic systems. OpenAI is eyeing the automation of its own research processes, while Microsoft is pushing for independent superintelligence development, aiming to "exceed the teacher." This isn't just about faster output; it’s about AI becoming an active participant in creation and decision-making.

"Our internal belief is that By March of 2028, we may have a significant fraction of our research being done by AI systems in tandem with our own researchers."
— Nathaniel Whittemore, Host of The AI Daily Brief

This shift is exemplified by Fable 5, which has demonstrated the capability to perform codebase-wide migrations in a day that would typically take a team over two months (Nathaniel Whittemore on The AI Daily Brief). It's also pushing users to develop "task imagination," thinking less about individual prompts and more about sophisticated delegation strategies. As Felix Risberg, Lead of Claude Code at Anthropic, puts it, we're moving "from giving AI tasks to giving IT responsibilities" (Nathaniel Whittemore on The AI Daily Brief).

Why it matters: This transition means that organizations need to rethink their AI strategy from the ground up, moving beyond simple automation to integrating AI as a reasoning partner. The focus should shift from managing AI outputs to managing AI-driven workflows and even AI-powered teams. The implications for productivity, organizational structure, and the nature of work are profound; it’s not just about augmenting human efforts but fundamentally transforming how work gets done and who (or what) is doing it.

The move: Start identifying critical workflows where AI can be given defined responsibilities rather than just tasks. Pilot delegation with an advanced agent, focusing on feedback loops that refine its autonomy and decision-making over time.


The Rundown

① The "Productivity Paradox" of AI is real and costing companies 6.4 hours per week.

Despite 87% of workers using AI and reporting 13 hours/week saved, only 13% of organizations see significant improvement, primarily due to "bot sitting" – the hidden labor of feeding AI context and debugging its outputs – which consumes 6.4 hours/week, effectively nullifying half the supposed gains (Rebecca Hinds on "The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis).

Why it matters: This directly impacts ROI and employee engagement. If your teams are spending as much time "babysitting" AI as it saves, the benefit is lost, leading to burnout and a disconnect between individual and organizational productivity.

② Microsoft is prioritizing independent AI model development and superintelligence, not just leveraging OpenAI.

Microsoft AI CEO Mustafa Suleyman emphasized their long-term goal of AI self-sufficiency, stating they need the ability to "stand on our own two feet and create world class models" rather than being structurally dependent on a third party like OpenAI (Mustafa Suleyman on Decoder with Nilay Patel).

What to watch: This signals a major strategic pivot for Microsoft, suggesting a future where they compete directly on frontier models, potentially diversifying the AI landscape and accelerating the race for superintelligence.

③ Enterprise AI agents are failing in production due to a lack of "temporally aware context."

Ravi Marwaha (COO & Chief Technology Product Officer, Arango) argues that the biggest barrier to successful enterprise AI agent deployment isn't model limitations, but the absence of a live, unified context layer that allows agents to reason accurately and explain decisions in real-time. He states, "The way to flip that ratio is to treat context as infrastructure upfront, period" (Ravi Marwaha on The AI in Business Podcast).

The context: Without a coherent, up-to-date context infrastructure, AI agents remain brittle and untrustworthy, especially in regulated industries where probabilistic decisions are unacceptable and accountability is paramount.

④ Anthropic's new Fable 5 model is sparking controversy over its guardrails and data policies.

This launch generated significant backlash due to perceived silent limitations on AI development, strict data retention policies, and specific biological research guardrails, frustrating researchers and enterprises (Nathaniel Whittemore on The AI Daily Brief: Artificial Intelligence News and Analysis). According to one user, "Anthropic is now silently making Claude Dumber for certain users on purpose, and there is no way to tell when it's happening to you."

Why it matters: This highlights a growing tension between frontier AI labs' control over model usage and the scientific community's demand for open, unrestricted access for research, raising questions about who controls the future of AI development.

⑤ Shifting from static files to dynamic web apps is the next big leap for knowledge work artifacts.

Nathaniel Whittemore (Host, The AI Daily Brief: Artificial Intelligence News and Analysis) argues that traditional documents (PDFs, PowerPoints) are becoming "really brittle" compared to dynamic, interactive websites, especially as AI agents start consuming information. OpenAI's "Sites" feature in Codex exemplifies this, turning static outputs into living, shareable web experiences. He noted, "Any semi capable person can now generate a useful, fairly good looking website as easily, if not more easily, than they used to throw together a deck." (Nathaniel Whittemore on The AI Daily Brief: Artificial Intelligence News and Analysis).

What to watch: This is a fundamental change in how information is created, shared, and consumed within organizations. Embracing dynamic web-native artifacts for internal knowledge will improve version control, collaboration, and agent-driven workflows.

⑥ Curiosity, not just capability, is critical for AI adoption.

Cognitive neuroscientist Chantel Prat explains that fear and a perceived sense of already "knowing the answer" actively inhibit curiosity and learning. This applies directly to AI adoption, where framing AI as a threat stifles exploration and adaptation (Chantel Prat on Beyond The Prompt - How to use AI in your company).

The context: Leaders need to cultivate a culture that fosters experimentation and learning with AI, rather than allowing fear of job displacement or complexity to create inertia. Educating on AI's potential as an augmentation partner is key to unlocking internal innovation.


The Signals

🟢 HEATING UP

Fable 5: Touted as the first Mythos class models, outperforming GPT-5.5 and enabling delegation of "responsibilities" over tasks. (Nathaniel Whittemore on The AI Daily Brief)

Unified Context Layer: Seen as crucial infrastructure to prevent enterprise AI agent failure and ensure explainability at scale. (Ravi Marwaha on The AI in Business Podcast)

AI-as-a-Reasoning-Partner: High-impact AI users frame problems, guide thinking, and iterate with AI rather than just prompt engineering. (Nathaniel Whittemore on The AI Daily Brief)

Space Data Centers: SpaceX's GPU rental deal with Google suggests an accidental rise of the largest Neo Cloud on Earth. (Nathaniel Whittemore on The AI Daily Brief)

🟠 ON WATCH

Zero Trust for AI Agents 🆕: A framework applying zero-trust principles to autonomous AI agent deployment to combat increasing AI-driven attacks. (Chris Benson on Practical AI)

bot sitting 🆕: The hidden, untracked labor (6.4 hours/week) employees spend feeding context and cleaning up AI outputs, creating a productivity paradox. (Rebecca Hinds on "The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis)

Theory of Mind 🆕: Emerging as a learned cognitive ability relevant to human-AI interaction and understanding diverse cognitive approaches. (Chantel Prat on Beyond The Prompt - How to use AI in your company)

Fable 5's controversial guardrails 🆕: Triggered backlash from researchers and enterprises over data retention and silent limitations on development. (Nathaniel Whittemore on The AI Daily Brief)

🥶 COOLING OFF

Probabilistic AI answers in regulated industries: Probabilistic answers are acceptable, but probabilistic decisions and actions are "not okay" in regulated environments. (Ravi Marwaha on The AI in Business Podcast)

Static knowledge artifacts (PDFs, PowerPoints): Become "really brittle" compared to dynamic, web-based formats that are better designed for agent consumption. (Nathaniel Whittemore on The AI Daily Brief)


The Debate

The central question this week: Is Retrieval-Augmented Generation (RAG) still a foundational necessity for high-stakes AI applications, especially with the advancement of "reasoning models" and increasingly larger context windows, or is its role diminishing?

🐂 The bull case for persistent RAG:Alex Bowcut (Head of Engineering, Sphere) argues that for domains requiring absolute accuracy and verifiable legal citations, such as tax law, RAG remains critical even with advanced models. Sphere's TRAM (Tax Review and Assessment Model) still leverages RAG due to the "absolute necessity of verifiable legal citations and high accuracy that current agentic systems cannot yet reliably provide."

"I think for us and, or at least for this particular problem, because we are so sensitive to accuracy and we are so sensitive to the exact right citation as of today, I don't think, you know, agents are just searching over the file system, grepping over. It is at a point where we could switch over and not lose accuracy."
— Alex Bowcut, Head of Engineering at Sphere on The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

🐻 The bear case for RAG's diminishing role: While not directly arguing against RAG, the broader conversation around advanced models like Fable 5 and OpenAI's "reasoning models" suggests a future where models inherently possess more context and reasoning capabilities, potentially reducing the reliance on external retrieval for many tasks. Alex Bowcut himself acknowledged a "big jump with OpenAI's O1 that came out in December of 24. Yeah, the first reasoning model pretty much right out of the gate." (Alex Bowcut on The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)), implying these models handle more context internally.

"We saw a big jump with OpenAI's O1 that came out in December of 24. Yeah, the first reasoning model pretty much right out of the gate. Like, you know, we swapped out the model names like everyone does, and we tried out this new model."
— Alex Bowcut, Guest at Sphere on The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Our read: RAG isn’t dead, but its application is becoming more targeted. For high-stakes, domain-specific tasks requiring absolute verifiability, RAG will likely remain essential. For broader, less critical applications, advanced reasoning models with larger context windows will likely reduce the need for complex retrieval systems.


The Bottom Line

AI is shifting from task automation to strategic reasoning partner, demanding leaders redefine "work," cultivate curiosity, and prioritize verifiable context over raw model power.


📖 Want the full episode breakdowns, guest details, and listen links?

Read the Episode Guide →

Episode Guide

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) — "Is RAG Dead? Lessons from Building AI for Tax Law with Alex Bowcut - #769"

Runtime: 52 min | Host: Sam Charrington | Guest: Alex Bowcut (Head of Engineering, Sphere)

Who should listen: Anyone building AI in high-stakes domains, especially those concerned with accuracy, explainability, and regulatory compliance.

This episode dives deep into how Sphere uses AI, specifically Retrieval Augmented Generation (RAG), to tackle complex global tax compliance. It reveals that TRAM, Sphere's internal tool, allows tax experts to work nearly two orders of magnitude faster with fewer errors, emphasizing the criticality of verifiable legal citations over pure agentic search.

"What we found is that TRAM allows our internal tax experts to move almost two orders of magnitude faster through this process with less errors than the traditional just fully human focused approach."
— Alex Bowcut, Head of Engineering at Sphere

▶ Listen · Apple Podcasts

The AI Daily Brief: Artificial Intelligence News and Analysis — "OpenAI Declares the Next Phase of AI"

Runtime: 30 min | Host: Nathaniel Whittemore | Guest: Nathaniel Whittemore (Host, The AI Daily Brief: Artificial Intelligence News and Analysis)

Who should listen: Leaders tracking the strategic direction of frontier AI labs and the evolving economic implications of generative AI.

OpenAI is signaling "Phase Three" for AI, emphasizing widespread access and the automation of AI R&D, with a goal for AI systems to perform a significant fraction of their research by March 2028. This episode explores the shift from a token subsidy to a token shortage era and the potential divergence between consumer and work-related agentic AI.

"Our internal belief is that By March of 2028, we may have a significant fraction of our research being done by AI systems in tandem with our own researchers."
— Nathaniel Whittemore, Host of The AI Daily Brief

▶ Listen · Apple Podcasts

The AI in Business Podcast — "How Unified Context Turns AI Into Real Enterprise Performance - with Ravi Marwaha of Arango"

Runtime: 38 min | Host: Daniel Faggella | Guest: Ravi Marwaha (Chief Operating Officer & Chief Technology Product Officer, Arango)

Who should listen: CTOs and enterprise architects struggling with AI agent deployment and looking to build robust, explainable AI systems.

Ravi Marwaha of Arango highlights that enterprise AI agents frequently fail in production not due to model limitations, but from a deficit of a live, temporally aware context layer. He advocates for treating context as a fundamental infrastructure component, crucial for real-time reasoning, explainability, and accurate responses, especially in regulated sectors.

"The way to flip that ratio is to treat context as infrastructure upfront, period. There's no two ways about it."
— Ravi Marwaha, Guest at Arango

▶ Listen · Apple Podcasts

The AI Daily Brief: Artificial Intelligence News and Analysis — "Why Fable 5 Is the Most Controversial AI Release Ever"

Runtime: 30 min | Host: Nathaniel Whittemore | Guest: Nathaniel Whittemore (Host, The AI Daily Brief: Artificial Intelligence News and Analysis)

Who should listen: AI researchers, developers, and policy makers interested in the ethical and practical debates around frontier model access and governance.

Anthropic's Fable 5 launch has ignited controversy over its safety restrictions, data retention policies, and silent limitations on AI development. This episode unpacks the backlash from researchers and enterprises, questioning the control frontier labs exert over user development and raising broader issues of open science versus proprietary AI.

"Anthropic is now silently making Claude Dumber for certain users on purpose, and there is no way to tell when it's happening to you."
— Akashgupta, Unknown Title at Unknown Organization

▶ Listen · Apple Podcasts

The AI in Business Podcast — "From Overwhelm to Working AI in Pharma and Life Sciences - with Art Shectman of Elephant Ventures"

Runtime: 34 min | Host: Marilie Fouché | Guest: Art Shectman (CEO and Founder, Elephant Ventures)

Who should listen: Pharma and life sciences executives seeking practical strategies for AI adoption amidst complex regulatory environments.

Art Shectman of Elephant Ventures advises pharma leaders to overcome AI implementation challenges by de-scoping initiatives to manageable, production-ready projects. He emphasizes the need for an engineering mindset to identify quick wins, focusing on compartmentalized workflows and tangible ROI, especially given the rapid pace of regulatory changes.

"Descope the thing you're going to first move with so that it's controllable and can get to production. But do something."
— Art Shectman, Guest at Elephant Ventures

▶ Listen · Apple Podcasts

The AI Daily Brief: Artificial Intelligence News and Analysis — "Fable 5 Raises the Bar for AI Ambition"

Runtime: 39 min | Host: Nathaniel Whittemore | Guest: Nathaniel Whittemore (Host, The AI Daily Brief)

Who should listen: Software development leaders, product managers, and advanced AI users looking to leverage next-gen models for complex problem-solving.

This episode highlights Anthropic's Fable 5 as a "Mythos class model" that redefines AI ambition, enabling users to delegate "responsibilities" rather than just "tasks." It demonstrates Fable 5's capacity for complex code migrations and strategic ideation, pushing for a shift towards "task imagination" and strategic use case classification for advanced AI.

"I'd normally highlight the numbers, but I want to talk about something else because with Fable 5 out in the world, I think a third era quietly started. Today I lead Claude Code and cowork on the desktop, so I think a lot about how people use AI to get work done. I believe we're about to see a major shift moving from giving AI tasks to giving IT responsibilities."
— Felix Risberg, Lead of Claude Code at Anthropic

▶ Listen · Apple Podcasts

The AI Daily Brief: Artificial Intelligence News and Analysis — "How We Use AI Is Changing"

Runtime: 26 min | Host: Nathaniel Whittemore | Guest: Nathaniel Whittemore (Host, The AI Daily Brief), Donald Trump (Former US President), Bernie Sanders (US Senator), Sam Altman (CEO, OpenAI), Daryl Bossinger (Former Microsoft employee and tech commentator), David Sachs (Former AIs), Brad Gertzner (Investor), Elon Musk (CEO, SpaceX), Jim Chanos (Prominent short seller), Yu Chen Jin, Jensen Huang (CEO, NVIDIA), Che Taiwan (Chairman, SK Group), Jenny Hsiao (Partner, Leonis Capital), David Georgiegan, Yoshik, Anand, Sarah Fryer (CFO, OpenAI), Ben Holmes (OpenAI), Peter Steinberger (OpenClock creator and employee, OpenAI), Boris Czerny (Claude Code creator), Andrej Karpathy, Jake (Railways), Shawnu Matthew, Zemus R (Exponential views), Dane Ketch (CTO, Cloudflare)

Who should listen: Business leaders and strategists tracking the evolving landscape of AI usage, from basic chat to advanced agentic workflows.

This episode unpacks the evolving landscape of AI usage, identifying a growing "advantage gap" between casual users and power users leveraging sophisticated agents, coding tools, and loops. Nathaniel Whittemore suggests OpenAI's ChatGPT overhaul aims to guide users towards these advanced behaviors, fundamentally reshaping AI interaction and the "super app" concept.

"The highest impact Users aren't better prompt engineers. They treat AI like a reasoning partner. They frame problems, guide thinking, iterate, and push for better answers."
— Nathaniel Whittemore, Host of The AI Daily Brief

▶ Listen · Apple Podcasts

The AI Daily Brief: Artificial Intelligence News and Analysis — "10+ Things You Should Build With AI Instead of Sending Files"

Runtime: 22 min | Host: Nathaniel Whittemore | Guest: Nathaniel Whittemore (Host, The AI Daily Brief: Artificial Intelligence News and Analysis)

Who should listen: Knowledge workers, consultants, and teams looking to upgrade their internal and client-facing communication artifacts.

NLW argues that AI, particularly with features like OpenAI's "Sites" in Codex, is transforming static knowledge work artifacts (like documents) into dynamic, interactive websites. This shift fundamentally addresses issues like version control, distribution friction, and limited navigation, making websites a superior format for shareable, updateable information that can also be designed for "agent consumption."

"Any semi capable person can now generate a useful, fairly good looking website as easily, if not more easily, than they used to throw together a deck."
— Nathaniel Whittemore, Host of The AI Daily Brief: Artificial Intelligence News and Analysis

▶ Listen · Apple Podcasts

Decoder with Nilay Patel — "Microsoft AI chief thinks superintelligence is near, but won't take your job"

Runtime: 76 min | Host: Nilay Patel | Guest: Mustafa Suleyman (CEO, Microsoft AI)

Who should listen: Executives, strategists, and investors interested in Microsoft's long-term AI vision and its approach to superintelligence and responsible AI.

Mustafa Suleyman, CEO of Microsoft AI, clarified Microsoft's independent pursuit of superintelligence, emphasizing their need to "stand on our own two feet" for AI development. He discusses the effectiveness of current AI models for emotional support and productivity, while asserting that AI will automate tasks rather than eliminate entire jobs, and critiques Anthropic's approach to AI consciousness.

"We have to make sure that we're completely sustainable and we're not just a recipient of somebody else's IP that we then slightly modify and adapt and put into production for our products. But we actually have the ability to stand on our own two feet and create world class models."
— Mustafa Suleyman, CEO of Microsoft AI

▶ Listen · Apple Podcasts

Beyond The Prompt - How to use AI in your company — "Why Fear Kills Curiosity and What That Means for AI - with Chantel Prat, Cognitive Neuroscientist"

Runtime: 62 min | Host: Jeremy Utley | Guest: Chantel Prat (Cognitive Neuroscientist)

Who should listen: Business leaders and educators looking to foster AI adoption and innovation within their organizations by overcoming resistance and fear.

Cognitive neuroscientist Chantel Prat explains that fear and a perceived sense of "knowing the answer" actively inhibit curiosity and learning, which directly impacts AI adoption. The discussion emphasizes balancing "exploration versus exploitation" and intentionally co-adapting with AI to enhance human capabilities, rather than just offloading tasks.

"If you think you already know the answer, you will feel zero curiosity. And if you feel zero curiosity, your brain is not set up to learn."
Chantel Prat

▶ Listen · Apple Podcasts

Practical AI — "Zero Trust for AI Agents"

Runtime: 47 min | Host: Daniel Whitenack | Guest: Chris Benson (Principal AI and Autonomy Research Engineer, Practical AI LLC)

Who should listen: Cybersecurity professionals, AI architects, and compliance officers developing secure AI agent deployments in the face of escalating AI-driven threats.

Chris Benson and Daniel Whitenack discuss Anthropic's Zero Trust for AI Agents framework, highlighting the urgent need for robust security in autonomous AI agents. They cover foundational, enterprise, and advanced zero-trust concepts, including agent identity, authentication, least agency, and the challenges of integrity and recovery in evolving agentic systems, stressing that over 90% of enterprises are not currently operating this way.

"The forcing function behind this discussion is that people actually need to adopt autonomous agents because of this offensive threat to their infrastructure."
— Daniel Whitenack, CEO at PredictionGuard

▶ Listen · Apple Podcasts

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis — "Babysitting the Machine: Glean's Rebecca Hinds on the Hidden Human Labor of AI at Work"

Runtime: 106 min | Host: Nathan Labenz | Guest: Rebecca Hinds (Head of the Work AI Institute, Glean)

Who should listen: HR professionals, operations leaders, and CEOs grappling with AI adoption's impact on productivity, employee engagement, and organizational design.

Rebecca Hinds of Glean exposes the "productivity paradox" of AI, where despite individual time savings, organizational improvement is minimal due to "bot sitting" (6.4 hours/week of hidden labor) and "bot shitting" (undefendable AI-generated work). This episode highlights the unseen human effort required to make AI useful and its impact on engagement and turnover.

"Bot sitting is all the unglamorous, untracked labor required to make AI useful, feeding it context, debugging its outputs and cleaning up its messes, which the report finds consumes 6.4 hours per week, or roughly half of all the time that AI supposedly saves."
— Rebecca Hinds, Head of the Work AI Institute at Glean

▶ Listen · Apple Podcasts

PARTNER

Not sure where AI fits in your operations? Start with the data.

Velocity Road's AI Readiness Assessment maps your organization against 7 operational dimensions and shows exactly where AI creates ROI — in under 10 minutes.

Take the Assessment -> →

Avi Savar

Get AI & Technology in your inbox

How AI and Tech are reshaping business. Free.