SDS 1004: Recursive Self-Improvement

Could an AI get good enough at AI research to build its own, more capable successor and kick off a compounding loop? That’s recursive self-improvement (RSI) and it surged into the conversation after Anthropic revealed that, as of May 2026, Claude wrote more than 80% of the code merged into its production codebase. In this Five-Minute Friday, Jon Krohn separates today’s AI-assisted coding from true RSI, walks through the accelerating evidence – METR’s shrinking task “time horizon,” Google DeepMind’s AlphaEvolve, Andrej Karpathy’s overnight training-tuner, weighs Jack Clark’s 60% bet that AI builds its own successor by 2028 against the compute, data and “marketing” skeptics. As ever, Jon lands in the optimistic middle.

Thanks to our Sponsors:

Interested in sponsoring a Super Data Science Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Jon Krohn breaks down recursive self-improvement (RSI), the idea that an AI capable enough at AI research could build a better successor, which builds a better one still. The term surged after Anthropic reported that Claude wrote over 80% of the code merged into its production codebase as of May 2026, up from low single digits before its coding agent launched in early 2025, with the typical Anthropic engineer now shipping roughly eight times as much code per day. But Jon stresses the distinction: this is AI-assisted coding, where humans still set the goals and judge the results, not true RSI, where the AI runs the whole research loop itself with little human involvement.

The evidence of acceleration is mounting. METR finds the length of tasks AI can complete autonomously has been doubling, recently every four months rather than seven, while Anthropic’s success rate on the hardest open-ended problems leapt from under 20% in late 2025 to 76% by May. Google DeepMind’s AlphaEvolve improved the machinery of AI (data-center scheduling and faster matrix multiplication), and Andrej Karpathy’s tool ran roughly 700 overnight experiments to speed up already-optimized training code by 11%. Anthropic sketches three futures, giving best odds to continued human-steered acceleration, while co-founder Jack Clark puts 60% on AI creating its own successor by 2028. Skeptics cite compute and data bottlenecks and a whiff of marketing; Jon lands in the optimistic middle, urging monitoring and human checkpoints.

ITEMS MENTIONED IN THIS PODCAST:

This episode of SuperDataScience is made possible by Anthropic.
SDS 497: Maximizing the Global Impact of Your Career
Anthropic’s report on recursive self-improvement
METR (Model Evaluation & Threat Research)
METR: “Measuring AI Ability to Complete Long Tasks”
AI Digest: Time Horizons
Google DeepMind’s AlphaEvolve
Andrej Karpathy’s autoresearch tool (GitHub)
“RSI Is the New AGI” (The Batch, DeepLearning.AI)
Super Data Science Podcast Team

DID YOU ENJOY THE PODCAST?

Download The Transcript

Podcasts SDS 1004: Recursive Self-Improvement

Podcast Transcript

Share on

Related Podcasts

June 26, 2026

June 23, 2026

June 19, 2026