Tuesday, May 13, 2025
HomeAIImprovements in reasoning AI models may slow down soon, analysis finds

Improvements in reasoning AI models may slow down soon, analysis finds

Share


An analysis by Epoch AI, a nonprofit AI research institute, suggests the AI industry may not be able to eke massive performance gains out of reasoning AI models for much longer. As soon as within a year, progress from reasoning models could slow down, according to the report’s findings.

Reasoning models such as OpenAI’s o3 have led to substantial gains on AI benchmarks in recent months, particularly benchmarks measuring math and programming skills. The models can apply more computing to problems, which can improve their performance, with the downside being that they take longer than conventional models to complete tasks.

Reasoning models are developed by first training a conventional model on a massive amount of data, then applying a technique called reinforcement learning, which effectively gives the model “feedback” on its solutions to difficult problems.

So far, frontier AI labs like OpenAI haven’t applied an enormous amount of computing power to the reinforcement learning stage of reasoning model training, according to Epoch.

That’s changing. OpenAI has said that it applied around 10x more computing to train o3 than its predecessor, o1, and Epoch speculates that most of this computing was devoted to reinforcement learning. And OpenAI researcher Dan Roberts recently revealed that the company’s future plans call for prioritizing reinforcement learning to use far more computing power, even more than for the initial model training.

But there’s still an upper bound to how much computing can be applied to reinforcement learning, per Epoch.

Epoch reasoning model training
According to an Epoch AI analysis, reasoning model training scaling may slow downImage Credits:Epoch AI

Josh You, an analyst at Epoch and the author of the analysis, explains that performance gains from standard AI model training are currently quadrupling every year, while performance gains from reinforcement learning are growing tenfold every 3-5 months. The progress of reasoning training will “probably converge with the overall frontier by 2026,” he continues.

Techcrunch event

Join us at TechCrunch Sessions: AI

Secure your spot for our leading AI industry event with speakers from OpenAI, Anthropic, and Cohere. For a limited time, tickets are just $292 for an entire day of expert talks, workshops, and potent networking.

Exhibit at TechCrunch Sessions: AI

Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.

Berkeley, CA | June 5

REGISTER NOW

Epoch’s analysis makes a number of assumptions, and draws in part on public comments from AI company executives. But it also makes the case that scaling reasoning models may prove to be challenging for reasons besides computing, including high overhead costs for research.

“If there’s a persistent overhead cost required for research, reasoning models might not scale as far as expected,” writes You. “Rapid compute scaling is potentially a very important ingredient in reasoning model progress, so it’s worth tracking this closely.”

Any indication that reasoning models may reach some sort of limit in the near future is likely to worry the AI industry, which has invested enormous resources developing these types of models. Already, studies have shown that reasoning models, which can be incredibly expensive to run, have serious flaws, like a tendency to hallucinate more than certain conventional models.

Popular

Related Articles

Anthropic co-founder Jared Kaplan is coming to TechCrunch Sessions: AI

Hungry to learn more about Anthropic, directly from Anthropic? You aren’t alone if...

The Billion-Dollar Fraud War No Ones Really Fighting

Let’s talk about the most popular F-word in...

Are Light Pulses from the Nearest Habitable Exoplanet, Proxima b, Natural or Artificial in Origin?

Avi Loeb is the head of the Galileo Project, founding director of Harvard University’s — Black...

Apple brings emergency satellite features to iPhone 13 with iOS 18.5

Apple on Monday released iOS 18.5, which expands emergency satellite capabilities to iPhone...

Apple reportedly plans to hike prices of upcoming iPhones

Apple is planning to increase the prices of its iPhone lineup set to...

Egypts Nawy, the largest proptech in Africa, raises $52M to take on MENA

For decades, buying property in Egypt meant navigating a fragmented real estate market,...

Trump fires Copyright Office director after report raises questions about AI training

President Donald Trump has fired Shira Perlmutter, who leads the U.S. Copyright Office....

23andMe customers notified of bankruptcy and potential claims deadline to file is July 14

23andMe, the genetic testing giant once valued in the billions, is now navigating...
0 0 votes
Article Rating
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x