'Change in technology that will reshape medicine': Leading medical experts say new AI reasoning models are so good they outperform even the best specialist doctors when it comes to diagnosing some of the world's most mysterious cases

Doctor interacting with an AI model
(Image credit: AI Generated)

  • AI models identify rare diseases faster than many experienced clinicians
  • Systems reach correct or near diagnoses in the majority of difficult cases
  • Models analyze symptoms and test data using structured reasoning processes

A new generation of AI tools is claiming to outperform experienced clinicians in diagnosing rare and complex medical conditions.

These reasoning models can process long chains of symptoms, test results, and clinical notes, then propose or narrow down the correct diagnosis faster than many human specialists.

Some researchers argue that this represents a profound change in technology that will reshape medicine, especially for cases where the correct diagnosis is not obvious even after extensive evaluation.

Article continues below

AI models tackle tough diagnoses

“We’re witnessing a really profound change in technology that will reshape medicine,” Arjun Manrai of Harvard University said in a news conference.

Still, serious questions remain about whether these systems can handle the full weight of real-world clinical uncertainty.

In one major study, researchers tested a leading AI reasoning model on a mix of textbook-style cases and real patient data from a Boston emergency department.

The model analyzed step-by-step descriptions of symptoms, test orders, and results, just as clinicians do.

It listed possible diagnoses more often than human doctors and included the true diagnosis, or something very close to it, in about 80% of the difficult cases.

For one transplant patient with subtle signs of a life-threatening infection, the model raised appropriate suspicion roughly a day before the clinical team did.

Researchers say the technology is particularly strong at scanning broad patterns across rare diseases that individual doctors may rarely encounter.

However, the studies rely on curated patient descriptions rather than raw, chaotic emergency room environments.

The models respond to the information they are given, not the full mess of overlapping priorities and incomplete data seen in real clinics.

Why uncertainty is still a problem

Despite the capacity of these AI reasoning models, critics point out that clinical reasoning is more than just step-by-step logic on a clean text summary.

“When we say clinical reasoning, it doesn’t mean the same thing as model reasoning,” says Arya Rao of Harvard Medical School, who was not involved in the study.

“These models have been optimized to do this kind of sequential thought that we call reasoning, but it’s not at all the same thing as how we teach medical students to reason.”

Doctors often have to entertain multiple uncertain possibilities at once, then update them as new data arrive.

AI models tend to latch onto a single strong explanation and update their answers in brittle ways when new facts appear.

One team that tested 21 different AI systems found that even the best reasoning models struggled when considering several uncertain diagnoses at the same time.

The team argued large language models are not yet ready to make independent decisions in medical settings.

They are at best useful for second opinions or for surfacing rare conditions that clinicians might initially overlook.

Experts stress that human doctors are still essential for interpreting context, talking to patients, and weighing risks in real time.

The technology may help avoid missed diagnoses in some settings, but it introduces new risks if used without careful oversight and appropriate guardrails.

Via Science News


Google logo on a black background next to text reading 'Click to follow TechRadar'

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds.


TOPICS
Efosa Udinmwen
Freelance Journalist

Efosa has been writing about technology for over 7 years, initially driven by curiosity but now fueled by a strong passion for the field. He holds both a Master's and a PhD in sciences, which provided him with a solid foundation in analytical thinking.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.