AI Tool Based on Evidence-Based Medicine Surpasses Competing AI and

In a groundbreaking advancement at the intersection of biomedical informatics and artificial intelligence, researchers from the University at Buffalo have unveiled a clinical AI tool that is setting new standards for medical reasoning and diagnostic accuracy. Dubbed Semantic Clinical Artificial Intelligence, or SCAI (pronounced “Sky”), this system has outperformed virtually every other AI model tested on the United States Medical Licensing Exam (USMLE), an exam that rigorously evaluates a physician’s ability to apply knowledge in a clinical context. Published in JAMA Network Open on April 22, 2025, the findings mark a pivotal moment in AI-assisted medicine, demonstrating that AI can transcend mere information retrieval and generate complex semantic reasoning akin to human medical decision-making.

The USMLE, comprising three essential Step exams, assesses a physician’s mastery not only of factual knowledge but also fundamental patient-centered skills and clinical judgment. SCAI’s remarkable performance—achieving an accuracy score of 95.2% on Step 3—surpasses that of GPT-4 Omni’s score of 90.5%, placing it well ahead of native large language models operating without semantic augmentation. This indicates a qualitative leap rather than an incremental gain in AI-driven diagnostics. The development team, led by Dr. Peter L. Elkin, Chair of Biomedical Informatics at the Jacobs School of Medicine and UBMD Internal Medicine, emphasizes that SCAI is designed to serve as a collaborative partner for clinicians, not a replacement.

Unlike conventional AI tools that rely heavily on statistical associations gleaned from massive digital text corpora—tools occasionally criticized for effectively “plagiarizing” internet content—SCAI employs a rigorously structured approach to knowledge representation and reasoning. The system utilizes semantic triples, a distinct form of knowledge encoding that links subjects, relations, and objects (for example, “Penicillin treats pneumococcal pneumonia”) to establish a richly interconnected knowledge network. This semantic network forms the substrate from which SCAI can draw logical inferences, enabling reasoning that resembles the cognitive processes physicians develop over years of training.

The integration of formal semantics into large language models represents a paradigm shift that directly addresses a known limitation in AI medicine: confabulation. Confabulation refers to an AI’s tendency to produce plausible but unverified or erroneous responses when faced with insufficient data. By coupling semantic knowledge graphs—structures designed to discover both explicit and hidden relationships in heterogeneous medical datasets—with retrieval-augmented generation techniques, SCAI drastically reduces this issue. Retrieval-augmented generation allows the model to consult specific, authoritative external knowledge databases before formulating responses, effectively ensuring that its outputs are anchored in validated medical facts.

This architecture leverages approximately 13 million discrete medical facts, encompassing verified data from an extensive array of domains, including genomic information, clinical guidelines, drug interactions, patient safety recommendations, and discharge protocols. Notably, the system deliberately excludes potentially biased inputs, such as raw clinical notes, to preserve objectivity and evidence-based reasoning. This breadth and depth of curated information imbue SCAI with a unique capacity for comprehensive and nuanced analysis across specialties.

SCAI’s conversational ability is another critical feature that differentiates it from traditional AI assistants. It can engage interactively with clinicians and lay users alike, enhancing decision-making through reasoned dialogue rather than rote answer generation. This dynamic interaction simulates a partnership, where the AI contributes balanced, evidence-based perspectives, helping users to explore complex clinical scenarios more thoroughly. Dr. Elkin asserts that by embedding semantic understanding into AI models, they are nurturing systems that think more like trained physicians, who integrate evidence-based medicine into patient care with contextual reasoning.

Beyond improving diagnostic accuracy, SCAI has the potential to transform healthcare delivery at multiple levels. Its ability to democratize access to specialist knowledge can empower primary care providers to manage more complex conditions confidently. This could alleviate bottlenecks in specialty referrals, reduce disparities in access to expert care, and enhance patient safety by better informing clinical decisions with a comprehensive, real-time knowledge base. Furthermore, SCAI’s scalable platform could serve as an educational tool, assisting medical students and professionals in navigating the increasingly complex landscape of modern medicine.

Despite the system’s extraordinary capabilities, the research team unequivocally positions AI as an augmentation rather than a replacement for human clinicians. As Dr. Elkin eloquently puts it, “Artificial intelligence isn’t going to replace doctors, but a doctor who uses AI may replace a doctor who does not.” This underscores a future healthcare environment where AI tools serve as cognitive amplifiers, sharpening a physician’s clinical acumen and supporting more precise and informed patient care decisions.

The origins of SCAI trace back to an existing foundation in natural language processing, which the research team significantly advanced by enriching it with authoritative, multi-dimensional medical knowledge. This fusion required expert curation and computational ingenuity to construct semantic knowledge networks capable of reasoning. The result is an AI tool that conceptualizes medical facts semantically rather than statistically, enabling it to infer “causal” relationships and logical consequences in a manner far closer to human clinical thought.

In terms of methodological innovation, SCAI exemplifies the power of combining symbolic AI—knowledge graphs and semantic triples—with modern deep learning architectures. This hybrid approach leverages the strengths of both paradigms: symbolic AI’s interpretability and reasoning, alongside the pattern recognition and generative capabilities of neural networks. Retrieval-augmented generation acts as an intelligent interface, fetching exact information when needed, grounding the AI model in authentic medical knowledge rather than probabilistic guesswork.

The research team comprises a multidisciplinary roster of experts across biomedical informatics, oncology, and veterans’ healthcare systems, highlighting the collaborative effort that underpinned SCAI’s creation. Funded by the National Institutes of Health and the Department of Veterans Affairs, their work reflects a strategic investment in AI technologies poised to revolutionize clinical practice. The open-access publication allows for widespread dissemination and invites further validation and development within the broader scientific and medical communities.

As AI continues to evolve, SCAI represents a model for future clinical decision support systems—ones that do not merely compute but reason, engage, and integrate seamlessly into the human workflows of medicine. Its debut performance on the USMLE not only sets a new bar for technical achievement but also sparks a profound conversation on how AI can best serve the healthcare ecosystem. In an era where medicine demands both precision and empathy, tools like SCAI promise to be indispensable allies in the quest to improve patient outcomes worldwide.

Subject of Research:
Not applicable

Article Title:
Semantic Clinical Artificial Intelligence vs Native Large Language Model Performance on the USMLE

News Publication Date:
22-Apr-2025

Web References:
http://jamanetwork.com/journals/jamanetworkopen/fullarticle/10.1001/jamanetworkopen.2025.6359?utm_source=For_The_Media&utm_medium=referral&utm_campaign=ftm_links&utm_term=042225

References:
10.1001/jamanetworkopen.2025.6359

Image Credits:
Credit: Sandra Kicman/University at Buffalo

Keywords:
Informatics

Tags: AI in medical diagnosticsAI surpassing human doctorsAI-assisted medical reasoningbiomedical informatics innovationsclinical judgment in AIdiagnostic accuracy in healthcareevidence-based medicine advancementsfuture of AI in medicinemedical decision-making AI toolsSemantic Clinical Artificial IntelligenceUniversity at Buffalo researchUSMLE performance comparison

AI Tool Based on Evidence-Based Medicine Surpasses Competing AI and Majority of Doctors on USMLE Exams

Related Posts

Evaluating Geriatric Assessment and Interventions for Prostate Cancer Patients on ADT

Single-Cell Multi-Omics Reveals Cancer-Associated Fibroblast Programs in Breast Cancer

Real-Time Tracking of Pathogen Spread Using Wastewater Analysis

Long-Term Air Pollution Linked to Increased Chronic Kidney Disease Risk

POPULAR NEWS

Detection of EDCs in Breast Milk and Infant Urine Up to Six Months Highlights Early Exposure Risks

New Drug Candidate Developed at McMaster Shows Potential for Treating Brain Cancer

KTU Researchers Explore Ultrasound’s Role in Enhancing Blood Flow Beyond Diagnostics

高齢者の骨粗鬆症治療の持続性比較

About

Follow us

Recent News

TP53 Mutation Triggers CD8+ T Cell Exhaustion Causing Therapy-Resistant Urothelial Cancer

Transient Simulation Advances in Bioresorbable Flexible Electronic Circuits

Evaluating Geriatric Assessment and Interventions for Prostate Cancer Patients on ADT

Subscribe to Blog via Email

Welcome Back!

Retrieve your password