Med-Gemini : A New AI Model Reaching 91.1% Accuracy in Medical Diagnostics

Nitika Sharma Last Updated : 01 May, 2024

3 min read

Researchers from Google and DeepMind have introduced Med-Gemini, a new generation of AI models specifically tailored for medical applications. Building on the strengths of the 2023 Gemini models renowned for language processing, multimodal understanding, and long-context reasoning, Med-Gemini significantly enhances these capabilities for healthcare applications.

Med-Gemini’s superiority is demonstrated through evaluation on 14 medical benchmarks, where it achieves new state-of-the-art performance on 10 benchmarks, often surpassing GPT-4 models significantly. Notably, on MedQA (USMLE), Med-Gemini achieved 91.1% accuracy, outperforming prior models by 4.6%.

Results from our first ✨Med-Gemini✨models, variants of the base Gemini multimodal models that have been tuned for the medical domain.

The multimodal and long context capabilities of the Gemini models really shine in some of the capabilities.

Joint work with @vivnat… https://t.co/KkKdLXjBx7
— Jeff Dean (@🏡) (@JeffDean) April 30, 2024

The Making of Med-Gemini

Med-Gemini opens exciting doors for AI in medicine. It can assist doctors in tackling complex diagnoses, engage in informative medical dialogue, and efficiently analyze vast amounts of data within electronic health records. The researchers achieved this specialization through innovative techniques:

Self-training with Web Search Integration: Med-Gemini can access and integrate up-to-date medical information from the web, ensuring its knowledge stays current.
Multimodal Fine-Tuning: The model can adapt to incorporate new medical data formats, making it future-proof.
Customized Encoders: Med-Gemini can process various data types, including text, images, videos, and even sensor readings from medical equipment.

Self-training with Web Search Integration

Capabilities of Med-Gemini

Med-Gemini is introduced as a family of highly capable, multimodal medical models built upon Gemini. The models’ clinical reasoning capabilities are enhanced through self-training and web search integration, while multimodal performance is improved via fine-tuning and customized encoders.

Med-Gemini models achieve state-of-the-art (SoTA) performance on 10 out of 14 medical benchmarks spanning text, multimodal, and long-context applications, surpassing the GPT-4 model family on every benchmark where a direct comparison could be made.

The bar chart below demonstrates the relative percentage gains from the models over prior SoTA across the benchmarks. Particularly on the MedQA (USMLE) benchmark, a new SoTA is achieved, surpassing the prior best (Med-PaLM 2) by a significant margin of 4.6%.

Additionally, re-annotation of the dataset with expert clinicians reveals that 7.4% of questions are deemed unfit for evaluation due to lacking key information, having incorrect answers, or supporting multiple plausible interpretations. These data quality issues are accounted for to characterize the performance of the model more precisely.

Med-Gemini models excel in multimodal and long-context capabilities, evidenced by their SoTA performance on several benchmarks including needle-in-a-haystack retrieval from long, de-identified health records, and medical video question answering benchmarks.

Beyond benchmarks, the real-world potential of Med-Gemini is demonstrated through quantitative evaluation on medical summarization, referral letter generation, and medical simplification tasks where the models outperform human experts, in addition to qualitative examples of multimodal medical dialogue.

Safety and Accuracy Remain Paramount

The paper emphasizes the importance of safety and accuracy in medical applications. The researchers acknowledge the need for specialized techniques like prompting and fine-tuning to ensure responsible AI development in this critical domain.

One such technique is the “uncertainty-guided search strategy.” This allows Med-Gemini to access and integrate relevant web search results during complex clinical reasoning tasks, leading to more nuanced and reliable outcomes.

Also Read: Top 7 AI Healthcare Solution Providers

Dialogue Example

You can find the research paper here.

Our Say

Med-Gemini’s multimodal capabilities open doors for more natural and comprehensive interactions between healthcare providers and patients. Doctors can leverage the model’s ability to analyze various data types, while the model itself can interact more conversationally, requesting additional information for a more complete picture.

This development adds to Google’s growing portfolio of healthcare-focused AI models, including Med-PaLM 2, AlphaFold, and Flan-PaLM. Med-Gemini represents a significant step forward in AI-powered healthcare, paving the way for a future with enhanced diagnostics, personalized medicine, and improved patient-provider communication.

Follow us on Google News to stay updated with the latest innovations in the world of AI, Data Science, & GenAI.

Nitika Sharma

Hello, I am Nitika, a tech-savvy Content Creator and Marketer. Creativity and learning new things come naturally to me. I have expertise in creating result-driven content strategies. I am well versed in SEO Management, Keyword Operations, Web Content Writing, Communication, Content Strategy, Editing, and Writing.

News

Free Courses

4.7

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

4.5

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

4.6

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

4.8

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

4.7

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

Reading list

Introduction to Generative AI

Introduction to Generative AI applications

No-code Generative AI app development

Code-focused Generative AI App Development

Introduction to Responsible AI

LLMS

Prompt Engineering

Finetuning LLMs

Training LLMs from Scratch

Langchain

RAG

LlamaIndex

Stable Diffusion

Med-Gemini : A New AI Model Reaching 91.1% Accuracy in Medical Diagnostics

The Making of Med-Gemini

Capabilities of Med-Gemini

Safety and Accuracy Remain Paramount

Dialogue Example

Our Say

Free Courses

Generative AI - A Way of Life

Getting Started with Large Language Models

Building LLM Applications using Prompt Engineering

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Microsoft Excel: Formulas & Functions

Recommended Articles

Responses From Readers

Write for us

Congratulations, You Did It!

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#

collect

AEC

G_ENABLED_IDPS

test_cookie

Webengage (2)

_we_us

WebKlipperAuth

LinkedIn (16)

ln_or

JSESSIONID

li_rm

AnalyticsSyncHistory

lms_analytics

liap

visit

li_at

s_plt

lang

s_tp

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

s_pltp

s_tslv

li_theme

li_theme_set

Google (11)

_gcl_au

SID

SAPISID

__Secure-#

APISID

SSID