Jan Beger’s Post

Healthcare needs AI … because it needs the human touch.

4mo

This study evaluates the performance of multimodal AI models in medical diagnostics using the NEJM Image Challenge dataset, comparing their accuracy to human collective intelligence. 1️⃣ Anthropic's Claude 3 models showed the highest accuracy, surpassing average human performance by about 10%. 2️⃣ Human collective intelligence achieved a 90.8% accuracy rate, outperforming all AI models. 3️⃣ GPT-4 Vision Preview was selective, often responding to easier questions with smaller images and longer texts. 4️⃣ OpenAI’s GPT-4 Vision Preview answered only 76% of the cases, while other models responded to all queries. 5️⃣ The study highlights the potential and current limitations of multimodal AI in clinical diagnostics. 6️⃣ Ethical and reliability concerns arise from the integration of multimodal AI in medical diagnostics. 7️⃣ The EU AI Act emphasizes the need for transparency, robustness, and human oversight in high-risk AI systems, including medical AI. ✍🏻 Robert Kaczmarczyk, Theresa Isabelle Wilhelm, Dr. med. Ron Martin, B.Sc., Dr. med. Jonas Roos. Evaluating multimodal AI in medical diagnostics. npj Digital Medicine. 2024. DOI: 10.1038/s41746-024-01208-3

16 Comments

Shrikant Pandya Ph.D.

Generative AI Engineer and Consultant | Machine Learning Engineer | Ph.D. Biomedical Engineering

4mo

This is an exciting coincidence, I was designing this study in my head yesterday as I was working with multimodal models for a different application! The sobering observation I had while reading this is that this is already out of date. Claude 3.5 is out, Gemini 1.5 pro and Flash are multimodal by design, and were not evaluated here, and Open AI has already released GPT4omni (GPT4o) that is natively multimodal. And that's just the major players, smaller labs have released many other open and closed source models like LLaVA and others. This is not to disparage the work by the authors, but just to remind everyone that the field moves very quickly, take every metric you read with a large handful of salt. By the time you read it, it's probably already incorrect.

3 Reactions

Niamh S.

Medical Device Regulatory Affairs, Software, AI and Risk Management expert. TC contributing member for Ireland on IEC 62304, IEC 63450 and AI Advisory Group SNAIG

4mo

#Accuracy is the primary reported variable when reporting such systems, but accuracy can be #inflated when training and testing the same or similar data sets, or even from creating similar images from which the results are taken from. There are ways to inflate accuracy whether intentionally or not. What are the metrics that we need to look at and the underlying #assumptions that we need to understand before accepting the results of #performance of these #AI models?

2 Reactions

Sameer P.

HealthTech Product & Venture Builder

4mo

Human collective intelligence will be a vital benchmark as we pursue the automation of key tasks, what is to be seen is how policymakers view risk in clinical settings, how does malpractice insurance evolve in this pursuit?

Dr. med. Ron Martin, B.Sc.

Assistenzarzt Plastische, Rekonstruktive und Ästhetische Chirurgie, Bachelor of Science - B.Sc., Geographie, ATLS® Provider

4mo

Thanks for sharing :)

Dalia Dawoud

Associate Director (Research)

3mo

Tuba SAYGIN AVŞAR, PhD

1 Reaction

Javed Haris

Senior Data Scientist at Boston Scientific | Gen AI | Machine Learning

4mo

Thank you for sharing!

🌐 John Hall, Ph.D.

Industry Mentor - Psychologist

4mo

Interesting AI calibration!

1 Reaction

Trivikram Tanguturi

Senior Product Manager AI | Management Consulting

4mo

Very informative

2 Reactions

Shefali Sanekar

SEO Analyst at Edvak Health

4mo

Impressive study! It’s fascinating to see multimodal AI models pushing boundaries in medical diagnostics.

3 Reactions

See more comments

To view or add a comment, sign in

More Relevant Posts

Mikael Boesen
4mo
Report this post
Important insights from Jan Beger, as always, regarding the potentials and pitfalls of multimodal AI in medical diagnostics and the importance of ensuring compatibility and proper surveillance of these models before implementing to comply with i.e the EU AI act, as they currently raise both ethical and reliability concerns

Jan Beger

Healthcare needs AI … because it needs the human touch.
4mo

This study evaluates the performance of multimodal AI models in medical diagnostics using the NEJM Image Challenge dataset, comparing their accuracy to human collective intelligence. 1️⃣ Anthropic's Claude 3 models showed the highest accuracy, surpassing average human performance by about 10%. 2️⃣ Human collective intelligence achieved a 90.8% accuracy rate, outperforming all AI models. 3️⃣ GPT-4 Vision Preview was selective, often responding to easier questions with smaller images and longer texts. 4️⃣ OpenAI’s GPT-4 Vision Preview answered only 76% of the cases, while other models responded to all queries. 5️⃣ The study highlights the potential and current limitations of multimodal AI in clinical diagnostics. 6️⃣ Ethical and reliability concerns arise from the integration of multimodal AI in medical diagnostics. 7️⃣ The EU AI Act emphasizes the need for transparency, robustness, and human oversight in high-risk AI systems, including medical AI. ✍🏻 Robert Kaczmarczyk, Theresa Isabelle Wilhelm, Dr. med. Ron Martin, B.Sc., Dr. med. Jonas Roos. Evaluating multimodal AI in medical diagnostics. npj Digital Medicine. 2024. DOI: 10.1038/s41746-024-01208-3
Like Comment
To view or add a comment, sign in
Curei.ai

83 followers
1mo
Report this post
This paper reviews the progress and challenges in using multimodal AI in medicine, with a focus on applications across medical specialties and the technical issues involved in implementing these systems. 1️⃣ Multimodal AI in healthcare combines diverse data sources (e.g., imaging, clinical data, genomics) to improve clinical decision-making by enhancing data interpretation, with models reviewed showing a 6.2% improvement in AUC over unimodal AI approaches. 2️⃣ The review analyzed 432 studies from 2018 to 2024, finding multimodal AI research prevalent in radiology and text data combinations, especially for nervous and respiratory systems, while less focus was observed in musculoskeletal and urinary systems. 3️⃣ Data fusion is a critical component, with intermediate fusion (merging encoded features before final model layers) being the most common method (79%), though earlier fusion techniques show promise for maximizing cross-modal data learning. 4️⃣ Challenges for multimodal AI include inconsistent data availability across patients, cross-departmental data silos, and model architecture complexity, often needing different encoding strategies for varied data types. 5️⃣ Public datasets are pivotal for model training, though most studies rely on internal validation, and external validation remains limited, hindering generalizability and regulatory progress. 6️⃣ Current multimodal AI systems lack regulatory clearance, underscoring barriers to clinical integration like interoperability, privacy concerns, and the need for explainable AI, which are necessary to bridge research and practical application gaps.
Like Comment
To view or add a comment, sign in
MediFormatica

1,119 followers
4mo
Report this post
Evaluating multimodal AI in medical diagnostics - npj Digital Medicine Title: Evaluating Multimodal AI in Medical Diagnostics #Introduction Multimodal AI systems combine different types of data to enhance medical diagnostics, improving accuracy and efficiency. #Challenges in Medical Imaging The integration of various imaging modalities poses challenges in data interpretation and analysis for healthcare professionals. #Benefits of Multimodal AI Multimodal AI offers improved diagnostic accuracy, reduced interpretation time, and enhanced patient outcomes in medical imaging. #Case Studies Several case studies demonstrate the effectiveness of multimodal AI in diagnosing complex medical conditions with high accuracy. #Future Implications The future of medical diagnostics will be shaped by the continued development and implementation of multimodal AI systems. #Conclusion Multimodal AI has the potential ai.mediformatica.com #medical #digital #gpt4 #this #medicine #clinical #diagnostics #intelligence #study #vision #digitalmedicine #preview #digitalhealth #healthit #healthtech #healthcaretechnology @MediFormatica (https://buff.ly/3LTpi06)

Evaluating multimodal AI in medical diagnostics

nature.com
Like Comment
To view or add a comment, sign in
Nick Tarazona, MD
2w
Report this post
👉🏼 Generating synthetic clinical text with local large language models to identify misdiagnosed limb fractures in radiology reports 🤓 Jinghui Liu 👇🏻 https://lnkd.in/entpHxYh 🔍 Focus on data insights: - 🌐 Local LLMs can generate synthetic clinical reports, providing a cost-effective alternative to commercial models. - 📈 Performance of synthetic reports is comparable to using real-world annotated data for training AI systems. - 🧠 Incorporating synthetic data significantly enhances model training, achieving over 90% effectiveness relative to real-world data. 💡 Main outcomes and implications: - 🔑 Open-source LLMs facilitate the generation of essential training data while addressing privacy concerns associated with sensitive clinical information. - ⚙️ The study supports the notion that synthetic data can be reliable and beneficial in healthcare applications, particularly in diagnostic processes. - 🏥 The findings could lead to greater adoption of synthetic data generation techniques in medical AI, improving efficiency and outcomes in diagnostics. 📚 Field significance: - 🔬 Highlights the transformative potential of artificial intelligence in healthcare through effective data utilization. - 💻 Encourages further exploration of open-source tools in medical applications, promoting transparency and accessibility. - 📊 Provides a framework for integrating synthetic data generation in clinical settings, enhancing research and development in medical imaging and diagnostics. 🗄️: [#largeLanguageModels #syntheticData #healthcareAI #radiology #dataPrivacy #machineLearning #diagnostics #clinicalReports]
Like Comment
To view or add a comment, sign in
A K M Firoj Mahmud, PhD

Bioinformatician; Postdoctoral Researcher, Medical Digital Twin Research group, Karolinska Institute
7mo
Report this post
After the OMICS era it seems the AI era! Despite the many benefits of AI, like improving Precision Medicine and remote patient monitoring, its full use in healthcare faces key challenges. These include poor data quality, lack of large labeled datasets, data privacy and safety issues, biases in AI, legal and ethical issues, and cost and implementation hurdles. Specially look at the section 😀 AI: myth versus reality 😀 A wonderful paper for the people who are thinking of ML to use in their research in precision medicine. https://lnkd.in/dd6gAdka

Tribulations and future opportunities for artificial intelligence in precision medicine - Journal of Translational Medicine

translational-medicine.biomedcentral.com
Like Comment
To view or add a comment, sign in
Oliver Webb

Business Development Director at Content Ed Net - Advertising & Media | Your Digital Engagement Partner
9mo
Report this post
Fascinating and timely article highlighting the importance of including the #PatientPerspective when applying #AI to #MedicalResearch and delivery. "While AI holds transformative possibilities for medicine, it is important that we are clear-eyed about both the risks and the opportunities... The many other factors that influence medical care, including the systematic disregard for patients’ knowledge of their own experiences, are often ignored in these discussions... ...some of the most promising areas of research are participatory approaches to machine learning and patient-led medical research." https://lnkd.in/d9mkxg3n

Rachel Thomas, PhD - “AI will cure cancer” misunderstands both AI and medicine

rachel.fast.ai
Like Comment
To view or add a comment, sign in
Inés Jiménez Fragoso

Marketing Manager | Global Marketing | Executive MBA at IE Business School | Oncology & Onco-hematology | Communication · People management · Market analysis · Teamwork · Launch · Lifecycle management · Leadership
7mo
Report this post
The controversy is served: how will we identify those physicians who could benefit from AI in their performance & differentiate them from those who will get "hurt" by it? Really interesting data on this article about the potential use of medical IA tools on image interpretation.

Does AI Help or Hurt Human Radiologists’ Performance? It Depends on the Doctor

hms.harvard.edu
Like Comment
To view or add a comment, sign in
Wouter Veldhuis

Radiologist at University Medical Center Utrecht
7mo Edited
Report this post
Having fun with a full day hackathon to lay the foundation for a clinical QA-4-AI layer - together with developers from UMCU, Sectra Benelux and Sectra Sweden. The QA-4-AI layer will further augment our Sectra AI Amplifier infra that forms the basis for radiological AI orchestration in our hospital. Continuous, prospective AI Quality Assurance is crucial because despite the massive potential of AI in radiology, currently efficiency and cost gains are not being achieved because AI is applied in an incomplete way: successfully implemented from an IT perspective; but due to a lack of trust and unestablished local safety margins, it is operating with minimal impact on the medical process. What's holding us back is a lack of quantitative knowledge about the local perfomance of neural networks: how it works in *your* hospital on *your* patients, in a multitude of different clinical settings. [Briefly, local perfomance is a function of the quality of the neural network itself (initial training), and external factors that manifest in AI generalization gaps, model drift, and long-tail failure. Examples of relevant external factors are local population, equipment, scan protocols, and guidelines, but also impactful factors like human-AI(-output) interaction.] Edwin Bennink, Christian Mol, Nick Besselink, Martin Lindvall, Erik Gabrielsson, Emilia Ståhlbom, Sectra, Universitair Medisch Centrum Utrecht
6 Comments
Like Comment
To view or add a comment, sign in
Rinad Karim

Software Engineering Student
8mo
Report this post
If this is a field we really want to venture into, I think the future is one where doctors will need some sort of baseline software engineering training or else, as the article has explained to us, we won't really understand the relationship between AI and medicine! To be able to truly understand how to utilize the power of computer code in medicine it makes sense that the person attempting to do that should understand both coding and medicine well! As a doctor who's also trained in coding languages that can be used in AI building programs, having the knowledge of when human intellect will trump computer intellect and vice versa is priceless! It becomes a team effort between the human brain and the machine brain, where strengths and weaknesses from each "entity" are evaluated and taken into account on a case by case basis. This is why I've taken a pause on my overall career of getting into medical school and am currently getting trained to become a software engineer! I truly think the future of personalized medicine and medicine in general is in this overlap!

Harvard Medical School

391,514 followers
9mo

The effect of AI assistance was inconsistent and varied across radiologists, with the performance of some radiologists improving with AI and worsening in others.

Does AI Help or Hurt Human Radiologists' Performance? It Depends on the Doctor

hms.harvard.edu
Like Comment
To view or add a comment, sign in
Global Generative AI Award

607 followers
4mo
Report this post
At the AWS Summit in Washington, D.C., Dr. Christine Tsien Silvers from Harvard Medical School highlighted the transformative potential of generative AI in healthcare. She discussed how a large language model can assist clinicians by accurately interpreting arterial blood gas (ABG) test results. These tests, crucial in emergency situations, measure oxygen and carbon dioxide levels in the blood, aiding in the diagnosis of conditions like sepsis and respiratory failure. Through the innovative use of generative AI, Harvard Medical School has enhanced the speed and accuracy of ABG result interpretations, improving from less than 50% to 98% accuracy with a two-step process involving prompt engineering and mathematical calculations. This advancement not only increases clinical efficiency but also reduces errors, ultimately contributing to safer patient care. Dr. Silvers emphasized that while challenges remain, the integration of generative AI in healthcare is a promising step towards mitigating clinician burnout and enhancing overall patient outcomes. https://lnkd.in/gb-3Zm4D https://lnkd.in/drFpsjMU healthtechmagazine.net

How Generative AI Can Help Clinicians Interpret ABG Test Results

healthtechmagazine.net
Like Comment
To view or add a comment, sign in

69,245 followers

View Profile Follow

Jan Beger’s Post

More from this author

Interesting reads ... November 2024

Superpowers That Have Inspired Me Throughout My Career

Interesting reads ... October 2024

Explore topics