Categories: HealthcareNews

Mendel and UMass Amherst Unveil Groundbreaking Research on AI-driven Hallucination Detection in Healthcare

Innovative framework developed to ensure the accuracy and reliability of AI-generated medical summaries.

SAN JOSE, Calif. and AMHERST, Mass., Aug. 6, 2024 /PRNewswire/ — Mendel, a leader in healthcare AI, and the University of Massachusetts Amherst (UMass Amherst) have jointly published pioneering research addressing the critical issue of faithfulness hallucinations in AI-generated medical summaries. This collaborative effort marks a significant advancement in ensuring the safety and reliability of AI applications in healthcare settings.

Research Overview

In recent years, large language models (LLMs) such as GPT-4o and Llama-3 have shown remarkable capabilities in generating medical summaries. However, the risk of hallucinations—where AI outputs include false or misleading information—remains a significant concern. This study aimed to systematically detect and categorize these hallucinations to improve the trustworthiness of AI in clinical contexts.

The research team developed a robust detection framework, categorizing hallucinations into five types. A pilot study of 100 summaries from GPT-4o and Llama-3 models revealed that GPT-4o produced longer summaries (>500 words) and often made bold, two-step reasoning statements, leading to hallucinations. Llama-3 hallucinated less by avoiding extensive inferences, but it’s summaries were of lower quality. Below is a table summarizing the inconsistencies identified in both models:

Read moreFossil launches its first smartwatch on the Qualcomm Snapdragon Wear 3100 platform Model	Medical Event Inconsistency	Incorrect Reasoning	Chronological Inconsistency
GPT-4o	Read moreConwy Care Home Multi-Professional Response wins Secure I.T. Environments Sponsored award at Betsi Cadwaladr University Health Board Achievement Awards 21	44	2
Llama-3	19	Read moreTorus and Aareon Collaborate on Housing Management Solution 26	1

“Our findings highlight the critical risks posed by hallucinations in AI-generated medical summaries,” said Andrew McCallum, Distinguished Professor of Computer Science, University of Massachusetts Amherst. “Ensuring the accuracy of these models is paramount to preventing potential misdiagnoses and inappropriate treatments in healthcare.”

The study also explored automated detection methods to mitigate the high costs and time associated with human annotations. The Hypercube system, leveraging medical knowledge bases, symbolic reasoning and NLP, played a crucial role in detecting hallucinations. It provided a comprehensive representation of patient documents, aiding in the initial detection step before human expert review.

“We are committed to continually enhancing Hypercube’s capabilities. The future of healthcare AI depends on reliable, accurate tools, and Hypercube’s evolving features, including real-time data processing and adaptive learning algorithms, will keep it at the forefront of clinical innovation,” said Dr. Wael Salloum, Chief Scientific Officer of Mendel AI.

Future Prospects

As AI continues to integrate into healthcare, addressing hallucinations in LLM outputs will be vital. Future research will focus on refining detection frameworks and exploring more advanced automated systems like Hypercube to ensure the highest levels of accuracy and reliability in AI-generated medical content. Hypercube’s real-time data processing and adaptive learning algorithms will be essential in maintaining its position at the forefront of clinical innovation.

Accepted Paper

Mendel’s work on Hypercube in detecting hallucinations is recognized by the academic community. The research paper, “Faithfulness Hallucination Detection in Healthcare AI,” is accepted for the KDD AI conference, August 2024. It details the methodologies and technologies underpinning Hypercube’s success.

For more information about the Hypercube platform try the Hypercube demo.

About Mendel

Mendel AI supercharges clinical data workflows by coupling large language models with a proprietary clinical hypergraph, delivering scalable clinical reasoning without hallucinations and ensuring 100% explainability. Headquartered in San Jose, California, Mendel is backed by blue-chip investors, including Oak HC/FT and DCM. For more information, visit Mendel or contact marketing@mendel.ai.

About UMass Amherst

UMass Amherst, the flagship campus of the University of Massachusetts system, is a nationally ranked public research university known for its excellence in teaching, research, and community engagement. The university fosters innovation and collaboration across a wide range of disciplines. For more information, visit UMass Amherst.

Press Contact: Jessica McNellis, Gale Strategies (Jessica@GaleStrategies.com)

View original content:https://www.prnewswire.com/news-releases/mendel-and-umass-amherst-unveil-groundbreaking-research-on-ai-driven-hallucination-detection-in-healthcare-302214850.html

SOURCE Mendel

Staff

Next SOPHiA GENETICS Reports Second Quarter 2024 Results »

Previous « Flagship Pioneering Announces Appointment of Raj Panjabi as Senior Partner

Published by

Staff

12 months ago

Assembly Biosciences Presents Data Highlighting Recurrent Genital Herpes Program at STI & HIV 2025 World Congress and International Herpesvirus Workshop

– Phase 1a clinical data in healthy participants presented for both ABI-5366 and ABI-1179, novel…

45 minutes ago

ISS Recommends Shareholders Vote FOR the Merger between Regional Health Properties, Inc. and SunLink Health Systems, Inc.

Atlanta, GA., July 25, 2025 (GLOBE NEWSWIRE) -- Regional Health Properties, Inc. (“Regional”) (OTCBQ: RHEP)…

45 minutes ago

News

IRADIMED CORPORATION to Hold Second Quarter of 2025 Financial Results Conference Call on August 1, 2025

ORLANDO, Fla., July 25, 2025 (GLOBE NEWSWIRE) -- IRADIMED CORPORATION (NASDAQ: IRMD) announced today that the…

45 minutes ago

Mendel and UMass Amherst Unveil Groundbreaking Research on AI-driven Hallucination Detection in Healthcare

Recent Posts

Assembly Biosciences Presents Data Highlighting Recurrent Genital Herpes Program at STI & HIV 2025 World Congress and International Herpesvirus Workshop

ISS Recommends Shareholders Vote FOR the Merger between Regional Health Properties, Inc. and SunLink Health Systems, Inc.

PharmAla Issues Q3 Financial Statements

Calidi announces Reverse Stock Split, expected to begin trading on a 1-for-12 split adjusted basis on August 5, 2025

Portage Biotech Reports Results for Fiscal Year Ended March 31, 2025

IRADIMED CORPORATION to Hold Second Quarter of 2025 Financial Results Conference Call on August 1, 2025

Mendel and UMass Amherst Unveil Groundbreaking Research on AI-driven Hallucination Detection in Healthcare

Related Post

Recent Posts

Assembly Biosciences Presents Data Highlighting Recurrent Genital Herpes Program at STI & HIV 2025 World Congress and International Herpesvirus Workshop

ISS Recommends Shareholders Vote FOR the Merger between Regional Health Properties, Inc. and SunLink Health Systems, Inc.

PharmAla Issues Q3 Financial Statements

Calidi announces Reverse Stock Split, expected to begin trading on a 1-for-12 split adjusted basis on August 5, 2025

Portage Biotech Reports Results for Fiscal Year Ended March 31, 2025

IRADIMED CORPORATION to Hold Second Quarter of 2025 Financial Results Conference Call on August 1, 2025