News | Artificial Intelligence | November 07, 2018

Artificial Intelligence May Fall Short Analyzing Data Across Multiple Health Systems

Study shows deep learning models must be carefully tested across multiple environments before being put into clinical practice

November 7, 2018 — Artificial intelligence (AI) tools trained to detect pneumonia on chest X-rays suffered significant decreases in performance when tested on data from outside health systems, according to a new study. The study, conducted at the Icahn School of Medicine at Mount Sinai, was published in a special issue of PLOS Medicine on machine learning and healthcare.1 These findings suggest that artificial intelligence in the medical space must be carefully tested for performance across a wide range of populations; otherwise, the deep learning models may not perform as accurately as expected.

As interest in the use of computer system frameworks called convolutional neural networks (CNN) to analyze medical imaging and provide a computer-aided diagnosis grows, recent studies have suggested that AI image classification may not generalize to new data as well as commonly portrayed.

Researchers at the Icahn School of Medicine at Mount Sinai assessed how AI models identified pneumonia in 158,000 chest X-rays across three medical institutions: the National Institutes of Health; The Mount Sinai Hospital; and Indiana University Hospital. Researchers chose to study the diagnosis of pneumonia on chest X-rays for its common occurrence, clinical significance and prevalence in the research community.

In three out of five comparisons, CNNs’ performance in diagnosing diseases on X-rays from hospitals outside of its own network was significantly lower than on X-rays from the original health system. However, CNNs were able to detect the hospital system where an X-ray was acquired with a high-degree of accuracy, and cheated at their predictive task based on the prevalence of pneumonia at the training institution. Researchers found that the difficulty of using deep learning models in medicine is that they use a massive number of parameters, making it challenging to identify specific variables driving predictions, such as the types of computed tomography (CT) scanners used at a hospital and the resolution quality of imaging.

“Our findings should give pause to those considering rapid deployment of artificial intelligence platforms without rigorously assessing their performance in real-world clinical settings reflective of where they are being deployed,” said senior author Eric Oermann, M.D., instructor in neurosurgery at the Icahn School of Medicine at Mount Sinai. “Deep learning models trained to perform medical diagnosis can generalize well, but this cannot be taken for granted since patient populations and imaging techniques differ significantly across institutions.”

“If CNN systems are to be used for medical diagnosis, they must be tailored to carefully consider clinical questions, tested for a variety of real-world scenarios and carefully assessed to determine how they impact accurate diagnosis,” said first author John Zech, a medical student at the Icahn School of Medicine at Mount Sinai.

This research builds on papers published earlier this year in the journals Radiology and Nature Medicine, which laid the framework for applying computer vision and deep learning techniques, including natural language processing algorithms, for identifying clinical concepts in radiology reports for CT scans.

Listen to the PODCAST: Radiologists Must Understand AI To Know If It Is Wrong

For more information: www.journals.plos.org/plosmedicine

Reference

1. Zech J.R., Badgeley M.A., Liu M., et al. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study. PLOS Medicine, Nov. 6, 2018. https://doi.org/10.1371/journal.pmed.1002683

Related Content

News | Breast Imaging

Rezolut Introduces New Analysis Tool for Breast Imaging

Aug. 28, 2024 — Rezolut, LLC recently debuted its latest offering for patients during their annual mammogram ...

August 29, 2024

News | Digital Pathology

AI Tool Simultaneously Screens 505 Genes for Comprehensive Cancer Diagnosis, Personalized Treatments

Paige has launched OmniScreen, an AI-driven biomarker module capable of evaluating over 505 genes and detecting 1,228 ...

August 27, 2024

The National Imaging Informatics Course (NIIC), a pioneering program in the radiology field, will return online September 23–27, 2024.

News | RSNA

RSNA and SIIM to Offer National Imaging Informatics Course

July 31, 2024 — The National Imaging Informatics Course (NIIC), a pioneering program in the radiology field, will return ...

July 31, 2024

Seeking to support patients and providers, national groups push for progress on the payment front, as radiation therapy providers advance products and partnerships

Feature | Radiation Oncology | By Christine Book

Progress Report on Radiation Therapy

News emerging from several leading organizations and vendors in the radiation therapy arena came in at a fast pace in ...

July 30, 2024

Middle East's first full-scale AI adoption for national screening marks Lunit's latest milestone amid rapid expansion of global cancer screening initiatives

News | Breast Imaging

Lunit's AI Solution Enhances Qatar's National Breast Cancer Screening Program

July 29, 2024 — Lunit, a leading provider of AI-powered solutions for cancer diagnostics and therapeutics, announced the ...

July 29, 2024

Windsong Radiology to Champion First Wave of ProFound AI Breast Health Suite Expansion

News | Breast Imaging

iCAD and Windsong Radiology Announce Strategic Commercial Agreement to Implement AI-Powered Mammography

July 29, 2024 — iCAD, Inc., a global leader in clinically proven AI-powered cancer detection solutions, announced a ...

July 29, 2024

GE HealthCare selects AWS as its strategic cloud provider to deliver entirely new, purpose-built foundation models designed to fast-track the development of innovative healthcare applications

News | Artificial Intelligence

GE HealthCare and AWS Announce Strategic Collaboration to Accelerate Healthcare Transformation With Generative AI

July 26, 2024 — GE HealthCare and Amazon Web Services, Inc. (AWS), an Amazon.com, Inc. company, announced a strategic ...

July 26, 2024

Immunis, Inc. has announced that a partnership with Springbok Analytics to implement their MRI-based AI muscle analysis technology in its Phase 1/2a Clinical Trial, assessing the efficacy of its novel secretome therapeutic (IMMUNA) in targeting sarcopenia.

News | Radiology Business

Immunis Partners with Springbok Analytics on MRI-based AI Muscle Analysis Technology

July 25, 2024 — Immunis, Inc., a clinical-stage biotech developing groundbreaking secretome therapeutics for age and ...

July 25, 2024

Videos | Information Technology

VIDEO: One on One with Hal Wolf, FHIMSS, HIMSS President and CEO

Industry trade shows and conferences seem to be making their comeback in 2024. And the Healthcare Information and ...

July 25, 2024

Proscia has announced an update to its AI-enabled cloud-based workflow solution, advancing digital pathology.

News | Digital Pathology

Proscia Announces Update to AI-enabled Cloud-based Workflow Solution for Pathologists

July 24, 2024 — Proscia, a developer of artificial intelligence (AI)-enabled digital pathology solutions for precision ...

July 24, 2024

If you enjoy this content, please share it with a colleague

Artificial Intelligence May Fall Short Analyzing Data Across Multiple Health Systems

If you enjoy this content, please share it with a colleague

Related Content