News | Artificial Intelligence | September 02, 2021

Scientists Create a Labor-saving Automated Method for Studying Electronic Health Records

Mount Sinai study suggests new method is as effective as manually-based “gold-standard” at classifying a diagnosis

Mount Sinai scientists created an AI-based, automated system that learns to read patient data from electronic health records. Here the system identified dementia cases (purple dots) from a database of nearly 2 million patients (blue dots). Courtesy of the Glicksberg lab, Mount Sinai, N.Y., N.Y.

Mount Sinai scientists created an AI-based, automated system that learns to read patient data from electronic health records. Here the system identified dementia cases (purple dots) from a database of nearly 2 million patients (blue dots). Courtesy of the Glicksberg lab, Mount Sinai, N.Y., N.Y.

September 2, 2021 — In an article published in the journal Patterns, scientists at the Icahn School of Medicine at Mount Sinai described the creation of a new, automated, artificial intelligence-based algorithm that can learn to read patient data from electronic health records. In a side-by-side comparison, they showed that their method, called Phe2vec (FEE-to-vek), accurately identified patients with certain diseases as well as the traditional, “gold-standard” method, which requires much more manual labor to develop and perform.

“There continues to be an explosion in the amount and types of data electronically stored in a patient’s medical record. Disentangling this complex web of data can be highly burdensome, thus slowing advancements in clinical research,” said Benjamin S. Glicksberg, Ph.D., Assistant Professor of Genetics and Genomic Sciences, a member of the Hasso Plattner Institute for Digital Health at Mount Sinai (HPIMS), and a senior author of the study. “In this study, we created a new method for mining data from electronic health records with machine learning that is faster and less labor intensive than the industry standard. We hope that this will be a valuable tool that will facilitate further, and less biased, research in clinical informatics.”

The study was led by Jessica K. De Freitas, a graduate student in Glicksberg's lab.

Currently, scientists rely on a set of established computer programs, or algorithms, to mine medical records for new information. The development and storage of these algorithms is managed by a system called the Phenotype Knowledgebase (PheKB). Although the system is highly effective at correctly identifying a patient diagnosis, the process of developing an algorithm can be very time-consuming and inflexible. To study a disease, researchers first have to comb through reams of medical records looking for pieces of data, such as certain lab tests or prescriptions, which are uniquely associated with the disease. They then program the algorithm that guides the computer to search for patients who have those disease-specific pieces of data, which constitute a “phenotype”. In turn, the list of patients identified by the computer needs to be manually double-checked by researchers. Each time researchers want to study a new disease, they have to restart the process from scratch.

In this study, the researchers tried a different approach—one in which the computer learns, on its own, how to spot disease phenotypes and thus save researchers time and effort. This new, Phe2vec method was based on studies the team had already conducted.

“Previously, we showed that unsupervised machine learning could be a highly efficient and effective strategy for mining electronic health records,” said Riccardo Miotto, Ph.D., a former Assistant Professor at the HPIMS and a senior author of the study. “The potential advantage of our approach is that it learns representations of diseases from the data itself. Therefore, the machine does much of the work experts would normally do to define the combination of data elements from health records that best describes a particular disease.”

Essentially, a computer was programmed to scour through millions of electronic health records and learn how to find connections between data and diseases. This programming relied on “embedding” algorithms that had been previously developed by other researchers, such as linguists, to study word networks in various languages. One of the algorithms, called word2vec, was particularly effective. Then, the computer was programmed to use what it learned to identify the diagnoses of nearly 2 million patients whose data was stored in the Mount Sinai Health System.

Finally, the researchers compared the effectiveness between the new and the old systems. For nine out of ten diseases tested, they found that the new Phe2vec system was as effective as, or performed slightly better than, the gold standard phenotyping process at correctly identifying a diagnoses from electronic health records. A few examples of the diseases included dementia, multiple sclerosis, and sickle cell anemia.

“Overall our results are encouraging and suggest that Phe2vec is a promising technique for large-scale phenotyping of diseases in electronic health record data,” Glicksberg said. “With further testing and refinement, we hope that it could be used to automate many of the initial steps of clinical informatics research, thus allowing scientists to focus their efforts on downstream analyses like predictive modeling.”

For more information: icahn.mssm.edu/

Related Content

News | Breast Imaging

Rezolut Introduces New Analysis Tool for Breast Imaging

Aug. 28, 2024 — Rezolut, LLC recently debuted its latest offering for patients during their annual mammogram ...

August 29, 2024

News | Digital Pathology

AI Tool Simultaneously Screens 505 Genes for Comprehensive Cancer Diagnosis, Personalized Treatments

Paige has launched OmniScreen, an AI-driven biomarker module capable of evaluating over 505 genes and detecting 1,228 ...

August 27, 2024

The National Imaging Informatics Course (NIIC), a pioneering program in the radiology field, will return online September 23–27, 2024.

News | RSNA

RSNA and SIIM to Offer National Imaging Informatics Course

July 31, 2024 — The National Imaging Informatics Course (NIIC), a pioneering program in the radiology field, will return ...

July 31, 2024

Seeking to support patients and providers, national groups push for progress on the payment front, as radiation therapy providers advance products and partnerships

Feature | Radiation Oncology | By Christine Book

Progress Report on Radiation Therapy

News emerging from several leading organizations and vendors in the radiation therapy arena came in at a fast pace in ...

July 30, 2024

Middle East's first full-scale AI adoption for national screening marks Lunit's latest milestone amid rapid expansion of global cancer screening initiatives

News | Breast Imaging

Lunit's AI Solution Enhances Qatar's National Breast Cancer Screening Program

July 29, 2024 — Lunit, a leading provider of AI-powered solutions for cancer diagnostics and therapeutics, announced the ...

July 29, 2024

Windsong Radiology to Champion First Wave of ProFound AI Breast Health Suite Expansion

News | Breast Imaging

iCAD and Windsong Radiology Announce Strategic Commercial Agreement to Implement AI-Powered Mammography

July 29, 2024 — iCAD, Inc., a global leader in clinically proven AI-powered cancer detection solutions, announced a ...

July 29, 2024

GE HealthCare selects AWS as its strategic cloud provider to deliver entirely new, purpose-built foundation models designed to fast-track the development of innovative healthcare applications

News | Artificial Intelligence

GE HealthCare and AWS Announce Strategic Collaboration to Accelerate Healthcare Transformation With Generative AI

July 26, 2024 — GE HealthCare and Amazon Web Services, Inc. (AWS), an Amazon.com, Inc. company, announced a strategic ...

July 26, 2024

Immunis, Inc. has announced that a partnership with Springbok Analytics to implement their MRI-based AI muscle analysis technology in its Phase 1/2a Clinical Trial, assessing the efficacy of its novel secretome therapeutic (IMMUNA) in targeting sarcopenia.

News | Radiology Business

Immunis Partners with Springbok Analytics on MRI-based AI Muscle Analysis Technology

July 25, 2024 — Immunis, Inc., a clinical-stage biotech developing groundbreaking secretome therapeutics for age and ...

July 25, 2024

Videos | Information Technology

VIDEO: One on One with Hal Wolf, FHIMSS, HIMSS President and CEO

Industry trade shows and conferences seem to be making their comeback in 2024. And the Healthcare Information and ...

July 25, 2024

Proscia has announced an update to its AI-enabled cloud-based workflow solution, advancing digital pathology.

News | Digital Pathology

Proscia Announces Update to AI-enabled Cloud-based Workflow Solution for Pathologists

July 24, 2024 — Proscia, a developer of artificial intelligence (AI)-enabled digital pathology solutions for precision ...

July 24, 2024

If you enjoy this content, please share it with a colleague

Scientists Create a Labor-saving Automated Method for Studying Electronic Health Records

If you enjoy this content, please share it with a colleague

Related Content