Skip to main content

Enhancing Causes of Death Prediction from Electronic Health Records through Multi-Modal Integration of Structured and Unstructured EHR Data

    Basic Details
    Date
    Type
    Presentation
    Description

    Rapid identification of death and causes of death is vital for medical product safety studies. In the United States, structured Electronic Health Record (EHR) data and unstructured notes offer valuable clinical information, but death data is often incomplete, and Cause of Death (CoD) details are typically missing. This study addresses these challenges by using a multi-modal approach that combines structured and unstructured EHR data to predict CoD more accurately. Integrating textual data with structured features improved the weighted Area Under the Curve (AUC) from 0.86 to 0.90. Significant AUC increases were observed for conditions like chronic lower respiratory disease (9%), cerebrovascular disease (8%), essential hypertension (9%), and intentional self-harm (9%). Unstructured notes were particularly beneficial for conditions with fewer samples. This study was presented at the American Medical Informatics Association (AMIA) 2024 Annual Symposium on November 13, 2024.

     

    Presenter(s)

    Mohammed A. Al-Garadi, Rishi J Desai, Kerry Ngan, Michele LeNoue-Newton, Ruth Reeves, Daniel Park, Shirley V. Wang, Judith C. Maro, Candace C. Fuller, Kueiyu Joshua Lin, José J. Hernández-Muñoz, Aida Kuzucan, Xi Wang, Haritha Pillai, Jill Whitaker, Jessica A. Deere, Michael F. McLemore, Dax Westerman, Michael E. Matheny