Skip to main content

Development and Evaluation of Electronic Health Record (EHR) Information Extraction Pipeline and Tree-Based Scan Statistic (TBSS) Methods for EHR-Based Signal Detection (DA2)

    Basic Details
    Date Posted
    Status
    In progress
    Description

    Tree-based scan statistics (TBSS) are data mining methods used to prioritize statistical alerts when screening for potential adverse events. TBSS uses a hierarchical tree to group and map relationships between thousands of correlated outcomes and account for multiple hypotheses being evaluated. Historically, TBSS has used insurance claims data - captured with International Classification of Diseases, Ninth Edition, Clinical Modification (ICD-9-CM) and (ICD-10-CM) codes - to screen for potential adverse events for associations with a drug of interest. While a valuable drug safety surveillance method for insurance claims data, TBSS methods, to date, have not been developed to leverage the rich clinical information, including laboratory test results, vital signs, and clinical notes, available in electronic health records (EHR). This project will develop approaches for abstracting and combining structured and unstructured EHR data. This project will also expand TBSS methods to allow identification of statistical alerts for outcomes available only through EHR data (e.g., natural language processing, laboratory values). Led by partner teams at Vanderbilt University Medical Center (VUMC) and Brigham and Women’s Hospital (BWH), this project will ultimately inform the Sentinel Innovation Center’s (IC) detection analytics roadmap for the integration and development of methods of signal identification from EHR data.  

    Information
    Time Period
    February 01, 2023 – July 31, 2024
    Data Source(s)
    Vanderbilt University Medical Center, Brigham and Women's Hospital
    Workgroup Leader(s)

    Joshua C. Smith, PhD; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN

    Shirley V Wang, PhD, ScM; Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA

    Judith C. Maro, PhD, MS; Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, MA

    Workgroup Member(s)

    Sharon Davis, PhD; Ruth Reeves, PhD; Daniel Park, BS; Robert Winter, BA; Jill Whittaker, MSN; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN

    Massimiliano Russo, PhD; Sushama Kattinakere Sreedhara, MBBS, MSPH; Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA

    Jose Hernandez, RPh, MPH, PhD; Yong Ma, PhD, MS; Office of Surveillance and Epidemiology, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD

    Audrey Wolfe, MPH; Sam McGown; Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, MA