Skip to main content

Incorporate a Range of Frequently Used Engineering Features from EHRs into the Sentinel Common Data Model in the Sentinel EHR and Claims Linked Data Partner Network

    Basic Details
    Date Posted
    Status
    In progress
    Description

    As a component of the Sentinel Common Data Model (SCDM) v8.2 upgrade, the Sentinel Operations Center (SOC) will develop a new Feature Engineering (FE) table to support gathering information from electronic health record (EHR) unstructured data where the source is free-text, but key information has been extracted and standardized to allow querying with the ARIA Tools. 

    The main goal of this Innovation Center (IC) project is to leverage and complement the SOC’s SCDM upgrade by applying natural language processing (NLP) pipelines to existing free text source data and populating the FE table.  This project will include multiple sites from both the EHR-claims Development Network (Duke Clinical Research Institute, Kaiser Permanente Washington Health Research Institute, Mass General Brigham, and Vanderbilt University Medical Center), and from the EHR-claims Commercial Network (TriNetX and HealthVerity).  

    The IC and FDA will identify a list of 3–5 concepts as candidates for existing NLP pipelines and inclusion in the Feature Engineering table and sites will populate the table with these concepts, using free text from the EHR-claims Commercial Network and Development Network sites’ data. This project will also develop and adopt a set of recommended pre-processing best practices designed to optimize the comparability of NLP-derived data across heterogeneous healthcare systems.

    Information
    Time Period
    September 30, 2023 – September 29, 2024
    Data Source(s)
    Mass General Brigham, Kaiser Permanente Washington Health Research Institute, Duke Clinical Research Institute, Vanderbilt University Medical Center, TriNetX, HealthVerity
    Workgroup Leader(s)

    Keith Marsolo, PhD; Department of Population Health Sciences, Duke University School of Medicine, Durham, NC

    Workgroup Member(s)

    Sarah Dutcher, PhD, MS; Jummai Apata, MBBS, DrPH; Terrence Lee, PhD., MPH; Office of Surveillance and Epidemiology, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD

    Rishi J. Desai, PhD; Jie Yang, PhD; Division of Pharmacoepidemiology and Pharmacoeconomics, Mass General Brigham, Boston, MA

    Robert Penfold, PhD; David Carrell, PhD; Kaiser Permanente Washington Health Research Institute, Seattle, WA

    Ruth Reeves, PhD; Josh C. Smith, PhD; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN

    Eliel Oliveira, MBA, MSc, FAMIA; Daniel Scarnecchia, MPIA; Tawil Diaz; Morgaine Payson; Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, MA

    Jeffrey Brown, PhD; Jeffrey Graham, PhD; John Doole, PharmD, MFA; Michael Swartzbaugh; Mike Temple; Sapna Rajupet; TriNetX, Cambridge, MA