Skip to main content

A Distributed Regression Analysis Application Package Using SAS

    Basic Details
    Date
    Type
    Publication
    Description

    Distributed regression is a privacy-preserving analytical method that performs multiple regression analysis using only summary-level information from participating data partners in multi-center studies. To our knowledge, there are no distributed regression applications in SAS, the statistical software used by several large national distributed data networks (DDNs) in the United States, including the Sentinel System. This manuscript presents a SAS software package for distributed regression analysis in DDNs. We describe a distributed regression application developed for use in Base SAS and SAS/STAT modules. This application supports distributed linear, logistic, and stratified Cox proportional hazards regression analysis within horizontally partitioned DDNs. Real data examples are used to demonstrate the utility of the software package.

    Author(s)

    Qoua L. Her, Dongdong Li, Yury Vilk, Jessica Young, Zilu Zhang, Jessica M. Malenfant, Sarah Malek, Sengwee Toh

    Corresponding Author

    Qoua L. Her and Dongdong Li have contributed equally to this work.
    Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, 401 Park Dr., Boston, 02215, MA, USA
    Emails: qoua_her@harvardpilgrim.org and dongdong_li@harvardpilgrim.org