Publications

2021

Tao, Ziye, Griffin M Weber, and Yun William Yu. (2021) 2021. “Expected 10-Anonymity of HyperLogLog Sketches for Federated Queries of Clinical Data Repositories.”. Bioinformatics (Oxford, England) 37 (Suppl_1): i151-i160. https://doi.org/10.1093/bioinformatics/btab292.

MOTIVATION: The rapid growth in of electronic medical records provide immense potential to researchers, but are often silo-ed at separate hospitals. As a result, federated networks have arisen, which allow simultaneously querying medical databases at a group of connected institutions. The most basic such query is the aggregate count-e.g. How many patients have diabetes? However, depending on the protocol used to estimate that total, there is always a tradeoff in the accuracy of the estimate against the risk of leaking confidential data. Prior work has shown that it is possible to empirically control that tradeoff by using the HyperLogLog (HLL) probabilistic sketch.

RESULTS: In this article, we prove complementary theoretical bounds on the k-anonymity privacy risk of using HLL sketches, as well as exhibit code to efficiently compute those bounds.

AVAILABILITY AND IMPLEMENTATION: https://github.com/tzyRachel/K-anonymity-Expectation.

Weber, Griffin M, Harrison G Zhang, Sehi L’Yi, Clara-Lea Bonzel, Chuan Hong, Paul Avillach, Alba Gutiérrez-Sacristán, et al. (2021) 2021. “International Changes in COVID-19 Clinical Trajectories Across 315 Hospitals and 6 Countries: Retrospective Cohort Study.”. Journal of Medical Internet Research 23 (10): e31400. https://doi.org/10.2196/31400.

BACKGROUND: Many countries have experienced 2 predominant waves of COVID-19-related hospitalizations. Comparing the clinical trajectories of patients hospitalized in separate waves of the pandemic enables further understanding of the evolving epidemiology, pathophysiology, and health care dynamics of the COVID-19 pandemic.

OBJECTIVE: In this retrospective cohort study, we analyzed electronic health record (EHR) data from patients with SARS-CoV-2 infections hospitalized in participating health care systems representing 315 hospitals across 6 countries. We compared hospitalization rates, severe COVID-19 risk, and mean laboratory values between patients hospitalized during the first and second waves of the pandemic.

METHODS: Using a federated approach, each participating health care system extracted patient-level clinical data on their first and second wave cohorts and submitted aggregated data to the central site. Data quality control steps were adopted at the central site to correct for implausible values and harmonize units. Statistical analyses were performed by computing individual health care system effect sizes and synthesizing these using random effect meta-analyses to account for heterogeneity. We focused the laboratory analysis on C-reactive protein (CRP), ferritin, fibrinogen, procalcitonin, D-dimer, and creatinine based on their reported associations with severe COVID-19.

RESULTS: Data were available for 79,613 patients, of which 32,467 were hospitalized in the first wave and 47,146 in the second wave. The prevalence of male patients and patients aged 50 to 69 years decreased significantly between the first and second waves. Patients hospitalized in the second wave had a 9.9% reduction in the risk of severe COVID-19 compared to patients hospitalized in the first wave (95% CI 8.5%-11.3%). Demographic subgroup analyses indicated that patients aged 26 to 49 years and 50 to 69 years; male and female patients; and black patients had significantly lower risk for severe disease in the second wave than in the first wave. At admission, the mean values of CRP were significantly lower in the second wave than in the first wave. On the seventh hospital day, the mean values of CRP, ferritin, fibrinogen, and procalcitonin were significantly lower in the second wave than in the first wave. In general, countries exhibited variable changes in laboratory testing rates from the first to the second wave. At admission, there was a significantly higher testing rate for D-dimer in France, Germany, and Spain.

CONCLUSIONS: Patients hospitalized in the second wave were at significantly lower risk for severe COVID-19. This corresponded to mean laboratory values in the second wave that were more likely to be in typical physiological ranges on the seventh hospital day compared to the first wave. Our federated approach demonstrated the feasibility and power of harmonizing heterogeneous EHR data from multiple international health care systems to rapidly conduct large-scale studies to characterize how COVID-19 clinical trajectories evolve.

Le, Trang T, Alba Gutiérrez-Sacristán, Jiyeon Son, Chuan Hong, Andrew M South, Brett K Beaulieu-Jones, Ne Hooi Will Loh, et al. (2021) 2021. “Multinational Characterization of Neurological Phenotypes in Patients Hospitalized With COVID-19.”. Scientific Reports 11 (1): 20238. https://doi.org/10.1038/s41598-021-99481-9.

Neurological complications worsen outcomes in COVID-19. To define the prevalence of neurological conditions among hospitalized patients with a positive SARS-CoV-2 reverse transcription polymerase chain reaction test in geographically diverse multinational populations during early pandemic, we used electronic health records (EHR) from 338 participating hospitals across 6 countries and 3 continents (January-September 2020) for a cross-sectional analysis. We assessed the frequency of International Classification of Disease code of neurological conditions by countries, healthcare systems, time before and after admission for COVID-19 and COVID-19 severity. Among 35,177 hospitalized patients with SARS-CoV-2 infection, there was an increase in the proportion with disorders of consciousness (5.8%, 95% confidence interval [CI] 3.7-7.8%, pFDR < 0.001) and unspecified disorders of the brain (8.1%, 5.7-10.5%, pFDR < 0.001) when compared to the pre-admission proportion. During hospitalization, the relative risk of disorders of consciousness (22%, 19-25%), cerebrovascular diseases (24%, 13-35%), nontraumatic intracranial hemorrhage (34%, 20-50%), encephalitis and/or myelitis (37%, 17-60%) and myopathy (72%, 67-77%) were higher for patients with severe COVID-19 when compared to those who never experienced severe COVID-19. Leveraging a multinational network to capture standardized EHR data, we highlighted the increased prevalence of central and peripheral neurological phenotypes in patients hospitalized with COVID-19, particularly among those with severe disease.

Börner, Katy, Sarah A Teichmann, Ellen M Quardokus, James C Gee, Kristen Browne, David Osumi-Sutherland, Bruce W Herr, et al. (2021) 2021. “Anatomical Structures, Cell Types and Biomarkers of the Human Reference Atlas.”. Nature Cell Biology 23 (11): 1117-28. https://doi.org/10.1038/s41556-021-00788-6.

The Human Reference Atlas (HRA) aims to map all of the cells of the human body to advance biomedical research and clinical practice. This Perspective presents collaborative work by members of 16 international consortia on two essential and interlinked parts of the HRA: (1) three-dimensional representations of anatomy that are linked to (2) tables that name and interlink major anatomical structures, cell types, plus biomarkers (ASCT+B). We discuss four examples that demonstrate the practical utility of the HRA.

Zhang, Harrison G, Boris P Hejblum, Griffin M Weber, Nathan P Palmer, Susanne E Churchill, Peter Szolovits, Shawn N Murphy, Katherine P Liao, Isaac S Kohane, and Tianxi Cai. (2021) 2021. “ATLAS: an Automated Association Test Using Probabilistically Linked Health Records With Application to Genetic Studies.”. Journal of the American Medical Informatics Association : JAMIA 28 (12): 2582-92. https://doi.org/10.1093/jamia/ocab187.

OBJECTIVE: Large amounts of health data are becoming available for biomedical research. Synthesizing information across databases may capture more comprehensive pictures of patient health and enable novel research studies. When no gold standard mappings between patient records are available, researchers may probabilistically link records from separate databases and analyze the linked data. However, previous linked data inference methods are constrained to certain linkage settings and exhibit low power. Here, we present ATLAS, an automated, flexible, and robust association testing algorithm for probabilistically linked data.

MATERIALS AND METHODS: Missing variables are imputed at various thresholds using a weighted average method that propagates uncertainty from probabilistic linkage. Next, estimated effect sizes are obtained using a generalized linear model. ATLAS then conducts the threshold combination test by optimally combining P values obtained from data imputed at varying thresholds using Fisher's method and perturbation resampling.

RESULTS: In simulations, ATLAS controls for type I error and exhibits high power compared to previous methods. In a real-world genetic association study, meta-analysis of ATLAS-enabled analyses on a linked cohort with analyses using an existing cohort yielded additional significant associations between rheumatoid arthritis genetic risk score and laboratory biomarkers.

DISCUSSION: Weighted average imputation weathers false matches and increases contribution of true matches to mitigate linkage error-induced bias. The threshold combination test avoids arbitrarily choosing a threshold to rule a match, thus automating linked data-enabled analyses and preserving power.

CONCLUSION: ATLAS promises to enable novel and powerful research studies using linked data to capitalize on all available data sources.

Klann, Jeffrey G, Hossein Estiri, Griffin M Weber, Bertrand Moal, Paul Avillach, Chuan Hong, Amelia L M Tan, et al. (2021) 2021. “Validation of an Internationally Derived Patient Severity Phenotype to Support COVID-19 Analytics from Electronic Health Record Data.”. Journal of the American Medical Informatics Association : JAMIA 28 (7): 1411-20. https://doi.org/10.1093/jamia/ocab018.

OBJECTIVE: The Consortium for Clinical Characterization of COVID-19 by EHR (4CE) is an international collaboration addressing coronavirus disease 2019 (COVID-19) with federated analyses of electronic health record (EHR) data. We sought to develop and validate a computable phenotype for COVID-19 severity.

MATERIALS AND METHODS: Twelve 4CE sites participated. First, we developed an EHR-based severity phenotype consisting of 6 code classes, and we validated it on patient hospitalization data from the 12 4CE clinical sites against the outcomes of intensive care unit (ICU) admission and/or death. We also piloted an alternative machine learning approach and compared selected predictors of severity with the 4CE phenotype at 1 site.

RESULTS: The full 4CE severity phenotype had pooled sensitivity of 0.73 and specificity 0.83 for the combined outcome of ICU admission and/or death. The sensitivity of individual code categories for acuity had high variability-up to 0.65 across sites. At one pilot site, the expert-derived phenotype had mean area under the curve of 0.903 (95% confidence interval, 0.886-0.921), compared with an area under the curve of 0.956 (95% confidence interval, 0.952-0.959) for the machine learning approach. Billing codes were poor proxies of ICU admission, with as low as 49% precision and recall compared with chart review.

DISCUSSION: We developed a severity phenotype using 6 code classes that proved resilient to coding variability across international institutions. In contrast, machine learning approaches may overfit hospital-specific orders. Manual chart review revealed discrepancies even in the gold-standard outcomes, possibly owing to heterogeneous pandemic conditions.

CONCLUSIONS: We developed an EHR-based severity phenotype for COVID-19 in hospitalized patients and validated it at 12 international sites.

2020

Beam, Andrew L, Benjamin Kompa, Allen Schmaltz, Inbar Fried, Griffin Weber, Nathan Palmer, Xu Shi, Tianxi Cai, and Isaac S Kohane. (2020) 2020. “Clinical Concept Embeddings Learned from Massive Sources of Multimodal Medical Data.”. Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing 25: 295-306.

Word embeddings are a popular approach to unsupervised learning of word relationships that are widely used in natural language processing. In this article, we present a new set of embeddings for medical concepts learned using an extremely large collection of multimodal medical data. Leaning on recent theoretical insights, we demonstrate how an insurance claims database of 60 million members, a collection of 20 million clinical notes, and 1.7 million full text biomedical journal articles can be combined to embed concepts into a common space, resulting in the largest ever set of embeddings for 108,477 medical concepts. To evaluate our approach, we present a new benchmark methodology based on statistical power specifically designed to test embeddings of medical concepts. Our approach, called cui2vec, attains state-of-the-art performance relative to previous methods in most instances. Finally, we provide a downloadable set of pre-trained embeddings for other researchers to use, as well as an online tool for interactive exploration of the cui2vec embeddings.

Barak-Corren, Yuval, Victor M Castro, Matthew K Nock, Kenneth D Mandl, Emily M Madsen, Ashley Seiger, William G Adams, et al. (2020) 2020. “Validation of an Electronic Health Record-Based Suicide Risk Prediction Modeling Approach Across Multiple Health Care Systems.”. JAMA Network Open 3 (3): e201262. https://doi.org/10.1001/jamanetworkopen.2020.1262.

IMPORTANCE: Suicide is a leading cause of mortality, with suicide-related deaths increasing in recent years. Automated methods for individualized risk prediction have great potential to address this growing public health threat. To facilitate their adoption, they must first be validated across diverse health care settings.

OBJECTIVE: To evaluate the generalizability and cross-site performance of a risk prediction method using readily available structured data from electronic health records in predicting incident suicide attempts across multiple, independent, US health care systems.

DESIGN, SETTING, AND PARTICIPANTS: For this prognostic study, data were extracted from longitudinal electronic health record data comprising International Classification of Diseases, Ninth Revision diagnoses, laboratory test results, procedures codes, and medications for more than 3.7 million patients from 5 independent health care systems participating in the Accessible Research Commons for Health network. Across sites, 6 to 17 years' worth of data were available, up to 2018. Outcomes were defined by International Classification of Diseases, Ninth Revision codes reflecting incident suicide attempts (with positive predictive value >0.70 according to expert clinician medical record review). Models were trained using naive Bayes classifiers in each of the 5 systems. Models were cross-validated in independent data sets at each site, and performance metrics were calculated. Data analysis was performed from November 2017 to August 2019.

MAIN OUTCOMES AND MEASURES: The primary outcome was suicide attempt as defined by a previously validated case definition using International Classification of Diseases, Ninth Revision codes. The accuracy and timeliness of the prediction were measured at each site.

RESULTS: Across the 5 health care systems, of the 3 714 105 patients (2 130 454 female [57.2%]) included in the analysis, 39 162 cases (1.1%) were identified. Predictive features varied by site but, as expected, the most common predictors reflected mental health conditions (eg, borderline personality disorder, with odds ratios of 8.1-12.9, and bipolar disorder, with odds ratios of 0.9-9.1) and substance use disorders (eg, drug withdrawal syndrome, with odds ratios of 7.0-12.9). Despite variation in geographical location, demographic characteristics, and population health characteristics, model performance was similar across sites, with areas under the curve ranging from 0.71 (95% CI, 0.70-0.72) to 0.76 (95% CI, 0.75-0.77). Across sites, at a specificity of 90%, the models detected a mean of 38% of cases a mean of 2.1 years in advance.

CONCLUSIONS AND RELEVANCE: Across 5 diverse health care systems, a computationally efficient approach leveraging the full spectrum of structured electronic health record data was able to detect the risk of suicidal behavior in unselected patients. This approach could facilitate the development of clinical decision support tools that inform risk reduction interventions.

Carney, Brian J, Erik J Uhlmann, Maneka Puligandla, Charlene Mantia, Griffin M Weber, Donna S Neuberg, and Jeffrey I Zwicker. (2020) 2020. “Anticoagulation After Intracranial Hemorrhage in Brain Tumors: Risk of Recurrent Hemorrhage and Venous Thromboembolism.”. Research and Practice in Thrombosis and Haemostasis 4 (5): 860-65. https://doi.org/10.1002/rth2.12377.

BACKGROUND: Intracranial hemorrhage (ICH) is a common and often devastating outcome in patients with brain tumors. Despite this, there is little evidence to guide anticoagulation management following an initial ICH event.

OBJECTIVES: To analyze the risk of recurrent hemorrhagic and thrombotic outcomes after an initial ICH event in patients with brain tumors and prior venous thromboembolism (VTE).

PATIENTS AND METHODS: A retrospective cohort study was performed. Radiographic images obtained after initial ICH were reviewed for the primary outcomes of recurrent ICH and VTE.

RESULTS AND CONCLUSIONS: A total of 79 patients with brain tumors who developed ICH on anticoagulation for VTE were analyzed. Fifty-four patients (68.4%) restarted anticoagulation following ICH. The cumulative incidence of recurrent ICH at 1 year was 6.1% (95% confidence interval [CI], 1.5-15.3) following reinitiation of anticoagulation. Following a major ICH (defined as an ICH >10 mL in size, causing symptoms, or requiring intervention), the rate of recurrent ICH upon reexposure to anticoagulation was 14.5% (95% CI, 2.1-38.35), whereas the rate of recurrent ICH following smaller ICH was 2.6% (95% CI, 0.2%-12.0%). Mortality following a recurrent ICH on anticoagulation was 67% at 30 days. The cumulative incidence of recurrent VTE was significantly lower in the restart cohort compared to patients who did not restart anticoagulation (8.1% vs 35.3%; P = .003). We conclude that resumption of anticoagulation is lowest among patients with metastatic brain tumors with small initial ICH. Following an initial major ICH, resumption of anticoagulation was associated with a high rate of recurrent ICH.