Publications by Year: 2019

2019

Hejblum, Boris P, Griffin M Weber, Katherine P Liao, Nathan P Palmer, Susanne Churchill, Nancy A Shadick, Peter Szolovits, Shawn N Murphy, Isaac S Kohane, and Tianxi Cai. (2019) 2019. “Probabilistic Record Linkage of De-Identified Research Datasets With Discrepancies Using Diagnosis Codes.”. Scientific Data 6: 180298. https://doi.org/10.1038/sdata.2018.298.

We develop an algorithm for probabilistic linkage of de-identified research datasets at the patient level, when only diagnosis codes with discrepancies and no personal health identifiers such as name or date of birth are available. It relies on Bayesian modelling of binarized diagnosis codes, and provides a posterior probability of matching for each patient pair, while considering all the data at once. Both in our simulation study (using an administrative claims dataset for data generation) and in two real use-cases linking patient electronic health records from a large tertiary care network, our method exhibits good performance and compares favourably to the standard baseline Fellegi-Sunter algorithm. We propose a scalable, fast and efficient open-source implementation in the ludic R package available on CRAN, which also includes the anonymized diagnosis code data from our real use-case. This work suggests it is possible to link de-identified research databases stripped of any personal health identifiers using only diagnosis codes, provided sufficient information is shared between the data sources.

Consortium, HuBMAP. (2019) 2019. “The Human Body at Cellular Resolution: The NIH Human Biomolecular Atlas Program.”. Nature 574 (7777): 187-92. https://doi.org/10.1038/s41586-019-1629-x.

Transformative technologies are enabling the construction of three-dimensional maps of tissues with unprecedented spatial and molecular resolution. Over the next seven years, the NIH Common Fund Human Biomolecular Atlas Program (HuBMAP) intends to develop a widely accessible framework for comprehensively mapping the human body at single-cell resolution by supporting technology development, data acquisition, and detailed spatial mapping. HuBMAP will integrate its efforts with other funding agencies, programs, consortia, and the biomedical research community at large towards the shared vision of a comprehensive, accessible three-dimensional molecular and cellular atlas of the human body, in health and under various disease conditions.