Publications by Year: 2023

2023

Tsai YT, Hrytsenko Y, Elgart M, Tahir U, Chen ZZ, Wilson JG, et al. A parametric bootstrap approach for computing confidence intervals for genetic correlations with application to genetically-determined protein-protein networks.. medRxiv : the preprint server for health sciences. 2023;.

Genetic correlation refers to the correlation between genetic determinants of a pair of traits. When using individual-level data, it is typically estimated based on a bivariate model specification where the correlation between the two variables is identifiable and can be estimated from a covariance model that incorporates the genetic relationship between individuals, e.g., using a pre-specified kinship matrix. Inference relying on asymptotic normality of the genetic correlation parameter estimates may be inaccurate when the sample size is low, when the genetic correlation is close to the boundary of the parameter space, and when the heritability of at least one of the traits is low. We address this problem by developing a parametric bootstrap procedure to construct confidence intervals for genetic correlation estimates. The procedure simulates paired traits under a range of heritability and genetic correlation parameters, and it uses the population structure encapsulated by the kinship matrix. Heritabilities and genetic correlations are estimated using the close-form, method of moment, Haseman-Elston regression estimators. The proposed parametric bootstrap procedure is especially useful when genetic correlations are computed on pairs of thousands of traits measured on the same exact set of individuals. We demonstrate the parametric bootstrap approach on a proteomics dataset from the Jackson Heart Study.

Elgart M, Zhang Y, Zhang Y, Yu B, Kim Y, Zee PC, et al. Anaerobic pathogens associated with OSA may contribute to pathophysiology via amino-acid depletion.. EBioMedicine. 2023;98:104891.

BACKGROUND: The human microbiome is linked to multiple metabolic disorders such as obesity and diabetes. Obstructive sleep apnoea (OSA) is a common sleep disorder with several metabolic risk factors. We investigated the associations between the gut microbiome composition and function, and measures of OSA severity in participants from a prospective community-based cohort study: the Hispanic Community Health Study/Study of Latinos (HCHS/SOL).

METHODS: Bacterial-Wide Association Analysis (BWAS) of gut microbiome measured via metagenomics with OSA measures was performed adjusting for clinical, lifestyle and co-morbidities. This was followed by functional analysis of the OSA-enriched bacteria. We utilized additional metabolomic and transcriptomic associations to suggest possible mechanisms explaining the microbiome effects on OSA.

FINDINGS: Several uncommon anaerobic human pathogens were associated with OSA severity. These belong to the Lachnospira, Actinomyces, Kingella and Eubacterium genera. Functional analysis revealed enrichment in 49 processes including many anaerobic-related ones. Severe OSA was associated with the depletion of the amino acids glycine and glutamine in the blood, yet neither diet nor gene expression revealed any changes in the production or consumption of these amino acids.

INTERPRETATION: We show anaerobic bacterial communities to be a novel component of OSA pathophysiology. These are established in the oxygen-poor environments characteristic of OSA. We hypothesize that these bacteria deplete certain amino acids required for normal human homeostasis and muscle tone, contributing to OSA phenotypes. Future work should test this hypothesis as well as consider diagnostics via anaerobic bacteria detection and possible interventions via antibiotics and amino-acid supplementation.

FUNDING: Described in methods.

Fuentes L de L, Schwander KL, Brown MR, Bentley AR, Winkler TW, Sung YJ, et al. Gene-educational attainment interactions in a multi-population genome-wide meta-analysis identify novel lipid loci.. Frontiers in genetics. 2023;14:1235337.

Introduction: Educational attainment, widely used in epidemiologic studies as a surrogate for socioeconomic status, is a predictor of cardiovascular health outcomes. Methods: A two-stage genome-wide meta-analysis of low-density lipoprotein cholesterol (LDL), high-density lipoprotein cholesterol (HDL), and triglyceride (TG) levels was performed while accounting for gene-educational attainment interactions in up to 226,315 individuals from five population groups. We considered two educational attainment variables: "Some College" (yes/no, for any education beyond high school) and "Graduated College" (yes/no, for completing a 4-year college degree). Genome-wide significant (p < 5 × 10-8) and suggestive (p < 1 × 10-6) variants were identified in Stage 1 (in up to 108,784 individuals) through genome-wide analysis, and those variants were followed up in Stage 2 studies (in up to 117,531 individuals). Results: In combined analysis of Stages 1 and 2, we identified 18 novel lipid loci (nine for LDL, seven for HDL, and two for TG) by two degree-of-freedom (2 DF) joint tests of main and interaction effects. Four loci showed significant interaction with educational attainment. Two loci were significant only in cross-population analyses. Several loci include genes with known or suggested roles in adipose (FOXP1, MBOAT4, SKP2, STIM1, STX4), brain (BRI3, FILIP1, FOXP1, LINC00290, LMTK2, MBOAT4, MYO6, SENP6, SRGAP3, STIM1, TMEM167A, TMEM30A), and liver (BRI3, FOXP1) biology, highlighting the potential importance of brain-adipose-liver communication in the regulation of lipid metabolism. An investigation of the potential druggability of genes in identified loci resulted in five gene targets shown to interact with drugs approved by the Food and Drug Administration, including genes with roles in adipose and brain tissue. Discussion: Genome-wide interaction analysis of educational attainment identified novel lipid loci not previously detected by analyses limited to main genetic effects.

Goodman MO, Dashti HS, Lane JM, Windred DP, Burns A, Jones SE, et al. Causal Association Between Subtypes of Excessive Daytime Sleepiness and Risk of Cardiovascular Diseases.. Journal of the American Heart Association. 2023;12(24):e030568.

BACKGROUND: Excessive daytime sleepiness (EDS), experienced in 10% to 20% of the population, has been associated with cardiovascular disease and death. However, the condition is heterogeneous and is prevalent in individuals having short and long sleep duration. We sought to clarify the relationship between sleep duration subtypes of EDS with cardiovascular outcomes, accounting for these subtypes.

METHODS AND RESULTS: We defined 3 sleep duration subtypes of excessive daytime sleepiness: normal (6-9 hours), short (<6 hours), and long (>9 hours), and compared these with a nonsleepy, normal-sleep-duration reference group. We analyzed their associations with incident myocardial infarction (MI) and stroke using medical records of 355 901 UK Biobank participants and performed 2-sample Mendelian randomization for each outcome. Compared with healthy sleep, long-sleep EDS was associated with an 83% increased rate of MI (hazard ratio, 1.83 [95% CI, 1.21-2.77]) during 8.2-year median follow-up, adjusting for multiple health and sociodemographic factors. Mendelian randomization analysis provided supporting evidence of a causal role for a genetic long-sleep EDS subtype in MI (inverse-variance weighted β=1.995, P=0.001). In contrast, we did not find evidence that other subtypes of EDS were associated with incident MI or any associations with stroke (P>0.05).

CONCLUSIONS: Our study suggests the previous evidence linking EDS with increased cardiovascular disease risk may be primarily driven by the effect of its long-sleep subtype on higher risk of MI. Underlying mechanisms remain to be investigated but may involve sleep irregularity and circadian disruption, suggesting a need for novel interventions in this population.

Hrytsenko Y, Shea B, Elgart M, Kurniansyah N, Lyons G, Morrison AC, et al. Machine learning models for blood pressure phenotypes combining multiple polygenic risk scores.. medRxiv : the preprint server for health sciences. 2023;.

We construct non-linear machine learning (ML) prediction models for systolic and diastolic blood pressure (SBP, DBP) using demographic and clinical variables and polygenic risk scores (PRSs). We developed a two-model ensemble, consisting of a baseline model, where prediction is based on demographic and clinical variables only, and a genetic model, where we also include PRSs. We evaluate the use of a linear versus a non-linear model at both the baseline and the genetic model levels and assess the improvement in performance when incorporating multiple PRSs. We report the ensemble model's performance as percentage variance explained (PVE) on a held-out test dataset. A non-linear baseline model improved the PVEs from 28.1% to 30.1% (SBP) and 14.3% to 17.4% (DBP) compared with a linear baseline model. Including seven PRSs in the genetic model computed based on the largest available GWAS of SBP/DBP improved the genetic model PVE from 4.8% to 5.1% (SBP) and 4.7% to 5% (DBP) compared to using a single PRS. Adding additional 14 PRSs computed based on two independent GWASs further increased the genetic model PVE to 6.3% (SBP) and 5.7% (DBP). PVE differed across self-reported race/ethnicity groups, with primarily all non-White groups benefitting from the inclusion of additional PRSs.

Gueye-Ndiaye S, Hauptman M, Yu X, Li L, Rueschman M, Castro-Diehl C, et al. Multilevel Risk Factors for Sleep-Disordered Breathing-Related Symptom Burden in an Urban Pediatric Community-Based Sample.. CHEST pulmonary. 2023;1(3).

BACKGROUND: Pediatric sleep-disordered breathing (SDB) disproportionately affects children with low socioeconomic status (SES). The multilevel risk factors that drive these associations are not well understood.

RESEARCH QUESTION: What are the associations between SDB risk factors, including individual health conditions (obesity, asthma, and allergies), household SES (maternal education), indoor exposures (environmental tobacco smoke [ETS] and pests), and neighborhood characteristics (neighborhood disadvantage), and pediatric SDB symptoms?

STUDY DESIGN AND METHODS: Cross-sectional analyses were performed on 303 children (aged 6-12 years) enrolled in the Environmental Assessment of Sleep Youth study from 2018 to 2022. Exposures were determined by caregiver reports, assays of measured settled dust from the child's bedroom, and neighborhood-level Census data (deriving the Childhood Opportunity Index to characterize neighborhood disadvantage). The primary outcome was the SDB-related symptom burden assessed by the OSA-18 questionnaire total score. Using linear regression models, we calculated associations between exposures and SDB-related symptom burden, adjusting for sociodemographic factors, then health conditions, indoor environment, and neighborhood factors.

RESULTS: The sample included 303 children (39% Hispanic, Latino, Latina, or Spanish origin; 30% Black or African American; 22% White; and 11% other). Increasing OSA-18 total scores were associated with low household SES after adjustment for demographic factors, and with asthma, allergies, ETS, pests (mouse, cockroach, and rodents), and an indoor environmental index (sum of the presence of pests and ETS; 0-2) after adjusting for sociodemographic factors. Even after further adjusting for asthma, allergies, and neighborhood disadvantage, ETS and pest exposure were associated with OSA-18 (ETS: β = 12.80; 95% CI, 7.07-18.53, also adjusted for pest; pest exposure: β = 3.69; 95% CI, 0.44-6.94, also adjusted for ETS).

INTERPRETATION: In addition to associations with ETS, a novel association was observed for indoor pest exposure and SDB symptom burden. Strategies to reduce household exposure to ETS and indoor allergens should be tested as approaches for reducing sleep health disparities.