Publications by Year: 2021

2021

Kasela S, Ortega VE, Martorella M, Garudadri S, Nguyen J, Ampleford E, et al. Genetic and non-genetic factors affecting the expression of COVID-19-relevant genes in the large airway epithelium.. Genome medicine. 2021;13(1):66.

BACKGROUND: The large airway epithelial barrier provides one of the first lines of defense against respiratory viruses, including SARS-CoV-2 that causes COVID-19. Substantial inter-individual variability in individual disease courses is hypothesized to be partially mediated by the differential regulation of the genes that interact with the SARS-CoV-2 virus or are involved in the subsequent host response. Here, we comprehensively investigated non-genetic and genetic factors influencing COVID-19-relevant bronchial epithelial gene expression.

METHODS: We analyzed RNA-sequencing data from bronchial epithelial brushings obtained from uninfected individuals. We related ACE2 gene expression to host and environmental factors in the SPIROMICS cohort of smokers with and without chronic obstructive pulmonary disease (COPD) and replicated these associations in two asthma cohorts, SARP and MAST. To identify airway biology beyond ACE2 binding that may contribute to increased susceptibility, we used gene set enrichment analyses to determine if gene expression changes indicative of a suppressed airway immune response observed early in SARS-CoV-2 infection are also observed in association with host factors. To identify host genetic variants affecting COVID-19 susceptibility in SPIROMICS, we performed expression quantitative trait (eQTL) mapping and investigated the phenotypic associations of the eQTL variants.

RESULTS: We found that ACE2 expression was higher in relation to active smoking, obesity, and hypertension that are known risk factors of COVID-19 severity, while an association with interferon-related inflammation was driven by the truncated, non-binding ACE2 isoform. We discovered that expression patterns of a suppressed airway immune response to early SARS-CoV-2 infection, compared to other viruses, are similar to patterns associated with obesity, hypertension, and cardiovascular disease, which may thus contribute to a COVID-19-susceptible airway environment. eQTL mapping identified regulatory variants for genes implicated in COVID-19, some of which had pheWAS evidence for their potential role in respiratory infections.

CONCLUSIONS: These data provide evidence that clinically relevant variation in the expression of COVID-19-related genes is associated with host factors, environmental exposures, and likely host genetic variation.

Liu Y, Zhang X, Lee J, Smelser D, Cade B, Chen H, et al. Genome-wide association study of neck circumference identifies sex-specific loci independent of generalized adiposity.. International journal of obesity (2005). 2021;45(7):1532-41.

BACKGROUND/OBJECTIVES: Neck circumference, an index of upper airway fat, has been suggested to be an important measure of body-fat distribution with unique associations with health outcomes such as obstructive sleep apnea and metabolic disease. This study aims to study the genetic bases of neck circumference.

METHODS: We conducted a multi-ethnic genome-wide association study of neck circumference, adjusted and unadjusted for BMI, in up to 15,090 European Ancestry (EA) and African American (AA) individuals. Because sexually dimorphic associations have been observed for anthropometric traits, we conducted both sex-combined and sex-specific analysis.

RESULTS: We identified rs227724 near the Noggin (NOG) gene as a possible quantitative locus for neck circumference in men (N = 8831, P = 1.74 × 10-9) but not in women (P = 0.08). The association was replicated in men (N = 1554, P = 0.045) in an independent dataset. This locus was previously reported to be associated with human height and with self-reported snoring. We also identified rs13087058 on chromosome 3 as a suggestive locus in sex-combined analysis (N = 15090, P = 2.94 × 10-7; replication P =0.049). This locus was also associated with electrocardiogram-assessed PR interval and is a cis-expression quantitative locus for the PDZ Domain-containing ring finger 2 (PDZRN3) gene. Both NOG and PDZRN3 interact with members of transforming growth factor-beta superfamily signaling proteins.

CONCLUSIONS: Our study suggests that neck circumference may have unique genetic basis independent of BMI.

Sofer T, Kurniansyah N, Aguet F, Ardlie K, Durda P, Nickerson DA, et al. Benchmarking association analyses of continuous exposures with RNA-seq in observational studies.. Briefings in bioinformatics. 2021;22(6).

Large datasets of hundreds to thousands of individuals measuring RNA-seq in observational studies are becoming available. Many popular software packages for analysis of RNA-seq data were constructed to study differences in expression signatures in an experimental design with well-defined conditions (exposures). In contrast, observational studies may have varying levels of confounding transcript-exposure associations; further, exposure measures may vary from discrete (exposed, yes/no) to continuous (levels of exposure), with non-normal distributions of exposure. We compare popular software for gene expression-DESeq2, edgeR and limma-as well as linear regression-based analyses for studying the association of continuous exposures with RNA-seq. We developed a computation pipeline that includes transformation, filtering and generation of empirical null distribution of association P-values, and we apply the pipeline to compute empirical P-values with multiple testing correction. We employ a resampling approach that allows for assessment of false positive detection across methods, power comparison and the computation of quantile empirical P-values. The results suggest that linear regression methods are substantially faster with better control of false detections than other methods, even with the resampling method to compute empirical P-values. We provide the proposed pipeline with fast algorithms in an R package Olivia, and implemented it to study the associations of measures of sleep disordered breathing with RNA-seq in peripheral blood mononuclear cells in participants from the Multi-Ethnic Study of Atherosclerosis.

Sarnowski C, Cousminer DL, Franceschini N, Raffield LM, Jia G, Fernández-Rhodes L, et al. Large trans-ethnic meta-analysis identifies AKR1C4 as a novel gene associated with age at menarche.. Human reproduction (Oxford, England). 2021;36(7):1999-2010.

STUDY QUESTION: Does the expansion of genome-wide association studies (GWAS) to a broader range of ancestries improve the ability to identify and generalise variants associated with age at menarche (AAM) in European populations to a wider range of world populations?

SUMMARY ANSWER: By including women with diverse and predominantly non-European ancestry in a large-scale meta-analysis of AAM with half of the women being of African ancestry, we identified a new locus associated with AAM in African-ancestry participants, and generalised loci from GWAS of European ancestry individuals.

WHAT IS KNOWN ALREADY: AAM is a highly polygenic puberty trait associated with various diseases later in life. Both AAM and diseases associated with puberty timing vary by race or ethnicity. The majority of GWAS of AAM have been performed in European ancestry women.

STUDY DESIGN, SIZE, DURATION: We analysed a total of 38 546 women who did not have predominantly European ancestry backgrounds: 25 149 women from seven studies from the ReproGen Consortium and 13 397 women from the UK Biobank. In addition, we used an independent sample of 5148 African-ancestry women from the Southern Community Cohort Study (SCCS) for replication.

PARTICIPANTS/MATERIALS, SETTING, METHODS: Each AAM GWAS was performed by study and ancestry or ethnic group using linear regression models adjusted for birth year and study-specific covariates. ReproGen and UK Biobank results were meta-analysed using an inverse variance-weighted average method. A trans-ethnic meta-analysis was also carried out to assess heterogeneity due to different ancestry.

MAIN RESULTS AND THE ROLE OF CHANCE: We observed consistent direction and effect sizes between our meta-analysis and the largest GWAS conducted in European or Asian ancestry women. We validated four AAM loci (1p31, 6q16, 6q22 and 9q31) with common genetic variants at P < 5 × 10-7. We detected one new association (10p15) at P < 5 × 10-8 with a low-frequency genetic variant lying in AKR1C4, which was replicated in an independent sample. This gene belongs to a family of enzymes that regulate the metabolism of steroid hormones and have been implicated in the pathophysiology of uterine diseases. The genetic variant in the new locus is more frequent in African-ancestry participants, and has a very low frequency in Asian or European-ancestry individuals.

LARGE SCALE DATA: N/A.

LIMITATIONS, REASONS FOR CAUTION: Extreme AAM (<9 years or >18 years) were excluded from analysis. Women may not fully recall their AAM as most of the studies were conducted many years later. Further studies in women with diverse and predominantly non-European ancestry are needed to confirm and extend these findings, but the availability of such replication samples is limited.

WIDER IMPLICATIONS OF THE FINDINGS: Expanding association studies to a broader range of ancestries or ethnicities may improve the identification of new genetic variants associated with complex diseases or traits and the generalisation of variants from European-ancestry studies to a wider range of world populations.

STUDY FUNDING/COMPETING INTEREST(S): Funding was provided by CHARGE Consortium grant R01HL105756-07: Gene Discovery For CVD and Aging Phenotypes and by the NIH grant U24AG051129 awarded by the National Institute on Aging (NIA). The authors have no conflict of interest to declare.

Chen J, Spracklen CN, Marenne G, Varshney A, Corbin LJ, Luan J, et al. The trans-ancestral genomic architecture of glycemic traits.. Nature genetics. 2021;53(6):840-6.

Glycemic traits are used to diagnose and monitor type 2 diabetes and cardiometabolic health. To date, most genetic studies of glycemic traits have focused on individuals of European ancestry. Here we aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available. Trans-ancestry and single-ancestry meta-analyses identified 242 loci (99 novel; P < 5 × 10-8), 80% of which had no significant evidence of between-ancestry heterogeneity. Analyses restricted to individuals of European ancestry with equivalent sample size would have led to 24 fewer new loci. Compared with single-ancestry analyses, equivalent-sized trans-ancestry fine-mapping reduced the number of estimated variants in 99% credible sets by a median of 37.5%. Genomic-feature, gene-expression and gene-set analyses revealed distinct biological signatures for each trait, highlighting different underlying biological pathways. Our results increase our understanding of diabetes pathophysiology by using trans-ancestry studies for improved power and resolution.

Sofer T, Zheng X, Laurie CA, Gogarten SM, Brody JA, Conomos MP, et al. Variant-specific inflation factors for assessing population stratification at the phenotypic variance level.. Nature communications. 2021;12(1):3506.

In modern Whole Genome Sequencing (WGS) epidemiological studies, participant-level data from multiple studies are often pooled and results are obtained from a single analysis. We consider the impact of differential phenotype variances by study, which we term 'variance stratification'. Unaccounted for, variance stratification can lead to both decreased statistical power, and increased false positives rates, depending on how allele frequencies, sample sizes, and phenotypic variances vary across the studies that are pooled. We develop a procedure to compute variant-specific inflation factors, and show how it can be used for diagnosis of genetic association analyses on pooled individual level data from multiple studies. We describe a WGS-appropriate analysis approach, implemented in freely-available software, which allows study-specific variances and thereby improves performance in practice. We illustrate the variance stratification problem, its solutions, and the proposed diagnostic procedure, in simulations and in data from the Trans-Omics for Precision Medicine Whole Genome Sequencing Program (TOPMed), used in association tests for hemoglobin concentrations and BMI.

Keramati AR, Chen MH, Rodriguez BAT, Yanek LR, Bhan A, Gaynor BJ, et al. Genome sequencing unveils a regulatory landscape of platelet reactivity.. Nature communications. 2021;12(1):3626.

Platelet aggregation at the site of atherosclerotic vascular injury is the underlying pathophysiology of myocardial infarction and stroke. To build upon prior GWAS, here we report on 16 loci identified through a whole genome sequencing (WGS) approach in 3,855 NHLBI Trans-Omics for Precision Medicine (TOPMed) participants deeply phenotyped for platelet aggregation. We identify the RGS18 locus, which encodes a myeloerythroid lineage-specific regulator of G-protein signaling that co-localizes with expression quantitative trait loci (eQTL) signatures for RGS18 expression in platelets. Gene-based approaches implicate the SVEP1 gene, a known contributor of coronary artery disease risk. Sentinel variants at RGS18 and PEAR1 are associated with thrombosis risk and increased gastrointestinal bleeding risk, respectively. Our WGS findings add to previously identified GWAS loci, provide insights regarding the mechanism(s) by which genetics may influence cardiovascular disease risk, and underscore the importance of rare variant and regulatory approaches to identifying loci contributing to complex phenotypes.

Li R, Rueschman M, Gottlieb DJ, Redline S, Sofer T. A composite sleep and pulmonary phenotype predicting hypertension.. EBioMedicine. 2021;68:103433.

BACKGROUND: Multiple aspects of sleep and Sleep Disordered Breathing (SDB) have been linked to hypertension. However, the standard measure of SDB, the Apnoea Hypopnea Index (AHI), has not identified patients likely to experience large improvements in blood pressure with SDB treatment.

METHODS: To use machine learning to select sleep and pulmonary measures associated with hypertension development when considered jointly, we applied feature screening followed by Elastic Net penalized regression in association with incident hypertension using a wide array of polysomnography measures, and lung function, derived for the Sleep Heart Health Study (SHHS).

FINDINGS: At baseline, n=860 SHHS individuals with complete data were age 61 years, on average. Of these, 291 developed hypertension  5 years later. A combination of pulmonary function and 18 sleep phenotypes predicted incident hypertension (OR=1.43, 95% confidence interval [1.14, 1.80] per 1 standard deviation (SD) of the phenotype), while the apnoea-hypopnea index (AHI) had low evidence of association with incident hypertension (OR =1.13, 95% confidence interval [0.97, 1.33] per 1 SD). In a generalization analysis in 923 individuals from the Multi-Ethnic Study of Atherosclerosis, aged 65 on average with 615 individuals with hypertension, the new phenotype was cross-sectionally associated with hypertension (OR=1.26, 95% CI [1.10, 1.45]).

INTERPRETATION: A unique combination of sleep and pulmonary function measures better predicts hypertension compared to the AHI. The composite measure included indices capturing apnoea and hypopnea event durations, with shorter event lengths associated with increased risk of hypertension.

FUNDING: This research was supported by National Heart, Lung, and Blood Institute (NHLBI) contracts HHSN268201500003I, N01-HC-95159, N01-HC-95160, N01-HC-95161, N01-HC-95162, N01-HC-95163, N01-HC-95164, N01-HC-95165, N01-HC-95166, N01-HC-95167, N01-HC-95168, and N01-HC-95169 and by National Center for Advancing Translational Sciences grants UL1-TR- 000040, UL1-TR-001079, and UL1-TR-001420. The MESA Sleep ancillary study was supported by NHLBI grant HL-56984. Pulmonary phenotyping in MESA was funded by NHLBI grants R01-HL077612 and R01-HL093081. This work was supported by NHLBI grant R35HL135818 to Susan Redline.