Publications

2019

Hidalgo BA, Sofer T, Qi Q, Schneiderman N, Chen YDI, Kaplan RC, et al. Associations between SLC16A11 variants and diabetes in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL).. Scientific reports. 2019;9(1):843.

Five sequence variants in SLC16A11 (rs117767867, rs13342692, rs13342232, rs75418188, and rs75493593), which occur in two non-reference haplotypes, were recently shown to be associated with diabetes in Mexicans from the SIGMA consortium. We aimed to determine whether these previous findings would replicate in the HCHS/SOL Mexican origin group and whether genotypic effects were similar in other HCHS/SOL groups. We analyzed these five variants in 2492 diabetes cases and 5236 controls from the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), which includes U.S. participants from six diverse background groups (Mainland groups: Mexican, Central American, and South American; and Caribbean groups: Puerto Rican, Cuban, and Dominican). We estimated the SNP-diabetes association in the six groups and in the combined sample. We found that the risk alleles occur in two non-reference haplotypes in HCHS/SOL, as in the SIGMA Mexicans. The haplotype frequencies were very similar between SIGMA Mexicans and the HCHS/SOL Mainland groups, but different in the Caribbean groups. The SLC16A11 sequence variants were significantly associated with risk for diabetes in the Mexican origin group (P = 0.025), replicating the SIGMA findings. However, these variants were not significantly associated with diabetes in a combined analysis of all groups, although the power to detect such effects was 85% (assuming homogeneity of effects among the groups). Additional analyses performed separately in each of the five non-Mexican origin groups were not significant. We also analyzed (1) exclusion of young controls and, (2) SNP by BMI interactions, but neither was significant in the HCHS/SOL data. The previously reported effects of SLC16A11 variants on diabetes in Mexican samples was replicated in a large Mexican-American sample, but these effects were not significant in five non-Mexican Hispanic/Latino groups sampled from U.S. populations. Lack of replication in the HCHS/SOL non-Mexicans, and in the entire HCHS/SOL sample combined may represent underlying genetic heterogeneity. These results indicate a need for future genetic research to consider heterogeneity of the Hispanic/Latino population in the assessment of disease risk, but add to the evidence suggesting SLC16A11 as a potential therapeutic target for type 2 diabetes.

2018

Zekavat SM, Ruotsalainen S, Handsaker RE, Alver M, Bloom J, Poterba T, et al. Deep coverage whole genome sequences and plasma lipoprotein(a) in individuals of European and African ancestries.. Nature communications. 2018;9(1):2606.

Lipoprotein(a), Lp(a), is a modified low-density lipoprotein particle that contains apolipoprotein(a), encoded by LPA, and is a highly heritable, causal risk factor for cardiovascular diseases that varies in concentrations across ancestries. Here, we use deep-coverage whole genome sequencing in 8392 individuals of European and African ancestry to discover and interpret both single-nucleotide variants and copy number (CN) variation associated with Lp(a). We observe that genetic determinants between Europeans and Africans have several unique determinants. The common variant rs12740374 associated with Lp(a) cholesterol is an eQTL for SORT1 and independent of LDL cholesterol. Observed associations of aggregates of rare non-coding variants are largely explained by LPA structural variation, namely the LPA kringle IV 2 (KIV2)-CN. Finally, we find that LPA risk genotypes confer greater relative risk for incident atherosclerotic cardiovascular diseases compared to directly measured Lp(a), and are significantly associated with measures of subclinical atherosclerosis in African Americans.

Burkart KM, Sofer T, London SJ, Manichaikul A, Hartwig FP, Yan Q, et al. A Genome-Wide Association Study in Hispanics/Latinos Identifies Novel Signals for Lung Function. The Hispanic Community Health Study/Study of Latinos.. American journal of respiratory and critical care medicine. 2018;198(2):208-19.

RATIONALE: Lung function and chronic obstructive pulmonary disease (COPD) are heritable traits. Genome-wide association studies (GWAS) have identified numerous pulmonary function and COPD loci, primarily in cohorts of European ancestry.

OBJECTIVES: Perform a GWAS of COPD phenotypes in Hispanic/Latino populations to identify loci not previously detected in European populations.

METHODS: GWAS of lung function and COPD in Hispanic/Latino participants from a population-based cohort. We performed replication studies of novel loci in independent studies.

MEASUREMENTS AND MAIN RESULTS: Among 11,822 Hispanic/Latino participants, we identified eight novel signals; three replicated in independent populations of European Ancestry. A novel locus for FEV1 in ZSWIM7 (rs4791658; P = 4.99 × 10-9) replicated. A rare variant (minor allele frequency = 0.002) in HAL (rs145174011) was associated with FEV1/FVC (P = 9.59 × 10-9) in a region previously identified for COPD-related phenotypes; it remained significant in conditional analyses but did not replicate. Admixture mapping identified a novel region, with a variant in AGMO (rs41331850), associated with Amerindian ancestry and FEV1, which replicated. A novel locus for FEV1 identified among ever smokers (rs291231; P = 1.92 × 10-8) approached statistical significance for replication in admixed populations of African ancestry, and a novel SNP for COPD in PDZD2 (rs7709630; P = 1.56 × 10-8) regionally replicated. In addition, loci previously identified for lung function in European samples were associated in Hispanic/Latino participants in the Hispanic Community Health Study/Study of Latinos at the genome-wide significance level.

CONCLUSIONS: We identified novel signals for lung function and COPD in a Hispanic/Latino cohort. Including admixed populations when performing genetic studies may identify variants contributing to genetic etiologies of COPD.

Wyss AB, Sofer T, Lee MK, Terzikhan N, Nguyen JN, Lahousse L, et al. Multiethnic meta-analysis identifies ancestry-specific and cross-ancestry loci for pulmonary function.. Nature communications. 2018;9(1):2976.

Nearly 100 loci have been identified for pulmonary function, almost exclusively in studies of European ancestry populations. We extend previous research by meta-analyzing genome-wide association studies of 1000 Genomes imputed variants in relation to pulmonary function in a multiethnic population of 90,715 individuals of European (N = 60,552), African (N = 8429), Asian (N = 9959), and Hispanic/Latino (N = 11,775) ethnicities. We identify over 50 additional loci at genome-wide significance in ancestry-specific or multiethnic meta-analyses. Using recent fine-mapping methods incorporating functional annotation, gene expression, and differences in linkage disequilibrium between ethnicities, we further shed light on potential causal variants and genes at known and newly identified loci. Several of the novel genes encode proteins with predicted or established drug targets, including KCNK2 and CDK12. Our study highlights the utility of multiethnic and integrative genomics approaches to extend existing knowledge of the genetics of lung function and clinical relevance of implicated loci.

Zekavat SM, Ruotsalainen S, Handsaker RE, Alver M, Bloom J, Poterba T, et al. Publisher Correction: Deep coverage whole genome sequences and plasma lipoprotein(a) in individuals of European and African ancestries.. Nature communications. 2018;9(1):3493.

The original version of this article contained an error in the name of the author Ramachandran S. Vasan, which was incorrectly given as Vasan S. Ramachandran. This has now been corrected in both the PDF and HTML versions of the article.

Sofer T, Moon JY, Isasi CR, Qi Q, Shah NA, Kaplan RC, et al. Relationship of genetic determinants of height with cardiometabolic and pulmonary traits in the Hispanic Community Health Study/Study of Latinos.. International journal of epidemiology. 2018;47(6):2059-6.

BACKGROUND: Associations of adult height with cardiometabolic and pulmonary traits have been studied in majority European ancestry populations using Mendelian randomization and polygenic risk score (PRS) analysis. The standard PRS approach entails creating a PRS for height using variants identified in prior genome-wide association studies (GWAS). It is unclear how well the standard PRS approach performs in non-European populations and whether height-trait associations observed in Europeans are also observed in other populations.

METHODS: In the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), we used: (i) the standard approach to create a PRS for height (PRS1) and (ii) a novel approach to optimize the selection of variants from previously established height association loci to better explain height in HCHS/SOL (PRS2). We also estimated the extent to which PRS-trait associations were independent or mediated by the PRS effect on height.

RESULTS: In 7539 women and 5245 men, PRS1 and PRS2 explained 9 and 29% of the variance in measured height, respectively. Both PRS1 and PRS2 were associated with forced expiratory volume in 1 second (FEV1), forced vital capacity (FVC), FEV1/ FVC ratio, total cholesterol and 2-hour oral glucose-tolerance test insulin levels. Additionally, PRS2 was associated with estimated glomerular filtration rate and ankle brachial index. Both PRS1 and PRS2 had pleiotropic associations with FEV1/ FVC ratio in mediation analyses.

CONCLUSIONS: Associations of polygenic scores of height with measures of lung function and cholesterol were consistent with those observed in prior studies of majority European ancestry populations. Mediation analysis may augment standard PRS approaches to disentangle pleiotropic and mediated effects.

Natarajan P, Peloso GM, Zekavat SM, Montasser M, Ganna A, Chaffin M, et al. Deep-coverage whole genome sequences and blood lipids among 16,324 individuals.. Nature communications. 2018;9(1):3391.

Large-scale deep-coverage whole-genome sequencing (WGS) is now feasible and offers potential advantages for locus discovery. We perform WGS in 16,324 participants from four ancestries at mean depth >29X and analyze genotypes with four quantitative traits-plasma total cholesterol, low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol, and triglycerides. Common variant association yields known loci except for few variants previously poorly imputed. Rare coding variant association yields known Mendelian dyslipidemia genes but rare non-coding variant association detects no signals. A high 2M-SNP LDL-C polygenic score (top 5th percentile) confers similar effect size to a monogenic mutation ( 30 mg/dl higher for each); however, among those with severe hypercholesterolemia, 23% have a high polygenic score and only 2% carry a monogenic mutation. At these sample sizes and for these phenotypes, the incremental value of WGS for discovery is limited but WGS permits simultaneous assessment of monogenic and polygenic models to severe hypercholesterolemia.

Sarnowski C, Satizabal CL, DeCarli C, Pitsillides AN, Cupples A, Vasan RS, et al. Whole genome sequence analyses of brain imaging measures in the Framingham Study.. Neurology. 2018;90(3):e188-e196.

OBJECTIVE: We sought to identify rare variants influencing brain imaging phenotypes in the Framingham Heart Study by performing whole genome sequence association analyses within the Trans-Omics for Precision Medicine Program.

METHODS: We performed association analyses of cerebral and hippocampal volumes and white matter hyperintensity (WMH) in up to 2,180 individuals by testing the association of rank-normalized residuals from mixed-effect linear regression models adjusted for sex, age, and total intracranial volume with individual variants while accounting for familial relatedness. We conducted gene-based tests for rare variants using (1) a sliding-window approach, (2) a selection of functional exonic variants, or (3) all variants.

RESULTS: We detected new loci in 1p21 for cerebral volume (minor allele frequency [MAF] 0.005, p = 10-8) and in 16q23 for hippocampal volume (MAF 0.05, p = 2.7 × 10-8). Previously identified associations in 12q24 for hippocampal volume (rs7294919, p = 4.4 × 10-4) and in 17q25 for WMH (rs7214628, p = 2.0 × 10-3) were confirmed. Gene-based tests detected associations (p ≤ 2.3 × 10-6) in new loci for cerebral (5q13, 8p12, 9q31, 13q12-q13, 15q24, 17q12, 19q13) and hippocampal volumes (2p12) and WMH (3q13, 4p15) including Alzheimer disease- (UNC5D) and Parkinson disease-associated genes (GBA). Pathway analyses evidenced enrichment of associated genes in immunity, inflammation, and Alzheimer disease and Parkinson disease pathways.

CONCLUSIONS: Whole genome sequence-wide search reveals intriguing new loci associated with brain measures. Replication of novel loci is needed to confirm these findings.

Sung YJ, Winkler TW, Fuentes L de L, Bentley AR, Brown MR, Kraja AT, et al. A Large-Scale Multi-ancestry Genome-wide Study Accounting for Smoking Behavior Identifies Multiple Significant Loci for Blood Pressure.. American journal of human genetics. 2018;102(3):375-400.

Genome-wide association analysis advanced understanding of blood pressure (BP), a major risk factor for vascular conditions such as coronary heart disease and stroke. Accounting for smoking behavior may help identify BP loci and extend our knowledge of its genetic architecture. We performed genome-wide association meta-analyses of systolic and diastolic BP incorporating gene-smoking interactions in 610,091 individuals. Stage 1 analysis examined ∼18.8 million SNPs and small insertion/deletion variants in 129,913 individuals from four ancestries (European, African, Asian, and Hispanic) with follow-up analysis of promising variants in 480,178 additional individuals from five ancestries. We identified 15 loci that were genome-wide significant (p < 5 × 10-8) in stage 1 and formally replicated in stage 2. A combined stage 1 and 2 meta-analysis identified 66 additional genome-wide significant loci (13, 35, and 18 loci in European, African, and trans-ancestry, respectively). A total of 56 known BP loci were also identified by our results (p < 5 × 10-8). Of the newly identified loci, ten showed significant interaction with smoking status, but none of them were replicated in stage 2. Several loci were identified in African ancestry, highlighting the importance of genetic studies in diverse populations. The identified loci show strong evidence for regulatory features and support shared pathophysiology with cardiometabolic and addiction traits. They also highlight a role in BP regulation for biological candidates such as modulators of vascular structure and function (CDKN1B, BCAR1-CFDP1, PXDN, EEA1), ciliopathies (SDCCAG8, RPGRIP1L), telomere maintenance (TNKS, PINX1, AKTIP), and central dopaminergic signaling (MSRA, EBF2).

Chen H, Cade BE, Gleason KJ, Bjonnes AC, Stilp AM, Sofer T, et al. Multiethnic Meta-Analysis Identifies RAI1 as a Possible Obstructive Sleep Apnea-related Quantitative Trait Locus in Men.. American journal of respiratory cell and molecular biology. 2018;58(3):391-40.

Obstructive sleep apnea (OSA) is a common heritable disorder displaying marked sexual dimorphism in disease prevalence and progression. Previous genetic association studies have identified a few genetic loci associated with OSA and related quantitative traits, but they have only focused on single ethnic groups, and a large proportion of the heritability remains unexplained. The apnea-hypopnea index (AHI) is a commonly used quantitative measure characterizing OSA severity. Because OSA differs by sex, and the pathophysiology of obstructive events differ in rapid eye movement (REM) and non-REM (NREM) sleep, we hypothesized that additional genetic association signals would be identified by analyzing the NREM/REM-specific AHI and by conducting sex-specific analyses in multiethnic samples. We performed genome-wide association tests for up to 19,733 participants of African, Asian, European, and Hispanic/Latino American ancestry in 7 studies. We identified rs12936587 on chromosome 17 as a possible quantitative trait locus for NREM AHI in men (N = 6,737; P = 1.7 × 10-8) but not in women (P = 0.77). The association with NREM AHI was replicated in a physiological research study (N = 67; P = 0.047). This locus overlapping the RAI1 gene and encompassing genes PEMT1, SREBF1, and RASD1 was previously reported to be associated with coronary artery disease, lipid metabolism, and implicated in Potocki-Lupski syndrome and Smith-Magenis syndrome, which are characterized by abnormal sleep phenotypes. We also identified gene-by-sex interactions in suggestive association regions, suggesting that genetic variants for AHI appear to vary by sex, consistent with the clinical observations of strong sexual dimorphism.