Soren Germer
Profile Url: soren-germer
Researcher at New York Genome Center
The Trans-Omics for Precision Medicine (TOPMed) program seeks to elucidate the genetic architecture and disease biology of heart, lung, blood, and sleep disorders, with the ultimate goal of improving diagnosis, treatment, and prevention. The initial phases of the program focus on whole genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here, we describe TOPMed goals and design as well as resources and early insights from the sequence data. The resources include a variant browser, a genotype imputation panel, and sharing of genomic and phenotypic data via dbGaP. In 53,581 TOPMed samples, >400 million single-nucleotide and insertion/deletion variants were detected by alignment with the reference genome. Additional novel variants are detectable through assembly of unmapped reads and customized analysis in highly variable loci. Among the >400 million variants detected, 97% have frequency <1% and 46% are singletons. These rare variants provide insights into mutational processes and recent human evolutionary history. The nearly complete catalog of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and non-coding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and extends the reach of nearly all genome-wide association studies to include variants down to ~0.01% in frequency.
Large-scale gene sequencing studies for complex traits have the potential to identify causal genes with therapeutic implications. We performed gene-based association testing of blood lipid levels with rare (minor allele frequency<1%) predicted damaging coding variation using sequence data from >170,000 individuals from multiple ancestries: 97,493 European, 30,025 South Asian, 16,507 African, 16,440 Hispanic/Latino, 10,420 East Asian, and 1,182 Samoan. We identified 35 genes associated with circulating lipid levels. Ten of these: ALB, SRSF2, JAK2, CREB3L3, TMEM136, VARS, NR1H3, PLA2G12A, PPARG and STAB1 have not been implicated for lipid levels using rare coding variation in population-based samples. We prioritize 32 genes identified in array-based genome-wide association study (GWAS) loci based on gene-based associations, of which three: EVI5, SH2B3, and PLIN1, had no prior evidence of rare coding variant associations. Most of the associated genes showed evidence of association in multiple ancestries. Also, we observed an enrichment of gene-based associations for low-density lipoprotein cholesterol drug target genes, and for genes closest to GWAS index single nucleotide polymorphisms (SNP). Our results demonstrate that gene-based associations can be beneficial for drug target development and provide evidence that the gene closest to the array-based GWAS index SNP is often the functional gene for blood lipid levels.
American Journal of Respiratory and Critical Care Medicine, 2018-06-15
Asthma is the most common chronic disease of children, with significant racial/ethnic differences in prevalence, morbidity, mortality and therapeutic response. Albuterol, a bronchodilator medication, is the first-line therapy for asthma treatment worldwide. We performed the largest whole genome sequencing (WGS) pharmacogenetics study to date using data from 1,441 minority children with asthma who had extremely high or low bronchodilator drug response (BDR). We identified population-specific and shared pharmacogenetic variants associated with BDR, including genome-wide significant (p < 3.53 x 10-7) and suggestive (p < 7.06 x 10-6) loci near genes previously associated with lung capacity (DNAH5), immunity (NFKB1 and PLCB1), and β-adrenergic signaling pathways (ADAMTS3 and COX18). Functional analyses centered on NFKB1 revealed potential regulatory function of our BDR-associated SNPs in bronchial smooth muscle cells. Specifically, these variants are in linkage disequilibrium with SNPs in a functionally active enhancer, and are also expression quantitative trait loci (eQTL) for a neighboring gene, SLC39A8. Given the lack of other asthma study populations with WGS data on minority children, replication of our rare variant associations is infeasible. We attempted to replicate our common variant findings in five independent studies with GWAS data. The age-specific associations previously found in asthma and asthma-related traits suggest that the over-representation of adults in our replication populations may have contributed to our lack of statistical replication, despite the functional relevance of the NFKB1 variants demonstrated by our functional assays. Our study expands the understanding of pharmacogenetic analyses in racially/ethnically diverse populations and advances the foundation for precision medicine in at-risk and understudied minority populations.