Konstantin Strauch
Profile Url: konstantin-strauch
Researcher at Institute of Genetic Epidemiology, Helmholtz Zentrum Munchen - German Research Center for Environmental Health, Neuherberg, Germany
Reproductive longevity is critical for fertility and impacts healthy ageing in women, yet insights into the underlying biological mechanisms and treatments to preserve it are limited. Here, we identify 290 genetic determinants of ovarian ageing, assessed using normal variation in age at natural menopause (ANM) in ~200,000 women of European ancestry. These common alleles influence clinical extremes of ANM; women in the top 1% of genetic susceptibility have an equivalent risk of premature ovarian insufficiency to those carrying monogenic FMR1 premutations. Identified loci implicate a broad range of DNA damage response (DDR) processes and include loss-of-function variants in key DDR genes. Integration with experimental models demonstrates that these DDR processes act across the life-course to shape the ovarian reserve and its rate of depletion. Furthermore, we demonstrate that experimental manipulation of DDR pathways highlighted by human genetics increase fertility and extend reproductive life in mice. Causal inference analyses using the identified genetic variants indicates that extending reproductive life in women improves bone health and reduces risk of type 2 diabetes, but increases risks of hormone-sensitive cancers. These findings provide insight into the mechanisms governing ovarian ageing, when they act across the life-course, and how they might be targeted by therapeutic approaches to extend fertility and prevent disease.
The timing of puberty is a highly polygenic childhood trait that is epidemiologically associated with various adult diseases. Here, we analyse 1000-Genome reference panel imputed genotype data on up to ~370,000 women and identify 389 independent signals (all P<5x10-8) for age at menarche, a notable milestone in female pubertal development. In Icelandic data from deCODE, these signals explain ~7.4% of the population variance in age at menarche, corresponding to one quarter of the estimated heritability. We implicate over 250 genes via coding variation or associated gene expression, and demonstrate enrichment across genes active in neural tissues. We identify multiple rare variants near the imprinted genes MKRN3 and DLK1 that exhibit large effects on menarche only when paternally inherited. Disproportionate effects of variants on early or late puberty timing are observed: single variant and heritability estimates are larger for early than late puberty timing in females. The opposite pattern is seen in males, with larger estimates for late than early puberty timing. Mendelian randomization analyses indicate causal inverse associations, independent of BMI, between puberty timing and risks for breast and endometrial cancers in women, and prostate cancer in men. In aggregate, our findings reveal new complexity in the genetic regulation of puberty timing and support new causal links with adult cancer risks.
Protein-coding genetic variants that strongly affect disease risk can provide important clues into disease pathogenesis. Here we report an exome sequence analysis of 20,791 type 2 diabetes (T2D) cases and 24,440 controls from five ancestries. We identify rare (minor allele frequency<0.5%) variant gene-level associations in (a) three genes at exome-wide significance, including a T2D protective series of >30 SLC30A8 alleles, and (b) within 12 gene sets, including those corresponding to T2D drug targets (p=6.1Ă—10-3) and candidate genes from knockout mice (p=5.2Ă—10-3). Within our study, the strongest T2D rare variant gene-level signals explain at most 25% of the heritability of the strongest common single variant signals, and the rare variant gene-level effect sizes we observe in established T2D drug targets will require 110K-180K sequenced cases to exceed exome-wide significance. To help prioritize genes using associations from current smaller sample sizes, we present a Bayesian framework to recalibrate association p-values as posterior probabilities of association, estimating that reaching p<0.05 (p<0.005) in our study increases the odds of causal T2D association for a nonsynonymous variant by a factor of 1.8 (5.3). To help guide target or gene prioritization efforts, our data are freely available for analysis at www.type2diabetesgenetics.org.
We assembled an ancestrally diverse collection of genome-wide association studies of type 2 diabetes (T2D) in 180,834 cases and 1,159,055 controls (48.9% non-European descent). We identified 277 loci at genome-wide significance (p<5x10-8), including 237 attaining a more stringent trans-ancestry threshold (p<5x10-9), which were delineated to 338 distinct association signals. Trans-ancestry meta-regression offered substantial enhancements to fine-mapping, with 58.6% of associations more precisely localised due to population diversity, and 54.4% of signals resolved to a single variant with >50% posterior probability. This improved fine-mapping enabled systematic assessment of candidate causal genes and molecular mechanisms through which T2D associations are mediated, laying foundations for functional investigations. Trans-ancestry genetic risk scores enhanced transferability across diverse populations, providing a step towards more effective clinical translation to improve global health.
Epigenetics & Chromatin, 2017-01-03
Genetic data are known to harbor information about human demographics, and genotyping data are commonly used for capturing ancestry information by leveraging genome-wide differences between populations. In contrast, it is not clear to what extent population structure is captured by whole-genome DNA methylation data. We demonstrate, using three large cohort 450K methylation array data sets, that ancestry information signal is mirrored in genome-wide DNA methylation data, and that it can be further isolated more effectively by leveraging the correlation structure of CpGs with cis-located SNPs. Based on these insights, we propose a method, EPISTRUCTURE, for the inference of ancestry from methylation data, without the need for genotype data. EPISTRUCTURE can be used to infer ancestry information of individuals based on their methylation data in the absence of corresponding genetic data. Although genetic data are often collected in epigenetic studies of large cohorts, these are typically not made publicly available, making the application of EPISTRUCTURE especially useful for anyone working on public data. Implementation of EPISTRUCTURE is available in GLINT, our recently released toolset for DNA methylation analysis at: http://glint-epigenetics.readthedocs.io.