Hailiang Huang
Profile Url: hailiang-huang
Researcher at Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
Nature Genetics, 2016-11-21
Genomic copy number variants (CNVs) have been strongly implicated in the etiology of schizophrenia (SCZ). However, apart from a small number of risk variants, elucidation of the CNV contribution to risk has been difficult due to the rarity of risk alleles, all occurring in less than 1% of cases. We sought to address this obstacle through a collaborative effort in which we applied a centralized analysis pipeline to a SCZ cohort of 21,094 cases and 20,227 controls. We observed a global enrichment of CNV burden in cases (OR=1.11, P=5.7e-15), which persisted after excluding loci implicated in previous studies (OR=1.07, P=1.7e-6). CNV burden is also enriched for genes associated with synaptic function (OR = 1.68, P = 2.8e-11) and neurobehavioral phenotypes in mouse (OR = 1.18, P= 7.3e-5). We identified genome-wide significant support for eight loci, including 1q21.1, 2p16.3 (NRXN1), 3q29, 7q11.2, 15q13.3, distal 16p11.2, proximal 16p11.2 and 22q11.2. We find support at a suggestive level for nine additional candidate susceptibility and protective loci, which consist predominantly of CNVs mediated by non-allelic homologous recombination (NAHR).
Recent advances in genotyping and sequencing technologies have made detecting rare variants in large cohorts possible. Various analytic methods for associating disease to rare variants have been proposed, including burden tests, C-alpha and SKAT. Most of these methods, however, assume that samples come from a homogeneous population, which is not realistic for analyses of large samples. Not correcting for population stratification causes inflated p-values and false-positive associations. Here we propose a population-informed bootstrap resampling method that controls for population stratification (Bootstrat) in rare variant tests. In essence, the Bootstrat procedure uses genetic distance to create a phenotype probability for each sample. We show that this empirical approach can effectively correct for population stratification while maintaining statistical power comparable to established methods of controlling for population stratification. The Bootstrat scheme can be easily applied to existing rare variant testing methods with reasonable computational complexity.
Cell, 2018-06-14
Schizophrenia (SCZ) and bipolar disorder (BD) are highly heritable disorders that share a significant proportion of common risk variation. Understanding the genetic factors underlying the specific symptoms of these disorders will be crucial for improving diagnosis, intervention and treatment. In case-control data consisting of 53,555 cases (20,129 BD, 33,426 SCZ) and 54,065 controls, we identified 114 genome-wide significant loci (GWS) when comparing all cases to controls, of which 41 represented novel findings. Two genome-wide significant loci were identified when comparing SCZ to BD and a third was found when directly incorporating functional information. Regional joint association identified a genomic region of overlapping association in BD and SCZ with disease-independent causal variants indicating a fourth region contributing to differences between these disorders. Regional SNP-heritability analyses demonstrated that the estimated heritability of BD based on the SCZ GWS regions was significantly higher than that based on the average genomic region (91 regions, p = 1.2x10-6) while the inverse was not significant (19 regions, p=0.89). Using our BD and SCZ GWAS we calculated polygenic risk scores and identified several significant correlations with: 1) SCZ subphenotypes: negative symptoms (SCZ, p=3.6x10-6) and manic symptoms (BD, p=2x10-5), 2) BD subphenotypes: psychotic features (SCZ p=1.2x10-10, BD p=5.3x10-5) and age of onset (SCZ p=7.9x10-4). Finally, we show that psychotic features in BD has significant SNP-heritability (h2snp=0.15, SE=0.06), and a significant genetic correlation with SCZ (rg=0.34) in addition there is a significant sign test result between SCZ GWAS and a GWAS of BD cases contrasting those with and without psychotic features (p=0.0038, one-side binomial test). For the first time, we have identified specific loci pointing to a potential role of 4 genes (DARS2, ARFGEF2, DCAKD and GATAD2A) that distinguish between BD and SCZ, providing an opportunity to understand the biology contributing to clinical differences of these disorders. Our results provide the best evidence so far of genomic components distinguishing between BD and SCZ that contribute directly to specific symptom dimensions.
Nature Genetics, 2019-02-25
Autism spectrum disorder (ASD) is a highly heritable and heterogeneous group of neurodevelopmental phenotypes diagnosed in more than 1% of children. Common genetic variants contribute substantially to ASD susceptibility, but to date no individual variants have been robustly associated with ASD. With a marked sample size increase from a unique Danish population resource, we report a genome-wide association meta-analysis of 18,381 ASD cases and 27,969 controls that identifies five genome-wide significant loci. Leveraging GWAS results from three phenotypes with significantly overlapping genetic architectures (schizophrenia, major depression, and educational attainment), seven additional loci shared with other traits are identified at equally strict significance levels. Dissecting the polygenic architecture we find both quantitative and qualitative polygenic heterogeneity across ASD subtypes, in contrast to what is typically seen in other complex disorders. These results highlight biological insights, particularly relating to neuronal function and corticogenesis and establish that GWAS performed at scale will be much more productive in the near term in ASD, just as it has been in a broad range of important psychiatric and diverse medical phenotypes.
Nature Neuroscience, 2017-11-28
As technology advances, whole genome sequencing (WGS) is likely to supersede other genotyping technologies. The rate of this change depends on its relative cost and utility. Variants identified uniquely through WGS may reveal novel biological pathways underlying complex disorders and provide high-resolution insight into when, where, and in which cell type these pathways are affected. Alternatively, cheaper and less computationally intensive approaches may yield equivalent insights. Understanding the role of rare variants in the noncoding gene-regulating genome, through pilot WGS projects, will be critical to determine which of these two extremes best represents reality. With large cohorts, well-defined risk loci, and a compelling need to understand the underlying biology, psychiatric disorders have a role to play in this preliminary WGS assessment. The WGSPD consortium will integrate data for 18,000 individuals with psychiatric disorders, beginning with autism spectrum disorder, schizophrenia, bipolar disorder, and major depressive disorder, along with over 150,000 controls.
Multiple mRNA isoforms can be generated from a single gene locus through alternative splicing. Abnormality in alternative splicing has been linked to many human disorders. Here using RNA-seq data from 48 tissues from GTEx v7 release and summary statistics from GWAS of complex diseases and traits, we present a study to identify genomic variants regulating junction-skipping with the goal to understand their contribution to complex diseases and traits. For each tissue, we found 48 - 575 junction-skipping events regulated by genomic variants. We performed fine-mapping on both the junction-skipping association and 23 complex disease and trait associations and mapped them to 95% credible sets. We found 13 - 279 junction-skipping regulations were mapped to a credible set with ≤5 variants. On the genome-wide scale, we noted a clear disease-tissue specificity. Results from this approach provided critical insights into the functional mechanism of the genetic disease associations and contributed to our understanding of the genetic architecture of human complex disorders.