Emily Flynn - SciTok

01:37

Sex-specific genetic effects across biomarkers

Emily Flynn ,

Yosuke Tanigawa ,

Fatima Rodriguez ,

Russ Altman ,

Nasa Sinnott-Armstrong ,

Manuel A Rivas

European Journal of Human Genetics, 2020-09-01

Video Abstract (AI generated)

Paper

Preprint

European Journal of Human Genetics

Sex differences have been shown in laboratory biomarkers; however, the extent to which this is due to genetics is unknown. In this study, we infer sex-specific genetic parameters (heritability and genetic correlation) across 33 quantitative biomarker traits in 181,064 females and 156,135 males from the UK Biobank study. We apply a Bayesian mixture model, Sex Effects Mixture Model, to Genome-wide Association Study summary statistics in order to (1) estimate the contributions of sex to the genetic variance of these biomarkers and (2) identify variants whose statistical association with these traits is sex-specific. We find that the genetics of most biomarker traits are shared between males and females, with the notable exception of testosterone, where we identify 119 female and 444 male-specific variants. These include protein-altering variants in steroid hormone production genes (POR, CYP3A43, UGT2B7). Using the sex-specific variants as genetic instruments for Mendelian Randomization, we find evidence for causal links between testosterone levels and height, body mass index, waist circumference, and type 2 diabetes. We also show that sex-specific polygenic risk score models for testosterone outperform a combined model. Overall, these results demonstrate that while sex has a limited role in the genetics of most biomarker traits, sex plays an important role in testosterone genetics.

01:39

GRep: Gene Set Representation via Gaussian Embedding

Sheng Wang ,

Emily Flynn ,

Russ Altman

Video Abstract (AI generated)

Paper

Preprint

Molecular interaction networks are our basis for understanding functional interdependencies among genes. Network embedding approaches analyze these complicated networks by representing genes as low-dimensional vectors based on the network topology. These low-dimensional vectors have recently become the building blocks for a larger number of systems biology applications. Despite the success of embedding genes in this way, it remains unclear how to effectively represent gene sets, such as protein complexes and signaling pathways. The direct adaptation of existing gene embedding approaches to gene sets cannot model the diverse functions of genes in a set. Here, we propose GRep, a novel gene set embedding approach, which represents each gene set as a multivariate Gaussian distribution rather than a single point in the low-dimensional space. The diversity of genes in a set, or the uncertainty of their contribution to a particular function, is modeled by the covariance matrix of the multivariate Gaussian distribution. By doing so, GRep produces a highly informative and compact gene set representation. Using our representation, we analyze two major pharmacogenomics studies and observe substantial improvement in drug target identification from expression-derived gene sets. Overall, the GRep framework provides a novel representation of gene sets that can be used as input features to off-the-shelf machine learning classifiers for gene set analysis.

02:06

Large-Scale Labeling and Assessment of Sex Bias in Publicly Available Expression Data

Emily Flynn ,

Annie Chang ,

Russ Altman

Video Abstract (AI generated)

Paper

Preprint

Supplemental Figures and Table Legends

Supplemental Tables

Women are at more than 1.5-fold higher risk for clinically relevant adverse drug events. While this higher prevalence is partially due to gender-related effects, biological sex differences likely also impact drug response. Publicly available gene expression databases provide a unique opportunity for examining drug response at a cellular level. However, missingness and heterogeneity of metadata prevent large-scale identification of drug exposure studies and limit assessments of sex bias. To address this, we trained organism-specific models to infer sample sex from gene expression data, and used entity normalization to map metadata cell line and drug mentions to existing ontologies. Using this method, we infer sex labels for 450,371 human and 245,107 mouse microarray and RNA-seq samples from refine.bio. Overall, we find slight female bias (52.1%) in human samples and (62.5%) male bias in mouse samples; this corresponds to a majority of single sex studies, split between female-only and male-only (33.3% vs 18.4% in human and 31.0% vs 30.4% in mouse respectively). In drug studies, we find limited evidence for sex-sampling bias overall; however, specific categories of drugs, including human cancer and mouse nervous system drugs, are enriched in female-only and male-only studies respectively. Our expression-based sex labels allow us to further examine the complexity of cell line sex and assess the frequency of metadata sex label misannotations (2-5%). We make our inferred and normalized labels, along with flags for misannotated samples, publicly available to catalyze the routine use of sex as a study variable in future analyses. ### Competing Interest Statement The authors have declared no competing interest.

Bulletin Board

Emily hasn't posted anything yet.

Processing...

Sex-specific genetic effects across biomarkers

GRep: Gene Set Representation via Gaussian Embedding

Large-Scale Labeling and Assessment of Sex Bias in Publicly Available Expression Data