Genes with differential expression across ancestries are enriched in ancestry-specific disease effects likely due to gene-by-environment interactions

Am J Hum Genet. 2024 Oct 3;111(10):2117-2128. doi: 10.1016/j.ajhg.2024.07.021. Epub 2024 Aug 26.

Abstract

Multi-ancestry genome-wide association studies (GWASs) have highlighted the existence of variants with ancestry-specific effect sizes. Understanding where and why these ancestry-specific effects occur is fundamental to understanding the genetic basis of human diseases and complex traits. Here, we characterized genes differentially expressed across ancestries (ancDE genes) at the cell-type level by leveraging single-cell RNA-sequencing data in peripheral blood mononuclear cells for 21 individuals with East Asian (EAS) ancestry and 23 individuals with European (EUR) ancestry (172,385 cells); then, we tested whether variants surrounding those genes were enriched in disease variants with ancestry-specific effect sizes by leveraging ancestry-matched GWASs of 31 diseases and complex traits (average n ∼ 90,000 and ∼ 267,000 in EAS and EUR, respectively). We observed that ancDE genes tended to be cell-type specific and enriched in genes interacting with the environment and in variants with ancestry-specific disease effect sizes, which suggests cell-type-specific, gene-by-environment interactions shared between regulatory and disease architectures. Finally, we illustrated how different environments might have led to ancestry-specific myeloid cell leukemia 1 (MCL1) expression in B cells and ancestry-specific allele effect sizes in lymphocyte count GWASs for variants surrounding MCL1. Our results imply that large single-cell and GWAS datasets from diverse ancestries are required to improve our understanding of human diseases.

MeSH terms

  • Asian People / genetics
  • Gene Expression Regulation
  • Gene-Environment Interaction*
  • Genetic Predisposition to Disease
  • Genome-Wide Association Study*
  • Humans
  • Leukocytes, Mononuclear / metabolism
  • Myeloid Cell Leukemia Sequence 1 Protein / genetics
  • Polymorphism, Single Nucleotide
  • Single-Cell Analysis
  • White People* / genetics

Substances

  • Myeloid Cell Leukemia Sequence 1 Protein