Integrating Genome-Wide Association and eQTLs Studies Identifies the Genes and Gene Sets Associated with Diabetes

Biomed Res Int. 2017:2017:1758636. doi: 10.1155/2017/1758636. Epub 2017 Jun 28.

Abstract

Aim: To identify novel candidate genes and gene sets for diabetes.

Methods: We performed an integrative analysis of genome-wide association studies (GWAS) and expression quantitative trait loci (eQTLs) data for diabetes. Summary data was driven from a large-scale GWAS of diabetes, totally involving 58,070 individuals. eQTLs dataset included 923,021 cis-eQTL for 14,329 genes and 4,732 trans-eQTL for 2,612 genes. Integrative analysis of GWAS and eQTLs data was conducted by summary data-based Mendelian randomization (SMR). To identify the gene sets associated with diabetes, the SMR single gene analysis results were further subjected to gene set enrichment analysis (GSEA). A total of 13,311 annotated gene sets were analyzed in this study.

Results: SMR analysis identified 6 genes significantly associated with fasting glucose, such as C11ORF10 (p value = 6.04 × 10-8), MRPL33 (p value = 1.24 × 10-7), and FADS1 (p value = 2.39 × 10-7). Gene set analysis identified HUANG_FOXA2_TARGETS_UP (false discovery rate = 0.047) associated with fasting glucose.

Conclusion: Our study provides novel clues for clarifying the genetic mechanism of diabetes. This study also illustrated the good performance of SMR approach and extended it to gene set association analysis for complex diseases.

MeSH terms

  • Blood Glucose / metabolism
  • Delta-5 Fatty Acid Desaturase
  • Diabetes Mellitus / genetics*
  • Fasting / blood
  • Gene Expression Regulation*
  • Genetic Predisposition to Disease*
  • Genome-Wide Association Study*
  • Humans
  • Insulin / blood
  • Mendelian Randomization Analysis
  • Quantitative Trait Loci / genetics*

Substances

  • Blood Glucose
  • Delta-5 Fatty Acid Desaturase
  • Insulin
  • FADS1 protein, human