SpaGCN: Integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network

Nat Methods. 2021 Nov;18(11):1342-1351. doi: 10.1038/s41592-021-01255-8. Epub 2021 Oct 28.

Abstract

Recent advances in spatially resolved transcriptomics (SRT) technologies have enabled comprehensive characterization of gene expression patterns in the context of tissue microenvironment. To elucidate spatial gene expression variation, we present SpaGCN, a graph convolutional network approach that integrates gene expression, spatial location and histology in SRT data analysis. Through graph convolution, SpaGCN aggregates gene expression of each spot from its neighboring spots, which enables the identification of spatial domains with coherent expression and histology. The subsequent domain guided differential expression (DE) analysis then detects genes with enriched expression patterns in the identified domains. Analyzing seven SRT datasets using SpaGCN, we show it can detect genes with much more enriched spatial expression patterns than competing methods. Furthermore, genes detected by SpaGCN are transferrable and can be utilized to study spatial variation of gene expression in other datasets. SpaGCN is computationally fast, platform independent, making it a desirable tool for diverse SRT studies.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Animals
  • Brain / metabolism*
  • Cluster Analysis
  • Computational Biology
  • Dorsolateral Prefrontal Cortex / metabolism*
  • Gene Expression Regulation
  • Genes*
  • Humans
  • Mice
  • Neural Networks, Computer
  • Pancreatic Neoplasms / genetics*
  • Pancreatic Neoplasms / pathology
  • Software*
  • Spatial Analysis
  • Transcriptome*
  • Visual Cortex / metabolism*

Associated data

  • Dryad/10.5061/dryad.8t8s248