Bayesian latent variable models for hierarchical clustered count outcomes with repeated measures in microbiome studies

Genet Epidemiol. 2017 Apr;41(3):221-232. doi: 10.1002/gepi.22031. Epub 2017 Jan 22.

Abstract

Motivated by the multivariate nature of microbiome data with hierarchical taxonomic clusters, counts that are often skewed and zero inflated, and repeated measures, we propose a Bayesian latent variable methodology to jointly model multiple operational taxonomic units within a single taxonomic cluster. This novel method can incorporate both negative binomial and zero-inflated negative binomial responses, and can account for serial and familial correlations. We develop a Markov chain Monte Carlo algorithm that is built on a data augmentation scheme using Pólya-Gamma random variables. Hierarchical centering and parameter expansion techniques are also used to improve the convergence of the Markov chain. We evaluate the performance of our proposed method through extensive simulations. We also apply our method to a human microbiome study.

Keywords: Bayesian latent variable model; microbiome; multivariate model; repeated measures; zero-inflated count outcomes.

Publication types

  • Comparative Study

MeSH terms

  • Adolescent
  • Adult
  • Age Factors
  • Aged
  • Algorithms*
  • Bayes Theorem*
  • Cluster Analysis*
  • DNA, Bacterial / genetics*
  • Feces / microbiology
  • Female
  • Humans
  • Markov Chains
  • Microbiota / genetics*
  • Middle Aged
  • Models, Genetic*
  • Models, Statistical
  • Monte Carlo Method
  • Obesity / genetics
  • Research Design
  • Thinness / genetics
  • Twin Studies as Topic
  • Young Adult

Substances

  • DNA, Bacterial