MegaLMM: Mega-scale linear mixed models for genomic predictions with thousands of traits

Genome Biol. 2021 Jul 23;22(1):213. doi: 10.1186/s13059-021-02416-w.

Abstract

Large-scale phenotype data can enhance the power of genomic prediction in plant and animal breeding, as well as human genetics. However, the statistical foundation of multi-trait genomic prediction is based on the multivariate linear mixed effect model, a tool notorious for its fragility when applied to more than a handful of traits. We present MegaLMM, a statistical framework and associated software package for mixed model analyses of a virtually unlimited number of traits. Using three examples with real plant data, we show that MegaLMM can leverage thousands of traits at once to significantly improve genetic value prediction accuracy.

Keywords: Genomic prediction; High-throughput phenotyping; Multi-environment trial; Multi-trait Linear Mixed Model.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Arabidopsis / genetics*
  • Bayes Theorem
  • Gene-Environment Interaction
  • Genome, Plant*
  • Genomics
  • Genotype
  • Humans
  • Models, Genetic*
  • Phenotype
  • Plant Breeding
  • Quantitative Trait, Heritable*
  • Software*
  • Triticum / genetics*
  • Zea mays / genetics*