Molecular digitization of a botanical garden: high-depth whole-genome sequencing of 689 vascular plant species from the Ruili Botanical Garden

Gigascience. 2019 Apr 1;8(4):giz007. doi: 10.1093/gigascience/giz007.

Abstract

Background: Genome sequencing has been widely used in plant research to construct reference genomes and provide evolutionary insights. However, few plant species have had their whole genome sequenced, thus restraining the utility of these data. We collected 1,093 samples of vascular plant species growing in the Ruili Botanical Garden, located in southwest China. Of these, we sequenced 761 samples and collected voucher specimens stored in the Herbarium of China National GeneBank.

Results: The 761 sequenced samples represented 689 vascular plant species from 137 families belonging to 49 orders. Of these, 257 samples were identified to the species level and 504 to the family level, using specimen and chloroplast sequences. In total, we generated 54 Tb of sequencing data, with an average sequencing depth of 60X per species, as estimated from genome sizes. A reference phylogeny was reconstructed with 78 chloroplast genes for molecular identification and other possible applications.

Conclusions: The large dataset of vascular plant genomes generated in this study, which includes both high-depth whole-genome sequencing data and associated voucher specimens, is valuable for plant genome research and other applications. This project also provides insight into the feasibility and technical requirements for "planetary-scale" projects such as the 10,000 Plant Genomes Project and the Earth BioGenome Project.

Keywords: Ruili Botanical Garden; phylogeny; vascular plants; voucher specimens; whole-genome sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • China
  • Gardens / classification*
  • Genome Size
  • Genome, Plant*
  • Genomics* / methods
  • Heterozygote
  • Phylogeny
  • Plants / classification*
  • Plants / genetics*
  • Repetitive Sequences, Nucleic Acid
  • Whole Genome Sequencing