Metagenomic Data Assembly - The Way of Decoding Unknown Microorganisms

Front Microbiol. 2021 Mar 23:12:613791. doi: 10.3389/fmicb.2021.613791. eCollection 2021.

Abstract

Metagenomics is a segment of conventional microbial genomics dedicated to the sequencing and analysis of combined genomic DNA of entire environmental samples. The most critical step of the metagenomic data analysis is the reconstruction of individual genes and genomes of the microorganisms in the communities using metagenomic assemblers - computational programs that put together small fragments of sequenced DNA generated by sequencing instruments. Here, we describe the challenges of metagenomic assembly, a wide spectrum of applications in which metagenomic assemblies were used to better understand the ecology and evolution of microbial ecosystems, and present one of the most efficient microbial assemblers, SPAdes that was upgraded to become applicable for metagenomics.

Keywords: SPAdes; algorithms; metagenomic assembly; metagenomics; microbiota.

Publication types

  • Review