tidybulk: an R tidy framework for modular transcriptomic data analysis

Genome Biol. 2021 Jan 22;22(1):42. doi: 10.1186/s13059-020-02233-7.

Abstract

Recently, efforts have been made toward the harmonization of transcriptomic data structures and workflows using the concept of data tidiness, to facilitate modularisation. We present tidybulk, a modular framework for bulk transcriptional analyses that introduces a tidy transcriptomic data structure paradigm and analysis grammar. Tidybulk covers a wide variety of analysis procedures and integrates a large ecosystem of publicly available analysis algorithms under a common framework. Tidybulk decreases coding burden, facilitates reproducibility, increases efficiency for expert users, lowers the learning curve for inexperienced users, and bridges transcriptional data analysis with the tidyverse. Tidybulk is available at R/Bioconductor bioconductor.org/packages/tidybulk .

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods
  • Data Analysis*
  • Ecosystem
  • Gene Expression Profiling / methods
  • Genomics / methods
  • Reproducibility of Results
  • Software
  • Transcriptome*