Analysis of whole-genome microarray replicates using mixed models

Bioinformatics. 2003 Jan;19(1):53-61. doi: 10.1093/bioinformatics/19.1.53.

Abstract

Motivation: Microarray experiments are inherently noisy. Replication is the key to estimating realistic fold-changes despite such noise. In the analysis of the various sources of noise the dependency structure of the replication needs to be taken into account.

Results: We analyzed replicate data sets from a Mycobacterium tuberculosis trcS mutant in order to identify differentially expressed genes and suggest new methods for filtering and normalizing raw array data and for imputing missing values. Mixed ANOVA models are applied to quantify the various sources of error. Such analysis also allows us to determine the optimal number of samples and arrays. Significance values for differential expression are obtained by a hierarchical bootstrapping scheme on scaled residuals. Four highly upregulated genes, including bfrB, were analyzed further. We observed an artefact, where transcriptional readthrough from these genes led to apparent upregulation of adjacent genes.

Availability: All methods and data discussed are available in the package YASMAhttp://www.cryst.bbk.ac.uk/wernisch/yasma.html for the statistical data analysis system R (http://www.R-project.org).

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Analysis of Variance
  • Cells, Cultured
  • DNA, Bacterial / analysis
  • DNA, Bacterial / classification
  • DNA, Bacterial / genetics*
  • Drosophila Proteins*
  • Gene Expression Regulation, Bacterial / genetics*
  • Genome, Bacterial
  • Humans
  • Models, Genetic*
  • Models, Statistical
  • Mutagenesis, Site-Directed
  • Mycobacterium tuberculosis / genetics
  • Oligonucleotide Array Sequence Analysis / methods*
  • Protein Serine-Threonine Kinases / genetics*
  • Quality Control
  • Reproducibility of Results
  • Sensitivity and Specificity

Substances

  • DNA, Bacterial
  • Drosophila Proteins
  • trc protein, Drosophila
  • Protein Serine-Threonine Kinases