Genetic diversity contribution to errors in short oligonucleotide microarray analysis

Plant Biotechnol J. 2006 Sep;4(5):489-98. doi: 10.1111/j.1467-7652.2006.00198.x.

Abstract

DNA arrays based on short oligonucleotide (< or = 25-mer) probes are being developed for many species, and are being applied to quantify transcript abundance variation in species with high genetic diversity. To define the parameters necessary to design short oligo arrays for maize (Zea mays L.), a species with particularly high nucleotide (single nucleotide polymorphism, SNP) and insertion-deletion (indel) polymorphism frequencies, we analysed gene expression estimates generated for four maize inbred lines using a custom Affymetrix DNA array, and identified biases associated with high levels of polymorphism between lines. Statistically significant interactions between probes and maize inbreds were detected, affecting five or more probes (out of 30 probes per transcript) in the majority of cases. SNPs and indels were identified by re-sequencing; they are the primary source of probe-by-line interactions, affecting probeset level estimates and reducing the power of detecting transcript level variation between maize inbreds. This analysis identified 36,196 probes in 5118 probesets containing markers that may be used for genotyping in natural and segregating populations for association gene analysis and genetic mapping.

MeSH terms

  • Base Sequence
  • DNA Probes
  • Gene Expression Profiling
  • Genes, Plant
  • Genotype
  • Molecular Sequence Data
  • Nucleic Acid Hybridization
  • Oligonucleotide Array Sequence Analysis*
  • Polymorphism, Genetic*
  • Sequence Homology, Nucleic Acid
  • Zea mays / genetics

Substances

  • DNA Probes