Gene expression-based classification and regulatory networks of pediatric acute lymphoblastic leukemia

Blood. 2009 Nov 12;114(20):4486-93. doi: 10.1182/blood-2009-04-218123. Epub 2009 Sep 15.

Abstract

Pediatric acute lymphoblastic leukemia (ALL) contains cytogenetically distinct subtypes that respond differently to cytotoxic drugs. Subtype classification can be also achieved through gene expression profiling. However, how to apply such classifiers to a single patient and correctly diagnose the disease subtype in an independent patient group has not been addressed. Furthermore, the underlying regulatory mechanisms responsible for the subtype-specific gene expression patterns are still largely unknown. Here, by combining 3 published microarray datasets on 535 mostly white children's samples and generating a new dataset on 100 Chinese children's ALL samples, we were able to (1) identify a 62-gene classifier with 97.6% accuracy from the white children's samples and validated it on the completely independent set of 100 Chinese samples, and (2) uncover potential regulatory networks of ALL subtypes. The classifier we identified was, thus far, the only one that could be applied directly to a single sample and that sustained validation in a large independent patient group. Our results also suggest that the etiology of ALL is largely the same among different ethnic groups, and that the transcription factor hubs in the predicted regulatory network might play important roles in regulating gene expression and development of ALL.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Child
  • Data Mining
  • Databases, Genetic
  • Gene Expression
  • Gene Expression Profiling / methods*
  • Humans
  • Oligonucleotide Array Sequence Analysis
  • Precursor Cell Lymphoblastic Leukemia-Lymphoma / classification*
  • Precursor Cell Lymphoblastic Leukemia-Lymphoma / genetics*
  • Reproducibility of Results
  • Reverse Transcriptase Polymerase Chain Reaction