Principal component analysis based feature extraction approach to identify circulating microRNA biomarkers

PLoS One. 2013 Jun 24;8(6):e66714. doi: 10.1371/journal.pone.0066714. Print 2013.

Abstract

The discovery and characterization of blood-based disease biomarkers are clinically important because blood collection is easy and involves relatively little stress for the patient. However, blood generally reflects not only targeted diseases, but also the whole body status of patients. Thus, the selection of biomarkers may be difficult. In this study, we considered miRNAs as biomarker candidates for several reasons. First, since miRNAs were discovered relatively recently, they have not yet been tested extensively. Second, since the number of miRNAs is relatively limited, selection is expected to be easy. Third, since they are known to play critical roles in a wide range of biological processes, their expression may be disease specific. We applied a newly proposed method to select combinations of miRNAs that discriminate between healthy controls and each of 14 diseases that include 5 cancers. A new feature selection method is based on principal component analysis. Namely this method does not require knowledge of whether each sample was derived from a disease patient or a healthy control. Using this method, we found that hsa-miR-425, hsa-miR-15b, hsa-miR-185, hsa-miR-92a, hsa-miR-140-3p, hsa-miR-320a, hsa-miR-486-5p, hsa-miR-16, hsa-miR-191, hsa-miR-106b, hsa-miR-19b, and hsa-miR-30d were potential biomarkers; combinations of 10 of these miRNAs allowed us to discriminate each disease included in this study from healthy controls. These 12 miRNAs are significantly up- or downregulated in most cancers and other diseases, albeit in a cancer- or disease-specific combinatory manner. Therefore, these 12 miRNAs were also previously reported to be cancer- and disease-related miRNAs. Many disease-specific KEGG pathways were also significantly enriched by target genes of up-/downregulated miRNAs within several combinations of 10 miRNAs among these 12 miRNAs. We also selected miRNAs that could discriminate one disease from another or from healthy controls. These miRNAs were found to be largely overlapped with miRNAs that discriminate each disease from healthy controls.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers / blood
  • Biomarkers / metabolism
  • Computational Biology / methods*
  • Humans
  • MicroRNAs / blood*
  • MicroRNAs / genetics
  • MicroRNAs / metabolism
  • Neoplasms / blood
  • Neoplasms / diagnosis
  • Neoplasms / genetics
  • Principal Component Analysis / methods*
  • Probability
  • Transcriptome

Substances

  • Biomarkers
  • MicroRNAs

Grants and funding

This study was supported by KAKENHI (23300357) (http://www.jsps.go.jp/english/e-grants/index.html). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.