Cardiovascular genomics: a biomarker identification pipeline

IEEE Trans Inf Technol Biomed. 2012 Sep;16(5):809-22. doi: 10.1109/TITB.2012.2199570. Epub 2012 May 16.

Abstract

Genomic biomarkers are essential for understanding the underlying molecular basis of human diseases such as cardiovascular disease. In this review, we describe a biomarker identification pipeline for cardiovascular disease, which includes 1) high-throughput genomic data acquisition, 2) preprocessing and normalization of data, 3) exploratory analysis, 4) feature selection, 5) classification, and 6) interpretation and validation of candidate biomarkers. We review each step in the pipeline, presenting current and widely used bioinformatics methods. Furthermore, we analyze several publicly available cardiovascular genomics datasets to illustrate the pipeline. Finally, we summarize the current challenges and opportunities for further research.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Biomarkers / analysis
  • Cardiovascular Diseases / genetics*
  • Cardiovascular Diseases / metabolism
  • Cluster Analysis
  • Gene Expression Profiling
  • Genomics / methods*
  • Humans
  • Oligonucleotide Array Sequence Analysis

Substances

  • Biomarkers