Universal method for robust detection of circadian state from gene expression

Rosemary Braun; William L Kath; Marta Iwanaszko; Elzbieta Kula-Eversole; Sabra M Abbott; Kathryn J Reid; Phyllis C Zee; Ravi Allada

doi:10.1073/pnas.1800314115

Universal method for robust detection of circadian state from gene expression

Proc Natl Acad Sci U S A. 2018 Sep 25;115(39):E9247-E9256. doi: 10.1073/pnas.1800314115. Epub 2018 Sep 10.

Authors

Rosemary Braun^{1

2

3}, William L Kath^{2

3

4}, Marta Iwanaszko^{5

3}, Elzbieta Kula-Eversole⁴, Sabra M Abbott^{6

7}, Kathryn J Reid^{6

7}, Phyllis C Zee^{4

6}, Ravi Allada^{3

4}

Affiliations

¹ Biostatistics Division, Department of Preventive Medicine, Northwestern University, Chicago, IL 60611; rbraun@northwestern.edu.
² Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, IL 60208.
³ NSF-Simons Center for Quantitative Biology, Northwestern University, Evanston, IL 60208.
⁴ Department of Neurobiology, Northwestern University, Evanston, IL 60208.
⁵ Biostatistics Division, Department of Preventive Medicine, Northwestern University, Chicago, IL 60611.
⁶ Department of Neurology, Northwestern University, Chicago, IL 60611.
⁷ the Center for Circadian and Sleep Medicine, Northwestern University, Chicago, IL 60611.

Abstract

Circadian clocks play a key role in regulating a vast array of biological processes, with significant implications for human health. Accurate assessment of physiological time using transcriptional biomarkers found in human blood can significantly improve diagnosis of circadian disorders and optimize the delivery time of therapeutic treatments. To be useful, such a test must be accurate, minimally burdensome to the patient, and readily generalizable to new data. A major obstacle in development of gene expression biomarker tests is the diversity of measurement platforms and the inherent variability of the data, often resulting in predictors that perform well in the original datasets but cannot be universally applied to new samples collected in other settings. Here, we introduce TimeSignature, an algorithm that robustly infers circadian time from gene expression. We demonstrate its application in data from three independent studies using distinct microarrays and further validate it against a new set of samples profiled by RNA-sequencing. Our results show that TimeSignature is more accurate and efficient than competing methods, estimating circadian time to within 2 h for the majority of samples. Importantly, we demonstrate that once trained on data from a single study, the resulting predictor can be universally applied to yield highly accurate results in new data from other studies independent of differences in study population, patient protocol, or assay platform without renormalizing the data or retraining. This feature is unique among expression-based predictors and addresses a major challenge in the development of generalizable, clinically useful tests.

Keywords: circadian rhythms; cross-platform prediction; gene expression dynamics; machine learning.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Biomarkers / blood
Circadian Clocks / genetics*
Circadian Rhythm / genetics
Gene Expression
Gene Expression Profiling / methods*
Genes / genetics
Humans
Machine Learning*
Models, Statistical
Reproducibility of Results
Sleep
Transcriptome

Substances

Biomarkers

Abstract

Publication types

MeSH terms

Substances

Grants and funding