Toward a Mobile Platform for Real-world Digital Measurement of Depression: User-Centered Design, Data Quality, and Behavioral and Clinical Modeling

Stefanie Nickels; Matthew D Edwards; Sarah F Poole; Dale Winter; Jessica Gronsbell; Bella Rozenkrants; David P Miller; Mathias Fleck; Alan McLean; Bret Peterson; Yuanwei Chen; Alan Hwang; David Rust-Smith; Arthur Brant; Andrew Campbell; Chen Chen; Collin Walter; Patricia A Arean; Honor Hsin; Lance J Myers; William J Marks Jr; Jessica L Mega; Danielle A Schlosser; Andrew J Conrad; Robert M Califf; Menachem Fromer

doi:10.2196/27589

Toward a Mobile Platform for Real-world Digital Measurement of Depression: User-Centered Design, Data Quality, and Behavioral and Clinical Modeling

JMIR Ment Health. 2021 Aug 10;8(8):e27589. doi: 10.2196/27589.

Authors

Stefanie Nickels¹, Matthew D Edwards¹, Sarah F Poole¹, Dale Winter¹, Jessica Gronsbell¹, Bella Rozenkrants¹, David P Miller¹, Mathias Fleck¹, Alan McLean¹, Bret Peterson¹, Yuanwei Chen¹, Alan Hwang¹, David Rust-Smith¹, Arthur Brant¹, Andrew Campbell², Chen Chen¹, Collin Walter¹, Patricia A Arean³, Honor Hsin¹, Lance J Myers¹, William J Marks Jr¹, Jessica L Mega¹, Danielle A Schlosser¹, Andrew J Conrad¹, Robert M Califf¹, Menachem Fromer¹

Affiliations

¹ Verily Life Sciences, South San Francisco, CA, United States.
² Department of Computer Science, Dartmouth College, Hanover, NH, United States.
³ Department of Psychiatry & Behavioral Sciences, University of Washington, Seattle, WA, United States.

PMID: 34383685
PMCID: PMC8386379
DOI: 10.2196/27589

Abstract

Background: Although effective mental health treatments exist, the ability to match individuals to optimal treatments is poor, and timely assessment of response is difficult. One reason for these challenges is the lack of objective measurement of psychiatric symptoms. Sensors and active tasks recorded by smartphones provide a low-burden, low-cost, and scalable way to capture real-world data from patients that could augment clinical decision-making and move the field of mental health closer to measurement-based care.

Objective: This study tests the feasibility of a fully remote study on individuals with self-reported depression using an Android-based smartphone app to collect subjective and objective measures associated with depression severity. The goals of this pilot study are to develop an engaging user interface for high task adherence through user-centered design; test the quality of collected data from passive sensors; start building clinically relevant behavioral measures (features) from passive sensors and active inputs; and preliminarily explore connections between these features and depression severity.

Methods: A total of 600 participants were asked to download the study app to join this fully remote, observational 12-week study. The app passively collected 20 sensor data streams (eg, ambient audio level, location, and inertial measurement units), and participants were asked to complete daily survey tasks, weekly voice diaries, and the clinically validated Patient Health Questionnaire (PHQ-9) self-survey. Pairwise correlations between derived behavioral features (eg, weekly minutes spent at home) and PHQ-9 were computed. Using these behavioral features, we also constructed an elastic net penalized multivariate logistic regression model predicting depressed versus nondepressed PHQ-9 scores (ie, dichotomized PHQ-9).

Results: A total of 415 individuals logged into the app. Over the course of the 12-week study, these participants completed 83.35% (4151/4980) of the PHQ-9s. Applying data sufficiency rules for minimally necessary daily and weekly data resulted in 3779 participant-weeks of data across 384 participants. Using a subset of 34 behavioral features, we found that 11 features showed a significant (P<.001 Benjamini-Hochberg adjusted) Spearman correlation with weekly PHQ-9, including voice diary-derived word sentiment and ambient audio levels. Restricting the data to those cases in which all 34 behavioral features were present, we had available 1013 participant-weeks from 186 participants. The logistic regression model predicting depression status resulted in a 10-fold cross-validated mean area under the curve of 0.656 (SD 0.079).

Conclusions: This study finds a strong proof of concept for the use of a smartphone-based assessment of depression outcomes. Behavioral features derived from passive sensors and active tasks show promising correlations with a validated clinical measure of depression (PHQ-9). Future work is needed to increase scale that may permit the construction of more complex (eg, nonlinear) predictive models and better handle data missingness.

Keywords: GPS; adherence; app usage; depression; digital phenotyping; engagement; location; mHealth; mental health; mobile phone; mobile sensing; mobility; physical activity; sleep; user-centered design; voice diaries.

©Stefanie Nickels, Matthew D Edwards, Sarah F Poole, Dale Winter, Jessica Gronsbell, Bella Rozenkrants, David P Miller, Mathias Fleck, Alan McLean, Bret Peterson, Yuanwei Chen, Alan Hwang, David Rust-Smith, Arthur Brant, Andrew Campbell, Chen Chen, Collin Walter, Patricia A Arean, Honor Hsin, Lance J Myers, William J Marks Jr, Jessica L Mega, Danielle A Schlosser, Andrew J Conrad, Robert M Califf, Menachem Fromer. Originally published in JMIR Mental Health (https://mental.jmir.org), 10.08.2021.