Statistical Methodology for the Analysis of Repeated Duration Data in Behavioral Studies

J Speech Lang Hear Res. 2018 Mar 15;61(3):561-582. doi: 10.1044/2017_JSLHR-S-17-0135.

Abstract

Purpose: Repeated duration data are frequently used in behavioral studies. Classical linear or log-linear mixed models are often inadequate to analyze such data, because they usually consist of nonnegative and skew-distributed variables. Therefore, we recommend use of a statistical methodology specific to duration data.

Method: We propose a methodology based on Cox mixed models and written under the R language. This semiparametric model is indeed flexible enough to fit duration data. To compare log-linear and Cox mixed models in terms of goodness-of-fit on real data sets, we also provide a procedure based on simulations and quantile-quantile plots.

Results: We present two examples from a data set of speech and gesture interactions, which illustrate the limitations of linear and log-linear mixed models, as compared to Cox models. The linear models are not validated on our data, whereas Cox models are. Moreover, in the second example, the Cox model exhibits a significant effect that the linear model does not.

Conclusions: We provide methods to select the best-fitting models for repeated duration data and to compare statistical methodologies. In this study, we show that Cox models are best suited to the analysis of our data set.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Behavioral Research / methods*
  • Compulsive Behavior
  • Data Interpretation, Statistical*
  • Gestures
  • Humans
  • Models, Statistical
  • Software
  • Speech
  • Time Factors