Reference range: Which statistical intervals to use?

Stat Methods Med Res. 2021 Feb;30(2):523-534. doi: 10.1177/0962280220961793. Epub 2020 Oct 14.

Abstract

Reference ranges, which are data-based intervals aiming to contain a pre-specified large proportion of the population values, are powerful tools to analyse observations in clinical laboratories. Their main point is to classify any future observations from the population which fall outside them as atypical and thus may warrant further investigation. As a reference range is constructed from a random sample from the population, the event 'a reference range contains (100 P)% of the population' is also random. Hence, all we can hope for is that such event has a large occurrence probability. In this paper we argue that some intervals, including the P prediction interval, are not suitable as reference ranges since there is a substantial probability that these intervals contain less than (100 P)% of the population, especially when the sample size is large. In contrast, a (P,γ) tolerance interval is designed to contain (100 P)% of the population with a pre-specified large confidence γ so it is eminently adequate as a reference range. An example based on real data illustrates the paper's key points.

Keywords: Nonparametric prediction interval; nonparametric tolerance interval; prediction interval; reference range; tolerance interval.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Confidence Intervals
  • Probability
  • Reference Values*
  • Sample Size