Empirically Estimable Classification Bounds Based on a Nonparametric Divergence Measure

IEEE Trans Signal Process. 2016 Feb 1;64(3):580-591. doi: 10.1109/TSP.2015.2477805.

Abstract

Information divergence functions play a critical role in statistics and information theory. In this paper we show that a non-parametric f-divergence measure can be used to provide improved bounds on the minimum binary classification probability of error for the case when the training and test data are drawn from the same distribution and for the case where there exists some mismatch between training and test distributions. We confirm the theoretical results by designing feature selection algorithms using the criteria from these bounds and by evaluating the algorithms on a series of pathological speech classification tasks.

Keywords: Bayes error rate; classification; divergence measures; domain adaptation; non-parametric divergence estimator.