In genomic selection choosing the statistical machine learning model is of paramount importance. In this paper, we present an application of a zero altered random forest model with two versions (ZAP_RF and ZAPC_RF) to deal with excess zeros in count response variables. The proposed model was compared with the conventional random forest (RF) model and with the conventional Generalized Poisson Ridge regression (GPR) using two real datasets, and we found that, in terms of prediction performance, the proposed zero inflated random forest model outperformed the conventional RF and GPR models.
Keywords: GenPred; Genomic Prediction; Shared Data Resource; count data; genomic selection; plant breeding; random forest; zero altered Poisson.
© The Author(s) 2020. Published by Oxford University Press on behalf of Genetics Society of America.