Predicting goal probabilities with improved xG models using event sequences in association football

PLoS One. 2024 Oct 30;19(10):e0312278. doi: 10.1371/journal.pone.0312278. eCollection 2024.

Abstract

In association football, predicting the likelihood and outcome of a shot at a goal is useful but challenging. Expected goal (xG) models can be used in a variety of ways including evaluating performance and designing offensive strategies. This study proposed a novel framework that uses the events preceding a shot, to improve the accuracy of the expected goals (xG) metric. A combination of previously explored and unexplored temporal features is utilized in the proposed framework. The new features include; "advancement factor", and "player position column". A random forest model was used, which performed better than published single-event-based models in the literature. Results further demonstrated a significant improvement in model performance with the inclusion of preceding event information. The proposed framework and model enable the discovery of event sequences that improve xG, which include; opportunities built up from the sides of the 18-yard box, shots attempted from in front of the goal within the opposition's 18-yard box, and shots from successful passes to the far post.

MeSH terms

  • Athletic Performance* / physiology
  • Goals
  • Humans
  • Male
  • Probability
  • Soccer*

Grants and funding

This work is part of first Author’s PhD. PhD is funded by Deakin University, Melbourne under Deakin-Coventry cotutelle scholarship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.