TY - JOUR
T1 - Bayesian prediction of winning times for elite swimming events
AU - Wu, Paul Pao-Yen
AU - Garufi, Lawrence
AU - Drovandi, Christopher
AU - Mengersen, Kerrie
AU - Mitchell, Lachlan J G
AU - Osborne, Mark A
AU - Pyne, David B
N1 - Funding Information:
This research was conducted by the Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers (project number CE140100049) and funded by the Australian Government.
Publisher Copyright:
© 2021 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2022
Y1 - 2022
N2 - To develop a statistical model of winning times for international swimming events with the aim of predicting winning time distributions and the probability of winning for the 2020 and 2024 Olympic Games. The data set included first and third place times from all individual swimming events from the Olympics and World Championships from 1990 to 2019. We compared different model formulations fitted with Bayesian inference to obtain predictive distributions; comparisons were based on mean percentage error in out-of-sample predictions of Olympics and World Championships winning swim times from 2011 to 2019. The Bayesian time series regression model, comprising auto-regressive and moving average terms and other predictors, had the smallest mean prediction error of 0.57% (CI 0.46-0.74%). For context, using the respective previous Olympics or World Championships winning time resulted in a mean prediction error of 0.70% (CI 0.59-0.82%). The Olympics were on average 0.5% (CI 0.3-0.7%) faster than World Championships over the study period. The model computes the posterior predictive distribution, which allows coaches and athletes to evaluate the probability of winning given an individual's swim time, and the probability of being faster or slower than the previous winning time or even the world record.
AB - To develop a statistical model of winning times for international swimming events with the aim of predicting winning time distributions and the probability of winning for the 2020 and 2024 Olympic Games. The data set included first and third place times from all individual swimming events from the Olympics and World Championships from 1990 to 2019. We compared different model formulations fitted with Bayesian inference to obtain predictive distributions; comparisons were based on mean percentage error in out-of-sample predictions of Olympics and World Championships winning swim times from 2011 to 2019. The Bayesian time series regression model, comprising auto-regressive and moving average terms and other predictors, had the smallest mean prediction error of 0.57% (CI 0.46-0.74%). For context, using the respective previous Olympics or World Championships winning time resulted in a mean prediction error of 0.70% (CI 0.59-0.82%). The Olympics were on average 0.5% (CI 0.3-0.7%) faster than World Championships over the study period. The model computes the posterior predictive distribution, which allows coaches and athletes to evaluate the probability of winning given an individual's swim time, and the probability of being faster or slower than the previous winning time or even the world record.
KW - Data
KW - Olympics
KW - performance analysis
KW - quantitative analysis
KW - statistics
UR - http://www.scopus.com/inward/record.url?scp=85116242123&partnerID=8YFLogxK
U2 - 10.1080/02640414.2021.1976485
DO - 10.1080/02640414.2021.1976485
M3 - Article
C2 - 34544331
SN - 0264-0414
VL - 40
SP - 1
EP - 8
JO - Journal of Sports Sciences
JF - Journal of Sports Sciences
IS - 1
ER -