A generative approach to frame-level multi-competitor races

Multi-competitor races often feature complicated within-race strategies that are difficult to capture when training data on race outcome level data. Models which do not account for race-level strategy may suffer from confounded inferences and predictions. We develop a generative model for multi-competitor races which explicitly models race-level effects like drafting and separates strategy from competitor ability. The model allows one to simulate full races from any real or created starting position opening new avenues for attributing value to within-race actions and performing counter-factual analyses. This methodology is sufficiently general to apply to any track based multi-competitor races where both tracking data is available and competitor movement is well described by simultaneous forward and lateral movements. We apply this methodology to one-mile horse races using frame-level tracking data provided by the New York Racing Association (NYRA) and the New York Thoroughbred Horsemen`s Association (NYTHA) for the Big Data Derby 2022 Kaggle Competition. We demonstrate how this model can yield new inferences, such as the estimation of horse-specific speed profiles and examples of posterior predictive counterfactual simulations to answer questions of interest such as starting lane impacts on race outcomes.
© Copyright 2024 Journal of Quantitative Analysis in Sports. de Gruyter. All rights reserved.

Bibliographic Details
Subjects:
Notations:training science
Published in:Journal of Quantitative Analysis in Sports
Language:English
Published: 2024
Online Access:https://doi.org/10.1515/jqas-2023-0091
Volume:20
Issue:4
Pages:365-383
Document types:article
Level:advanced