Imputation of non-participated race results

(Imputation von nicht teilgenommenen Rennergebnissen)

Most current solutions in cycling analytics focus on one specific race or participant, while a sports-wide system could render huge benefits of scale, by automating certain processes. The development of such a system is, however, heavily inflicted by the large number of non-participations as most riders do not compete in all races. Therefore, value imputation is required. Most popular value imputation techniques are developed for cases where part of the data is fully observed, which is not the case for cycling race results. While some methods are adapted to situations without complete cases, this is not the case for the cross-sectional imputation algorithm suggested by multiple previous studies (i.e., KNN imputation). We therefore suggest an adaptation to the KNN imputation algorithm which uses expert knowledge on race similarity in order to facilitate the deployment of the algorithm in situations without complete cases. The method is shown to be the most performant predictive model and does this within a competitive computation time.
© Copyright 2022 Machine Learning and Data Mining for Sports Analytics. MLSA 2021. Communications in Computer and Information Science. Veröffentlicht von Springer. Alle Rechte vorbehalten.

Bibliographische Detailangaben
Schlagworte:
Notationen:Naturwissenschaften und Technik Ausdauersportarten
Tagging:Algorithmus
Veröffentlicht in:Machine Learning and Data Mining for Sports Analytics. MLSA 2021. Communications in Computer and Information Science
Sprache:Englisch
Veröffentlicht: Cham Springer 2022
Schriftenreihe:Communications in Computer and Information Science, 1571
Online-Zugang:https://doi.org/10.1007/978-3-031-02044-5_13
Seiten:155-166
Dokumentenarten:Artikel
Level:hoch