Illness prediction of elite youth soccer players based on blood-based biomarker data and machine learning models
(Krankheitsvorhersage bei Elite-Jugendfußballern auf der Grundlage von blutbasierten Biomarkerdaten und maschinellen Lernmodellen)
INTRODUCTION: Minimizing the risk of illness is of great interest in sport, e.g. soccer, as illnesses can lead to long absences during the season. The risk for developing illness has already been explained in studies by low fitness, high training load, or subjectively reported factors [1, 2]. The aim of our work was to investigate whether an emerging illness in soccer players can be predicted by a comprehensive panel of blood-based biomarkers collected on the days preceding the onset of illness using machine learning models. METHODS: Data were collected from 23 soccer players of an elite European youth soccer team over a three-month period during the 2021/2022 regular season. In addition to 40 blood parameters (covering aspects of e.g., muscle damage, iron status, inflammation and immune response), information on the illness status of the players was regularly collected. To predict illness, five players were randomly selected as the training dataset and the remaining players were used as the test dataset for validation. The training dataset contained 10 illness and the test dataset contained two illness data points. To increase the number of illness cases in the training dataset, we used ADASYN [3] for oversampling. For prediction, we compared three machine learning models: random forest, linear support vector machines and a naïve Bayes classifier. To investigate the important variables for classification, we use the Receiving Operating Curve. RESULTS: Linear supported vector machine achieved the best classification results. With this model one of the two illnesses present in the test data could be explained, which resulted in an accuracy value of 85%, recall of 50% and a Cohens Kappa of 0.10. Analysing the most important variables for classification reveals that illness is best predicted by Eosinophil, Ferritin, Glucose and C-reactive protein. CONCLUSION: Although it is possible to predict illness of soccer players by using blood-based biomarker data and applying a linear support vector machine, the number of wrongly predicted illnesses is high. Nevertheless, when having more illnesses to train a model, machine learning approaches could help to detect illness signs in advance and thus react early if needed.
© Copyright 2023 28th Annual Congress of the European College of Sport Science, 4-7 July 2023, Paris, France. Veröffentlicht von European College of Sport Science. Alle Rechte vorbehalten.
| Schlagworte: | |
|---|---|
| Notationen: | Nachwuchssport Biowissenschaften und Sportmedizin Spielsportarten |
| Tagging: | maschinelles Lernen Biomarker |
| Veröffentlicht in: | 28th Annual Congress of the European College of Sport Science, 4-7 July 2023, Paris, France |
| Sprache: | Englisch |
| Veröffentlicht: |
Paris
European College of Sport Science
2023
|
| Online-Zugang: | https://www.ecss.mobi/DATA/EDSS/C28/28-0760.pdf |
| Dokumentenarten: | Kongressband, Tagungsbericht |
| Level: | hoch |