Machine learning in rugby union: predicting and identifying key performance indicators for professional rugby union players in match play based workload
(Maschinelles Lernen im Rugby Union: Prognose und Identifizierung zentraler Leistungsindikatoren für professionelle Rugby-Union-Spieler im wettkampfbasierten Belastungskontext)
Rugby union is an intermittent high-intensity contact sport requiring the analysis of various training and match metrics. Time-motion analysis and video analysis have enhanced the understanding of the interplay between these two factors. However, limited studies have investigated the effect of workload on key performance indicators (KPIs) during matches. In this study, data collected from the global positioning system (GPS) were used to calculate cumulative workload values over 7, 14, and 21 days prior to each game. After dimensionality reduction through principal component analysis (PCA), these workload values were employed as features, with game KPIs as target variables. Modeling was conducted using linear regression (LR), support vector regression (SVR), random forest regression (RFR), and light gradient boosting machine (LightGBM) for regression tasks. The superiority of the model was assessed by coefficient of determination (R2), root mean square error (RMSE), and correlation coefficient (R). The findings revealed that although individual GPS metrics exhibited weak correlations with KPIs, machine learning (ML) models particularly RFR, successfully captured complex interactions and nonlinear relationships. These models achieved significantly improved predictive performance, with R2 values ranging from 0.40 to 0.72 for certain KPIs. Using SHapley Additive exPlanations (SHAP) analysis and partial dependence plots, this study enhanced the interpretability of ML models by identifying the influence of GPS features on KPIs and exploring their underlying mechanisms. These findings offer actionable insights for workload management, emphasizing critical factors that affect player performance.
Highlights
- Machine learning improved prediction accuracy compared to single-feature correlation, with random forest performing best overall.
- Key workload features influenced different KPIs. For forwards, training load and heart rate exertion were most important for tackles and carries. For backs, sprinting intensity and deceleration metrics strongly impacted passes, kicks, and receipts.
© Copyright 2025 European Journal of Sport Science. Wiley. Alle Rechte vorbehalten.
| Schlagworte: | |
|---|---|
| Notationen: | Spielsportarten Naturwissenschaften und Technik |
| Tagging: | maschinelles Lernen Monitoring |
| Veröffentlicht in: | European Journal of Sport Science |
| Sprache: | Englisch |
| Veröffentlicht: |
2025
|
| Online-Zugang: | https://doi.org/10.1002/ejsc.70042 |
| Jahrgang: | 25 |
| Heft: | 9 |
| Seiten: | e70042 |
| Dokumentenarten: | Artikel |
| Level: | hoch |