Exploring competition performance in decathlon using semi-parametric latent variable models

In this paper, we explore competition performance in decathlon based on competition, training and personal data. Our data set comprises 3103 competition results from the decathlon world's best performance lists from 1998 to 2009. The aim of our analysis is to estimate latent factors describing the performance results and—at the same time—to model effects of age, season, and year of the competition on the results. Thus, we apply a new statistical method, semi-parametric latent variable models (LVMs), which can be seen as a synthesis between classical factor analysis and semi-parametric regression. LVMs are especially well-suited for modeling decathlon data, because (i) they permit the assumption of latent factors and therefore take the correlation structure between the ten performance results into account, and (ii) they enable us to model (potentially non-linear) relationships between response variables and covariates—contrary to classical factor analysis. In our analysis, we apply LVMs with a semi-parametric predictor allowing for non-linear covariate effects on the latent factors. Thereby, we obtain well interpretable results: four latent factors standing for sprint, jumping, throwing, and endurance abilities, as well as interesting nonlinear effects of age and season on these latent factors. We also compare our results from LVMs to those obtained from classical factor analysis.
© Copyright 2011 Journal of Quantitative Analysis in Sports. de Gruyter. All rights reserved.

Bibliographic Details
Subjects:
Notations:strength and speed sports training science technical and natural sciences
Published in:Journal of Quantitative Analysis in Sports
Language:English
Published: 2011
Online Access:http://www.degruyter.com/view/j/jqas.2011.7.issue-4/1559-0410.1307/1559-0410.1307.xml?format=INT
Volume:7
Issue:4
Pages:Art. 6
Document types:article
Level:advanced