4040369

Exploring chance in NCAA basketball

There seems to be an upper limit to predicting the outcome of matches in (semi-)professional sports. Recent work has proposed that this is due to chance and attempts have been made to simulate the distribution of win percentages to identify the most likely proportion of matches decided by chance. We argue that the approach that has been chosen so far makes some simplifying assumptions that cause its result to be of limited practical value. Instead, we propose to use clustering of statistical team profiles and observed scheduling information to derive limits on the predictive accuracy for particular seasons, which can be used to assess the performance of predictive models on those seasons. We show that the resulting simulated distributions are much closer to the observed distributions and give higher assessments of chance and tighter limits on predictive accuracy.
© Copyright 2015 Machine Learning and Data Mining for Sports Analytics ECML/PKDD 2015 workshop. Published by Department of Computer Science, KU Leuven. All rights reserved.

Bibliographic Details
Subjects:
Notations:technical and natural sciences sport games
Published in:Machine Learning and Data Mining for Sports Analytics ECML/PKDD 2015 workshop
Language:English
Published: Leuven Department of Computer Science, KU Leuven 2015
Online Access:https://arxiv.org/pdf/1508.04688v1.pdf
Pages:63-74
Document types:article
Level:advanced