Contrastive learning for sports video: Unsupervised player classification

We address the problem of unsupervised classification of players in a team sport according to their team affiliation, when jersey colours and design are not known a priori. We adopt a contrastive learning approach in which an embedding network learns to maximize the distance between representations of players on different teams relative to players on the same team, in a purely unsupervised fashion, without any labelled data. We evaluate the approach using a new hockey dataset and find that it outperforms prior unsupervised approaches by a substantial margin, particularly for real-time application when only a small number of frames are available for unsupervised learning before team assignments must be made. Remarkably, we show that our contrastive method achieves 94% accuracy after unsupervised training on only a single frame, with accuracy rising to 97% within 500 frames (17 seconds of game time). We further demonstrate how accurate team classification allows accurate team-conditional heat maps of player positioning to be computed.
© Copyright 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE. All rights reserved.

Bibliographic Details
Subjects:
Notations:sport games technical and natural sciences
Tagging:maschinelles Lernen
Published in:IEEE/CVF Conference on Computer Vision and Pattern Recognition
Language:English
Published: 2021
Online Access:https://openaccess.thecvf.com/content/CVPR2021W/CVSports/html/Koshkina_Contrastive_Learning_for_Sports_Video_Unsupervised_Player_Classification_CVPRW_2021_paper.html
Pages:4528-4536
Document types:article
Level:advanced