Contrastive learning for sports video: Unsupervised player classification
We address the problem of unsupervised classification of players in a team sport according to their team affiliation, when jersey colours and design are not known a priori. We adopt a contrastive learning approach in which an embedding network learns to maximize the distance between representations of players on different teams relative to players on the same team, in a purely unsupervised fashion, without any labelled data. We evaluate the approach using a new hockey dataset and find that it outperforms prior unsupervised approaches by a substantial margin, particularly for real-time application when only a small number of frames are available for unsupervised learning before team assignments must be made. Remarkably, we show that our contrastive method achieves 94% accuracy after unsupervised training on only a single frame, with accuracy rising to 97% within 500 frames (17 seconds of game time). We further demonstrate how accurate team classification allows accurate team-conditional heat maps of player positioning to be computed.
© Copyright 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE. All rights reserved.
| Subjects: | |
|---|---|
| Notations: | sport games technical and natural sciences |
| Tagging: | maschinelles Lernen |
| Published in: | IEEE/CVF Conference on Computer Vision and Pattern Recognition |
| Language: | English |
| Published: |
2021
|
| Online Access: | https://openaccess.thecvf.com/content/CVPR2021W/CVSports/html/Koshkina_Contrastive_Learning_for_Sports_Video_Unsupervised_Player_Classification_CVPRW_2021_paper.html |
| Pages: | 4528-4536 |
| Document types: | article |
| Level: | advanced |