Toward improving the visual characterization of sport activities with abstracted scene graphs
(Zur Verbesserung der visuellen Charakterisierung von Sportaktivitäten mit abstrahierten Szenengrafiken)
We present techniques for abstracting relevant information from scene graph features to improve action recognition in sports videos. Feature representation with relevant information can dramatically increase machine learning's utility across many tasks. Despite the advantages of incorporating objects and relations as building blocks of semantic information, we still encounter too many irrelevant objects and relations in sports videos, adding uncertainty to the classifiers. This paper describes four fundamentally different scene abstraction techniques, each searching for the relevant information within aggregated features from pixel-level to object-level. In each method, we formulate relevancy through co-occurrence statistics, semantic similarity, feature decomposition, and correlation-based mapping and evaluate each technique's efficacy through performance gains in action recognition and decay rate of training loss. We demonstrate that by creating a relevant and more concise knowledge representation, we improve performance (mAP) of action recognition in sports by 26.6% and achieve faster converging models due to higher representation power.
© Copyright 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE. Alle Rechte vorbehalten.
| Schlagworte: | |
|---|---|
| Notationen: | Spielsportarten Naturwissenschaften und Technik |
| Tagging: | maschinelles Lernen |
| Veröffentlicht in: | IEEE/CVF Conference on Computer Vision and Pattern Recognition |
| Sprache: | Englisch |
| Veröffentlicht: |
2021
|
| Online-Zugang: | https://openaccess.thecvf.com/content/CVPR2021W/CVSports/html/Rahimi_Toward_Improving_the_Visual_Characterization_of_Sport_Activities_With_Abstracted_CVPRW_2021_paper.html |
| Seiten: | 4500-4507 |
| Dokumentenarten: | Artikel |
| Level: | hoch |