Football action recognition using hierarchical LSTM

We present a hierarchical recurrent network for understanding team sports activity in image and location sequences. In the hierarchical model, we integrate proposed multiple person-centered features over a temporal sequence based on LSTM's outputs. To achieve this scheme, we introduce the Keeping state in LSTM as one of externally controllable states, and extend the Hierarchical LSTMs to include mechanism for the integration. Experimental results demonstrate effectiveness of the proposed framework involving hierarchical LSTM and person-centered feature. In this study, we demonstrate improvement over the reference model. Specifically, by incorporating the person-centered feature with meta-information (e.g., location data) in our proposed late fusion framework, we also demonstrate increased discriminability of action categories and enhanced robustness against fluctuation in the number of observed players.
© Copyright 2017 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE. Published by IEEE. All rights reserved.

Bibliographic Details
Subjects:
Notations:technical and natural sciences sport games
Tagging:deep learning
Published in:IEEE/CVF Conference on Computer Vision and Pattern Recognition
Language:English
Published: Honolulu IEEE 2017
Online Access:https://doi.org/10.1109/CVPRW.2017.25
Pages:155-163
Document types:article
Level:advanced