Three-stream 3D/1D CNN for fine-grained action classification and segmentation in table tennis

Martin, P.-E. , Benois-Pineau, J. , Péteri, R. , Morlier, J.

4073439

Three-stream 3D/1D CNN for fine-grained action classification and segmentation in table tennis

Source

This paper proposes a fusion method of modalities extracted from video through a three-stream network with spatio-temporal and temporal convolutions for fine-grained action classification in sport. It is applied to TTStroke-21 dataset which consists of untrimmed videos of table tennis games. The goal is to detect and classify table tennis strokes in the videos, the first step of a bigger scheme aiming at giving feedback to the players for improving their performance. The three modalities are raw RGB data, the computed optical flow and the estimated pose of the player. The network consists of three branches with attention blocks. Features are fused at the latest stage of the network using bilinear layers. Compared to previous approaches, the use of three modalities allows faster convergence and better performances on both tasks: classification of strokes with known temporal boundaries and joint segmentation and classification. The pose is also further investigated in order to offer richer feedback to the athletes.
© Copyright 2021 ACM International Conference Proceeding Series. Published by Association for Computing Machinery. All rights reserved.

Bibliographic Details
Subjects:	table tennis measuring and information system mathematic-logical model measuring procedure technology movement co-ordination video motion capturing USA sport games technical and natural sciences
Notations:	sport games technical and natural sciences
Tagging:	deep learning neuronale Netze maschinelles Lernen
Published in:	ACM International Conference Proceeding Series
Language:	English
Published:	New York Association for Computing Machinery 2021
Online Access:	https://dl.acm.org/doi/10.1145/3475722.3482793
Pages:	35-41
Document types:	article
Level:	advanced

Three-stream 3D/1D CNN for fine-grained action classification and segmentation in table tennis

Similar Items