Synthesising 2D videos from 3D data: enlarging sparse 2D video datasets for machine learning applications

This study outlines a technique to repurpose widely available high resolution three-dimensional (3D) motion capture data for training a machine learning model to estimate the ground reaction forces from two-dimensional (2D) pose estimation keypoints. Keypoints describe anatomically related landmarks in 2D image coordinates. The landmarks can be calculated from 3D motion capture data and projected to different image planes, serving to synthesise a near-infinite number of 2D camera views. This highly efficient method of synthesising 2D camera views can be used to enlarge sparse 2D video databases of sporting movements. We show the feasibility of this approach using a sidestepping dataset and evaluate the optimal camera number and location required to estimate 3D ground reaction forces. The method presented and the additional insights gained from this approach can be used to optimise corporeal data capture by sports practitioners.
© Copyright 2022 ISBS Proceedings Archive (Michigan). Northern Michigan University. Published by International Society of Biomechanics in Sports. All rights reserved.

Bibliographic Details
Subjects:
Notations:technical and natural sciences
Tagging:2D maschinelles Lernen
Published in:ISBS Proceedings Archive (Michigan)
Language:English
Published: Liverpool International Society of Biomechanics in Sports 2022
Online Access:https://commons.nmu.edu/isbs/vol40/iss1/121/
Volume:40
Issue:1
Pages:Article 121
Document types:congress proceedings
Level:advanced