📄 Paper 🔗 Website ▶︎ Video 📊 Data
Unsupervised self-rehabilitation exercises and physical training can cause serious injuries if performed incorrectly. We introduce a learning-based framework that identifies the mistakes made by a user and proposes corrective measures for easier and safer individual training. Our framework does not rely on hard-coded, heuristic rules. Instead, it learns them from data, which facilitates its adaptation to specific user needs. To this end, we use a Graph Convolutional Network (GCN) architecture acting on the user's pose sequence to model the relationship between the the body joints trajectories. To evaluate our approach, we introduce a dataset with 3 different physical exercises. Our approach yields 90.9% mistake identification accuracy and successfully corrects 94.2% of the mistakes.
Figure1: Gif of our results. The red poses correspond to the exercises performed incorrectly while the green poses correspond to our corrections.
Figure2: Our framework consists of a classification and a correction branch. They share several graph convolutional layers are then split such that the classification branch identifies the type of mistakes made by the user and the correction branch outputs a corrected pose sequence. The result of the classification branch is fed to the correction branch via a feedback module.
Our framework for providing exercise feedback relies on GCNs which can learn to exploit the relationships between the trajectories of individual joints. The overall model consists of two branches: the classification branch which predicts whether the input motion is correct or incorrect, specifying the mistake being made in the latter case, and the correction branch that outputs a corrected 3D pose sequence, providing a detailed feedback to the user. We feed the predicted action labels coming from the classification branch to the correction branch, which is called the “feedback module”. It allows us to explicitly provide label information to the correction module, enabling us to further improve the accuracy of the corrected motion.
Examples of acquired images for each action, subject and camera and 3D ground truth poses computation.
You can access the Exercise Correction in 3D (EC3D) dataset here!
Description of the data:
-
data_3D.pickle is the data source used in this paper. It contains the labels and 3D coordinates of all action sequences, while the coordinates have a size of (29789, 3, 25) where 29789 is the number of extracted frames for all action sequences, 3 is the x,y,z coordinates and 25 is the 25 skeletal nodes.
-
data.pickle contains all the raw data, including the camera parameters and the 2D and 3D coordinates of each skeleton node for each frame.
- If you wish to reproduce the results of this project, you may directly use data_3D.pickle.
- Cuda==10.1.168
- numpy==1.21.5
- pandas==1.3.5
- python==3.7.12
- tensorboard==2.8.0
- torch==1.4.0
- torch_dct==0.1.5
- tqdm==4.63.1
3D_Pose_Based_Feedback_for_Physical_Exercises has been implemented and tested on Ubuntu 18.04 with python >= 3.7. Our model is trained with GPU. If you don't have a suitable device, try running on Google Colab.
Clone the repo:
git clone https://github.com/Jacoo-Zhao/3D-Pose-Based-Feedback-For-Physical-Exercises.git
Install the requirements using virtualenv
or conda
:
# pip
source scripts/install_pip.sh
# conda
source scripts/install_conda.sh
Enable tensorboard (logdir is your local folder):
tensorboard --logdir='Running_logs' --port=8001 --bind_all
Change to root diretory:
python main.py --ckpt='check_point' --hidden=256 --epoch=200
Then select the model version, wait for the data to be loaded/generated and finally choose whether to train the model or use the pre-trained weights, the results will be automatically saved to the checkpoint folder.
This repository holds the code for the following paper:
3D-Pose-Based-Feedback-For-Physical-Exercises. ACCV, 2022.
If you find our work useful, please cite it as:
@inproceedings{zhao2022exercise,
author = {Zhao, Ziyi and Kiciroglu, Sena and Vinzant, Hugues and Cheng, Yuan and Katircioglu, Isinsu and Salzmann, Mathieu and Fua, Pascal},
booktitle = {ACCV},
title = {3D Pose Based Feedback for Physical Exercises},
year = {2022}
}
-
Some of our data process code for NTU RGB+D was adapted/ported from SGN by Microsoft.
-
Our theoretical framework draws on the work in Learning Trajectory Dependencies for Human Motion Prediction by Wei Mao.