Official repo for MotionGPT

MotionGPT: Human Motion as a Foreign Language

Project Page • Arxiv Paper • HuggingFace Demo • FAQ • Citation

Teaser Video	Demo Video
teaser_video.mp4	demo_video.mp4

🏃 Intro MotionGPT

MotionGPT is a unified and user-friendly motion-language model to learn the semantic coupling of two modalities and generate high-quality motions and text descriptions on multiple motion tasks.

Technical details

Though the advancement of pre-trained large language models unfolds, the exploration of building a unified model for language and other multi-modal data, such as motion, remains challenging and untouched so far. Fortunately, human motion displays a semantic coupling akin to human language, often perceived as a form of body language. By fusing language data with large-scale motion models, motion-language pre-training that can enhance the performance of motion-related tasks becomes feasible. Driven by this insight, we propose MotionGPT, a unified, versatile, and user-friendly motion-language model to handle multiple motion-relevant tasks. Specifically, we employ the discrete vector quantization for human motion and transfer 3D motion into motion tokens, similar to the generation process of word tokens. Building upon this “motion vocabulary”, we perform language modeling on both motion and text in a unified manner, treating human motion as a specific language. Moreover, inspired by prompt learning, we pre-train MotionGPT with a mixture of motion-language data and fine-tune it on prompt-based question-and-answer tasks. Extensive experiments demonstrate that MotionGPT achieves state-of-the-art performances on multiple motion tasks including text-driven motion generation, motion captioning, motion prediction, and motion in-between.

🚩 News

[2023/09/22] MotionGPT got accepted by NeurIPS 2023!
[2023/09/11] Release the huggingface demo 🔥🔥🔥
[2023/09/09] Release the training of MotionGPT V1.0 🔥🔥🔥
[2023/06/20] Upload paper and init project

⚡ Quick Start

Setup and download

1. Conda environment

conda create python=3.10 --name mgpt
conda activate mgpt

Install the packages in requirements.txt and install PyTorch 2.0

pip install -r requirements.txt
python -m spacy download en_core_web_sm

We test our code on Python 3.10.6 and PyTorch 2.0.0.

2. Dependencies

Run the script to download dependencies materials:

bash prepare/download_smpl_model.sh
bash prepare/prepare_t5.sh

For Text to Motion Evaluation

bash prepare/download_t2m_evaluators.sh

3. Pre-train model

Run the script to download the pre-train model

bash prepare/download_pretrained_models.sh

4. (Optional) Download manually

Visit the Google Driver to download the previous dependencies.

Visit the Hugging Face to download the pretrained models.

▶️ Demo

Webui

Run the following script to launch webui, then visit 0.0.0.0:8888

python app.py

Batch demo

We support txt file input, the output motions are npy files and output texts are txt files. Please check the configs/assets.yaml for path config, TEST.FOLDER as output folder.

Then, run the following script:

python demo.py --cfg ./configs/config_h3d_stage3.yaml --example ./demos/t2m.txt

Some parameters:

--example=./demo/t2m.txt: input file as text prompts
--task=t2m: evaluation tasks including t2m, m2t, pred, inbetween

The outputs:

npy file: the generated motions with the shape of (nframe, 22, 3)
txt file: the input text prompt or text output

💻 Train your own models

Training guidance

1. Prepare the datasets

Please refer to HumanML3D for text-to-motion dataset setup.
Put the instructions data in prepare/instructions to the same folder of HumanML3D dataset.

2.1. Ready to train motion tokenizer model

Please first check the parameters in configs/config_h3d_stage1.yaml, e.g. NAME,DEBUG.

Then, run the following command:

python -m train --cfg configs/config_h3d_stage1.yaml --nodebug

2.2. Ready to pretrain MotionGPT model

Please update the parameters in configs/config_h3d_stage2.yaml, e.g. NAME,DEBUG,PRETRAINED_VAE (change to your latest ckpt model path in previous step)

Then, run the following command to store all motion tokens of training set for convenience

python -m scripts.get_motion_code --cfg configs/config_h3d_stage2.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
assets		assets
configs		configs
demos		demos
mGPT		mGPT
prepare		prepare
scripts		scripts
LICENSE		LICENSE
README.md		README.md
app.py		app.py
demo.py		demo.py
fit.py		fit.py
render.py		render.py
requirements.txt		requirements.txt
setup.py		setup.py
test.py		test.py
train.py		train.py

Method	FID $\downarrow$	DIV $\rightarrow$	ADE $\downarrow$	FDE $\downarrow$
Real	0.002	9.503	-	-
MDM	6.031	7.813	5.446	8.561
T2M-GPT	2.056	8.635	6.161	8.302
MotionGPT (Ours)	0.905	8.972	4.745	6.040

Downsampling	MPJPE $\downarrow$	MPJPE $\downarrow$	ACCL $\downarrow$	FID $\downarrow$	DIV $\rightarrow$
$l=1$	76.2	49.5	19.5	0.421	9.613
$l=2$	52.6	37.7	9.5	0.135	9.722
$l=4$	55.8	40.1	7.5	0.067	9.675
$l=8$	62.7	45.3	8.7	0.223	9.584

Method	MPJPE$\downarrow$	MPJPE $\downarrow$	ACCL $\downarrow$	FID $\downarrow$	DIV $\rightarrow$
VPoser-t	75.6	48.6	9.3	1.430	8.336
ACTOR	65.3	41.0	7.0	0.341	9.569
MLD-1	54.4	41.6	8.3	0.247	9.630
MotionGPT (Ours)	55.8	40.1	7.5	0.067	9.675

Method	FID $\downarrow$
MotionGPT (Ours)	$0.510^{\pm.016}$
T2M-GPT	$0.514^{\pm.029}$
MLD	$\boldsymbol{0.404}^{\pm.027}$

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Official repo for MotionGPT

MotionGPT: Human Motion as a Foreign Language

🏃 Intro MotionGPT

🚩 News

⚡ Quick Start

1. Conda environment

2. Dependencies

3. Pre-train model

4. (Optional) Download manually

▶️ Demo

💻 Train your own models

1. Prepare the datasets

2.1. Ready to train motion tokenizer model

2.2. Ready to pretrain MotionGPT model

2.3. Ready to instruct-tuning MotionGPT model

3. Evaluate the model

👀 Visualization

1. Set up blender - WIP

2. (Optional) Render rigged cylinders

2. Create SMPL meshes with:

3. Render SMPL meshes

⚠️ FAQ

The purpose and ability of MotionGPT

More technical details

More experimental details

About performances

About illustrations

📖 Citation

Acknowledgments

License

About

Uh oh!

Releases

Packages

Languages

Method	FID $\downarrow$
MDM	$0.544^{\pm.044}$
MotionGPT	$0.160^{\pm.008}$
T2M-GPT	$\boldsymbol{0.116}^{\pm.004}$

License

normand1/MotionGPT

Folders and files

Latest commit

History

Repository files navigation

Official repo for MotionGPT

MotionGPT: Human Motion as a Foreign Language

🏃 Intro MotionGPT

🚩 News

⚡ Quick Start

1. Conda environment

2. Dependencies

3. Pre-train model

4. (Optional) Download manually

▶️ Demo

💻 Train your own models

1. Prepare the datasets

2.1. Ready to train motion tokenizer model

2.2. Ready to pretrain MotionGPT model

2.3. Ready to instruct-tuning MotionGPT model

3. Evaluate the model

👀 Visualization

1. Set up blender - WIP

2. (Optional) Render rigged cylinders

2. Create SMPL meshes with:

3. Render SMPL meshes

⚠️ FAQ

The purpose and ability of MotionGPT

More technical details

More experimental details

About performances

About illustrations

📖 Citation

Acknowledgments

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages