Motion-Prompt-Tracking

PyTorch implementation of "Efficient Motion Prompt Learning for Robust Visual Tracking" (ICML2025)

Paper can be found here.

Introduction

In this paper, we propose a lightweight and plug-and-play motion prompt tracking method (MPT). It can be easily integrated into existing vision-based trackers to build a joint tracking framework leveraging both motion and vision cues, thereby achieving robust tracking through efficient prompt learning. A motion encoder with three different positional encodings is proposed to encode the long-term motion trajectory into the visual embedding space, while a fusion decoder and an adaptive weight mechanism are designed to dynamically fuse visual and motion features.

We integrate our motion module (MPT) into three different trackers with five models in total. Experiments on seven challenging tracking benchmarks demonstrate that the proposed motion module significantly improves the robustness of vision-based trackers, with minimal training costs and negligible speed sacrifice.

We also make a video to show qualitative comparisons on three challenging sequences, which can be found here[pwd:wxct].

Installation the environment and set project paths

The environment installation and training configurations (like project path, pretrained models) are similar to the baseline trackers, e.g., OSTrack, please refer to OSTrack.

Data Preparation

Download the trajectory data of the three training datasets (LaSOT, GOT-10K, TrackingNet) here[pwd:qght], and put them in ./traj_data. It should look like this:

${PROJECT_ROOT}
  -- traj_data
    |-- lasot
    |-- got10k
    |-- tn

Put pretrained baseline models in ./pretrained_models. The baseline model name must match MODEL.PRETRAIN_FILE in your configuration file (./experiments/ostrack/xxx.yaml), e.g.:

MODEL:
  PRETRAIN_FILE: OSTrack_mae256_ep0300.pth.tar

Training and Testing

Please see eval.sh to find the commands for training and testing.

Models and Results

Models and results can be found here[pwd:8j2g].

Acknowledgments

We acknowledge the Segment Anything Model (SAM) for inspiring our prompt tracking methodology. In addition, our work is integrated into three visual trackers, i.e., OSTrack, SeqTrack, and ARTrack. Thanks for these concise and effective SOT frameworks.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
experiments		experiments
lib		lib
tracking		tracking
.DS_Store		.DS_Store
Framework.png		Framework.png
LICENSE		LICENSE
README.md		README.md
Results1.png		Results1.png
Results2.png		Results2.png
eval.sh		eval.sh
install.sh		install.sh
ostrack_cuda113_env.yaml		ostrack_cuda113_env.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Motion-Prompt-Tracking

Introduction

Installation the environment and set project paths

Data Preparation

Training and Testing

Models and Results

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

zj5559/Motion-Prompt-Tracking

Folders and files

Latest commit

History

Repository files navigation

Motion-Prompt-Tracking

Introduction

Installation the environment and set project paths

Data Preparation

Training and Testing

Models and Results

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages