📰 News

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

ACM MM 2025

Haiyang Zhou*, Wangbo Yu*, Jiawen Guan, Xinhua Cheng, Yonghong Tian†, Li Yuan†

If you like our project, please give us a star ⭐ on GitHub for latest update.

📰 News

[2025.4.30] 🔥🔥🔥 We have released our Panoramic Animator model and the inference code of the whole pipeline HoloTime. Welcome to download it from Huggingface and have a try!

💡 Introduction

We propose HoloTime, a framework that integrates video diffusion models to generate panoramic videos from a single prompt or reference image, along with a 360-degree 4D scene reconstruction method that seamlessly transforms the generated panoramic video into 4D assets, enabling a fully immersive 4D experience for users.

🚀 Results

4D Scene Generation

Panorama	4D Scene
	ocean2.mp4
	cyberpunk.mp4
	temple.mp4

Panoramic Video Generation

Panorama	Panoramic Video
	car.mp4
	aurora.mp4
	fire.mp4
	firework.mp4

🛠️ Requirements and Installation

git clone https://github.com/PKU-YuanGroup/HoloTime --recursive
cd HoloTime
conda create -n holotime python=3.10 -y
conda activate holotime
conda install -c nvidia cuda-toolkit=12.4 -y
pip install torch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 --index-url https://download.pytorch.org/whl/cu124
pip install -r requirements.txt

After installation, please follow the instructions provided here to modify a few lines of some libraries.

🤖 Inference

Panoramic Video Generation

The input directory structure should be like:

📦 input/
└── 🖼️ --panorama1.png
└── 🖼️ --panorama2.png
└── 🖼️ --panorama3.png
└──    ...
└── 📄 --text_prompts.txt

The txt file contains text descriptions of panoramas, with each line corresponding to one panorama, sorted according to the natural sort order of the png filenames. You can use the text-driven panorama generation models (PanFusion or FLUX) to create input data, or you can use the files we provide.

Download the Panoramic Animator model from Huggingface and then put the checkpoint in the checkpoints/holotime directory. (optional as it can be done automatically)
Run the following command.

sh run_animator.sh

Panoramic Animator need 24GB GPU memory. VEnhancer need 80GB GPU memory for super resolution and frame interpolation. (optional)

4D Scene Reconstruction

After generating the 75EC panoramic video, you can transform the panoramic video into 4D scene by running the following command.

sh run_reconstruction.sh

Reconstruction from refinement video need 24GB GPU memory. Reconstruction from enhancement video need 48GB GPU memory.

4D Scene Rendering

Run the following command.

sh run_render.sh

We provide some preset trajectories here.

👍 Acknowledgement

Special thanks to DynamiCrafter, 360DVD, VEnhancer, DreamScene360 and Spacetime Gaussian for codebase and pre-trained weights.

✏️ Citation

@misc{zhou2025holotimetamingvideodiffusion,
      title={HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation}, 
      author={Haiyang Zhou and Wangbo Yu and Jiawen Guan and Xinhua Cheng and Yonghong Tian and Li Yuan},
      year={2025},
      eprint={2504.21650},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2504.21650}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Depth_estimation		Depth_estimation
PanoFlowAPI		PanoFlowAPI
assets		assets
configs		configs
image_warper		image_warper
input		input
installation		installation
lvdm		lvdm
scripts		scripts
thirdparty		thirdparty
utils		utils
video_to_video		video_to_video
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run_animator.sh		run_animator.sh
run_reconstruction.sh		run_reconstruction.sh
run_render.sh		run_render.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

If you like our project, please give us a star ⭐ on GitHub for latest update.

📰 News

💡 Introduction

🚀 Results

4D Scene Generation

Panoramic Video Generation

🛠️ Requirements and Installation

🤖 Inference

Panoramic Video Generation

4D Scene Reconstruction

4D Scene Rendering

👍 Acknowledgement

✏️ Citation

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

PKU-YuanGroup/HoloTime

Folders and files

Latest commit

History

Repository files navigation

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

If you like our project, please give us a star ⭐ on GitHub for latest update.

📰 News

💡 Introduction

🚀 Results

4D Scene Generation

Panoramic Video Generation

🛠️ Requirements and Installation

🤖 Inference

Panoramic Video Generation

4D Scene Reconstruction

4D Scene Rendering

👍 Acknowledgement

✏️ Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages