Lists (1)
Sort Name ascending (A-Z)
Stars
🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Vector (and Scalar) Quantization, in Pytorch
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
XeLaTeX template for writing thesis aiming to apply degrees from Xidian University.
Open-Sora: Democratizing Efficient Video Production for All
[CVPR 2022] Pre-Training 3D Point Cloud Transformers with Masked Point Modeling
Official GitHub repo for VecKM. A very efficient and descriptive local geometry encoder / point tokenizer / patch embedder. ICML2024.
[ECCV-2024] DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition
[ECCV2022] Masked Autoencoders for Point Cloud Self-supervised Learning
[NeurIPS 2023] PointGPT: Auto-regressively Generative Pre-training from Point Clouds
Code for the paper "PointAttN: You Only Need Attention for Point Cloud Completion"
[CVPR'24 Highlight] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.
The source code for "LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction"
Official repository for CVPR 2023 paper "Event-Based Frame Interpolation with Ad-hoc Deblurring"
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling
PyTorch implementation of E-Motion: Future Motion Simulation via Event Sequence Diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning