More
Starred repositories
OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.
Project Page of Paper "Drive in Corridors: Enhancing the Safety of End-to-end Autonomous Driving via Corridor Learning and Planning"
Democratizing Reinforcement Learning for LLMs
Integrate the DeepSeek API into popular softwares
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"
Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
Official implementation of Continuous 3D Perception Model with Persistent State
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
[CVPR 2025] StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
[TPAMI 2024] Of 9D90 ficial repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
3DGS-LM accelerates Gaussian-Splatting optimization by replacing the ADAM optimizer with Levenberg-Marquardt.
[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
[NeurIPS 2024] DeMo: Decoupling Motion Forecasting into Directional Intentions and Dynamic States
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.