8000 GitHub - sophiahhuang/Cosmos: Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
forked from NVIDIA/Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.

License

Notifications You must be signed in to change notification settings

sophiahhuang/Cosmos

 
 

Repository files navigation

NVIDIA Cosmos Header

Cosmos World Foundation Model Family
Cosmos-Predict1 Code Models Paper
Cosmos-Transfer1 Code Models Paper
Cosmos-Reason1 Coming soon Coming soon Paper

NVIDIA Cosmos is a developer-first world foundation model platform designed to help Physical AI developers build their Physical AI systems better and faster. Cosmos contains

  1. Pre-trained models (available via Hugging Face) under the NVIDIA Open Model License that allows free commercial use.
  2. Pre-training, post-training, and inference code (available in native PyTorch) under the Apache 2 License.

There are three main model families in Cosmos World Foundation Model Platform.

  1. Cosmos Predict: a collection of general-purpose world models for future state prediction.

  2. Cosmos Transfer: a collection of multimodal conditional world generation model for various domain transfer applications such as Sim2Real.

  3. Cosmos Reason: a collection of Physical AI reasoning models for planning and critics.

Being a minimalist, we have these individual models in individual repositories under nvidia-github.

Example Model Behavior

Cosmos-Predict Text2World

predict1_text2world.mp4

Cosmos-Predict Video2World

predict1_video2world.mp4

Cosmos-Transfer LiDAR + HDMap Conditional Inputs -> World

transfer1_lidarhdmap.mp4

Cosmos-Transfer Multimodal Conditional Inputs -> World

transfer1_multimodal.mp4

Cosmos-Reason Physical AI Planning

reason1_example1.mp4

Cosmos Publication

Paper Title Summary Authors Date
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Introduces a reasoning model for physical AI that combines common sense knowledge with embodied reasoning capabilities. NVIDIA 2025-03-19
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control Presents a multimodal model for conditional world generation with adaptive control mechanisms. NVIDIA 2025-03-18
Cosmos World Foundation Model Platform for Physical AI Overview of the Cosmos platform, its architecture, and applications in physical AI systems. Introduction of Cosmos-Predict1 world models. NVIDIA 2025-01-06

Developer

For native PyTorch developers, we provide native PyTorch training and inference scripts in nvidia-github. For Nemo developers, please refer to README_CES2025.md.

About

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 86.0%
  • Python 14.0%
0