Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
Official PyTorch implementation for "Large Language Diffusion Models"
Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory
High-Resolution Image Synthesis with Latent Diffusion Models
Repository for predictive dual-arm reactive motion planning
Implementing DeepSeek R1's GRPO algorithm from scratch
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
This is the repository for example code from Prof. Boon Thau Loo's Operating System Course
Official repository of Agent Attention (ECCV2024)
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
Official repository for CVPR 2024 highlight paper 4D-DRESS: A 4D Dataset of Real-world Human Clothing with Semantic Annotations.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
Autoregressive policies for continuous control reinforcement learning
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Official implementation of Diffusion Policy Policy Optimization, arxiv 2024