Stars
Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory
A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play.
Open-source code for Neural Internal Model Control (Neural-IMC)
What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study
Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning
Codes accompanying the paper "Bayesian Design Principles for Offline-to-Online Reinforcement Learning" (ICML 2024)
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Easily train a good VC model with voice data <= 10 mins!
The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
gen-robot / openvla
Forked from openvla/openvlaOpenVLA: An open-source vision-language-action model for robotic manipulation.
gen-robot / SimplerEnv
Forked from simpler-env/SimplerEnvEvaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)
A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
Train transformer language models with reinforcement learning.
DelinQu / SimplerEnv-OpenVLA
Forked from simpler-env/SimplerEnvEvaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo, and OpenVLA) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)
破解MobaXterm的高级版,生成密钥,支持几乎所有版本。
A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC