Stars
My learning notes/codes for ML SYS.
OpenArm: an open-source robotic arm for human manipulation data collection
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
Official implementation for BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory
SJTU-IAAR / verl
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
This repository collects awesome survey, resource, and paper for lifelong learning LLM agents