Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 689 112 Updated Mar 28, 2025

Physical-Intelligence / openpi

Python 3,959 474 Updated Jul 10, 2025

SakanaAI / RLT

Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.

Python 295 43 Updated Jun 23, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,989 226 Updated Jul 12, 2025

niuzaisheng / ScreenExplorer

ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World

Python 17 Updated Jun 17, 2025

RUCAIBox / R1-Searcher

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Python 592 40 Updated May 25, 2025

RUCAIBox / R1-Searcher-plus

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Python 45 1 Updated May 25, 2025

ustcwhy / BitVLA

Official implementation for BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

Python 54 4 Updated Jul 6, 2025

PRIME-RL / Entropy-Mechanism-of-RL

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 247 8 Updated Jul 11, 2025

PRIME-RL / SimpleVLA-RL

Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory

Python 275 10 Updated Jun 20, 2025

SJTU-IAAR / verl

Forked from volcengine/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 1 1 Updated Jun 26, 2025

TIGER-AI-Lab / verl-tool

A version of verl to support tool use

Python 291 25 Updated Jul 12, 2025

Zanette-Labs / speed-rl

Python 11 Updated Jun 30, 2025

open-thought / reasoning-gym

procedural reasoning datasets

Python 947 77 Updated Jul 7, 2025

QingyangZhang / Label-Free-RLVR

239 5 Updated Jul 6, 2025

RLHFlow / GVM

Python 13 Updated May 7, 2025

Gen-Verse / ReasonFlux

ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling

Python 452 32 Updated Jul 3, 2025

huggingface / Math-Verify

Python 828 38 Updated Jul 2, 2025

RAGEN-AI / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,117 164 Updated Jul 6, 2025

mlfoundations / evalchemy

Automatic evals for LLMs

HTML 463 57 Updated Jun 27, 2025

qianlima-lab / awesome-lifelong-llm-agent

This repository collects awesome survey, resource, and paper for lifelong learning LLM agents

Python 205 15 Updated May 30, 2025

ByteDance-Seed / Seed-Thinking-v1.5

800 17 Updated Jun 9, 2025

google-deepmind / open_x_embodiment

Jupyter Notebook 1,281 84 Updated Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vance0124

Achievements

Achievements

Block or report Vance0124

Stars

zhaochenyang20 / Awesome-ML-SYS-Tutorial

enactic / openarm

bytedance / trae-agent

jidiai / ai_lib

allenzren / open-pi-zero

VITA-MLLM / VITA

GuanxingLu / vlarl

simpler-env / SimplerEnv