8000 Vance0124 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Vance0124's full-sized avatar

Block or report Vance0124

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

My learning notes/codes for ML SYS.

Python 2,867 178 Updated Jul 12, 2025

OpenArm: an open-source robotic arm for human manipulation data collection

C 339 38 Updated Jul 11, 2025

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 7,626 688 Updated Jul 13, 2025
JavaScript 166 85 Updated Oct 9, 2023

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 1,016 66 Updated Jan 31, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,354 172 Updated Mar 28, 2025

Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.

Python 163 1 Updated Jul 7, 2025

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 689 112 Updated Mar 28, 2025

Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.

Python 295 43 Updated Jun 23, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,989 226 Updated Jul 12, 2025

ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World

Python 17 Updated Jun 17, 2025

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Python 592 40 Updated May 25, 2025

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Python 45 1 Updated May 25, 2025

Official implementation for BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

Python 54 4 Updated Jul 6, 2025

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 247 8 Updated Jul 11, 2025

Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory

Python 275 10 Updated Jun 20, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 1 1 Updated Jun 26, 2025

A version of verl to support tool use

Python 291 25 Updated Jul 12, 2025
Python 11 Updated Jun 30, 2025

procedural reasoning datasets

Python 947 77 Updated Jul 7, 2025
Python 13 Updated May 7, 2025

ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling

Python 452 32 Updated Jul 3, 2025
Python 828 38 Updated Jul 2, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,117 164 Updated Jul 6, 2025

Automatic evals for LLMs

HTML 463 57 Updated Jun 27, 2025

This repository collects awesome survey, resource, and paper for lifelong learning LLM agents

Python 205 15 Updated May 30, 2025
Jupyter Notebook 1,281 84 Updated Nov 27, 2024
Next
0