8000 UbeCc (Haoran Wang) · GitHub

More Web Proxy on the site http://driver.im/

UbeCc

Follow

Haoran Wang UbeCc

Follow

I am not a beast of burden. I am a LLaMA! 不是牛马是拉马（我不是奶龙） (Junior@Tsinghua University)

53 followers · 77 following

Tsinghua University
Beijing, China
12:13 (UTC +08:00)
ubecwang@gmail.com
@UbecWang

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

OpenRLHF/OpenRLHF OpenRLHF/OpenRLHF Public

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6.7k 656
volcengine/verl volcengine/verl Public

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8.1k 963
THUDM/SWE-Dev THUDM/SWE-Dev Public

[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.

Python 26
Generalization-of-Transformers Generalization-of-Transformers Public

[ICLR'25] Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Python 3

0