Yueeeeeeee

🌏

Working from home

Zhenrui Yueeeeeeee

🌏

Working from home

UIUC's PhD sorcerer in training | Casting LLM spells, navigating RecSys realms, unlocking IR secrets

72 followers · 35 following

UIUC
Champaign, IL
15:12 (UTC -05:00)
yueeeeeeee.github.io
in/zhenrui-yue
@Yueeeeeeee2837

Achievements

Highlights

Developer Program Member
Pro

Stars

Lightning-AI / forked-pdb

Python pdb for multiple processes

Python 48 6 Updated May 24, 2025

Parallel-Reasoning / APR

Code for Paper: Learning Adaptive Parallel Reasoning with Language Models

Python 98 5 Updated Apr 23, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,227 147 Updated Jun 2, 2025

kaiwenzha / rl-tango

RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning

23 Updated May 27, 2025

Yueeeeeeee / HRPO

Hybrid Latent Reasoning via Reinforcement Learning

Python 76 18 Updated May 27, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,003 400 Updated Jun 10, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,958 1,467 Updated Jun 9, 2025

shangshang-wang / Tina

Tina: Tiny Reasoning Models via LoRA

Python 254 30 Updated May 29, 2025

llm-lab-org / Multimodal-RAG-Survey

A Survey on Multimodal Retrieval-Augmented Generation

213 9 Updated Jun 3, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 9,560 1,036 Updated Jun 10, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,619 271 Updated Apr 10, 2025

HansiZeng / scaling-retriever

[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"

Python 15 1 Updated Mar 31, 2025

kanishkg / cognitive-behaviors

Python 184 11 Updated Mar 26, 2025

qixucen / atom

Atom of Thoughts for Markov LLM Test-Time Scaling

Python 574 48 Updated May 28, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,549 183 Updated Jun 6, 2025

uclaml / SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Python 565 46 Updated Jan 23, 2025

Alibaba-NLP / OmniSearch

Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Python 333 22 Updated Apr 22, 2025

RyanLiu112 / compute-optimal-tts

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

Python 262 21 Updated Feb 19, 2025

microsoft / rStar

Python 564 50 Updated Apr 15, 2025

shawnricecake / Heima

Code for Heima

Python 45 3 Updated Apr 21, 2025

weizhepei / InstructRAG

[ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales

Python 97 6 Updated Feb 6, 2025

natolambert / rlhf-book

Textbook on reinforcement learning from human feedback

TeX 1,003 84 Updated Jun 10, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,883 1,489 Updated Apr 24, 2025

Yushi-Hu / VisualSketchpad

Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Jupyter Notebook 227 12 Updated Oct 28, 2024

FreedomIntelligence / HuatuoGPT-o1

Medical o1, Towards medical complex reasoning with LLMs

Python 1,127 112 Updated Jan 20, 2025

sunnynexus / Search-o1

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 910 88 Updated May 13, 2025

facebookresearch / blt

Code for BLT research paper

Python 1,679 141 Updated May 22, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 7,027 683 Updated Jun 10, 2025

lyogavin / airllm

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 5,788 458 Updated May 6, 2025

AkariAsai / OpenScholar

This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.

Python 693 70 Updated Apr 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhenrui Yueeeeeeee

Achievements

Achievements

Highlights

Block or report Yueeeeeeee

Stars

Lightning-AI / forked-pdb

Parallel-Reasoning / APR

ML-GSAI / LLaDA

kaiwenzha / rl-tango

Yueeeeeeee / HRPO

allenai / open-instruct

QwenLM / Qwen3

shangshang-wang / Tina

llm-lab-org / Multimodal-RAG-Survey

axolotl-ai-cloud / axolotl

hkust-nlp / simpleRL-reason

HansiZeng / scaling-retriever

kanishkg / cognitive-behaviors

qixucen / atom

PeterGriffinJin / Search-R1

uclaml / SPPO

Alibaba-NLP / OmniSearch

RyanLiu112 / compute-optimal-tts

microsoft / rStar

shawnricecake / Heima

weizhepei / InstructRAG

natolambert / rlhf-book

Jiayi-Pan / TinyZero

Yushi-Hu / VisualSketchpad

FreedomIntelligence / HuatuoGPT-o1

sunnynexus / Search-o1

facebookresearch / blt

OpenRLHF / OpenRLHF

lyogavin / airllm

AkariAsai / OpenScholar