ybrrraway

hyb ybrrraway

a student from HIT MI&T LAB

2 followers · 11 following

Achievements

Highlights

Stars

TIGER-AI-Lab / Pixel-Reasoner

Pixel-Level Reasoning Model trained with RL

Python 145 2 Updated Jun 24, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 1,266 67 Updated Jun 25, 2025

aikorea / awesome-rl

Reinforcement learning resources curated

9,157 1,836 Updated May 25, 2023

hrlics / HoPE

HoPE: Hybrid of Position Embedding for Length Generalization in Vision-Language Models

7 Updated Jun 8, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,597 564 Updated Jun 27, 2025

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,477 63 Updated Jun 5, 2025

modelscope / evalscope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,217 131 Updated Jun 26, 2025

ConardLi / easy-dataset

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 8,897 861 Updated Jun 25, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,641 686 Updated Jun 25, 2025

OpenGVLab / V2PE

[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Python 50 2 Updated Dec 13, 2024

HKUNLP / ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 412 19 Updated Oct 16, 2024

PzySeere / MetaSpatial

MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, realistic, and adaptive scene generation for applications in…

Python 146 6 Updated May 5, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,054 1,655 Updated Jun 27, 2025

Qihoo360 / Light-R1

Python 718 47 Updated May 30, 2025

atfortes / Awesome-LLM-Reasoning

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

3,178 186 Updated May 7, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 146,116 29,469 Updated Jun 27, 2025