qingzhenduyu

✍️

Jianhua Zhu qingzhenduyu

✍️

26 followers · 34 following

Highlights

Stars

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,324 356 Updated May 9, 2025

ByteDance-Seed / Seed-Thinking-v1.5

741 10 Updated Apr 20, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

1,220 49 Updated May 10, 2025

tianyi-lab / MiP-Overthinking

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

Python 27 1 Updated Apr 10, 2025

KellerJordan / Muon

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 617 32 Updated Mar 25, 2025

Osilly / Vision-R1

This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 550 13 Updated May 7, 2025

TapXWorld / ChinaTextbook

所有小初高、大学PDF教材。

Roff 2,320 591 Updated Apr 3, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,347 2,237 Updated May 9, 2025

YangLing0818 / IterComp

[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

Python 184 11 Updated Feb 19, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,316 164 Updated May 9, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 4,893 304 Updated Apr 21, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,642 286 Updated Mar 1, 2025

TideDra / lmm-r1

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 749 46 Updated May 4, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 7,817 887 Updated May 10, 2025

PKU-YuanGroup / MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Python 2,154 133 Updated Dec 3, 2024

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 84,021 9,897 Updated May 10, 2025

tongyx361 / Awesome-LLM-Research

Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers get the gist as quickly as possible.

53 1 Updated Jul 12, 2024