-
Peking University
-
07:14
(UTC +08:00) - https://qingzhenduyu.github.io/
- https://orcid.org/0009-0000-3982-2739
Highlights
- Pro
Stars
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
An Open-source RL System from ByteDance Seed and Tsinghua AIR
Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
Muon optimizer: +>30% sample efficiency with <3% wallclock overhead
This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…
Fully open reproduction of DeepSeek-R1
[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Solve Visual Understanding with Reinforced VLMs
Witness the aha moment of VLM with less than $3.
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
verl: Volcano Engine Reinforcement Learning for LLMs
Mixture-of-Experts for Large Vision-Language Models
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers get the gist as quickly as possible.
Acceptance rates for the major AI conferences
Xiaomi Home Integration for Home Assistant
DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
List of Computer Science courses with video lectures.
Get your documents ready for gen AI
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.