8000 zhilizju (LI ZHI) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zhilizju's full-sized avatar
  • Zhejiang University
  • Zhejiang, HangZhou

Block or report zhilizju

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Python 47 3 Updated May 22, 2025

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 122 16 Updated Jun 19, 2025

This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 608 13 Updated Jun 13, 2025

ReasonFlux Series - Open-Sourced LLM Family for Reasoning, Coding, Reward Modeling and Data Selection

Python 408 31 Updated Jun 9, 2025
Python 297 18 Updated May 31, 2025

A flexible and efficient training framework for large-scale alignment tasks

Python 383 32 Updated Jun 19, 2025

Simple RL training for reasoning

Python 3,634 271 Updated Apr 10, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,115 1,487 Updated Jun 13, 2025

s1: Simple test-time scaling

Python 6,453 749 Updated May 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,693 1,576 Updated Jun 19, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,355 160 Updated Mar 20, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,604 6,438 Updated Jun 19, 2025

Zhejiang University Graduation Thesis LaTeX Template

TeX 3,063 671 Updated Jan 6, 2025
Python 24 Updated Jun 5, 2024
Python 520 46 Updated Nov 20, 2024

code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning

Python 16 Updated Jul 16, 2024

Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models

Python 84 8 Updated Jun 28, 2024

Production-ready data processing made easy and shareable

Python 352 27 Updated May 28, 2024

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,341 268 Updated Jun 19, 2025

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 634 45 Updated Feb 14, 2025

[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Python 119 13 Updated Apr 22, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,641 312 Updated Jun 19, 2025

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,421 732 Updated Jun 7, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,830 2,520 Updated Aug 12, 2024
Python 3,884 250 Updated Mar 15, 2024

Diffusion Model-Based Image Editing: A Survey (TPAMI 2025)

633 40 Updated Mar 23, 2025

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 56,743 12,036 Updated Jun 15, 2025

MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning

Python 135 5 Updated Jun 20, 2023
Next
0