8000 zzh8241102 (Zihan Zhou) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zzh8241102's full-sized avatar
:shipit:
work
:shipit:
work

Highlights

  • Pro

Block or report zzh8241102

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 239 14 Updated Jun 10, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 659 23 Updated May 27, 2025

🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"

Python 15 2 Updated Feb 9, 2025

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Python 211 20 Updated Apr 10, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

920 43 Updated Jun 18, 2025

Witness the aha moment of VLM with less than $3.

Python 3,780 288 Updated May 19, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,177 315 Updated May 11, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,201 50 Updated Nov 16, 2024

Minimal reproduction of DeepSeek R1-Zero

Python 11,925 1,489 Updated Apr 24, 2025

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

3,145 290 Updated Dec 17, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,756 379 Updated Jun 18, 2025

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"

HTML 9,454 1,474 Updated Apr 15, 2023

Official implementation of paper "Controllable 3D Outdoor Scene Generation via Scene Graphs"

26 Updated Mar 11, 2025

👨‍💻 An awesome and curated list of best code-LLM for research.

1,200 67 Updated Dec 10, 2024

Efficient Triton Kernels for LLM Training

Python 5,248 355 Updated Jun 20, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,104 804 Updated May 15, 2025

Ongoing research training transformer models at scale

Python 12,635 2,857 Updated Jun 19, 2025

Robust recipes to align language models with human and AI preferences

Python 5,232 449 Updated Apr 30, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,614 218 Updated Aug 11, 2024

Train transformer language models with reinforcement learning.

Python 14,290 1,982 Updated Jun 22, 2025

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

2,873 202 Updated Jun 6, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,551 179 Updated Jun 25, 2024
Python 97 7 Updated Mar 20, 2024

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,193 131 Updated Jun 23, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,214 708 Updated Jun 23, 2025

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.

Python 5,019 406 Updated Jun 20, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,993 448 Updated Aug 7, 2024
Next
0