zzh8241102

work

Zihan Zhou zzh8241102

work

What I cannot create, I do not understand.

16 followers · 28 following

MSCS@UCSD
La Jolla, CA
in/zihan-zhou-cs

Achievements

Highlights

8000

Starred repositories

qiancheng0 / ToolRL

Python 239 14 Updated Jun 10, 2025

ModalMinds / MM-EUREKA

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 659 23 Updated May 27, 2025

visual-haystacks / mirage

🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"

Python 15 2 Updated Feb 9, 2025

TIGER-AI-Lab / ScholarCopilot

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Python 211 20 Updated Apr 10, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

920 43 Updated Jun 18, 2025

taeukkang / ca-dmv-appointment-bot

Python 5 1 Updated Oct 27, 2024

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,780 288 Updated May 19, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,177 315 Updated May 11, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,201 50 Updated Nov 16, 2024

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,925 1,489 Updated Apr 24, 2025

deepseek-ai / DeepSeek-V3

Python 97,788 15,906 Updated Jun 16, 2025

OpenDriveLab / End-to-end-Autonomous-Driving

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

3,145 290 Updated Dec 17, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,756 379 Updated Jun 18, 2025

chiphuyen / machine-learning-systems-design

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"

HTML 9,454 1,474 Updated Apr 15, 2023

yuhengliu02 / control-3d-scene

Official implementation of paper "Controllable 3D Outdoor Scene Generation via Scene Graphs"

26 Updated Mar 11, 2025

ibm-granite / granite-3.0-language-models

259 24 Updated Dec 4, 2024

huybery / Awesome-Code-LLM

👨‍💻 An awesome and curated list of best code-LLM for research.

1,200 67 Updated Dec 10, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,248 355 Updated Jun 20, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,104 804 Updated May 15, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 12,635 2,857 Updated Jun 19, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,232 449 Updated Apr 30, 2025

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,614 218 Updated Aug 11, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,290 1,982 Updated Jun 22, 2025

eosphoros-ai / Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

2,873 202 Updated Jun 6, 2025

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,551 179 Updated Jun 25, 2024

mutonix / RefGPT

Python 97 7 Updated Mar 20, 2024

modelscope / evalscope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,193 131 Updated Jun 23, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,214 708 Updated Jun 23, 2025

QwenLM / Qwen2.5-Coder

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.

Python 5,019 406 Updated Jun 20, 2025

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,993 448 Updated Aug 7, 2024

Zihan Zhou zzh8241102

Highlights

Starred repositories

vue