-
MSCS@UCSD
- La Jolla, CA
- in/zihan-zhou-cs
Highlights
- Pro
Starred repositories
MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning
🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
Witness the aha moment of VLM with less than $3.
Solve Visual Understanding with Reinforced VLMs
A bibliography and survey of the papers surrounding o1
Minimal reproduction of DeepSeek R1-Zero
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
Official implementation of paper "Controllable 3D Outdoor Scene Generation via Scene Graphs"
👨💻 An awesome and curated list of best code-LLM for research.
Efficient Triton Kernels for LLM Training
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Ongoing research training transformer models at scale
Robust recipes to align language models with human and AI preferences
Reference implementation for DPO (Direct Preference Optimization)
Train transformer language models with reinforcement learning.
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.