-
University of Macau & Shanghai AI Lab
- Macau & Shanghai
-
12:45
(UTC +08:00) - https://yczhou001.github.io/
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
OmniGen2: Exploration to Advanced Multimodal Generation.
Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
Demo of a customer service use case implemented with the OpenAI Agents SDK
Official repository for VisionZip (CVPR 2025)
Official repo and evaluation implementation of VSI-Bench
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"
SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks
This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies
MAM: ModularMulti-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration
Must-read papers on Repository-level Code Generation & Issue Resolution π₯
This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search".
Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation
The official code repository for the FullFront benchmark
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache).
paper list, tutorial, and nano code snippet for Diffusion Large Language Models.
Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.
A lightweight, powerful framework for multi-agent workflows
Official Repository of "Learning to Reason under Off-Policy Guidance"
Align Anything: Training All-modality Model with Feedback
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning
[ICCV 2025] π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond