- PhD, University of Trento & FBK
- https://yhlleo.github.io/
Stars
official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"
[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4ā¦
FlashMLA: Efficient MLA decoding kernels
A very simple GRPO implement for reproducing r1-like LLM thinking.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Official inference repo for FLUX.1 models
The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchersš„
Official implementation of OneDiffusion paper (CVPR 2025)
[NeurIPS 2024 Best Paper Award][GPT beats diffusionš„] [scaling laws in visual generationš] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". Aā¦
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Official code base of "Perception-Oriented Video Frame Interpolation via Asymmetric Blending" (CVPR 2024), also denoted as ''PerVFI''.
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editinā¦
Official repo for consistency models.
Metric depth estimation from a single image
Neural Light Transport for Relighting and View Synthesis
A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using either Automatic or ComfyUI as a backend.
Stable Diffusion web UI
An official implementation of MobileStyleGAN in PyTorch
A latent text-to-image diffusion model
Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.net/forum?id=3jooF27-0Wy