8000 Sg11100 (Zonglin Zhao) / Starred · GitHub

More Web Proxy on the site http://driver.im/

Sg11100

Follow

Zonglin Zhao Sg11100

Follow

3 followers · 2 following

Highlights

Pro

Stars

naver / dust3r

DUSt3R: Geometric 3D Vision Made Easy

Python 6,493 690 Updated Jul 1, 2025

YihangChen-ee / FCGS

🚀 [ICLR 2025] Pytorch implementation of 'Fast Feedforward 3D Gaussian Splatting Compression'

Python 174 12 Updated May 31, 2025

graphdeco-inria / reduced-3dgs

The code for the paper "Reducing the Memory Footprint of 3D Gaussian Splatting"

Python 189 14 Updated Aug 20, 2024

XavierCHEN34 / UniReal

Jupyter Notebook 18 Updated Jun 30, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 23,511 1,696 Updated Jul 1, 2025

fenghora / personalize-anything

Personalize Anything for Free with Diffusion Transformer

Jupyter Notebook 334 9 Updated Mar 20, 2025

Yuanshi9815 / OminiControl

[ICCV 2025] OminiControl: Minimal and Universal Control for Diffusion Transformer

Python 1,687 119 Updated Jul 3, 2025

bytedance / UNO

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,158 71 Updated Apr 17, 2025

bytedance / DreamO

DreamO: A Unified Framework for Image Customization

Python 1,631 123 Updated Jul 4, 2025

bytedance / XVerse

Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".

Python 524 43 Updated Jul 10, 2025

I2-Multimedia-Lab / Magnet

Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function" [NeurIPS 2024]

Jupyter Notebook 27 1 Updated Dec 2, 2024

hutaiHang / ToMe

[NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis

Python 73 5 Updated Feb 3, 2025

mit-han-lab / fastcomposer

[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Python 706 41 Updated Jan 10, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,313 324 Updated Jun 26, 2025

Charmve / Surface-Defect-Detection

📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are of great importance.

Python 3,670 576 Updated May 27, 2024

Episoode / Breast-Cancer-Ultrasound-Classification

Two neural network models built based on ConvNeXT and DenseNet, respectively, for the BIRADS six-class classification and feature recognition tasks, along with the data processing and training code

Python 3 1 Updated Jun 2, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,314 521 Updated May 18, 2025

datawhalechina / fantastic-matplotlib

Matplotlib中文教程，在线阅读地址：https://datawhalechina.github.io/fantastic-matplotlib/

Python 497 108 Updated Jul 31, 2022

facebookresearch / SLIP

Code release for SLIP Self-supervision meets Language-Image Pre-training

Python 773 71 Updated Feb 9, 2023

TingsongYu / PyTorch-Tutorial-2nd

《Pytorch实用教程》（第二版）无论是零基础入门，还是CV、NLP、LLM项目应用，或是进阶工程化部署落地，在这里都有。相信在本书的帮助下，读者将能够轻松掌握 PyTorch 的使用，成为一名优秀的深度学习工程师。

Jupyter Notebook 3,768 418 Updated Jan 27, 2025

Sg11100 / Open_CLIP-LLaVA

Python 1 Updated Mar 24, 2025

sarahESL / AlignCLIP

AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)

Python 42 1 Updated Mar 1, 2025

mlfoundations / open_clip

An open source implementation of CLIP.

Python 12,176 1,129 Updated Jun 10, 2025

facebookresearch / mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,905 1,292 Updated Jul 23, 2024

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,171 52 Updated Jun 18, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,462 2,242 Updated Feb 1, 2025

showlab / Show-o

[ICLR 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,596 68 Updated Jul 16, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,574 663 Updated Jul 16, 2025

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,033 116 Updated Jul 29, 2024

MuiseDestiny / zotero-gpt

GPT Meet Zotero.

TypeScript 6,376 265 Updated Jul 8, 2025

0