8000 Sg11100 (Zonglin Zhao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Sg11100's full-sized avatar

Highlights

  • Pro

Block or report Sg11100

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DUSt3R: Geometric 3D Vision Made Easy

Python 6,493 690 Updated Jul 1, 2025

🚀 [ICLR 2025] Pytorch implementation of 'Fast Feedforward 3D Gaussian Splatting Compression'

Python 174 12 Updated May 31, 2025

The code for the paper "Reducing the Memory Footprint of 3D Gaussian Splatting"

Python 189 14 Updated Aug 20, 2024
Jupyter Notebook 18 Updated Jun 30, 2025

Official inference repo for FLUX.1 models

Python 23,511 1,696 Updated Jul 1, 2025

Personalize Anything for Free with Diffusion Transformer

Jupyter Notebook 334 9 Updated Mar 20, 2025

[ICCV 2025] OminiControl: Minimal and Universal Control for Diffusion Transformer

Python 1,687 119 Updated Jul 3, 2025

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,158 71 Updated Apr 17, 2025

DreamO: A Unified Framework for Image Customization

Python 1,631 123 Updated Jul 4, 2025

Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".

Python 524 43 Updated Jul 10, 2025

Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function" [NeurIPS 2024]

Jupyter Notebook 27 1 Updated Dec 2, 2024

[NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis

Python 73 5 Updated Feb 3, 2025

[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Python 706 41 Updated Jan 10, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,313 324 Updated Jun 26, 2025

📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are of great importance.

Python 3,670 576 Updated May 27, 2024

Two neural network models built based on ConvNeXT and DenseNet, respectively, for the BIRADS six-class classification and feature recognition tasks, along with the data processing and training code

Python 3 1 Updated Jun 2, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,314 521 Updated May 18, 2025

Matplotlib中文教程,在线阅读地址:https://datawhalechina.github.io/fantastic-matplotlib/

Python 497 108 Updated Jul 31, 2022

Code release for SLIP Self-supervision meets Language-Image Pre-training

Python 773 71 Updated Feb 9, 2023

《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。

Jupyter Notebook 3,768 418 Updated Jan 27, 2025
Python 1 Updated Mar 24, 2025

AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)

Python 42 1 Updated Mar 1, 2025

An open source implementation of CLIP.

Python 12,176 1,129 Updated Jun 10, 2025

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,905 1,292 Updated Jul 23, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,171 52 Updated Jun 18, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,462 2,242 Updated Feb 1, 2025

[ICLR 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,596 68 Updated Jul 16, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,574 663 Updated Jul 16, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,033 116 Updated Jul 29, 2024

GPT Meet Zotero.

TypeScript 6,376 265 Updated Jul 8, 2025
Next
0