8000 dongbo811 (TimZ) / Starred · GitHub

More Web Proxy on the site http://driver.im/

dongbo811

Follow

TimZ dongbo811

Follow

14 followers · 4 following

Achievements

Achievements

Lists (4)

Sort

3DOD

occupanc predication

seg

Tools

Stars

XueZeyue / DanceGRPO

173 3 Updated May 12, 2025

AvaLovelace1 / LegoGPT

Official repository for LegoGPT, the first approach for generating physically stable LEGO brick models from text prompts.

Python 1,053 53 Updated May 17, 2025

leeruibin / RORem

[CVPR2025] RORem: Training a Robust Object Remover with Human-in-the-Loop

Python 39 2 Updated Mar 15, 2025

lzyhha / VisualCloze

VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)

Python 215 10 Updated May 18, 2025

sandyresearch / chipmunk

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× vs cuBLAS

Cuda 59 2 Updated May 8, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

761 34 Updated May 18, 2025

tyxsspa / AnyText2

Official implementation code of the paper <AnyText2: Visual Text Generation and Editing With Customizable Attributes>

Python 102 13 Updated Mar 3, 2025

muzishen / IMAGGarment-1

🎨 IMAGGarment-1: Fine-Grained Garment Generation with Controllable Structure, Color, and Logo. It supports precise and customizable garment synthesis guided by multi-conditions (e.g., sketch, colo…

Python 39 2 Updated Apr 22, 2025

song-wensong / insert-anything

Python 354 10 Updated May 14, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,035 163 Updated May 14, 2025

Eureka-Maggie / MIGE

Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing

Python 59 4 Updated Mar 5, 2025

dvlab-research / Seg-Zero

Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

Python 346 11 Updated May 18, 2025

wei-cheng777 / PS-Diffusion

Official implementations for paper: PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention

Python 9 Updated Apr 21, 2025

ali-vilab / UniAnimate-DiT

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Python 574 44 Updated Apr 27, 2025

modelscope / ImagePulse

Open Image Curation Tools

Python 26 1 Updated Apr 22, 2025

SkyworkAI / Skywork-OR1

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 569 35 Updated May 14, 2025

THUDM / CogView4

CogView4, CogVie 9AF2 w3-Plus and CogView3(ECCV 2024)

Python 1,031 74 Updated Mar 29, 2025

PRIS-CV / Omnieraser

Python 52 Updated May 8, 2025

bytedance / UNO

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,043 55 Updated Apr 17, 2025

zhhoper / DPR

Code for Deep Single-image Portrait Image Relighting

Python 556 86 Updated Dec 22, 2022

HiDream-ai / HiDream-I1

Python 2,078 195 Updated Apr 28, 2025

SherryXTChen / Instruct-CLIP

Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning (CVPR 2025)

Python 19 Updated Mar 28, 2025

CodeGoat24 / DreamText

[CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.

Python 62 1 Updated Mar 24, 2025

PicoTrex / GPT-ImgEval

GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities

Python 261 4 Updated May 3, 2025

ModalMinds / MM-EUREKA

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 598 23 Updated May 3, 2025

tanhuajie / Reason-RFT

⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.

Python 137 6 Updated May 7, 2025

yaotingwangofficial / Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

572 15 Updated May 9, 2025

SkyworkAI / SkyReels-A2

SkyReels-A2: Compose anything in video diffusion transformers

Python 512 44 Updated Apr 22, 2025

Alpha-VLLM / Lumina-mGPT-2.0

Lumina-mGPT 2.0: Stand-alone Autoregressive Image Modeling

Python 690 40 Updated May 1, 2025

adobe-research / MagicFixup

Python 155 10 Updated Nov 26, 2024

0