8000 dongbo811 (TimZ) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View dongbo811's full-sized avatar

Block or report dongbo811

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for LegoGPT, the first approach for generating physically stable LEGO brick models from text prompts.

Python 1,053 53 Updated May 17, 2025

[CVPR2025] RORem: Training a Robust Object Remover with Human-in-the-Loop

Python 39 2 Updated Mar 15, 2025

VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)

Python 215 10 Updated May 18, 2025

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× vs cuBLAS

Cuda 59 2 Updated May 8, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

761 34 Updated May 18, 2025

Official implementation code of the paper <AnyText2: Visual Text Generation and Editing With Customizable Attributes>

Python 102 13 Updated Mar 3, 2025

🎨 IMAGGarment-1: Fine-Grained Garment Generation with Controllable Structure, Color, and Logo. It supports precise and customizable garment synthesis guided by multi-conditions (e.g., sketch, colo…

Python 39 2 Updated Apr 22, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,035 163 Updated May 14, 2025

Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing

Python 59 4 Updated Mar 5, 2025

Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

Python 346 11 Updated May 18, 2025

Official implementations for paper: PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention

Python 9 Updated Apr 21, 2025

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Python 574 44 Updated Apr 27, 2025

Open Image Curation Tools

Python 26 1 Updated Apr 22, 2025

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 569 35 Updated May 14, 2025

CogView4, CogVie 9AF2 w3-Plus and CogView3(ECCV 2024)

Python 1,031 74 Updated Mar 29, 2025
Python 52 Updated May 8, 2025

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,043 55 Updated Apr 17, 2025

Code for Deep Single-image Portrait Image Relighting

Python 556 86 Updated Dec 22, 2022
Python 2,078 195 Updated Apr 28, 2025

Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning (CVPR 2025)

Python 19 Updated Mar 28, 2025

[CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.

Python 62 1 Updated Mar 24, 2025

GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities

Python 261 4 Updated May 3, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 598 23 Updated May 3, 2025

⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.

Python 137 6 Updated May 7, 2025

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

572 15 Updated May 9, 2025

SkyReels-A2: Compose anything in video diffusion transformers

Python 512 44 Updated Apr 22, 2025

Lumina-mGPT 2.0: Stand-alone Autoregressive Image Modeling

Python 690 40 Updated May 1, 2025
Python 155 10 Updated Nov 26, 2024
Next
0