8000 Martinser (Ge Wu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Martinser's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Martinser

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[NeurIPS 2024 spotlight] Offical implementation of MSFA and release of SARDet_100K dataset for Large-Scale Synthetic Aperture Radar (SAR) Object Detection

Python 565 32 Updated May 6, 2025

Offical implementation of "SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection"

Python 217 6 Updated Feb 2, 2025

(IJCV2024 & ICCV2023) LSKNet: A Foundation Lightweight Backbone for Remote Sensing

Python 589 47 Updated Feb 10, 2025

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

617 32 Updated Jun 27, 2025

Resources and paper list for "Image Generation with Thinking", particular focus on the utilizing of reinforcement learning.

13 1 Updated Jul 14, 2025

An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.

Python 347 9 Updated Jul 8, 2025

OmniGen2: Exploration to Advanced Multimodal Generation.

Jupyter Notebook 3,385 264 Updated Jul 5, 2025

[ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥

Python 585 26 Updated Jun 26, 2025

DDT: Decoupled Diffusion Transformer

DB87 Python 264 15 Updated Jul 3, 2025
Python 55 6 Updated Jul 10, 2025

🚀🚀🚀A curated list of papers on controllable video generation.

299 22 Updated Jul 8, 2025

Liquid: Language Models are Scalable and Unified Multi-modal Generators

Python 601 34 Updated Apr 8, 2025

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 636 20 Updated Jul 1, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,924 130 Updated Oct 30, 2024

A Collection of Papers on Diffusion Language Models

90 Updated Jul 4, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,191 55 Updated Jun 13, 2025
Python 1,274 49 Updated Jul 11, 2025

Awesome Unified Multimodal Models

454 11 Updated Jul 2, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,450 2,243 Updated Feb 1, 2025

Open-source unified multimodal model

Python 4,550 384 Updated Jul 2, 2025

This is a repo to track the latest autoregressive visual generation papers.

369 5 Updated Jun 25, 2025

[CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow

Python 130 1 Updated Apr 5, 2025

USP: Unified Self-Supervised Pretraining for Image Generation and Understanding

Python 72 Updated Jun 30, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,239 781 Updated Dec 17, 2024

Collections of Papers and Projects for Multimodal Reasoning.

105 9 Updated Apr 25, 2025

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 908 55 Updated Mar 12, 2024

PyTorch Implementation of "LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding"

19 1 Updated Mar 7, 2025

[ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"

Python 116 7 Updated Jul 5, 2025

The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". MMFuser addresses the limitations of current MLLMs in captur…

Python 56 4 Updated Nov 5, 2024
Python 9 Updated Feb 6, 2025
Next
0