8000 zhw-zhang (littlewei) / Starred Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zhw-zhang's full-sized avatar

Block or report zhw-zhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] Official lmplementation of SPM-Diff: Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On

Python 21 3 Updated Mar 3, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 889 66 Updated May 15, 2025

[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,176 46 Updated Apr 20, 2025

Official implementation of Inductive Moment Matching

Python 467 11 Updated Mar 12, 2025

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 346 16 Updated Apr 22, 2025

This is a repo to track the latest autoregressive visual generation papers.

322 5 Updated May 13, 2025

[CVPR 2025] Official Implementation of MotionPro: A Precise Motion Controller for Image-to-Video Generation

Python 60 8 Updated Apr 11, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,038 164 Updated May 14, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 2,354 262 Updated May 16, 2025

Lets make video diffusion practical!

Python 13,261 1,137 Updated May 4, 2025

πŸš€πŸš€πŸš€A curated list of papers on controllable video generation.

235 19 Updated Apr 22, 2025

SkyReels-A2: Compose anything in video diffusion transformers

Python 512 44 Updated Apr 22, 2025

STeP: a general and scalable framework for solving video inverse problems with spatiotemporal diffusion priors

Python 20 1 Updated Apr 15, 2025
Python 484 28 Updated Apr 29, 2025
Python 2,080 197 Updated Apr 28, 2025

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Python 1,257 78 Updated Apr 26, 2025

SpatialLM: Large Language Model for Spatial Understanding

Python 3,174 245 Updated Mar 28, 2025

Multimodal Models in Real World

Jupyter Notebook 506 21 Updated Feb 24, 2025

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

Python 2,282 193 Updated Apr 11, 2025

Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 1,795 87 Updated May 15, 2025

[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"

Python 363 20 Updated Apr 8, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,415 1,294 Updated May 17, 2025

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Python 107 8 Updated Apr 27, 2025

[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,839 299 Updated Feb 19, 2025

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 404 18 Updated Mar 17, 2025

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,788 442 Updated Mar 18, 2025

πŸš€ Cross attention map tools for huggingface/diffusers

Python 283 21 Updated Jan 18, 2025
Next
0