Stars
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.
A summary of related works about flow matching, stochastic interpolants
[ACM MM 2024] Frame Interpolation with Consecutive Brownian Bridge Diffusion Model
LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨
[CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)
[ECCV-2024] This is the official implementation of ZeST.
MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation
Unofficial Implementation of Animate Anyone
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Consistent Subject Generation via Contrastive Instantiated Concepts
[ACL 2024] Official code for "IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation" (TheatreMaker)
A beautiful, simple, clean, and responsive Jekyll theme for academics
An up-to-date list of works on Multi-Task Learning
vickywu1022 / NLPer-Arsenal
Forked from TingFree/NLPer-Arsenal收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中
[CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Dramatron uses large language models to generate coherent scripts and screenplays.
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
Deep Learning-based Image Fusion: A Survey
Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation"
[CVPR 2024] Official implementation for "Equivariant Multi-Modality Image Fusion."