-
The University of Maryland, College Park
- College Park
- https://sukritipaul5.github.io/
Stars
An extremely fast Python package and project manager, written in Rust.
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
HyMPS will be a platform-indipendent software suite for advanced audio/video contents production.
Mumbai slum segmentation and change detection on statellite images.
Variational Autoencoder (VAE) with perception loss implementation in pytorch
[CVPR 2025 Oral & Best Paper Award Candidate] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
This is a repo to track the latest autoregressive visual generation papers.
Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"
[ICLR 2025] HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
[TMLR 2025🔥] A survey for the autoregressive models in vision.
A suite of image and video neural tokenizers
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
SEED-Voken: A Series of Powerful Visual Tokenizers
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
CUDA accelerated rasterization of gaussian splatting
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
🏠[ECCV 2024] GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]
[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacch…
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
A general fine-tuning kit geared toward diffusion models.
Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"
A Data Streaming Library for Efficient Neural Network Training
A PyTorch native platform for training generative AI models