Lists (1)
Sort Name ascending (A-Z)
Stars
RSNN is a repository for academic research on memorization in spiking neural networks.
This repository provides guidelines and best practices for starting a new deep learning project.
Enjoy the magic of Diffusion models!
🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
Official implementation of "A Noise is Worth Diffusion Guidance", code and weights will be available soon.
[CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
Official implementation of OneDiffusion paper (CVPR 2025)
[CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training sc…
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Install PyTorch distributions with computation backend auto-detection
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
[CVPR 2024] On the Content Bias in Fréchet Video Distance
The official Pytorch Implementation for ElasticDiffusion: Training-free Arbitrary Size Image Generation through Global-Local Content Separation (CVPR 2024)
The most reliable AI agent framework that supports MCP.
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Open-Sora: Democratizing Efficient Video Production for All
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Codebase for evaluation of deep generative models as presented in Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models
📖 A curated list of resources dedicated to talking face.
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
PyTorch code for Group Orthogonalization regularization
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)