Stars
OmniGen2: Exploration to Advanced Multimodal Generation.
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
cjeen / LoRAEdit
Forked from tdrussell/diffusion-pipeWe achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additional reference conditions.
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Official implementation of "Text-Aware Image Restoration with Diffusion Models"
An extension for tracking your activities on myanimelist.net
PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers
Any-to-Bokeh is a novel one-step video bokeh framework that converts arbitrary input videos into temporally coherent, depth-aware bokeh effects.
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
SkyReels-A2: Compose anything in video diffusion transformers
In-context subject-driven image generation while preserving foreground fidelity
PixelHacker: Image Inpainting with Structural and Semantic Consistency
Robust Speech Recognition via Large-Scale Weak Supervision
colinurbs / FramePack-Studio
Forked from lllyasviel/FramePackExpanding FramePack into a multifunction video creation tool
Self-hosted game stream host for Moonlight.
Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.
Lets make video diffusion practical!
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
A High-Quality Real Time Upscaler for Anime Video
Welcome to TheAnimeScripter – the ultimate tool for Video Upscaling, Interpolating and many more. Available as a CLI, GUI and Adobe Extension.