Stars
WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket-FLV/HTTP-TS/HTTP-fMP4/WebSocket-TS/WebSocket-fMP4/GB28181/SRT server and client framework based on C++11
🔥 作者:常炎隆(Author: ChangYanlong):HEVC/H.265 网页直播/点播播放器 支持硬解! 支持H.265的HttpFLV/HLS/MP4/TS/FLV/M3U8/Websocket播放。 🔥 A HEVC/H.265 Web Player, Support hard-decoding! for LIVE/VOD stream. Support H.265 Code…
MAGI-1: Autoregressive Video Generation at Scale
[AAAI 2025] Event-Enhanced Blurry Video Super-Resolution
A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GRPC services and as a standalone library, providing highly ef…
Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference
A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models.
Simple no frills AWS S3 Golang Library using REST with V4 Signing (without AWS Go SDK)
Analyze videos using LLMs, Computer Vision and Automatic Speech Recognition
FastVideo is a unified framework for accelerated video generation.
This is the official implementation of our Señorita-2M [Weights and Dataset] : A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists
Create narrated video story from book chapter using NLP, OpenAI and StableDiffusion.
A tool which is uses to remove Windows Defender in Windows 8.x, Windows 10 (every version) and Windows 11.
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437
OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.
cross-platform (Qt), open-source (GPLv3) video editor
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。
Dockerized API for converting HTML to Markdown using ReaderLM-v2, powered by llama.cpp
orderedmap is a golang map where the keys keep the order that they're added. It can be de/serialized from/to JSON. It's based closely on the python collections.OrderedDict.
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。