-
minimind-v Public
Forked from jingyaogong/minimind-v🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
Python Apache License 2.0 UpdatedApr 27, 2025 -
LLMs-Zero-to-Hero Public
Forked from bbruceyuan/LLMs-Zero-to-HeroJupyter Notebook Apache License 2.0 UpdatedFeb 22, 2025 -
docling Public
Forked from docling-project/doclingGet your documents ready for gen AI
Python MIT License UpdatedJan 27, 2025 -
-
LLaSA_training Public
Forked from zhenye234/LLaSA_trainingLLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesis
Python Other UpdatedJan 25, 2025 -
memo Public
Forked from memoavatar/memoMemory-Guided Diffusion for Expressive Talking Video Generation
Python Apache License 2.0 UpdatedJan 24, 2025 -
minimind Public
Forked from jingyaogong/minimind🚀🚀 「大模型」3小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 3 hours!
Python Apache License 2.0 UpdatedDec 13, 2024 -
GLM-4-Voice Public
Forked from THUDM/GLM-4-VoiceGLM-4-Voice | 端到端中英语音对话模型
Python Apache License 2.0 UpdatedOct 25, 2024 -
mini-omni2 Public
Forked from gpt-omni/mini-omni2Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
Python MIT License UpdatedOct 18, 2024 -
RealtimeSTT Public
Forked from KoljaB/RealtimeSTTA robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Python MIT License UpdatedOct 15, 2024 -
swarm Public
Forked from openai/swarmEducational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Python MIT License UpdatedOct 15, 2024 -
KnowStreaming Public
Forked from didi/KnowStreaming一站式云原生实时流数据平台,通过0侵入、插件化构建企业级Kafka服务,极大降低操作、存储和管理实时流数据门槛
Java GNU Affero General Public License v3.0 UpdatedOct 12, 2024 -
video-subtitle-extractor Public
Forked from YaoFANGUK/video-subtitle-extractor视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Python Apache License 2.0 UpdatedOct 9, 2024 -
SafeEar Public
Forked from LetterLiGo/SafeEarThe Official Code Repo of SafeEar (Accepted by CCS 2024)
Python Other UpdatedOct 1, 2024 -
langchain Public
Forked from langchain-ai/langchain🦜🔗 Build context-aware reasoning applications
Jupyter Notebook MIT License UpdatedSep 30, 2024 -
libks Public
Forked from signalwire/libksFoundational support for signalwire C products
C Other UpdatedSep 26, 2024 -
FireRedTTS Public
Forked from FireRedTeam/FireRedTTSPython Mozilla Public License 2.0 UpdatedSep 20, 2024 -
SenseVoice-OneApi Public
Forked from LuckLittleBoy/SenseVoice-OneApi基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi
Python UpdatedSep 5, 2024 -
-
Deep-Live-Cam Public
Forked from hacksider/Deep-Live-Camreal time face swap and one-click video deepfake with only a single image (uncensored)
Python GNU Affero General Public License v3.0 UpdatedAug 11, 2024 -
supervoice-vall-e-2 Public
Forked from ex3ndr/supervoice-vall-e-2VALL-E 2 reproduction
Jupyter Notebook UpdatedJul 14, 2024 -
stable-diffusion Public
Forked from CompVis/stable-diffusionA latent text-to-image diffusion model
Jupyter Notebook Other UpdatedJun 18, 2024 -
-
DeepFilterNet Public
Forked from Rikorose/DeepFilterNetNoise supression using deep filtering
Python Other UpdatedMay 30, 2024 -
Quantization-Tutorials Public
Forked from OscarSavolainen/Quantization-TutorialsA bunch of coding tutorials for my Youtube videos on Neural Network Quantization.
-
VAF_2 Public
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
-
dict Public
Forked from kajweb/dict英语字典 英语词库 字典词库 四级单词 六级单词 考研单词 雅思 托福 SAT GMAT TOEFL GRE
Python UpdatedApr 5, 2024 -
audio-preprocess Public
Forked from fishaudio/audio-preprocessPreprocess Audio for training
Python Apache License 2.0 UpdatedApr 1, 2024 -
vall-e Public
Forked from lifeiteng/vall-ePyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Python Apache License 2.0 UpdatedMar 31, 2024 -