8000 bookong22 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View bookong22's full-sized avatar

Block or report bookong22

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A tool for extracting plain text from Wikipedia dumps

Python 3,869 983 Updated May 23, 2024
Python 1,433 184 Updated Feb 11, 2024

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,147 97 Updated Mar 2, 2025

汉字转拼音(pypinyin)

Python 5,077 625 Updated Mar 30, 2025

结巴中文分词

Python 34,144 6,733 Updated Aug 21, 2024

A Conversational Speech Generation Model

Python 13,475 1,296 Updated May 27, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,591 297 Updated Jun 8, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,746 241 Updated Jun 3, 2025

可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

Python 782 69 Updated Jun 7, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,099 240 Updated May 28, 2025

A python package to build AI-powered real-time audio applications

Python 1,320 103 Updated Feb 12, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,415 614 Updated May 27, 2025

数字人资料整理

879 104 Updated Jan 8, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,998 895 Updated May 21, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,368 1,880 Updated Mar 26, 2025

Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message se…

TypeScript 26,315 4,619 Updated Jun 8, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,136 721 Updated May 27, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 6,266 716 Updated Jun 6, 2025

Share a single keyboard and mouse between multiple computers.

C++ 17,925 4,038 Updated Jun 8, 2025

Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community

MDX 4,485 650 Updated Jan 14, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,307 1,591 Updated Jun 8, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 79,183 8,745 Updated Jun 8, 2025

NLP Datasets for Indonesian

Python 116 13 Updated Feb 11, 2023

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,956 364 Updated Jan 7, 2025

Text Normalization & Inverse Text Normalization

Python 592 81 Updated Nov 11, 2024

Noise supression using deep filtering

Python 3,106 290 Updated Oct 17, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,897 229 Updated May 23, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,345 2,630 Updated Jun 3, 2025

Pytorch Lightning入门中文教程,转载请注明来源。(当初是写着玩的,建议看完MNIST这个例子再上手)

Jupyter Notebook 217 19 Updated Dec 6, 2020
Next
0