8000 nangongmu / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View nangongmu's full-sized avatar

Block or report nangongmu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

基于InsightFace的人脸识别

Python 84 25 Updated Sep 3, 2020

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Python 1,082 342 Updated Jun 8, 2024

Python interface to the WebRTC Voice Activity Detector

C 2,246 418 Updated Jul 4, 2024

Deep Speaker: an End-to-End Neural Speaker Embedding System https://arxiv.org/pdf/1705.02304.pdf

Python 1 Updated Jul 30, 2018

This repo contains code for speech vs music vs noise classification

Python 7 4 Updated Jan 10, 2020

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Python 708 155 Updated Apr 6, 2023

You can find the speech algorithms you want here

C 806 249 Updated Jan 1, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,596 883 Updated May 21, 2025

python codes to extract MFCC and FBANK speech features for Kaldi

Python 65 18 Updated Nov 28, 2018

A Convolutional Neural Network based Voice Activity Detector for Smartphones

Jupyter Notebook 71 23 Updated Apr 30, 2019

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,936 1,913 Updated May 26, 2025

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Python 316 64 Updated Nov 11, 2020

3gpp协议26073里面的vad的移植

C 14 8 Updated Feb 14, 2019

100+ Chinese Word Vectors 上百种预训练中文词向量

Python 12,019 2,329 Updated Oct 30, 2023
Python 55 27 Updated Jun 15, 2020

Speech Recognition using DeepSpeech2.

Python 2,118 624 Updated Dec 13, 2022

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Python 6,382 1,284 Updated Aug 31, 2024

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Python 759 177 Updated Jul 6, 2023

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

Python 1,714 333 Updated May 22, 2023
Python 6 Updated May 14, 2020

中文语音识别; Mandarin Automatic Speech Recognition;

Python 1,943 483 Updated Jul 25, 2024

Kaldi model converter to ONNX

Python 244 59 Updated Jan 27, 2023

Dockerfile for compiling Kaldi for Android.

Shell 66 24 Updated Feb 4, 2019

基于kaldi的ios本地语音识别(本地实时流)Kaldi-based ios native speech recognition (local real-time streaming)

Objective-C 72 29 Updated Sep 13, 2021

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Python 728 102 Updated Feb 15, 2025

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…

Python 2,387 445 Updated Mar 14, 2022

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,876 5,359 Updated Apr 28, 2025

Code samples used on cloud.google.com

Jupyter Notebook 7,708 6,549 Updated May 29, 2025

Facebook AI Research's Automatic Speech Recognition Toolkit

C++ 6,427 1,012 Updated Nov 23, 2024
Next
0