8000 superYangwenwen (yangwenwen) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View superYangwenwen's full-sized avatar

Block or report superYangwenwen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Command line utility for forced alignment using Kaldi

Python 1,469 254 Updated Mar 25, 2025

Phonetisaurus G2P

Shell 473 123 Updated Jun 1, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 39,972 5,100 Updated Aug 16, 2024

End-to-End Speech Processing Toolkit

Python 9,087 2,258 Updated May 6, 2025

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,129 92 Updated Mar 2, 2025

A curated list of awesome papers on contextualizing E2E ASR outputs

77 9 Updated May 10, 2023

Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding

Python 43 8 Updated Mar 12, 2023

This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.

108 6 Updated Aug 4, 2023

TigerBot: A multi-language multi-task LLM

Python 2,258 191 Updated Dec 28, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,827 1,891 Updated Apr 30, 2024

【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。

2,040 200 Updated Mar 30, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,550 4,702 Updated Apr 12, 2025

LLM training code for Databricks foundation models

Python 4,242 559 Updated May 13, 2025

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

Python 2,943 198 Updated Nov 26, 2023

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,439 405 Updated Feb 12, 2025

A self-supervised learning framework for audio-visual speech

Python 902 141 Updated Dec 7, 2023

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

208 22 Updated Apr 16, 2023

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

Python 500 65 Updated Mar 29, 2025

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

1,862 225 Updated Jun 27, 2022

Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"

Python 109 20 Updated Nov 9, 2020

MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf

Python 294 33 Updated Sep 11, 2021

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,188 1,174 Updated May 28, 2023

End-to-end ASR/LM implementation with PyTorch

Python 596 139 Updated Aug 30, 2021

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

Python 107 25 Updated Mar 19, 2024

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Python 648 135 Updated Apr 5, 2022

Python interface to the WebRTC Voice Activity Detector

C 2,231 417 Updated Jul 4, 2024

This project is real-time visualization of a network recognizing digits from user's input.

Processing 584 67 Updated Dec 30, 2019

DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

Python 118 21 Updated Dec 10, 2020

The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild

Python 25 11 Updated Nov 23, 2018
Next
0