Stars
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Code for fintune ChatGLM-6b using low-rank adaptation (LoRA)
使用 Prompts 和 Chains 让 ChatGPT 成为神奇的生产力工具!Unlocking the power of LLMs.
Real-time microphone noise suppression on Linux.
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
In defence of metric learning for speaker recognition
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构
List of articles related to deep learning applied to music
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)