Stars
Awesome speech/audio LLMs, representation learning, and codec models
This repo. contains our implementation for Federated Learning with PEFT methods (e.g. Adapters) integrated with frozen WavLM
This is the official implementation of " Enhancing Embeddings for Speech Classification in Noisy Conditions"
The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)
This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture…
This is the official implementation of " Enhancing Embeddings for Speech Classification in Noisy Conditions"
An anthology of recent continual learning papers, where people interested in this fascinating topic can start discovering its multidimensional representations.
"An Investigation of the Combination of Rehearsal and Knowledge Distillation in Continual Learning for Spoken Language Understanding", accepted at INTERSPEECH 2023.
Keras/Pytorch neural network size, operations and parameters counter
The official implementation of the paper "Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models"
Fine-tune wav2vec2-xls-r on data from low-resource-languages
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Script to calculate SNR and SDR using python
kaldi-asr/kaldi is the official location of the Kaldi project.