Demucs ONNX Denoiser

A PyTorch-based speech denoising project built on top of Facebook Research's denoiser repository. This repository provides training pipelines, model definitions, and ONNX export scripts for a Demucs-inspired denoiser that you can integrate into your applications via ONNX modules. The original project can be found at https://github.com/facebookresearch/denoiser.

Features

Two-step training pipeline:
1. Valentini dataset pretraining
2. MS-SNSD fine-tuning
Custom augmentation, loss functions, and model definitions
Exports encoder and decoder to ONNX for cross-platform inference
Simple Python application (app.py) for real-time denoising

Getting Started

Prerequisites

Python 3.8 or higher
pip for managing Python packages
Datasets:
1. Valentini dataset (Edinburgh DataShare)
2. Microsoft MS-SNSD repository

Installation

Clone this repository:

git clone https://github.com/GitStroberi/demucs-onnx.git
cd demucs-onnx

Create and activate a virtual environment:

conda env create -f environment.yml -n demucs-onnx
conda activate demucs-onnx

Dataset Preparation

Download the Valentini and MS-SNSD datasets.
Unzip and organize them into folders on your local machine.
Update the file paths in both training scripts (Denoiser-Valentini.py and Denoiser-MS-SNSD.py) to point to your dataset directories.

Training (optional)

Valentini dataset training

Run the first-stage training on the Valentini dataset:

python Denoiser-Valentini.py

This script will:

Load and preprocess Valentini noisy and clean pairs
Apply augmentation (via augmentation.py)
Train the Demucs causal model (defined in model_def.py)
Save checkpoints

MS-SNSD Fine-tuning

After completing Valentini training, fine-tune on MS-SNSD:

python Denoiser-MS-SNSD.py

This script will:

Load the pretrained weights from the Valentini stage
Continue training on MS-SNSD data for better generalization
Save final model checkpoint (sample model checkpoint present in the repository as demucs_model_finetune.pth)

Inference & ONNX Export

Before running the demo app, you must export the encoder and decoder:

python inference.py

This script will:

Split the trained model into three parts: encoder, LSTM bottleneck, and decoder
Export the encoder and decoder to ONNX format (.onnx files)
Start a simple real time inference streaming loop using the model

Running the App

Once you have your encoder.onnx and decoder.onnx files, run the demo application:

python app.py

License

This project incorporates code from facebookresearch/denoiser, which is licensed under CC BY-NC 4.0 (see LICENSE).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Demucs ONNX Denoiser

Features

Getting Started

Installation

Training (optional)

Valentini dataset training

MS-SNSD Fine-tuning

Inference & ONNX Export

Running the App

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Denoiser-MS-SNSD.py		Denoiser-MS-SNSD.py
Denoiser-Valentini.py		Denoiser-Valentini.py
LICENSE		LICENSE
README.md		README.md
app.py		app.py
augmentation.py		augmentation.py
demucs_decoder.onnx		demucs_decoder.onnx
demucs_encoder.onnx		demucs_encoder.onnx
demucs_model_finetune.pth		demucs_model_finetune.pth
environment.yml		environment.yml
inference.py		inference.py
losses.py		losses.py
model_def.py		model_def.py
training.py		training.py

License

GitStroberi/demucs-onnx

Folders and files

Latest commit

History

Repository files navigation

Demucs ONNX Denoiser

Features

Getting Started

Installation

Training (optional)

Valentini dataset training

MS-SNSD Fine-tuning

Inference & ONNX Export

Running the App

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages