Performative Latents for Adaptive Unsupervised DDSP (PLAUD)

PLAUD is a PyTorch-based modular synthesis framework that extends DDSP: Differentiable Digital Signal Processing for real-time, performance-oriented, and latent-variable-controlled sound generation. It is specifically designed for small, personal datasets, and prioritizes playability, modularity, and exploration over strict reconstruction quality.

At its core, PLAUD is a reconfigurable DDSP synthesizer where every aspect of the synthesis and training process is steered by a structured latent space. It supports intervenable architectures, allowing real-time constraints (e.g., number of oscillators) and flexible loss objectives.

Key Features

Exclusively latent-based control: The entire generation process is conditioned on a regularized latent space—there is no explicit audio feature extraction.
Smoothed latent trajectories: Temporal structure and controllability are improved via latent smoothing (e.g., average pooling over time).
Modular synthesis blocks:
- Sinusoidal additive synthesis
- Harmonic oscillator banks (optional)
- NoiseBandNet-based residual modeling (NoiseBandNet paper)
Customizable loss functions:
- Multi-resolution STFT loss
- Perceptual CLAP loss (audio-text contrastive model)
- Perceptual M2L loss (mel-to-latent for realism)
Optional attribute regularization: Add task-specific structure to latent space for controlled generation. (Attribute Regularization paper
Highly customizable: Modular training interface for swapping losses, synthesis blocks, and regularization strategies.
Real-time deployment: Compatible with nn~ externals for Max/MSP and PureData for live performance environments.

Installation

Clone the repository and install the package locally with pip:

pip install -r requirements.txt
pip install -e .

Training

The training is done in two steps:

Preprocess the dataset

python utils/dataset_converter.py --input_dir <path_to_dataset> --output_dir <path_to_output_dir>

Train the model

python cli/train.py\
        --latent_size 8\
        --model_name <model_name>\
        --dataset_path <path_to_preprocessed_dataset>

The training process is highly customisable. To see all the options run:

python cli/train.py --help

Inference

Max/MSP and PureData

The model is compatibile with nn~ externals for Max/MSP and PureData. In order to use trained model, you need to install the extensions following the instructions from original nn~ repository.

Model export

In order to export the model to be used with nn~ externals, run:

python cli/export.py --model_directory <path_to_model_training> --output_dir <path_to_output_dir>

Colab Notebook

The project is also available as this colab notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
cli		cli
ddsp		ddsp
utils		utils
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Performative Latents for Adaptive Unsupervised DDSP (PLAUD)

Key Features

Installation

Training

Inference

Max/MSP and PureData

Model export

Colab Notebook

About

Uh oh!

Releases

Packages

Uh oh!

Languages

blazejkotowski/plaud

Folders and files

Latest commit

History

Repository files navigation

Performative Latents for Adaptive Unsupervised DDSP (PLAUD)

Key Features

Installation

Training

Inference

Max/MSP and PureData

Model export

Colab Notebook

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages