ddsp_textures

Esteban Gutiérrez¹ and Lonce Wyse¹

¹ Department of Information and Communications Technologies, Universitat Pompeu Fabra

1. Introduction

This repository contains an implementation of all algorithm and models introduced in the thesis titled "Statistics-Driven Texture Sound Synthesis Using Differentiable Digital Signal Processing-Based Architectures" authored by Esteban Gutiérrez and advised by Lonce Wyse at the Universitat Pompeu Fabra.

Figure 1. DDSP architecture modified to synthesize texture sounds.

The thesis explored adapting Differentiable Digital Signal Processing (DDSP) architectures, first introduced by Engel et al. in [1], for synthesizing and controlling texture sounds, which are complex and noisy compared to traditional pitched instrument timbres. It introduces two innovative synthesizers: the $\texttt{TexEnv\ Synth}$, which applies amplitude envelopes to subband decompositions of white noise, and the $\texttt{P-VAE\ Synth}$, which integrates a Poisson process with a Variational Autoencoder (VAE) to handle time and event-based aspects of texture sounds based on the early conceptions of a texture sound introduced by Saint-Arnaud in [2]. Additionally, the $\texttt{TextStat}$ loss function is presented, inspired in McDermott and Simoncelli's work [3] and designed to evaluate texture sounds based on their statistical properties rather than short-term perceptual similarity. The thesis demonstrates the application of these synthesizers and the loss function within DDSP-based frameworks, highlights mixed success in resynthesizing texture sounds, and identifies challenges, particularly with the $\texttt{P-VAE\ Synth}$. Future work will focus on optimizing the $\texttt{TextStat}$ loss function, reassessing the VAE component, and exploring real-time implementations. This research lays the groundwork for advancing texture sound synthesis and provides valuable insights for both theoretical and practical developments in audio signal processing.

Figure 2. Latent space exploration.

2. How to Use

This repository contains a variety of functions, each demonstrated in one or more of the provided tester Jupyter notebooks.

To train a model, follow these steps:

Prepare a Configuration File: Create and fill out a JSON configuration file.
Run Training: Execute the training process using the following command:

python main.py train configuration.json

Continue Training from Checkpoint: Execute the training process using the following command:

python main.py retrain model_folder

For detailed examples of the training process, refer to the training/wrapper_tester.ipynb notebook. To see a sample configuration file, check out auxiliar/config_template_pvae.json.

3. References

[1] J. Engel, L. Hantrakul, C. Gu, and A. Roberts, “Ddsp: Differentiable digital signal processing,” in International Conference on Learning Representations, 2020.
[2] N. Saint-Arnaud, “Classification of Sound Textures,” Master’s thesis, Massachusetts Institute of Technology, Cambridge, MA, 1995.
[3] J. H. McDermott and E. P. Simoncelli, “Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis,” Neuron, vol. 71, pp. 926–940, 2011.\

Name		Name	Last commit message	Last commit date
Latest commit History 147 Commits
.github/workflows		.github/workflows
architectures		architectures
auxiliar		auxiliar
dataset		dataset
examples		examples
experiments		experiments
loss		loss
signal_processors		signal_processors
tester		tester
training		training
webpage		webpage
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ddsp_textures

1. Introduction

2. How to Use

3. References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

cordutie/ddsp_textures

Folders and files

Latest commit

History

Repository files navigation

ddsp_textures

1. Introduction

2. How to Use

3. References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages