8000 GitHub - cordutie/ddsp_textures: Repository designed to showcase the results of the paper "A Statistics-Driven Differentiable Approach for Sound Texture Synthesis and Analysis" presented atDAFx 2025 in Ancona, Italy.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Repository designed to showcase the results of the paper "A Statistics-Driven Differentiable Approach for Sound Texture Synthesis and Analysis" presented atDAFx 2025 in Ancona, Italy.

License

Notifications You must be signed in to change notification settings

cordutie/ddsp_textures

Repository files navigation

ddsp_textures

Esteban Gutiérrez1 and Lonce Wyse1

1 Department of Information and Communications Technologies, Universitat Pompeu Fabra

1. Introduction

This repository contains an implementation of all algorithm and models introduced in the thesis titled "Statistics-Driven Texture Sound Synthesis Using Differentiable Digital Signal Processing-Based Architectures" authored by Esteban Gutiérrez and advised by Lonce Wyse at the Universitat Pompeu Fabra.

DDSP architecture

Figure 1. DDSP architecture modified to synthesize texture sounds.

The thesis explored adapting Differentiable Digital Signal Processing (DDSP) architectures, first introduced by Engel et al. in [1], for synthesizing and controlling texture sounds, which are complex and noisy compared to traditional pitched instrument timbres. It introduces two innovative synthesizers: the $\texttt{TexEnv\ Synth}$, which applies amplitude envelopes to subband decompositions of white noise, and the $\texttt{P-VAE\ Synth}$, which integrates a Poisson process with a Variational Autoencoder (VAE) to handle time and event-based aspects of texture sounds based on the early conceptions of a texture sound introduced by Saint-Arnaud in [2]. Additionally, the $\texttt{TextStat}$ loss function is presented, inspired in McDermott and Simoncelli's work [3] and designed to evaluate texture sounds based on their statistical properties rather than short-term perceptual similarity. The thesis demonstrates the application of these synthesizers and the loss function within DDSP-based frameworks, highlights mixed success in resynthesizing texture sounds, and identifies challenges, particularly with the $\texttt{P-VAE\ Synth}$. Future work will focus on optimizing the $\texttt{TextStat}$ loss function, reassessing the VAE component, and exploring real-time implementations. This research lays the groundwork for advancing texture sound synthesis and provides valuable insights for both theoretical and practical developments in audio signal processing.

Latent space exploration

Figure 2. Latent space exploration.

2. How to Use

This repository contains a variety of functions, each demonstrated in one or more of the provided tester Jupyter notebooks.

To train a model, follow these steps:

  1. Prepare a Configuration File: Create and fill out a JSON configuration file.
  2. Run Training: Execute the training process using the following command:
python main.py train configuration.json
  1. Continue Training from Checkpoint: Execute the training process using the following command:
python main.py retrain model_folder

For detailed examples of the training process, refer to the training/wrapper_tester.ipynb notebook. To see a sample configuration file, check out auxiliar/config_template_pvae.json.

3. References

[1] J. Engel, L. Hantrakul, C. Gu, and A. Roberts, “Ddsp: Differentiable digital signal processing,” in International Conference on Learning Representations, 2020.
[2] N. Saint-Arnaud, “Classification of Sound Textures,” Master’s thesis, Massachusetts Institute of Technology, Cambridge, MA, 1995.
[3] J. H. McDermott and E. P. Simoncelli, “Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis,” Neuron, vol. 71, pp. 926–940, 2011.\

About

Repository designed to showcase the results of the paper "A Statistics-Driven Differentiable Approach for Sound Texture Synthesis and Analysis" presented atDAFx 2025 in Ancona, Italy.

https://cordutie.github.io/ddsp_textures/

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0