Audio Bit-Depth Super Resolution

Our project focuses on the goal of adapting WaveNet, an audio prediction CNN architecture, to superresolve 8-bit audio clips into 16-bit audio clips, trying to restore lossed dynamic range and as a result cleaning compression artifacts.

We evaluate several modifications to WaveNet, including:

Discriminative rather than autoregressive prediction
Non-causal dilations - both input samples from past and future are available during prediction
Delta prediction - assuming 8-bit audio mostly preserves the 16-bit audio data, we aim to only predict the delta between the two waveforms
Real-valued prediction - since the amplitude space is inherently continuous (discretized during compression), a real-valued number space is a more natural model than a categorical output passed through softmax.

Improvements are subtle but include audible muffling of the background noise, though we terminated training early due to resource constraints and observed that loss was still decreasing approximately linearly at time of evaluation. We believe there is further improvements to be had with our architecture given sufficient training.

Final Write up

Audio Bit Depth Super Resolution Paper

Presentation

Project Presentation

Generated Samples

Source code

Source Code

Maintained by Taylor Lundy, Thomas Liu, and William Qi.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
data		data
docs		docs
pytorch-wavenet		pytorch-wavenet
.gitignore		.gitignore
README.md		README.md
_config.yml		_config.yml
baseline_mehri.py		baseline_mehri.py
baseline_vctk.py		baseline_vctk.py
data.py		data.py
mehri_data_prep.py		mehri_data_prep.py
metrics.py		metrics.py
vctk_data_prep.py		vctk_data_prep.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Audio Bit-Depth Super Resolution

Final Write up

Presentation

Generated Samples

Source code

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

wqi/bdsr

Folders and files

Latest commit

History

Repository files navigation

Audio Bit-Depth Super Resolution

Final Write up

Presentation

Generated Samples

Source code

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages