Need help with DNP?
Click the “chat” button below for chat support from the developer who created it, or find similar developers for support.

About the developer

mosheman5
144 Stars 30 Forks 23 Commits 2 Opened issues

Description

Audio Denoising with Deep Network Priors

Services available

!
?

Need anything else?

Contributors list

No Data

DNP

Speech Denoising by Accumulating Per-Frequency Modeling Fluctuations

This repository provides a PyTorch implementation of "Speech Denoising by Accumulating Per-Frequency Modeling Fluctuations" (paper)

The method trains on noisy audio signal and provides a the clean underlying signal. Comparison with traditional unsupervised methods can be found here

The method is completely unsupervised and only trains on the specific audio clip that is being denoised.

The algorithm is based on the observation that modeling noise in the signal is harder than a clean signal. During the fitting we observe flactuations in different stages of the train. By calculating the amount of difference between outputs in the time-frequency domain we create a robust spectral mask used for denoising the noisy output. Accumulator

Dependencies

A conda environment file is available in the repository. * Python 3.6 + * Pytorch 1.0 * Torchvision * librosa * tqdm * scipy * soundfile

Usage

1. Cloning the repository & setting up conda environment

$ git clone https://github.com/mosheman5/DNP.git
$ cd DNP/

For creating and activating the conda environment:

$ conda env create -f environment.yml
$ conda activate DNP

2. Testing

To test on the demo speech file:

$ python DNP.py --run_name demo --noisy_file demo.wav --samples_dir samples --save_every 50 --num_iter 5000 --LR 0.001

To test on any other audio file insert the file path after the

--noisy_file
option.

A jupyter notebook with visualization is available:

dnp.ipynb

Reference

If you found this code useful, please cite the following paper:

@article{michelashvili2020denoising,
  title={Speech Denoising by Accumulating Per-Frequency Modeling Fluctuations},
  author={Michael Michelashvili and Lior Wolf},
  journal={arXiv preprint arXiv:1904.07612},
  year={2020}
}

Acknowledgement

The implemantation of the network architecture is taken from Wave-U-Net

We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.