by usernaamee

usernaamee / keras-wavenet

Keras implementation of deepmind's wavenet paper

415 Stars 83 Forks Last release: Not found GNU General Public License v3.0 29 Commits 0 Releases

Available items

No Items, yet!

The developer of this repository has not created any items for sale yet. Need a bug fixed? Help with integration? A different license? Create a request here:


Keras implementation of deepmind's wavenet paper

Link to paper

Dataset used

I have used Librispeech corpus. I have concatenated all audio files in dev-clean to create train.wav and all files in test-clean to create validate.wav. I have resampled the audio files to 8000 Hz. Here is how you can create train.wav & validate.wav using vlc on linux:

cvlc -vvv --sout-keep --sout-all --sout "#gather:transcode{acodec=s16l,channels=1,samplerate=8000}:std{access=file,mux=wav,dst=validate.wav}" `find LibriSpeech/test-clean/ -name "*.flac"` vlc://quit
cvlc -vvv --sout-keep --sout-all --sout "#gather:transcode{acodec=s16l,channels=1,samplerate=8000}:std{access=file,mux=wav,dst=train.wav}" `find LibriSpeech/dev-clean/ -name "*.flac"` vlc://quit


  • [x] The basic generative model
  • [ ] Conditioning logic (speaker)
  • [ ] Conditioning logic (TTS)

We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.