Incredible WaveNet Speech Synthesis
Posted by Daniel on Monday, 12 September 2016
Yaaaay, there is certainly some magic in deep neural networks - after mastering Go or making huge progress in the field of Spoken Language Recognition, Google now presents WaveNet, a deep neural networks-based approach to Speech Synthesis. It sound astoundingly real and even can compose music or fictional languge-like sounds. Amazing. And spooky.
WaveNet changes this paradigm by directly modelling the raw waveform of the audio signal, one sample at a time. As well as yielding more natural-sounding speech, using raw waveforms means that WaveNet can model any kind of audio, including music.
Heise: http://www.heise.de/newsticker/meldung/Google-DeepMind-Sprachsynthese-so...
DeepWave: https://deepmind.com/blog/wavenet-generative-model-raw-audio/