LoginSign Up
WaveNet

WaveNet

A deep neural network for generating realistic voices for Google Assistant.

WaveNet is a deep neural network designed to generate raw audio waveforms. It generates realistic-sounding voices for Google Assistant globally.

It mimics the human voice and sounds more natural than the best existing Text-to-Speech systems, reducing the gap with the human performance by over 50% and creating higher quality audio.

DeepMind's WaveNet is a type of feedforward neural network, convolutional neural network (CNN). It is composed of layers of interconnected nodes, CNN uses a raw signal as input and synthesizes an output. The trained network creates new speech-like waveforms at 16,000 samples per second. The output waveforms include realistic breaths and lip smacks.

It was created by researchers at DeepMind in London in 2016. Other Text-to-speech systems (TTSs) are Apple's Siri, Microsoft’s Cortana, Amazon Alexa among others.

Timeline

Currently, no events have been added to this timeline yet.
Be the first one to add some.

People

Name
Role
Related Golden topics

Further reading

Author
Title
Link
Type

Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew Seniorn and Koray Kavukcuoglu

WAVENET: A GENERATIVE MODEL FOR RAW AUDIO

Academic paper

Jesse Engel, Cinjon Resnick, Adam Roberts, Sander Dieleman, Douglas Eck, Karen Simonyan, Mohammad Norouzi

Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders

Academic paper

Documentaries, videos and podcasts

Title
Date
Link

Companies

Company
CEO
Location
Products/Services