LoginSign Up
Tacotron 2

Tacotron 2

A neural network system for speech synthesis and the combination of WaveNet and Tacotron for Google Assistant.

Tacotron 2 is a multiple neural network architecture for speech synthesis. It is the combination of the text-to-speech systems (TTSs) WaveNet and Tacotron. The system was developed for Google Assistant.

It is an end-to-end TTS system with a sequence-to-sequence recurrent network that predicts mel spectograms with a modified WaveNet vocoder. It can be directly trained from data and can achieve state-of-the-art natural human speech sound quality.

Alphabet Inc. researchers have developed Tacotron 2 as a new version of DeepMind's WaveNet to power Google Assistant. It is a second generation of AI powered speech synthesis system by Google. It uses multiple neural networks to produce speech almost indistinguishable from humans.

Timeline

Currently, no events have been added to this timeline yet.
Be the first one to add some.

People

Name
Role
Related Golden topics

Further reading

Author
Title
Link
Type

Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, RJ Skerry-Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis and Yonghui Wu

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

Academic paper

Documentaries, videos and podcasts

Title
Date
Link

Tacotron 2 - THE BEST TEXT TO SPEECH AI YET!

20 January 2018

Companies

Company
CEO
Location
Products/Services