Golden
LoginSign Up
Tacotron 2

Tacotron 2

A neural network system for speech synthesis and the combination of WaveNet and Tacotron for Google Assistant.

Tacotron 2 is a multiple neural network architecture for speech synthesis. It is the combination of the text-to-speech systems (TTSs) WaveNet and Tacotron. The system was developed for Google Assistant.

It is an end-to-end TTS system with a sequence-to-sequence recurrent network that predicts mel spectograms with a modified WaveNet vocoder. It can be directly trained from data and can achieve state-of-the-art natural human speech sound quality.

Alphabet Inc. researchers have developed Tacotron 2 as a new version of DeepMind's WaveNet to power Google Assistant. It is a second generation of AI powered speech synthesis system by Google. It uses multiple neural networks to produce speech almost indistinguishable from humans.

Timeline

People

Name
Role
Related Golden topics

Further reading

Title
Author
Link
Type

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, RJ Skerry-Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis and Yonghui Wu

Academic paper

Documentaries, videos and podcasts

Title
Date
Link

Tacotron 2 - THE BEST TEXT TO SPEECH AI YET!

20 January 2018

Companies

Company
CEO
Location
Products/Services