Log in
Enquire now
Deep voice 2

Deep voice 2

A multi-speaker neural artificial speech synthesis system based on Deep voice 1.

OverviewStructured DataIssuesContributors

Contents

Other attributes

Industry
Software development
Software development
Machine learning
Machine learning

Deep Voice 2 is an artificial system synthesis commonly called text-to-speech system (TTS). It is based on Deep voice 1 but constructed with higher performance building blocks and introduces a post-processing neural vocoder. It demonstrates a significant audio quality improvement.

Deep voice 2 can generate several hundred voices and accents. It can learn from hundreds of voices and imitate them perfectly. It can learn from hundreds of unique voices from less than half an hour of data per speaker, while achieving high audio quality synthesis and preserving the speaker identities.

It was released in May 2017 by Baidu Research.

Timeline

No Timeline data yet.

Further Resources

Title
Author
Link
Type
Date

Deep Voice 2: Multi-Speaker Neural Text-to-Speech

Sercan Arik, Gregory Diamos, Andrew Gibiansky, John Miller, Kainan Peng, Wei Ping, Jonathan Raiman and Yanqi Zhou

https://arxiv.org/pdf/1705.08947.pdf

Academic paper

References

Find more entities like Deep voice 2

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.
Open Query Tool
Access by API
Golden Query Tool
Golden logo

Company

  • Home
  • Press & Media
  • Blog
  • Careers
  • WE'RE HIRING

Products

  • Knowledge Graph
  • Query Tool
  • Data Requests
  • Knowledge Storage
  • API
  • Pricing
  • Enterprise
  • ChatGPT Plugin

Legal

  • Terms of Service
  • Enterprise Terms of Service
  • Privacy Policy

Help

  • Help center
  • API Documentation
  • Contact Us
By using this site, you agree to our Terms of Service.