Speech synthesis

Artificial simulation of human speech using computers or other devices

Overview Structured Data Issues Contributors

All edits by Joshua Holley

Edits on 9 Mar, 2018

"Added a basic explanation of Hidden Markov models and how they relate to speech synthesis"

Joshua Holley

edited on 9 Mar, 2018

Edits made to:

Article (+1 videos) (+1896 characters)

Table (+1 rows) (+3 cells) (+204 characters)

Article

The simple method of speech synthesis relies on a machine analyzing the words of input phrases and grouping letters based on common usage together. These letters are then matched to a specific sound in the machine's database, which creates the synthesized audio. In this version of speech synthesis, the machine is merely converting the most common sounds that letters make together into audio, which results in the uneven and robotic tones and odd mispronunciations present in simpler systems.

In order to introduce smooth and more natural speech patterns, modern speech synthesis systems have begun to deploy Hidden Markov models to determine the most likely phrase that needs to be "spoken" by the synthesizer. Hidden Markov models are finite state machines that can be used to analyze segments of text that are broken down into a series based on time. The state machine determines the actual word that has been typed using phonetic analysis and its place within the typed phrase based on probability. This allows the machine to string the sounds along in a more naturally paced manner that matches the intent of the text to the audio being produced.

HMM-Based Speech Synthesis: Fundamentals and Its Recent Advances

The four states of analysis to produce audio based on Hidden Markov models are text, phonetic, prosodic, and speech. Text analysis converts the text into a form usable by the machine and utilizes probability to determine the linguistic meaning of the text and the context of the text. Phonetic analysis converts the literal typed letters into phonetic symbols that the machine can relate to certain sounds. Prosodic analysis seeks to use the linguistic meaning in conjunction with the context and phonetic sounds to determine the most probable rhythms, stress patterns, and intonation. Speech analysis combines the results of the previous states to generate the speech signal.

Table

Author

Title

Link

Sangramsing Kayte, Monica Mundada, Jayesh Gujrathi

Hidden Markov Model based Speech Synthesis: A Review

https://www.researchgate.net/publication/284139182_Hidden_Markov_Model_based_Speech_Synthesis_A_Review

Find more entities like Speech synthesis

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.

Open Query Tool

Access by API

By using this site, you agree to our Terms of Service.