Log in
Enquire now
Whisper (OpenAI)

Whisper (OpenAI)

An automatic speech recognition (ASR) system from OpenAI.

OverviewStructured DataIssuesContributors

Contents

openai.com/blog/whisper/
Is a
‌
AI Project

AI Project attributes

Industry
Speech recognition
Speech recognition
Artificial Intelligence (AI)
Artificial Intelligence (AI)
AI Project Parent Organization
OpenAI
OpenAI

Other attributes

Competitors
Massively Multilingual Speech (MMS)
Massively Multilingual Speech (MMS)
Launch Date
September 21, 2022

Whisper is an automatic speech recognition (ASR) system that is approaching human levels of accuracy for the English language. The model is trained on 680,000 hours of multilingual and multitask supervised data collected from the internet. Using such a large training dataset helps Whisper improve its robustness to accents, background noise and technical language, enabling transcription in multiple languages as well as translating from various languages into English.

Whisper's architecture uses an end-to-end approach implemented as an encoder-decoder transformer. Audio is divided into 30-second chunks before being converted into a log-Mel spectrogram and then passed into an encoder. A decoder is trained to predict the corresponding text caption as well as perform specific tasks such as language identification, phrase-level timestamps, multilingual speech transcription, and to-English speech translation. While specialized models show better speech recognition performance, using a large and diverse dataset allows Whisper to be used for a variety of tasks with fewer errors. Roughly a third of Whisper's audio dataset is non-English. OpenAI open-sourced the Whisper model and inference code.

Timeline

No Timeline data yet.

Further Resources

Title
Author
Link
Type
Date

Introducing Whisper

https://openai.com/blog/whisper/

Web

September 21, 2022

References

Find more entities like Whisper (OpenAI)

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.
Open Query Tool
Access by API
Golden Query Tool
Golden logo

Company

  • Home
  • Press & Media
  • Blog
  • Careers
  • WE'RE HIRING

Products

  • Knowledge Graph
  • Query Tool
  • Data Requests
  • Knowledge Storage
  • API
  • Pricing
  • Enterprise
  • ChatGPT Plugin

Legal

  • Terms of Service
  • Enterprise Terms of Service
  • Privacy Policy

Help

  • Help center
  • API Documentation
  • Contact Us
By using this site, you agree to our Terms of Service.