Log in
Enquire now
‌

US Patent 11475898 Low-latency multi-speaker speech recognition

OverviewStructured DataIssuesContributors

Contents

Is a
Patent
Patent

Patent attributes

Patent Applicant
Apple (company)
Apple (company)
Current Assignee
Apple (company)
Apple (company)
Patent Jurisdiction
United States Patent and Trademark Office
United States Patent and Trademark Office
Patent Number
11475898
Date of Patent
October 18, 2022
Patent Application Number
16534902
Date Filed
August 7, 2019
Patent Citations
‌
US Patent 10049663 Intelligent automated assistant for media exploration
‌
US Patent 10001817 User interface for manipulating user interface objects with magnetic properties
‌
US Patent 10032455 Configurable speech recognition system using a pronunciation alignment between multiple recognizers
‌
US Patent 10037758 Device and method for understanding user intent
‌
US Patent 10216351 Device configuration user interface
‌
US Patent 10248308 Devices, methods, and graphical user interfaces for manipulating user interfaces with physical gestures
‌
US Patent 10249305 Permutation invariant training for talker-independent multi-talker speech separation
‌
US Patent 10296160 Method for extracting salient dialog usage from live data
...
Patent Primary Examiner
‌
Edwin S Leland, III
CPC Code
‌
G10L 17/18
‌
G10L 21/0272
‌
G10L 17/00
‌
G10L 15/20
‌
G10L 17/02
‌
G10L 17/04

Systems and processes for operating an intelligent automated assistant are provided. In one example, a method includes receiving mixed speech data representing utterances of a target speaker and utterances of one or more interfering audio sources. The method further includes obtaining a target speaker representation, which represents speech characteristics of the target speaker; and determining, using a learning network, probability distributions of phonetic elements directly from the mixed speech data. The inputs of the learning network include the mixed speech data and the target speaker representation. An output of the learning network includes the probability distributions of phonetic elements. The method further includes generating text corresponding to the utterances of the target speaker based on the probability distributions of the phonetic elements; and providing a response to the target speaker based on the text corresponding to the utterances of the target speaker.

Timeline

No Timeline data yet.

Further Resources

Title
Author
Link
Type
Date
No Further Resources data yet.

References

Find more entities like US Patent 11475898 Low-latency multi-speaker speech recognition

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.
Open Query Tool
Access by API
Golden Query Tool
Golden logo

Company

  • Home
  • Press & Media
  • Blog
  • Careers
  • WE'RE HIRING

Products

  • Knowledge Graph
  • Query Tool
  • Data Requests
  • Knowledge Storage
  • API
  • Pricing
  • Enterprise
  • ChatGPT Plugin

Legal

  • Terms of Service
  • Enterprise Terms of Service
  • Privacy Policy

Help

  • Help center
  • API Documentation
  • Contact Us
By using this site, you agree to our Terms of Service.