Log in
Enquire now
‌

US Patent 12001791 Systems and methods for screening data instances based on a target text of a target corpus

OverviewStructured DataIssuesContributors

Contents

Is a
Patent
Patent
0

Patent attributes

Patent Jurisdiction
United States Patent and Trademark Office
United States Patent and Trademark Office
0
Patent Number
120017910
Patent Inventor Names
Mina Naghshnejad0
Harsh Singhal0
Vijayan Nair0
Agus Sudjianto0
Angelina Yang0
Tarun Joshi0
Date of Patent
June 4, 2024
0
Patent Application Number
180456890
Date Filed
October 11, 2022
0
Patent Citations
‌
US Patent 8155950 Method and system for providing a personalized electronic dictionary and vocabulary builder
0
‌
US Patent 8620658 Voice chat system, information processing apparatus, speech recognition method, keyword data electrode detection method, and program for speech recognition
0
‌
US Patent 8719006 Combined statistical and rule-based part-of-speech tagging for text-to-speech synthesis
0
‌
US Patent 8849665 System and method of providing machine translation from a source language to a target language
0
‌
US Patent 9020806 Generating sentence completion questions
0
‌
US Patent 9239827 Identifying collocations in a corpus of text in a distributed computing environment
0
‌
US Patent 9292487 Discriminative language model pruning
0
‌
US Patent 9436760 Measuring accuracy of semantic graphs with exogenous datasets
0
...
Patent Primary Examiner
‌
Gerald Gauthier
0
CPC Code
‌
H04L 63/08
0
‌
G06F 16/258
0
‌
G06F 16/3329
0
‌
G06F 16/3344
0
‌
G06F 16/3347
0
‌
G06F 16/374
0
‌
G06F 16/783
0
‌
G06F 40/117
0
...
Patent abstract

Systems, apparatuses, methods, and computer program products are disclosed for screening data instances based on a target text of a target corpus. A screening device analyzes a target corpus to generate at least two term dictionaries for the target corpus. The screening apparatus, based on a frequency of a term in the target corpus, determines a term weight for the term; for each data instance, determines term scores for the data instance and the target text based on the term weights; filters the data instances based on the term scores, to generate a short list of data instances; determines term similarity scores between each data instance of the short list and target text based on the term weights; and provides a data instance determined to likely correspond to the target text and an indication of the corresponding term similarity score(s). A term is a word or an n-gram.

Timeline

No Timeline data yet.

Further Resources

Title
Author
Link
Type
Date
No Further Resources data yet.

References

Find more entities like US Patent 12001791 Systems and methods for screening data instances based on a target text of a target corpus

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.
Open Query Tool
Access by API
Golden Query Tool
Golden logo

Company

  • Home
  • Press & Media
  • Blog
  • Careers
  • WE'RE HIRING

Products

  • Knowledge Graph
  • Query Tool
  • Data Requests
  • Knowledge Storage
  • API
  • Pricing
  • Enterprise
  • ChatGPT Plugin

Legal

  • Terms of Service
  • Enterprise Terms of Service
  • Privacy Policy

Help

  • Help center
  • API Documentation
  • Contact Us
By using this site, you agree to our Terms of Service.