Log in
Enquire now
‌

US Patent 11188821 Control policies for collective robot learning

Patent 11188821 was granted and assigned to X (company) on November, 2021 by the United States Patent and Trademark Office.

OverviewStructured DataIssuesContributors

Contents

Is a
Patent
Patent
0

Patent attributes

Patent Applicant
X (company)
X (company)
0
Current Assignee
X (company)
X (company)
0
Patent Jurisdiction
United States Patent and Trademark Office
United States Patent and Trademark Office
0
Patent Number
111888210
Patent Inventor Names
Sergey Vladimir Levine0
Adrian Ling Hin Li0
Ali Hamid Yahya Valdovinos0
Mrinal Kalakrishnan0
Yevgen Chebotar0
Date of Patent
November 30, 2021
0
Patent Application Number
157056010
Date Filed
September 15, 2017
0
Patent Citations Received
0
‌
US Patent 11373466 Data recorders of autonomous vehicles
‌
US Patent 11392796 Feature dictionary for bandwidth enhancement
‌
US Patent 11410475 Autonomous vehicle data recorders
‌
US Patent 11836650 Artificial intelligence engine for mixing and enhancing features from one or more trained pre-existing machine-learning models
0
‌
US Patent 11841789 Visual aids for debugging
0
‌
US Patent 11842172 Graphical user interface to an artificial intelligence engine utilized to generate one or more trained artificial intelligence models
0
‌
US Patent 11868896 Interface for working with simulations on premises
0
...
Patent Primary Examiner
‌
Miranda M Huang
0
Patent abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, of training a global policy neural network. One of the methods includes initializing an instance of the robotic task for multiple local workers, generating a trajectory of state-action pairs by selecting actions to be performed by the robotic agent while performing the instance of the robotic task, optimizing a local policy controller on the trajectory, generating an optimized trajectory using the optimized local controller, and storing the optimized trajectory in a replay memory associated with the local worker. The method includes sampling, for multiple global workers, an optimized trajectory from one of one or more replay memories associated with the global worker, and training the replica of the global policy neural network maintained by the global worker on the sampled optimized trajectory to determine delta values for the parameters of the global policy neural network.

Timeline

No Timeline data yet.

Further Resources

Title
Author
Link
Type
Date
No Further Resources data yet.

References

Find more entities like US Patent 11188821 Control policies for collective robot learning

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.
Open Query Tool
Access by API
Golden Query Tool
Golden logo

Company

  • Home
  • Press & Media
  • Blog
  • Careers
  • WE'RE HIRING

Products

  • Knowledge Graph
  • Query Tool
  • Data Requests
  • Knowledge Storage
  • API
  • Pricing
  • Enterprise
  • ChatGPT Plugin

Legal

  • Terms of Service
  • Enterprise Terms of Service
  • Privacy Policy

Help

  • Help center
  • API Documentation
  • Contact Us
By using this site, you agree to our Terms of Service.