Log in
Enquire now
‌

US Patent 12061964 Modulating agent behavior to optimize learning progress

Patent 12061964 was granted and assigned to Deepmind Technologies Limited on August, 2024 by the United States Patent and Trademark Office.

OverviewStructured DataIssuesContributors

Contents

Is a
Patent
Patent
0

Patent attributes

Patent Applicant
Deepmind Technologies Limited
Deepmind Technologies Limited
0
Current Assignee
Deepmind Technologies Limited
Deepmind Technologies Limited
0
Patent Jurisdiction
United States Patent and Trademark Office
United States Patent and Trademark Office
0
Patent Number
120619640
Patent Inventor Names
Diana Luiza Borsa0
Georg Ostrovski0
William Clinton Dabney0
Fengning Ding0
Simon Osindero0
Tom Schaul0
David Szepesvari0
Date of Patent
August 13, 2024
0
Patent Application Number
170325620
Date Filed
September 25, 2020
0
Patent Citations
‌
US Patent 10242666 Method of performing multi-modal dialogue between a humanoid robot and user, computer program product and humanoid robot for implementing said method
0
‌
US Patent 11537439 Intelligent compute resource selection for machine learning training jobs
0
Patent Primary Examiner
‌
William L Bashore
0
CPC Code
‌
G06V 10/764
0
‌
G06V 40/20
0
‌
G06V 10/82
0
‌
G06N 3/08
0
‌
G06N 3/006
0
Patent abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent. One of the methods includes sampling a behavior modulation in accordance with a current probability distribution; for each of one or more time steps: processing an input comprising an observation characterizing a current state of the environment at the time step using an action selection neural network to generate a respective action score for each action in a set of possible actions that can be performed by the agent; modifying the action scores using the sampled behavior modulation; and selecting the action to be performed by the agent at the time step based on the modified action scores; determining a fitness measure corresponding to the sampled behavior modulation; and updating the current probability distribution over the set of possible behavior modulations using the fitness measure corresponding to the behavior modulation.

Timeline

No Timeline data yet.

Further Resources

Title
Author
Link
Type
Date
No Further Resources data yet.

References

Find more entities like US Patent 12061964 Modulating agent behavior to optimize learning progress

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.
Open Query Tool
Access by API
Golden Query Tool
Golden logo

Company

  • Home
  • Press & Media
  • Blog
  • Careers
  • WE'RE HIRING

Products

  • Knowledge Graph
  • Query Tool
  • Data Requests
  • Knowledge Storage
  • API
  • Pricing
  • Enterprise
  • ChatGPT Plugin

Legal

  • Terms of Service
  • Enterprise Terms of Service
  • Privacy Policy

Help

  • Help center
  • API Documentation
  • Contact Us
By using this site, you agree to our Terms of Service.