Patent attributes
An apparatus includes a processor and a memory that stores a deep Q reinforcement learning (DQN) algorithm configured to generate an action, based on a state. Each action includes a recommendation associated with a computational resource. Each state identifies at least a role within an enterprise. The processor receives information associated with a first user, including an identification of a first role assigned to the user and computational resource information associated with the user. The processor applies the DQN algorithm to a first state, which includes an identification of the first role, to generate a first action, which includes a recommendation associated with a first computational resource. In response to applying the DQN algorithm, the processor generates a reward value based on the alignment between the first recommendation and the computational resource information associated with the first user. The processor uses the reward value to update the DQN algorithm.