Patent attributes
An apparatus includes a memory and processor. The memory stores a machine learning algorithm configured to decide whether to use a virtual shopping cart to verify all or a portion of a transaction performed with an algorithmic shopping cart. The processor receives feedback for a decision made by the algorithm, indicating whether the algorithmic and virtual carts match. The processor assigns a reward to the feedback. A first positive reward is assigned when the virtual shopping cart is not used for verification, and the feedback indicates that the carts match. A second positive reward is assigned when the virtual cart is used for verification, and the feedback indicates that the carts do not match. A first negative reward is assigned when the virtual shopping cart is not used for verification, and the feedback indicates that the carts do not match. A second negative reward is assigned when the virtual cart is used for verification, and the feedback indicates that the carts match. The processor uses the reward to update the algorithm.