SBIR/STTR Award attributes
This proposal is to extend into Phase II for our development of Team Oriented Resource Management and Control (TORMAC), as a solution to the problem of distributed decision-making for resource constrained platforms in complex, adversarial environments. We are joined in this effort by Harvard University’s Center for Research on Computation and Society (CRCS), Teamcore Group. In Phase I we developed a scenario where rangers with UAVs and ground vehicles patrolled a wildlife reserve for poachers. The rangers were supported by ground sensors listening for vehicles and drones-in-a-box placed strategically in the environment. The objective of the rangers was to minimize poaching by finding poachers and tracking them when possible. The adversarial poachers aimed to poach as many animals as possible, without being caught by the rangers. This scenario features much of the complexity of more general problems, with spatial and temporal constraints on rangers, uncertainty about poachers, adversarial behaviors, and the need to react to stimulus. In Phase II our work will focus specifically on the game theoretic aspects of the cooperative reasoning. There has been extensive study of adversarial bandit problems, where algorithms like Exp3 and its variants have proven excellent theoretical guarantees of low-regret learning against an adversary.