A collaborative video camera system of an edge node can be used to track and predict objects, locations of the objects, and events associated therewith. For example, multiple cameras can be utilized to determine the direct in which an object is heading. This data can be used to activate and/or dispatch other cameras that may be at or near the predicted location of the object. Additionally, sound associated with the object can be used to predict and/or active cameras that are at or near the predicted location of the object.