An apparatus includes a camera and a processor. The camera may be configured to capture video data of an area of interest. The processor may be configured to (A) process the video data, (B) generate control signals used to initiate an external stimulus and (C) execute computer readable instructions. The computer readable instructions may be executed by the processor to (a) stream the video data to an external server, receive facial recognition results from the external server and (c) if the facial recognition results cannot detect a face of a detected person, determine which of the control signals to generate. The external stimuli may be implemented to encourage a detected person to look in a direction of the camera sensor.