Patent attributes
A processing system can include a processor that includes circuitry. The circuitry can be configured to: receive far-end and near-end audio signals; detect silence events and voice activities from the audio signals; determine whether an audio event in the audio signals is an interference event or a speaker event based on the detected silence events and voice activities, and further based on localized acoustic source data and faces or motion detected from an image; and generate a mute or unmute indication based on whether the audio event is the interference event or the speaker event. The system can include a near-end microphone array to output the near-end audio signals, one or more far-end microphones to output the far-end audio signals, and one or more cameras to capture the image of the environment.