A method and an apparatus for processing voice are provided. The method is applied to a decision-making device in communication with a distributed microphone array and the distributed microphone array comprises a plurality of sub-arrays. The method comprises: obtaining, for each sub-array, an awakening voice signal received by each microphone of the sub-array; determining, for each sub-array, a frequency domain signal corresponding to each awakening voice signal of the sub-array, and a first cross-correlation function between every two frequency domain signals; determining an awakened sub-array based on each first cross-correlation function for each sub-array.