System and method for analyzing audio data are provided. The audio data may be analyzed to identify speech prosody. For example, the audio data may be analyzed to select a portion of the audio data containing speech produced by a first speaker. The audio data may be further analyzed to identify speech prosody of the speech within the selected portion. Feedbacks and reports may be provided based on the identified speech prosody.