Technical solutions are described for generating structured conversational data. An example method includes receiving an utterance that is part of a conversation and identifying the utterance as part of an adjacency pair. The adjacency pair includes two utterances, each produced by different speakers. The method also includes associating the utterance with a label from a predetermined set of labels based on the identifying of the adjacency pair.