Patent 11288593 was granted and assigned to Baidu on March, 2022 by the United States Patent and Trademark Office.
A method, apparatus and device are for extracting information. The method includes: acquiring an annotated corpus, which includes a plurality of sample sentences and annotated information sets corresponding to the sample sentences, constructing an input sequence and an output sequence based on the sample sentences and the annotated information sets corresponding to the sample sentences, obtaining an information extraction model generating the output sequence from the input sequence by carrying out training with a deep learning method, and inputting a to-be-processed sentence into the information extraction model to extract a knowledge information set included in the to-be-processed sentence. The annotated information set includes information of at least one piece of the following types of knowledge to be extracted from corresponding sample sentences: knowledge based on verbs or prepositions, knowledge based on noun attributes, knowledge of entity description, and knowledge of a relationship between an entity and a concept.