Patent attributes
Disclosed are systems, methods, and non-transitory computer-readable media for structuring data in a knowledge graph. A data management system determines known concepts that are related to a data snippet. The data management system determines cosine similarity values indicating an intrinsic similarity between the data snippet and the known concepts, as well as pertinence values indicating a measure of topical similarity between the data snippet and the known concepts. The data management system determines, based on the cosine similarity values and the pertinence values, that the data snippet is related to a first known concept, and in response, assigns a concept identifier for the first known concept to the data snippet. Score indicating a strength of connection between the concepts added to the knowledge graph are determined and used to derive insights between the concepts.