Patent attributes
Systems and methods for coreference resolution are disclosed. In one embodiment, a method includes locating, for each of a selected plurality of chains of coreferent mentions, a particular context-based name from the respective chain, wherein the coreferent mentions correspond to entities and the context-based name is a longest name in the respective chain, a last name in the respective chain, or a most frequently occurring name in the respective chain. The method also includes determining an entity category for each respective one of the plurality of chains and determining one or more entity attributes from structured data and unstructured data. The method further includes, based on the located particular context-based name, the entity category, and the one or more attributes, assigning high-probability coreferent chains to high-confidence buckets, such as to produce a Zipfian-like distribution having a head region and a tail region.