Patent 10509809 was granted and assigned to Amperity on December, 2019 by the United States Patent and Trademark Office.
The present disclosure relates to evaluating whether two data records reflect the same entity using a classifier in the absence of ground truth. Without ground truth, it is difficult to determine the precision or recall of a classifier. The present disclosure generates output data comprising a list of unique signatures generated from a set of records that are compared with each other. The output data may also comprise corresponding record pairs limited to a predetermined sample size for each unique feature signature.