The present disclosure relates to evaluating whether two data records reflect the same entity using a classifier in the absence of ground truth. Without ground truth, it is difficult to determine the precision or recall of a classifier. The present disclosure generates a list comprising a series of unique feature signatures and a set of sample record pairs for each unique feature signature. In some embodiments, users may provide labels for the set of sample record pairs for each unique feature signature.