Patent attributes
A similar files management apparatus displays files similar to a specified file with the respective degrees of similarity. The similar files management apparatus includes a unit specific information generation means for acquiring information specific to each unit contained in a file by means of a predetermined computation formula from the contents of the unit, a file similarity degree computation means for computing the similarity degree between files by comparing the pieces of information specific to the respective units on a unit by unit basis and a display means for displaying the similarity degree of each file other than a specified file relative to the specified file and file identification information of the each file. The information specific to each unit may be a hash value, a sum check value or a CRC value. The units may be pages, chapters, sections or paragraphs.