Patent attributes
Described herein is a computer implemented method for processing an electronic document. The method comprises accessing a comparison set of reference document codifications, each reference document codification in the comparison set comprising a plurality of canonical feature codifications. Each canonical feature codification in each reference document codification in the comparison set is processed by determining whether the electronic document has one or more text rectangles in a potential position of the canonical feature and, in response determining that the electronic document has one or more text rectangles in a potential position of the canonical feature, recording a preliminary association between the or each text rectangle and the canonical feature. For each text rectangle preliminarily associated with one or more canonical features, a final canonical feature assignment is determined for the text rectangle based on the one or more preliminarily associated canonical features.