Patent attributes
A system for extracting attributes can analyze text from data sources, extract n-grams from the text as candidate attribute and service/product pairs, prompt a human operator to rate the suitability of the candidate attribute and service/product pairs, and, based on the ratings, add the candidate attribute and service/product pairs to an attribute dictionary. In embodiments, an attribute extraction system includes an n-gram extractor, an attribute selector interface, and a dictionary builder. Data sources may include product titles, category descriptions, product descriptions, and like data from one or more product databases. In embodiments, the attribute dictionary is analyzed to determine canonical names for products or services and name variants for the products or services.

