Patent attributes
Extracting data from web pages. User input is received defining a tabular form. User input is received correlating portions of the form with user selected data items contained in one or more first web pages. The user input is correlated to create an ontology defining relationships between the user selected data items based on the definition of the tabular form. One or more other web pages are accessed, and based on a context of the one or more data items in the first web page being similar to a context of the selected data items in the one or more first web pages, one or more similar data items are extracted from the one or more other web pages. The extracted data items are correlated to each other in accordance with the ontology defining relationships between the user selected data items and are output as a user searchable data structure.