Patents Assigned to Pronto, Inc. - Justia Patents Search

Patents Assigned to Pronto, Inc.

Method and system for identifying product-related information on a web page

Patent number: 7912755

Abstract: A method and system is provided that in a fully automated manner crawls web sites and identifies specific types of web pages, then extracts targeted data from those web pages. One or more text nodes containing product-related information on a first web page are first identified, and the locations of those text nodes are described using one or more vectors. The vectors are then analyzed to identify one or more patterns and to generate a model from those patterns that discriminates between text nodes that contain product-related information and text nodes that do not contain product-related information on a second web page. The model can then be used to crawl web sites to identify and extract targeted data, or the model can be installed on a user's computer to identify and extract targeted information from web sites as the user is browsing.

Type: Grant

Filed: September 23, 2005

Date of Patent: March 22, 2011

Assignee: Pronto, Inc.

Inventors: Bradley John Perry, Nancy Ann Perry, Daniel Carl Marriott