Abstract: Exemplary methods, apparatuses, and systems parse a form from a webpage to identify a plurality of input areas and corresponding input types. A multi-sequence form including a plurality of stages is generated. Each stage of the multi-sequence form is to be displayed and submitted independently of other stages of the multi-sequence form. Each stage of the multi-sequence form corresponds to a subset of the parsed form including one or more of the identified input areas. One of the plurality of identified input areas is identified to be of an input type categorized as having a higher likelihood of being completed and submitted by a user. The multi-sequence form is ordered such that the identified input area is ordered first. An updated version of the webpage is generated including the generated multi-sequence form in the determined order in place of the parsed form.
Type:
Grant
Filed:
March 12, 2014
Date of Patent:
February 21, 2017
Assignee:
CAPTORA INC.
Inventors:
Srihari P. Sampath-Kumar, Anindo Mukherjee
Abstract: Web pages of a website are parsed and a set of n-grams are generated from the parsed web pages. A relevancy value is determined for each n-gram and a second set of n-grams is generated by removing any n-gram in the first set whose relevancy value is below a threshold. A third set of n-grams is generated at least by removing those of the second set of n-grams that have been determined to be similar to another one of the second set of n-grams. Responsive to determining that there is not a web page that is directed at an n-gram, a web page is automatically created with content directed at that n-gram including reusing existing content of the website that is related to the n-gram. One or more links to the created page are added to web pages so that the created page is not an orphan page.
Type:
Grant
Filed:
March 12, 2014
Date of Patent:
October 21, 2014
Assignee:
Captora Inc.
Inventors:
Srihari P. Sampath-Kumar, Anindo Mukherjee