Patents Assigned to Alexa Internaet
  • Patent number: 7954053
    Abstract: An extraction-rule generation and training system uses information obtained from multiple markup language documents (e.g. web pages) of similar structure to generate an extraction rule for extracting datapoints from markup language documents. Where the structures of two or more documents are not sufficiently similar, the system maintains separate extraction rules for the same datapoint, and applies these separate extraction rules in combination to particular markup language documents to extract the datapoint.
    Type: Grant
    Filed: January 6, 2010
    Date of Patent: May 31, 2011
    Assignee: Alexa Internaet
    Inventors: Greger J. Orelind, August A. Jaenicke