Patents by Inventor Maximilian Ibel

Maximilian Ibel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8037055
    Abstract: Methods and systems for a sitemap generating client for web crawlers are described. The client accesses one or more sources of document information about the documents available on a website, such as the file system, access logs, or pre-made URL lists. Document information is extracted from the sources and one or more sitemaps are generated based on the extracted document information. A notification is transmitted to a remote computer, informing that the sitemap(s) are available for access and likely have been updated. If the remote computer is associated with a web crawler, the remote computer may access the sitemap(s) and use the sitemaps to schedule a crawl of documents included or available on the website.
    Type: Grant
    Filed: August 23, 2010
    Date of Patent: October 11, 2011
    Assignee: Google Inc.
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Patent number: 8032518
    Abstract: Web crawlers crawl websites to access documents of the website for purposes of indexing the documents for search engines. The web crawlers crawl a specified website at a crawl rate that is based on multiple factors. One of the factors is a pre-set crawl rate limit. According to certain embodiments, an owner for a specified website is enabled to modify the crawl rate limit for the specified website when one or more pre-set criteria are met.
    Type: Grant
    Filed: September 4, 2009
    Date of Patent: October 4, 2011
    Assignee: Google Inc.
    Inventors: Vanessa Fox, Amanda Ann Camp, Maximilian Ibel, Patrik Rene Celeste Reali, Jeremy J. Lilley, Katherine Jane Lai, Ted J. Bonkenburg, Neal Douglas Cardwell
  • Publication number: 20110093533
    Abstract: Methods, systems, and apparatus, including computer program products, for generating sitemaps. The method includes scanning network traffic between a server and one or more clients requesting resources from the server, the network traffic including resource request messages from the one or more clients and resources served by the server in response to the resource request messages. The method also includes automatically extracting data from the traffic served by the server to the one or more clients, the extracted data including one or more Uniform Resource Locators that identify the resources served by the server to the one or more clients. The method automatically generates a sitemap from the extracted data, and stores the sitemap in a computer-readable memory.
    Type: Application
    Filed: April 17, 2008
    Publication date: April 21, 2011
    Inventors: Rupinder Kataria, Maximilian Ibel, Gangjiang Li, Narayanan Shivakumar
  • Publication number: 20100318508
    Abstract: Methods and systems for a sitemap generating client for web crawlers are described. The client accesses one or more sources of document information about the documents available on a website, such as the file system, access logs, or pre-made URL lists. Document information is extracted from the sources and one or more sitemaps are generated based on the extracted document information. A notification is transmitted to a remote computer, informing that the sitemap(s) are available for access and likely have been updated. If the remote computer is associated with a web crawler, the remote computer may access the sitemap(s) and use the sitemaps to schedule a crawl of documents included or available on the website.
    Type: Application
    Filed: August 23, 2010
    Publication date: December 16, 2010
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Publication number: 20100262592
    Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.
    Type: Application
    Filed: June 25, 2010
    Publication date: October 14, 2010
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Patent number: 7801881
    Abstract: Methods and systems for a sitemap generating client for web crawlers are described. The client accesses one or more sources of document information about the documents available on a website, such as the file system, access logs, or pre-made URL lists. Document information is extracted from the sources and one or more sitemaps are generated based on the extracted document information. A notification is transmitted to a remote computer, informing that the sitemap(s) are available for access and likely have been updated. If the remote computer is associated with a web crawler, the remote computer may access the sitemap(s) and use the sitemaps to schedule a crawl of documents included or available on the website.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: September 21, 2010
    Assignee: Google Inc.
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Patent number: 7769742
    Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: August 3, 2010
    Assignee: Google Inc.
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Publication number: 20100125564
    Abstract: A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
    Type: Application
    Filed: January 25, 2010
    Publication date: May 20, 2010
    Applicant: GOOGLE INC.
    Inventors: Alan C. Strohm, Feng Hu, Sascha B. Brawer, Maximilian Ibel, Ralph M. Keller, Narayanan Shivakumar, Elad Gil
  • Publication number: 20100077098
    Abstract: Web crawlers crawl websites to access documents of the website for purposes of indexing the documents for search engines. The web crawlers crawl a specified website at a crawl rate that is based on multiple factors. One of the factors is a pre-set crawl rate limit. According to certain embodiments, an owner for a specified website is enabled to modify the crawl rate limit for the specified website when one or more pre-set criteria are met.
    Type: Application
    Filed: September 4, 2009
    Publication date: March 25, 2010
    Inventors: Vanessa Fox, Amanda Camp, Maximilian Ibel, Patrik Reali, Jeremy J. Lilley, Katherine Jane Lai, Ted J. Bonkenburg, Neal Douglas Cardwell
  • Patent number: 7653617
    Abstract: A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
    Type: Grant
    Filed: May 1, 2006
    Date of Patent: January 26, 2010
    Assignee: Google Inc.
    Inventors: Alan C. Strohm, Feng Hu, Sascha B. Brawer, Maximilian Ibel, Ralph M. Keller, Narayanan Shivakumar, Elad Gil
  • Patent number: 7599920
    Abstract: Web crawlers crawl websites to access documents of the website for purposes of indexing the documents for search engines. The web crawlers crawl a specified website at a crawl rate that is based on multiple factors. One of the factors is a pre-set crawl rate limit. According to certain embodiments, an owner for a specified website is enabled to modify the crawl rate limit for the specified website when one or more pre-set criteria are met.
    Type: Grant
    Filed: October 12, 2006
    Date of Patent: October 6, 2009
    Assignee: Google Inc.
    Inventors: Vanessa Fox, Amanda Ann Camp, Maximilian Ibel, Patrik Rene Celeste Reali, Jeremy J. Lilley, Katherine Jane Lai, Ted J. Bonkenburg, Neal Douglas Cardwell
  • Publication number: 20070050338
    Abstract: A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
    Type: Application
    Filed: May 1, 2006
    Publication date: March 1, 2007
    Inventors: Alan Strohm, Feng Hu, Sascha Brawer, Maximilian Ibel, Ralph Keller, Narayanan Shivakumar, Elad Gil