Patents by Inventor Ralph Michael Keller

Ralph Michael Keller has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Web crawler scheduler that utilizes sitemaps from websites

Patent number: 9355177

Abstract: Systems and methods for scheduling documents for crawling are disclosed in which sitemap information is updated for a first website identified by a sitemap by downloading updated sitemap information for the first website and scheduling documents for crawling in accordance with the updated sitemap information for the first website. The sitemap information includes one or more sitemap indexes, where each respective sitemap index in the one or more sitemap indices includes a list of URLs corresponding to documents stored at a corresponding website in a plurality of websites, the plurality of websites including the first website, and each sitemap index in the one or more sitemap indexes includes information identifying one or more of: a last modification date of a URL in the list of URLs, a change frequency of a document specified by the URL, a document title, an authority of the document, and a priority of the document.

Type: Grant

Filed: January 27, 2015

Date of Patent: May 31, 2016

Assignee: Google, Inc.

Inventors: Sascha Benjamin Brawer, Max Ibel, Ralph Michael Keller, Narayanan Shivakumar
Web Crawler Scheduler that Utilizes Sitemaps from Websites

Publication number: 20150242508

Abstract: Systems and methods for scheduling documents for crawling are disclosed in which sitemap information is updated for a first website identified by a sitemap by downloading updated sitemap information for the first website and scheduling documents for crawling in accordance with the updated sitemap information for the first website. The sitemap information includes one or more sitemap indexes, where each respective sitemap index in the one or more sitemap indices includes a list of URLs corresponding to documents stored at a corresponding website in a plurality of websites, the plurality of websites including the first website, and each sitemap index in the one or more sitemap indexes includes information identifying one or more of: a last modification date of a URL in the list of URLs, a change frequency of a document specified by the URL, a document title, an authority of the document, and a priority of the document.

Type: Application

Filed: January 27, 2015

Publication date: August 27, 2015

Inventors: Sascha Benjamin Brawer, Max Ibel, Ralph Michael Keller, Narayanan Shivakumar
Web crawler scheduler that utilizes sitemaps from websites

Patent number: 9002819

Abstract: Systems and methods for scheduling documents for crawling are disclosed. In some implementations, a method includes obtaining sitemap information for a plurality of websites; and analyzing the sitemap information to identify a website, in the plurality of websites. The website has sitemap information that is at least potentially out of date. The method also includes updating the sitemap information for the identified website by downloading updated sitemap information for the identified website; and scheduling documents for crawling in accordance with the updated sitemap information for the identified website.

Type: Grant

Filed: April 8, 2013

Date of Patent: April 7, 2015

Assignee: Google Inc.

Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
Web crawler scheduler that utilizes sitemaps from websites

Patent number: 8417686

Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.

Type: Grant

Filed: October 11, 2011

Date of Patent: April 9, 2013

Assignee: Google Inc.

Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
Web Crawler Scheduler that Utilizes Sitemaps from Websites

Publication number: 20120036118

Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.

Type: Application

Filed: October 11, 2011

Publication date: February 9, 2012

Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
Sitemap generating client for web crawler

Patent number: 8037055

Abstract: Methods and systems for a sitemap generating client for web crawlers are described. The client accesses one or more sources of document information about the documents available on a website, such as the file system, access logs, or pre-made URL lists. Document information is extracted from the sources and one or more sitemaps are generated based on the extracted document information. A notification is transmitted to a remote computer, informing that the sitemap(s) are available for access and likely have been updated. If the remote computer is associated with a web crawler, the remote computer may access the sitemap(s) and use the sitemaps to schedule a crawl of documents included or available on the website.

Type: Grant

Filed: August 23, 2010

Date of Patent: October 11, 2011

Assignee: Google Inc.

Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
Web crawler scheduler that utilizes sitemaps from websites

Patent number: 8037054

Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.

Type: Grant

Filed: June 25, 2010

Date of Patent: October 11, 2011

Assignee: Google Inc.

Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
Sitemap Generating Client for Web Crawler

Publication number: 20100318508

Abstract: Methods and systems for a sitemap generating client for web crawlers are described. The client accesses one or more sources of document information about the documents available on a website, such as the file system, access logs, or pre-made URL lists. Document information is extracted from the sources and one or more sitemaps are generated based on the extracted document information. A notification is transmitted to a remote computer, informing that the sitemap(s) are available for access and likely have been updated. If the remote computer is associated with a web crawler, the remote computer may access the sitemap(s) and use the sitemaps to schedule a crawl of documents included or available on the website.

Type: Application

Filed: August 23, 2010

Publication date: December 16, 2010

Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
Web Crawler Scheduler that Utilizes Sitemaps from Websites

Publication number: 20100262592

Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.

Type: Application

Filed: June 25, 2010

Publication date: October 14, 2010

Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
Sitemap generating client for web crawler

Patent number: 7801881

Abstract: Methods and systems for a sitemap generating client for web crawlers are described. The client accesses one or more sources of document information about the documents available on a website, such as the file system, access logs, or pre-made URL lists. Document information is extracted from the sources and one or more sitemaps are generated based on the extracted document information. A notification is transmitted to a remote computer, informing that the sitemap(s) are available for access and likely have been updated. If the remote computer is associated with a web crawler, the remote computer may access the sitemap(s) and use the sitemaps to schedule a crawl of documents included or available on the website.

Type: Grant

Filed: June 30, 2005

Date of Patent: September 21, 2010

Assignee: Google Inc.

Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
Web crawler scheduler that utilizes sitemaps from websites

Patent number: 7769742

Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.

Type: Grant

Filed: June 30, 2005

Date of Patent: August 3, 2010

Assignee: Google Inc.

Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar