Patents by Inventor Sascha B. Brawer

Sascha B. Brawer has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10133737
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for transforming text strings. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of receiving input string having a plurality of terms, the input string being in a first form; transforming the input string from the first form to a second form including: applying one or more rules to the input string to identify one or more terms for translation, the one or more identified terms being fewer than the plurality of terms, translating the identified one or more terms to one or more translated terms in the second form, and transliterating the remaining terms of the plurality of terms into transliterated terms in the second form; and concatenating the translated and transliterated terms to form a hybrid output string in the second form.
    Type: Grant
    Filed: August 26, 2011
    Date of Patent: November 20, 2018
    Assignee: GOOGLE LLC
    Inventors: Sascha B Brawer, Martin Jansche, Richard Sproat, Hiroshi Takenaka, Yui Terashima
  • Patent number: 9002819
    Abstract: Systems and methods for scheduling documents for crawling are disclosed. In some implementations, a method includes obtaining sitemap information for a plurality of websites; and analyzing the sitemap information to identify a website, in the plurality of websites. The website has sitemap information that is at least potentially out of date. The method also includes updating the sitemap information for the identified website by downloading updated sitemap information for the identified website; and scheduling documents for crawling in accordance with the updated sitemap information for the identified website.
    Type: Grant
    Filed: April 8, 2013
    Date of Patent: April 7, 2015
    Assignee: Google Inc.
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Publication number: 20140163952
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for transforming text strings. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of receiving input string having a plurality of terms, the input string being in a first form; transforming the input string from the first form to a second form including: applying one or more rules to the input string to identify one or more terms for translation, the one or more identified terms being fewer than the plurality of terms, translating the identified one or more terms to one or more translated terms in the second form, and transliterating the remaining terms of the plurality of terms into transliterated terms in the second form; and concatenating the translated and transliterated terms to form a hybrid output string in the second form.
    Type: Application
    Filed: August 26, 2011
    Publication date: June 12, 2014
    Inventors: Sascha B Brawer, Martin Jansche, Richard Sproat, Hiroshi Takenaka, Yui Terashima
  • Patent number: 8655864
    Abstract: A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
    Type: Grant
    Filed: June 11, 2012
    Date of Patent: February 18, 2014
    Assignee: Google Inc.
    Inventors: Alan C. Strohm, Feng Hu, Sascha B. Brawer, Maximilian Ibel, Ralph M. Keller, Narayanan Shivakumar, Elad Gil
  • Publication number: 20140046925
    Abstract: A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
    Type: Application
    Filed: June 11, 2012
    Publication date: February 13, 2014
    Inventors: Alan C. Strohm, Feng Hu, Sascha B. Brawer, Maximilian Ibel, Ralph M. Keller, Narayanan Shivakumar, Elad Gil
  • Patent number: 8417686
    Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.
    Type: Grant
    Filed: October 11, 2011
    Date of Patent: April 9, 2013
    Assignee: Google Inc.
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Patent number: 8234266
    Abstract: A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
    Type: Grant
    Filed: January 25, 2010
    Date of Patent: July 31, 2012
    Assignee: Google Inc.
    Inventors: Alan C. Strohm, Feng Hu, Sascha B. Brawer, Maximilian Ibel, Ralph M. Keller, Narayanan Shivakumar, Elad Gil
  • Patent number: 8200652
    Abstract: A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
    Type: Grant
    Filed: January 25, 2010
    Date of Patent: June 12, 2012
    Assignee: Google Inc.
    Inventors: Alan C. Strohm, Feng Hu, Sascha B. Brawer, Maximilian Ibel, Ralph M. Keller, Narayanan Shivakumar, Elad Gil
  • Publication number: 20120036118
    Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.
    Type: Application
    Filed: October 11, 2011
    Publication date: February 9, 2012
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Patent number: 8037055
    Abstract: Methods and systems for a sitemap generating client for web crawlers are described. The client accesses one or more sources of document information about the documents available on a website, such as the file system, access logs, or pre-made URL lists. Document information is extracted from the sources and one or more sitemaps are generated based on the extracted document information. A notification is transmitted to a remote computer, informing that the sitemap(s) are available for access and likely have been updated. If the remote computer is associated with a web crawler, the remote computer may access the sitemap(s) and use the sitemaps to schedule a crawl of documents included or available on the website.
    Type: Grant
    Filed: August 23, 2010
    Date of Patent: October 11, 2011
    Assignee: Google Inc.
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Patent number: 8037054
    Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.
    Type: Grant
    Filed: June 25, 2010
    Date of Patent: October 11, 2011
    Assignee: Google Inc.
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Publication number: 20100318508
    Abstract: Methods and systems for a sitemap generating client for web crawlers are described. The client accesses one or more sources of document information about the documents available on a website, such as the file system, access logs, or pre-made URL lists. Document information is extracted from the sources and one or more sitemaps are generated based on the extracted document information. A notification is transmitted to a remote computer, informing that the sitemap(s) are available for access and likely have been updated. If the remote computer is associated with a web crawler, the remote computer may access the sitemap(s) and use the sitemaps to schedule a crawl of documents included or available on the website.
    Type: Application
    Filed: August 23, 2010
    Publication date: December 16, 2010
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Publication number: 20100262592
    Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.
    Type: Application
    Filed: June 25, 2010
    Publication date: October 14, 2010
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Patent number: 7801881
    Abstract: Methods and systems for a sitemap generating client for web crawlers are described. The client accesses one or more sources of document information about the documents available on a website, such as the file system, access logs, or pre-made URL lists. Document information is extracted from the sources and one or more sitemaps are generated based on the extracted document information. A notification is transmitted to a remote computer, informing that the sitemap(s) are available for access and likely have been updated. If the remote computer is associated with a web crawler, the remote computer may access the sitemap(s) and use the sitemaps to schedule a crawl of documents included or available on the website.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: September 21, 2010
    Assignee: Google Inc.
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Patent number: 7769742
    Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: August 3, 2010
    Assignee: Google Inc.
    Inventors: Sascha B. Brawer, Maximilian Ibel, Ralph Michael Keller, Narayanan Shivakumar
  • Publication number: 20100125564
    Abstract: A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
    Type: Application
    Filed: January 25, 2010
    Publication date: May 20, 2010
    Applicant: GOOGLE INC.
    Inventors: Alan C. Strohm, Feng Hu, Sascha B. Brawer, Maximilian Ibel, Ralph M. Keller, Narayanan Shivakumar, Elad Gil
  • Patent number: 7653617
    Abstract: A method of analyzing documents or relationships between documents includes receiving a notification of an available metadata document containing information about one or more network-accessible documents, obtaining a document format indicator associated with the metadata document, selecting a document crawler using the document format indicator, and crawling at least some of the network-accessible documents using the selected document crawler.
    Type: Grant
    Filed: May 1, 2006
    Date of Patent: January 26, 2010
    Assignee: Google Inc.
    Inventors: Alan C. Strohm, Feng Hu, Sascha B. Brawer, Maximilian Ibel, Ralph M. Keller, Narayanan Shivakumar, Elad Gil