Patents by Inventor Rupesh Kapoor

Rupesh Kapoor has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11328114
    Abstract: Implementations include a batch-optimized render and fetch architecture. An example method performed by the architecture includes receiving a request from a batch process to render a web page and initializing a virtual clock and a task list for rendering the web page. The virtual clock stands still when a request for an embedded item is outstanding and when a task is ready to run. The method may also include generating a rendering result for the web page when the virtual clock matches a run time for a stop task in the task list, and providing the rendering result to the batch process. Another example method includes receiving a request from a batch process to render a web page, identifying an embedded item in the web page, and determining, based on a rewrite rule, that the embedded item has content that is duplicative of content for a previously fetched embedded item.
    Type: Grant
    Filed: May 28, 2018
    Date of Patent: May 10, 2022
    Assignee: GOOGLE LLC
    Inventors: Hui Xu, Rupesh Kapoor, Hao Fang, Cristian Tapus, Erik Arjan Hendriks
  • Publication number: 20180276220
    Abstract: Implementations include a batch-optimized render and fetch architecture. An example method performed by the architecture includes receiving a request from a batch process to render a web page and initializing a virtual clock and a task list for rendering the web page. The virtual clock stands still when a request for an embedded item is outstanding and when a task is ready to run. The method may also include generating a rendering result for the web page when the virtual clock matches a run time for a stop task in the task list, and providing the rendering result to the batch process. Another example method includes receiving a request from a batch process to render a web page, identifying an embedded item in the web page, and determining, based on a rewrite rule, that the embedded item has content that is duplicative of content for a previously fetched embedded item.
    Type: Application
    Filed: May 28, 2018
    Publication date: September 27, 2018
    Inventors: Hui Xu, Rupesh Kapoor, Hao Fang, Cristian Tapus, Erik Arjan Hendriks
  • Patent number: 9984130
    Abstract: Implementations include a batch-optimized render and fetch architecture. An example method performed by the architecture includes receiving a request from a batch process to render a web page and initializing a virtual clock and a task list for rendering the web page. The virtual clock stands still when a request for an embedded item is outstanding and when a task is ready to run. The method may also include generating a rendering result for the web page when the virtual clock matches a run time for a stop task in the task list, and providing the rendering result to the batch process. Another example method includes receiving a request from a batch process to render a web page, identifying an embedded item in the web page, and determining, based on a rewrite rule, that the embedded item has content that is duplicative of content for a previously fetched embedded item.
    Type: Grant
    Filed: October 22, 2014
    Date of Patent: May 29, 2018
    Assignee: GOOGLE LLC
    Inventors: Hui Xu, Rupesh Kapoor, Erik Arjan Hendriks, Hao Fang, Cristian Tapus
  • Patent number: 9424340
    Abstract: A system may identify a set of first documents associated with an organization, and identify clusters to which the first documents belong. Each of a number of the identified clusters may include a group of documents that includes one of the first documents and one or more second documents associated with one or more different organizations. The system may determine a quality score for each of the documents in each of the identified clusters, and determine, for each of the number of the identified clusters, whether the quality score of the one of the first documents in the identified cluster is higher than the quality score of the one or more second documents in the identified cluster. The system may generate a proxy pad score based on the determinations, and store the proxy pad score.
    Type: Grant
    Filed: October 22, 2014
    Date of Patent: August 23, 2016
    Assignee: Google Inc.
    Inventors: Rupesh Kapoor, David Michael Proudfoot, Joachim Kupke
  • Publication number: 20150379014
    Abstract: Implementations include a batch-optimized render and fetch architecture. An example method performed by the architecture includes receiving a request from a batch process to render a web page and initializing a virtual clock and a task list for rendering the web page. The virtual clock stands still when a request for an embedded item is outstanding and when a task is ready to run. The method may also include generating a rendering result for the web page when the virtual clock matches a run time for a stop task in the task list, and providing the rendering result to the batch process. Another example method includes receiving a request from a batch process to render a web page, identifying an embedded item in the web page, and determining, based on a rewrite rule, that the embedded item has content that is duplicative of content for a previously fetched embedded item.
    Type: Application
    Filed: October 22, 2014
    Publication date: December 31, 2015
    Inventors: Hui Xu, Rupesh Kapoor, Erik Arjan Hendriks, Hao Fang, Cristian Tapus
  • Patent number: 8977653
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating modified web documents. In one aspect, a method includes retrieving a web document including markup language code and having references to resources external to the web document and parsing the retrieved web document to interpret the markup language code and identify references to resources external to the retrieved web document. Data relating to at least a portion of the resources external to the retrieved web document are retrieved, and a modified web document including the retrieved data is generated and stored for use in responding to a request for retrieval of content of the web document.
    Type: Grant
    Filed: June 17, 2010
    Date of Patent: March 10, 2015
    Assignee: Google Inc.
    Inventors: Ziga Mahkovec, Rupesh Kapoor
  • Patent number: 8892543
    Abstract: System and method for indexing rendered web page images. A web crawling engine stores the content and crawl time of a web page. A scheduling engine sends the content and crawl time to a rendering engine, and processes requests for embedded objects. If a requested object has been crawled, it sends the contents to the rendering engine. Otherwise it schedules the crawl of the object, and once the object is crawled, it resends the content and crawl time of the web page to the rendering engine. The rendering engine receives the content and crawl time of a web page, requests all embedded objects, and renders the web page to an image once all embedded objects are received.
    Type: Grant
    Filed: September 13, 2012
    Date of Patent: November 18, 2014
    Assignee: Google Inc.
    Inventors: Rupesh Kapoor, Erik Hendriks, Sathayanarayana Giridhar, Andrei Pascovici, Pawel Aleksander Fedorynski
  • Patent number: 8874565
    Abstract: A system may identify a set of first documents associated with an organization, and identify clusters to which the first documents belong. Each of a number of the identified clusters may include a group of documents that includes one of the first documents and one or more second documents associated with one or more different organizations. The system may determine a quality score for each of the documents in each of the identified clusters, and determine, for each of the number of the identified clusters, whether the quality score of the one of the first documents in the identified cluster is higher than the quality score of the one or more second documents in the identified cluster. The system may generate a proxy pad score based on the determinations, and store the proxy pad score.
    Type: Grant
    Filed: December 29, 2008
    Date of Patent: October 28, 2014
    Assignee: Google Inc.
    Inventors: Rupesh Kapoor, David Michael Proudfoot, Joachim Kupke
  • Patent number: 8521746
    Abstract: A system may identify a set of related documents, identify one or more documents in the set of related documents that are sources of redirects, and identify organizations that are targets of the redirects. The system may also determine a redirect score based on the number of the identified documents that are sources of the redirects, determine a spam score based on a number of the organizations that are targets of the redirects, determine whether to classify the set of related documents as a bounce pad based on the redirect score and the spam score, and storing information associated with the result of the determination of whether to classify the set of related documents as a bounce pad.
    Type: Grant
    Filed: September 7, 2011
    Date of Patent: August 27, 2013
    Assignee: Google Inc.
    Inventors: Rupesh Kapoor, David Michael Proudfoot, Joachim Kupke
  • Patent number: 8468145
    Abstract: A URL inspector may determine a uniform resource locator (URL) which includes an indexable fragment. A URL separator may separate, from the URL which includes the indexable fragment, a base URL occurring prior to the indexable fragment. An indexer may process content of the base URL to obtain processed content thereof. A rendering system may render the processed content together with the URL which includes the indexable fragment to obtain rendered content. A content converter may convert the rendered content into indexable content.
    Type: Grant
    Filed: November 10, 2011
    Date of Patent: June 18, 2013
    Assignee: Google Inc.
    Inventors: Pawel Aleksander Fedorynski, Rupesh Kapoor
  • Publication number: 20130073536
    Abstract: A URL inspector may determine a uniform resource locator (URL) which includes an indexable fragment. A URL separator may separate, from the URL which includes the indexable fragment, a base URL occurring prior to the indexable fragment. An indexer may process content of the base URL to obtain processed content thereof. A rendering system may render the processed content together with the URL which includes the indexable fragment to obtain rendered content. A content converter may convert the rendered content into indexable content.
    Type: Application
    Filed: November 10, 2011
    Publication date: March 21, 2013
    Applicant: GOOGLE INC.
    Inventors: Pawel Aleksander Fedorynski, Rupesh Kapoor
  • Patent number: 8346755
    Abstract: System and method for indexing rendered web page images. A web crawling engine stores the content and crawl time of a web page. A scheduling engine sends the content and crawl time to a rendering engine, and processes requests for embedded objects. If a requested object has been crawled, it sends the contents to the rendering engine. Otherwise it schedules the crawl of the object, and once the object is crawled, it resends the content and crawl time of the web page to the rendering engine. The rendering engine receives the content and crawl time of a web page, requests all embedded objects, and renders the web page to an image once all embedded objects are received.
    Type: Grant
    Filed: May 4, 2010
    Date of Patent: January 1, 2013
    Assignee: Google Inc.
    Inventors: Rupesh Kapoor, Erik Hendriks, Sathayanarayana Giridhar, Andrei Pascovici, Pawel Aleksander Fedorynski
  • Patent number: 8037073
    Abstract: A system may identify a set of related documents, identify one or more documents in the set of related documents that are sources of redirects, and identify organizations that are targets of the redirects. The system may also determine a redirect score based on the number of the identified documents that are sources of the redirects, determine a spam score based on a number of the organizations that are targets of the redirects, determine whether to classify the set of related documents as a bounce pad based on the redirect score and the spam score, and storing information associated with the result of the determination of whether to classify the set of related documents as a bounce pad.
    Type: Grant
    Filed: December 29, 2008
    Date of Patent: October 11, 2011
    Assignee: Google Inc.
    Inventors: Rupesh Kapoor, David Michael Proudfoot, Joachim Kupke
  • Patent number: 6889259
    Abstract: A method for providing Internet protocol (IP) addresses with a domain name server (DNS) for multiple web servers of an Internet host. In one embodiment, each web server of an Internet host having multiple web servers is assigned a relative weight based on the individual processing power of the particular web server. As DNS resolution requests are received from client domains, the DNS returns IP addresses for the web servers such that the total number of times that each IP address of the web server is returned is proportional to the relative weight of each server relative to the total weight of all the servers. In another embodiment, the client domains that have most frequently accessed the web servers according to recent web server access logs are identified. In addition, the total number of accesses of each client domain is considered such that static arrays may be constructed to apportion the web servers among the client domains that most frequently access the web servers.
    Type: Grant
    Filed: February 14, 2001
    Date of Patent: May 3, 2005
    Assignee: Quote LLC
    Inventor: Rupesh Kapoor
  • Publication number: 20030037143
    Abstract: A method for providing Internet protocol (IP) addresses with a domain name server (DNS) for multiple web servers of an Internet host. In one embodiment, each web server of an Internet host having multiple web servers is assigned a relative weight based on the individual processing power of the particular web server. As DNS resolution requests are received from client domains, the DNS returns IP addresses for the web servers such that the total number of times that each IP address of the web server is returned is proportional to the relative weight of each server relative to the total weight of all the servers. In another embodiment, the client domains that have most frequently accessed the web servers according to recent web server access logs are identified. In addition, the total number of accesses of each client domain is considered such that static arrays may be constructed to apportion the web servers among the client domains that most frequently access the web servers.
    Type: Application
    Filed: February 14, 2001
    Publication date: February 20, 2003
    Inventor: Rupesh Kapoor
  • Patent number: 6205489
    Abstract: A method for providing Internet protocol (IP) addresses with a domain name server (DNS) for multiple web servers of an Internet host. In one embodiment, each web server of an Internet host having multiple web servers is assigned a relative weight based on the individual processing power of the particular web server. As DNS resolution requests are received from client domains, the DNS returns IP addresses for the web servers such that the total number of times that each IP address of the web server is returned is proportional to the relative weight of each server relative to the total weight of all the servers. In another embodiment, the client domains that have most frequently accessed the web servers according to recent web server access logs are identified. In addition, the total number of accesses of each client domain is considered such that static arrays may be constructed to apportion the web servers among the client domains that most frequently access the web servers.
    Type: Grant
    Filed: January 5, 1999
    Date of Patent: March 20, 2001
    Assignee: WhoWhere, Inc.
    Inventor: Rupesh Kapoor
  • Patent number: 5884038
    Abstract: A method for providing Internet protocol (IP) addresses with a domain name server (DNS) for multiple web servers of an Internet host. In one embodiment, each web server of an Internet host having multiple web servers is assigned a relative weight based on the individual processing power of the particular web server. As DNS resolution requests are received from client domains, the DNS returns IP addresses for the web servers such that the total number of times that each IP address of the web server is returned is proportional to the relative weight of each server relative to the total weight of all the servers. In another embodiment, the client domains that have most frequently accessed the web servers according to recent web server access logs are identified. In addition, the total number of accesses of each client domain is considered such that static arrays may be constructed to apportion the web servers among the client domains that most frequently access the web servers.
    Type: Grant
    Filed: May 2, 1997
    Date of Patent: March 16, 1999
    Assignee: WhoWhere? Inc.
    Inventor: Rupesh Kapoor