Patents by Inventor Himabindu Vuppula

Himabindu Vuppula has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11663257
    Abstract: Transient computing clusters can be temporarily provisioned in cloud-based infrastructure to run data processing tasks. Such tasks may be run by services operating in the clusters that consume and produce data including operational metadata. Techniques are introduced for tracking data lineage across multiple clusters, including transient computing clusters, based on the operational metadata. In some embodiments, operational metadata is extracted from the transient computing clusters and aggregated at a metadata system for analysis. Based on the analysis of the metadata, operations can be summarized at a cluster level even if the transient computing cluster no longer exists. Further relationships between workflows, such as dependencies or redundancies, can be identified and utilized to optimize the provisioning of computing clusters and tasks performed by the computing clusters.
    Type: Grant
    Filed: July 2, 2021
    Date of Patent: May 30, 2023
    Assignee: Cloudera, Inc.
    Inventors: Sudhanshu Arora, Mark Donsky, Guang Yao Leng, Naren Koneru, Chang She, Vikas Singh, Himabindu Vuppula
  • Publication number: 20210334301
    Abstract: Transient computing clusters can be temporarily provisioned in cloud-based infrastructure to run data processing tasks. Such tasks may be run by services operating in the clusters that consume and produce data including operational metadata. Techniques are introduced for tracking data lineage across multiple clusters, including transient computing clusters, based on the operational metadata. In some embodiments, operational metadata is extracted from the transient computing clusters and aggregated at a metadata system for analysis. Based on the analysis of the metadata, operations can be summarized at a cluster level even if the transient computing cluster no longer exists. Further relationships between workflows, such as dependencies or redundancies, can be identified and utilized to optimize the provisioning of computing clusters and tasks performed by the computing clusters.
    Type: Application
    Filed: July 2, 2021
    Publication date: October 28, 2021
    Inventors: Sudhanshu Arora, Mark Donsky, Guang Yao Leng, Naren Koneru, Chang She, Vikas Singh, Himabindu Vuppula
  • Patent number: 11086917
    Abstract: Transient computing clusters can be temporarily provisioned in cloud-based infrastructure to run data processing tasks. Such tasks may be run by services operating in the clusters that consume and produce data including operational metadata. Techniques are introduced for tracking data lineage across multiple clusters, including transient computing clusters, based on the operational metadata. In some embodiments, operational metadata is extracted from the transient computing clusters and aggregated at a metadata system for analysis. Based on the analysis of the metadata, operations can be summarized at a cluster level even if the transient computing cluster no longer exists. Further relationships between workflows, such as dependencies or redundancies, can be identified and utilized to optimize the provisioning of computing clusters and tasks performed by the computing clusters.
    Type: Grant
    Filed: February 26, 2020
    Date of Patent: August 10, 2021
    Assignee: Cloudera, Inc.
    Inventors: Sudhanshu Arora, Mark Donsky, Guang Yao Leng, Naren Koneru, Chang She, Vikas Singh, Himabindu Vuppula
  • Publication number: 20200192926
    Abstract: Transient computing clusters can be temporarily provisioned in cloud-based infrastructure to run data processing tasks. Such tasks may be run by services operating in the clusters that consume and produce data including operational metadata. Techniques are introduced for tracking data lineage across multiple clusters, including transient computing clusters, based on the operational metadata. In some embodiments, operational metadata is extracted from the transient computing clusters and aggregated at a metadata system for analysis. Based on the analysis of the metadata, operations can be summarized at a cluster level even if the transient computing cluster no longer exists. Further relationships between workflows, such as dependencies or redundancies, can be identified and utilized to optimize the provisioning of computing clusters and tasks performed by the computing clusters.
    Type: Application
    Filed: February 26, 2020
    Publication date: June 18, 2020
    Inventors: Sudhanshu Arora, Mark Donsky, Guang Yao Leng, Naren Koneru, Chang She, Vikas Singh, Himabindu Vuppula
  • Patent number: 10635700
    Abstract: Transient computing clusters can be temporarily provisioned in cloud-based infrastructure to run data processing tasks. Such tasks may be run by services operating in the clusters that consume and produce data including operational metadata. Techniques are introduced for tracking data lineage across multiple clusters, including transient computing clusters, based on the operational metadata. In some embodiments, operational metadata is extracted from the transient computing clusters and aggregated at a metadata system for analysis. Based on the analysis of the metadata, operations can be summarized at a cluster level even if the transient computing cluster no longer exists. Further relationships between workflows, such as dependencies or redundancies, can be identified and utilized to optimize the provisioning of computing clusters and tasks performed by the computing clusters.
    Type: Grant
    Filed: April 2, 2018
    Date of Patent: April 28, 2020
    Assignee: Cloudera, Inc.
    Inventors: Sudhanshu Arora, Mark Donsky, Guang Yao Leng, Naren Koneru, Chang She, Vikas Singh, Himabindu Vuppula
  • Publication number: 20190138654
    Abstract: Transient computing clusters can be temporarily provisioned in cloud-based infrastructure to run data processing tasks. Such tasks may be run by services operating in the clusters that consume and produce data including operational metadata. Techniques are introduced for tracking data lineage across multiple clusters, including transient computing clusters, based on the operational metadata. In some embodiments, operational metadata is extracted from the transient computing clusters and aggregated at a metadata system for analysis. Based on the analysis of the metadata, operations can be summarized at a cluster level even if the transient computing cluster no longer exists. Further relationships between workflows, such as dependencies or redundancies, can be identified and utilized to optimize the provisioning of computing clusters and tasks performed by the computing clusters.
    Type: Application
    Filed: April 2, 2018
    Publication date: May 9, 2019
    Inventors: Sudhanshu Arora, Mark Donsky, Guang Yao Leng, Naren Koneru, Chang She, Vikas Singh, Himabindu Vuppula
  • Publication number: 20170060892
    Abstract: An example method includes identifying one or more content items in a first user account according to a first search string. The example method can then include storing a pointer to each of the one or more content items in the first user account in a shared folder, the shared folder allowing the first user account or a second user account to access the one or more content items in the first user account. The example method can then include identifying one or more content items in the second user account according to a second search string. The method can then include storing a pointer to each of the one or more content items in the second user account in the shared folder, the shared folder allowing the first user account or the second user account to access the one or more content items in the second user account.
    Type: Application
    Filed: September 1, 2015
    Publication date: March 2, 2017
    Inventor: Himabindu Vuppula