Patents by Inventor Suparna Bhattacharya

Suparna Bhattacharya has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240135162
    Abstract: Systems and methods are configured to provide lifetime data valuations for a dataset that evolves across multiple machine learning training tasks by providing and updating path-dependent data valuations for data points in the dataset during each training task. A current machine learning training task may include splitting the dataset into multiple random mini-epochs and training the current machine learning model using a first random mini-epoch and an accuracy mini-epoch, which consists of high value data points from the path-dependent data valuations. The random and accuracy mini-epochs can be, during the training, iterated for a number of times during the training, while a second random mini-epoch is prefetch. During the training, the path-dependent data valuations can be updated based on data valuations during the current training and a similarity between the current machine learning model and prior trained machine learning models.
    Type: Application
    Filed: October 20, 2022
    Publication date: April 25, 2024
    Inventors: CONG XU, SUPARNA BHATTACHARYA, RYAN BEETHE, MARTIN FOLTIN
  • Publication number: 20240069787
    Abstract: Examples described herein relate to preparing datasets in a storage device for machine learning (ML) applications. Examples include maintaining ML facet mappings between ML facets and dataset preparation tags, deriving ML facets of a dataset stored in the storage device, and generating filtered datasets from the datasets using the ML facets and ML facet mappings. The filtered dataset is associated with improved dataset quality compared to unfiltered dataset. The storage device transmits the filtered dataset to ML applications requesting the dataset. Some examples include recommending, by the storage device, ML facets to the ML application based on performance metrics.
    Type: Application
    Filed: August 23, 2022
    Publication date: February 29, 2024
    Inventors: Kalapriya Kannan, Chaitra Kallianpur, Bruce Rabe, Suparna Bhattacharya, Krishnaraju Thangaraju
  • Patent number: 11907241
    Abstract: Systems and methods provide a system that gathers information about data as it progresses through data processing pipelines of data analysis projects. The data analytics system derives value indicators and implicit metadata from the data processing pipelines. For example, the data analytics system may derive value indicators and implicit metadata from data-related products themselves, semantic analysis of the code/processing steps used to process the data-related products, the structure of data processing pipelines, and human behavior related to production and usage of data-related products. Once a new data analysis project is initiated, the data analytics system gathers parameters and characteristics about the new data analysis project and references the value indicators and implicit metadata to recommend useful processing steps, datasets, and/or other data-related products for the new data analysis project.
    Type: Grant
    Filed: June 17, 2022
    Date of Patent: February 20, 2024
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Ted Dunning, Suparna Bhattacharya, Glyn Bowden, Lin A. Nease, Janice M. Zdankus, Sonu Sudhakaran
  • Publication number: 20230418792
    Abstract: Systems and methods are provide for automatically constructing data lineage representations for distributed data processing pipelines. These data lineage representations (which are constructed and stored in a central repository shared by the multiple data processing sites) can be used to among other things, clone the distributed data processing pipeline for quality assurance or debugging purposes. Examples of the presently disclosed technology are able to construct data lineage representations for distributed data processing pipelines by (1) generating a hash content value for universally identifying each data artifact of the distributed data processing pipeline across the multiple processing stages/processing sites of the distributed data processing pipeline; and (2) creating an data processing pipeline abstraction hierarchy for associating each data artifact to input and output events for given executions of given data processing stages (performed by the multiple data processing sites).
    Type: Application
    Filed: June 28, 2022
    Publication date: December 28, 2023
    Inventors: Annmary Justine KOOMTHANAM, Suparna Bhattacharya, Aalap Tripathy, Sergey Serebryakov, Martin Foltin, Paolo Faraboschi
  • Publication number: 20230409586
    Abstract: Systems and methods provide a system that gathers information about data as it progresses through data processing pipelines of data analysis projects. The data analytics system derives value indicators and implicit metadata from the data processing pipelines. For example, the data analytics system may derive value indicators and implicit metadata from data-related products themselves, semantic analysis of the code/processing steps used to process the data-related products, the structure of data processing pipelines, and human behavior related to production and usage of data-related products. Once a new data analysis project is initiated, the data analytics system gathers parameters and characteristics about the new data analysis project and references the value indicators and implicit metadata to recommend useful processing steps, datasets, and/or other data-related products for the new data analysis project.
    Type: Application
    Filed: July 12, 2023
    Publication date: December 21, 2023
    Inventors: TED DUNNING, Suparna Bhattacharya, Glyn Bowden, Lin A. Nease, Janice M. Zdankus, Sonu Sudhakaran
  • Publication number: 20230409587
    Abstract: Systems and methods provide a system that gathers information about data as it progresses through data processing pipelines of data analysis projects. The data analytics system derives value indicators and implicit metadata from the data processing pipelines. For example, the data analytics system may derive value indicators and implicit metadata from data-related products themselves, semantic analysis of the code/processing steps used to process the data-related products, the structure of data processing pipelines, and human behavior related to production and usage of data-related products. Once a new data analysis project is initiated, the data analytics system gathers parameters and characteristics about the new data analysis project and references the value indicators and implicit metadata to recommend useful processing steps, datasets, and/or other data-related products for the new data analysis project.
    Type: Application
    Filed: July 12, 2023
    Publication date: December 21, 2023
    Inventors: TED DUNNING, SUPARNA BHATTACHARYA, GLYN BOWDEN, LIN A. NEASE, JANICE M. ZDANKUS, SONU SUDHAKARAN
  • Publication number: 20230409585
    Abstract: Systems and methods provide a system that gathers information about data as it progresses through data processing pipelines of data analysis projects. The data analytics system derives value indicators and implicit metadata from the data processing pipelines. For example, the data analytics system may derive value indicators and implicit metadata from data-related products themselves, semantic analysis of the code/processing steps used to process the data-related products, the structure of data processing pipelines, and human behavior related to production and usage of data-related products. Once a new data analysis project is initiated, the data analytics system gathers parameters and characteristics about the new data analysis project and references the value indicators and implicit metadata to recommend useful processing steps, datasets, and/or other data-related products for the new data analysis project.
    Type: Application
    Filed: June 17, 2022
    Publication date: December 21, 2023
    Inventors: TED DUNNING, SUPARNA BHATTACHARYA, GLYN BOWDEN, LIN A. NEASE, JANICE M. ZDANKUS, SONU SUDHAKARAN
  • Patent number: 11734041
    Abstract: Architectures and techniques for providing persistent volume functionality are disclosed. A storage container having a virtual storage volume to be persisted across multiple applications is created. The multiple applications hosted in one or more application containers. The storage container is placed within a virtual machine object. The virtual machine object containing the storage container is stored in a computer-readable memory as a persistent virtual storage volume.
    Type: Grant
    Filed: November 23, 2020
    Date of Patent: August 22, 2023
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Prashanto Kochavara, Priyanka Sood, Suparna Bhattacharya
  • Publication number: 20220327376
    Abstract: Systems and methods are configured to split an epoch associated with a training dataset into a plurality of mini-epochs. A machine learning model can be trained with a mini-epoch of the plurality of mini-epochs. The mini-epoch can be, during the training, iterated for a number of times during the training. One or more metrics reflective of at least one of: a training loss, training accuracy, or validation accuracy of the machine learning model associated with the mini-epoch can be received. Whether to terminate iterations of the mini-epoch early before a number of iterations of the mini-epoch reaches the number of times based on the one or more metrics can be determined. The number of iterations can be a non-zero number.
    Type: Application
    Filed: April 9, 2021
    Publication date: October 13, 2022
    Inventors: Cong Xu, Suparna Bhattacharya, Paolo Faraboschi
  • Publication number: 20220300712
    Abstract: Artificial-intelligence (AI)-based question-answer (QA) trace analysis of a text corpus that identifies answers to a natural language question and assesses the manner in which those answers evolve over time based on associated context is described herein. A set of QA trace records can be generated that includes a collection of answers derived from a text corpus in response to a posed natural language question along with contextual information relating to the answers. The set of QA trace records can be ordered based on corresponding date attributes gleaned from the contextual information to produce a time-series of QA trace records that can be processed by various types of downstream processing. Such downstream processing can include data visualization, pattern recognition, or the like for assessing how an answer to a natural language question evolves over time, identifying patterns/trends that develop over time with respect to the set of answers, and so forth.
    Type: Application
    Filed: March 22, 2021
    Publication date: September 22, 2022
    Inventors: Suparna BHATTACHARYA, Mayukh DUTTA, Manoj SRIVATSAV, Sergey SEREBRYAKOV
  • Publication number: 20220230024
    Abstract: Systems and methods are provided for reusing machine learning models. For example, the applicability of prior models may be compared using one or more assessment values, including a similarity threshold and/or an accuracy threshold. The similarity threshold may identify a similarity of data between a first data set used to generate a first model and a new data set that is received by the system. When the similarity between these two data sets is exceeded, the system may reuse a model with the highest similarity value. When an accuracy value of the data set does not exceed an accuracy threshold, the system may initiate a retraining process to generate a second ML model associated with the second data.
    Type: Application
    Filed: January 20, 2021
    Publication date: July 21, 2022
    Inventors: CHAITRA KALLIANPUR, Kalapriya Kannan, Suparna Bhattacharya
  • Patent number: 11392541
    Abstract: A source system generates snapshots of collected data. The snapshots have respective associated time references. Responsive to a request from a target system for data collected over a time interval, the source system generates a subset of the data collected by determining a start snapshot and an end snapshot. The start snapshot and the end snapshot are determined as a pair of snapshots that have respective associated time references that are most closely spaced and are inclusive of the time interval. The source system determines a difference in the data included in the end snapshot and the start snapshot and provides the subset of the data as the difference in the data included in the end snapshot and the start snapshot.
    Type: Grant
    Filed: March 22, 2019
    Date of Patent: July 19, 2022
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Suparna Bhattacharya, Lin A. Nease, Peter F. Corbett
  • Publication number: 20220075794
    Abstract: Examples include bypassing a portion of an analytics workflow. In some examples, execution of an analytics workflow may be monitored upon receipt of a raw data and the execution may be interrupted at an optimal bypass stage to obtain insights data from the raw data. A similarity analysis may be performed to compare the insights data to a stored insights data in an insights data repository. Based, at least in part, on a determination of similarity, a bypass operation may be performed to bypass a remainder of the analytics workflow.
    Type: Application
    Filed: November 19, 2021
    Publication date: March 10, 2022
    Inventors: Kalapriya Kannan, Suparna Bhattacharya, Douglas L. Voigt
  • Patent number: 11204935
    Abstract: Examples include bypassing a portion of an analytics workflow. In some examples, execution of an analytics workflow may be monitored upon receipt of a raw data and the execution may be interrupted at an optimal bypass stage to obtain insights data from the raw data. A similarity analysis may be performed to compare the insights data to a stored insights data in an insights data repository. Based, at least in part, on a determination of similarity, a bypass operation may be performed to bypass a remainder of the analytics workflow.
    Type: Grant
    Filed: May 27, 2016
    Date of Patent: December 21, 2021
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Kalapriya Kannan, Suparna Bhattacharya, Douglas L. Voigt
  • Publication number: 20210342301
    Abstract: Examples described herein relate to a computing system, a method and a non-transitory machine-readable medium for handling a request directed to a file in first filesystem having a filesystem instance being a content addressable storage objects. The computing system may also include a general-purpose second filesystem including its backing store within the filesystem instance of the first filesystem. Moreover, the computing system includes a first filesystem server to receive the request for an operation directed to the file in the first filesystem from an application. The first filesystem server may redirect the request to the second filesystem if the operation is a metadata operation; else redirect the request to the first filesystem.
    Type: Application
    Filed: March 18, 2021
    Publication date: November 4, 2021
    Inventors: Venkataraman Kamalaksha, Suparna Bhattacharya, Ashutosh Kumar
  • Publication number: 20210279088
    Abstract: Architectures and techniques for providing persistent volume functionality are disclosed. A storage container having a virtual storage volume to be persisted across multiple applications is created. The multiple applications hosted in one or more application containers. The storage container is placed within a virtual machine object. The virtual machine object containing the storage container is stored in a computer-readable memory as a persistent virtual storage volume.
    Type: Application
    Filed: November 23, 2020
    Publication date: September 9, 2021
    Inventors: Prashanto Kochavara, Priyanka Sood, Suparna Bhattacharya
  • Patent number: 11074236
    Abstract: An example implementation may relate to an apparatus that may identify data content of interest from data in buffers, and may store index entries representing the identified data content in a hierarchical index having different performance levels. The apparatus may include a priority manager that maintains an index scoreboard that tracks where index entries are to be stored among the different performance levels of the hierarchical index based on predetermined polices that prioritize data content of interest or functions that use data content of interest.
    Type: Grant
    Filed: November 19, 2015
    Date of Patent: July 27, 2021
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Douglas L Voigt, Suparna Bhattacharya
  • Patent number: 10991428
    Abstract: Ternary content addressable memory (TCAM) structures and methods of use are disclosed. The memory architecture includes one or more ternary content addressable memory (TCAM) fields, and control logic that applies progressively discriminating data-masking and scores a closeness of a match based on matched and mismatched bits.
    Type: Grant
    Filed: January 15, 2020
    Date of Patent: April 27, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Igor Arsovski, Suparna Bhattacharya, Arvind Kumar
  • Patent number: 10986183
    Abstract: Example techniques of data management in a network environment are described. In an example, a semantic pattern in a data stream transmitted from a source device to an edge device in the network environment is determined. The semantic pattern indicates relevance of data samples in the data stream for analysis of the data stream. The data stream is processed based on the semantic pattern, for storage and transmission in the network environment.
    Type: Grant
    Filed: May 2, 2018
    Date of Patent: April 20, 2021
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Suparna Bhattacharya, Madhumita Bharde, Santigopal Mondal
  • Patent number: 10936637
    Abstract: Some examples relate to associating an insight with data. In an example, data may be received. A determination may be made that data type of the data is same as compared to an earlier data. An insight generated from the earlier data may be identified, wherein the insight may represent intermediate or resultant data generated upon processing of the earlier data by an analytics function, and wherein during generation metadata is associated with the insight. An analytics function used for generating the insight may be identified.
    Type: Grant
    Filed: April 10, 2017
    Date of Patent: March 2, 2021
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Kalapriya Kannan, Suparna Bhattacharya, Douglas L. Voigt, Muthukumar Murugan