Patents by Inventor Ritesh Kumar GUPTA

Ritesh Kumar GUPTA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Selecting best cloud computing environment in a hybrid cloud scenario

Patent number: 12393464

Abstract: Embodiments for identifying an optimal cloud computing environment for a computing task is disclosed. Embodiments comprises receiving a computing task to be executed in a cloud computing environment, wherein the computing task requires a set of cloud computing environment parameter values of the cloud computing environment, pre-selecting a set of candidate cloud computing environments, each of which meets the set of cloud computing environment parameter values, ranking the candidate cloud computing environments using reward-based ranking parameter values of the candidate cloud computing environments as an additional selection constraint, and selecting the highest ranking cloud computing environment as the optimal cloud computing environment for the computing task.

Type: Grant

Filed: February 15, 2022

Date of Patent: August 19, 2025

Assignee: International Business Machines Corporation

Inventors: Anton Zorin, Manish Kesarwani, Niels Dominic Pardon, Ritesh Kumar Gupta, Sameep Mehta
Automatically orchestrating a computerized workflow

Patent number: 12248810

Abstract: The method performs at the orchestration interface at which update information, including changes to tasks of a workflow, is received from a task manager system (TMS), where the workflow includes a set of tasks, inputs to the tasks, and outputs from the tasks. The inputs and outputs determine runtime dependencies between the tasks. Based on the update information received, the orchestration interface populates a topology of nodes and edges as a directed acyclic graph (DAG) that maps nodes to tasks and edges to runtime dependencies between tasks, based on node inputs and outputs. The orchestration interface instructs the execution of the tasks and handling dependencies by interacting with a task execution system (TES) and by traversing the DAG, the orchestration interface identifies tasks that depend on completed tasks as per the runtime dependencies and instructs the TES to execute the dependent tasks identified.

Type: Grant

Filed: June 15, 2022

Date of Patent: March 11, 2025

Assignee: International Business Machines Corporation

Inventors: Anton Zorin, Manish Kesarwani, Niels Dominic Pardon, Ritesh Kumar Gupta, Sameep Mehta
Targeted data acquisition for model training

Patent number: 12217195

Abstract: Targeted acquisition of data for model training includes identifying attributes of classified samples of a collection of samples classified by a classification model, and generating at least one query based on the identified attributes, the at least one query tailored, based on the attributes, to retrieve additional training data for training the classification model to more accurately classify samples and avoid incorrect sample classification.

Type: Grant

Filed: December 21, 2023

Date of Patent: February 4, 2025

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Namit Kabra, Ritesh Kumar Gupta, Vijay Ekambaram, Smitkumar Narotambhai Marvaniya
TARGETED DATA ACQUISITION FOR MODEL TRAINING

Publication number: 20240127085

Abstract: Targeted acquisition of data for model training includes identifying attributes of classified samples of a collection of samples classified by a classification model, and generating at least one query based on the identified attributes, the at least one query tailored, based on the attributes, to retrieve additional training data for training the classification model to more accurately classify samples and avoid incorrect sample classification.

Type: Application

Filed: December 21, 2023

Publication date: April 18, 2024

Inventors: Namit Kabra, Ritesh Kumar Gupta, Vijay Ekambaram, Smitkumar Narotambhai MARVANIYA
Targeted data acquisition for model training

Patent number: 11907860

Abstract: Targeted acquisition of data for model training includes automatically generating metadata describing samples, of an initial dataset, in neighborhoods of an embedding space in which the samples are embedded. The samples described by the automatically generated metadata are classified by a classification model, and include both correctly classified samples in the neighborhoods and incorrectly classified samples in the neighborhoods. Additionally, attributes of one or more correctly classified samples of the collection of samples and one or more incorrectly classified samples of the collection of samples are identified, and queries are generated based on the identified attributes, the queries tailored, based on the attributes, to retrieve additional training data for training the classification model to more accurately classify samples and avoid incorrect sample classification.

Type: Grant

Filed: September 26, 2022

Date of Patent: February 20, 2024

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Namit Kabra, Ritesh Kumar Gupta, Vijay Ekambaram, Smitkumar Narotambhai Marvaniya
AUTOMATICALLY ORCHESTRATING A COMPUTERIZED WORKFLOW

Publication number: 20230409386

Abstract: The method performs at the orchestration interface at which update information, including changes to tasks of a workflow, is received from a task manager system (TMS), where the workflow includes a set of tasks, inputs to the tasks, and outputs from the tasks. The inputs and outputs determine runtime dependencies between the tasks. Based on the update information received, the orchestration interface populates a topology of nodes and edges as a directed acyclic graph (DAG) that maps nodes to tasks and edges to runtime dependencies between tasks, based on node inputs and outputs. The orchestration interface instructs the execution of the tasks and handling dependencies by interacting with a task execution system (TES) and by traversing the DAG, the orchestration interface identifies tasks that depend on completed tasks as per the runtime dependencies and instructs the TES to execute the dependent tasks identified.

Type: Application

Filed: June 15, 2022

Publication date: December 21, 2023

Inventors: Anton Zorin, Manish Kesarwani, Niels Dominic Pardon, Ritesh Kumar Gupta, Sameep Mehta
Prioritization and prediction of jobs using cognitive rules engine

Patent number: 11755926

Abstract: A method, computer system, and a computer program product for data pipeline prioritization is provided. Embodiments may include receiving, by a cognitive rules engine, one or more data pipelines. Embodiments may then include analyzing, using a computational method of the cognitive rules engine, the one or more data pipelines. Embodiments may lastly include prioritizing the one or more data pipelines based on a result of the computational method of the cognitive rules engine.

Type: Grant

Filed: February 28, 2019

Date of Patent: September 12, 2023

Assignee: International Business Machines Corporation

Inventors: Ritesh Kumar Gupta, Namit Kabra, Likhitha Maddirala, Eric Allen Jacobson, Scott Louis Brokaw, Jo Ramos
GENERATING SMART AUTOMATED DATA MOVEMENT WORKFLOWS

Publication number: 20230281212

Abstract: A computer-implemented method generates an automated data movement workflow. The method includes transforming a received request for data, which was received in a restricted natural language form, into a form suitable for accessing a metadata repository. The method further includes identifying data and data dependencies using the transformed request for data. The method further includes building a workflow using the identified data and data dependencies. The method further includes, upon applying at least one governance rule to the workflow, modifying the built workflow to be compliant with the at least one governance rule, and if no compliance with the at least one governance rule is achievable, recommending a change to the built workflow.

Type: Application

Filed: March 7, 2022

Publication date: September 7, 2023

Inventors: Anton Zorin, Manish Kesarwani, Niels Dominic Pardon, Ritesh Kumar Gupta, Sameep Mehta
SELECTING BEST CLOUD COMPUTING ENVIRONMENT IN A HYBRID CLOUD SCENARIO

Publication number: 20230259401

Abstract: Embodiments for identifying an optimal cloud computing environment for a computing task is disclosed. Embodiments comprises receiving a computing task to be executed in a cloud computing environment, wherein the computing task requires a set of cloud computing environment parameter values of the cloud computing environment, pre-selecting a set of candidate cloud computing environments, each of which meets the set of cloud computing environment parameter values, ranking the candidate cloud computing environments using reward-based ranking parameter values of the candidate cloud computing environments as an additional selection constraint, and selecting the highest ranking cloud computing environment as the optimal cloud computing environment for the computing task.

Type: Application

Filed: February 15, 2022

Publication date: August 17, 2023

Inventors: Anton Zorin, Manish Kesarwani, Niels Dominic Pardon, Ritesh Kumar Gupta, Sameep Mehta
Automatically completing a pipeline graph in an internet of things network

Patent number: 11675838

Abstract: An approach is provided for completing a pipeline graph. Using a deep learning based sequence model, an initial data pipeline having a sequence of nodes is generated. Mismatch(es) between data formats required by input and output in the sequence of nodes is identified. Virtual gap node(s) that correct the mismatch(es) are added to the initial data pipeline. For a given virtual gap node, tentative graph structures are determined using knowledge graphs and a crowd sourced validation system. Reuse forecast scores and performance scores for the tentative graph structures are calculated. Based on the reuse forecast scores and the performance scores, a final graph structure for implementing the given virtual gap node is determined.

Type: Grant

Filed: May 11, 2021

Date of Patent: June 13, 2023

Assignee: International Business Machines Corporation

Inventors: Namit Kabra, Ritesh Kumar Gupta, Yannick Saillet, Vijay Ekambaram
TARGETED DATA ACQUISITION FOR MODEL TRAINING

Publication number: 20230016082

Abstract: Targeted acquisition of data for model training includes automatically generating metadata describing samples, of an initial dataset, in neighborhoods of an embedding space in which the samples are embedded. The samples described by the automatically generated metadata are classified by a classification model, and include both correctly classified samples in the neighborhoods and incorrectly classified samples in the neighborhoods. Additionally, attributes of one or more correctly classified samples of the collection of samples and one or more incorrectly classified samples of the collection of samples are identified, and queries are generated based on the identified attributes, the queries tailored, based on the attributes, to retrieve additional training data for training the classification model to more accurately classify samples and avoid incorrect sample classification.

Type: Application

Filed: September 26, 2022

Publication date: January 19, 2023

Inventors: Namit Kabra, Ritesh Kumar Gupta, Vijay Ekambaram, Smitkumar Narotambhai MARVANIYA
Insight expansion in smart data retention systems

Patent number: 11556558

Abstract: A computer-implemented method applies insights from a variety of data sources to each of the data sources. The method includes identifying a set of data sources, wherein each of the data sources are associated with a domain. The method includes analyzing documentation for each of the data sources. The method further includes extracting a set of attributes for each data source, and determining a data schema associated with each data source. The method includes mapping each data schema to a common domain schema. The method also includes linking, based on the mapping and on the set of attributes for each data source, common features across each data source. The method includes generating, in response to the linking, a knowledge graph. The method further includes preparing a visual display for a set of domain insights; and forking the set of domain insights into a first data source.

Type: Grant

Filed: January 11, 2021

Date of Patent: January 17, 2023

Assignee: International Business Machines Corporation

Inventors: Namit Kabra, Ritesh Kumar Gupta, Ron Reuben, Vijay Ekambaram, Smitkumar Narotambhai Marvaniya
Targeted data acquisition for model training

Patent number: 11537915

Abstract: Targeted acquisition of data for model training includes automatically generating metadata describing samples, of an initial dataset, in neighborhoods of an embedding space in which the samples are embedded. The samples described by the automatically generated metadata are classified by a classification model, and include both correctly classified samples in the neighborhoods and incorrectly classified samples in the neighborhoods. Additionally, attributes of one or more correctly classified samples of the collection of samples and one or more incorrectly classified samples of the collection of samples are identified, and queries are generated based on the identified attributes, the queries tailored, based on the attributes, to retrieve additional training data for training the classification model to more accurately classify samples and avoid incorrect sample classification.

Type: Grant

Filed: May 14, 2020

Date of Patent: December 27, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Namit Kabra, Ritesh Kumar Gupta, Vijay Ekambaram, Smitkumar Narotambhai Marvaniya
AUTOMATICALLY COMPLETING A PIPELINE GRAPH IN AN INTERNET OF THINGS NETWORK

Publication number: 20220365973

Abstract: An approach is provided for completing a pipeline graph. Using a deep learning based sequence model, an initial data pipeline having a sequence of nodes is generated. Mismatch(es) between data formats required by input and output in the sequence of nodes is identified. Virtual gap node(s) that correct the mismatch(es) are added to the initial data pipeline. For a given virtual gap node, tentative graph structures are determined using knowledge graphs and a crowd sourced validation system. Reuse forecast scores and performance scores for the tentative graph structures are calculated. Based on the reuse forecast scores and the performance scores, a final graph structure for implementing the given virtual gap node is determined.

Type: Application

Filed: May 11, 2021

Publication date: November 17, 2022

Inventors: Namit KABRA, Ritesh Kumar GUPTA, Yannik SAILLET, Vijay EKAMBARAM
Dynamically modifying the parallelism of a task in a pipeline

Patent number: 11461135

Abstract: In an approach to dynamically identifying and modifying the parallelism of a particular task in a pipeline, the optimal execution time of each stage in a dynamic pipeline is calculated. The actual execution time of each stage in the dynamic pipeline is measured. Whether the actual time of completion of the data processing job will exceed a threshold is determined. If it is determined that the actual time of completion of the data processing job will exceed the threshold, then additional instances of the stages are created.

Type: Grant

Filed: October 25, 2019

Date of Patent: October 4, 2022

Assignee: International Business Machines Corporation

Inventors: Yannick Saillet, Namit Kabra, Ritesh Kumar Gupta
INSIGHT EXPANSION IN SMART DATA RETENTION SYSTEMS

Publication number: 20220222265

Abstract: A computer-implemented method applies insights from a variety of data sources to each of the data sources. The method includes identifying a set of data sources, wherein each of the data sources are associated with a domain. The method includes analyzing documentation for each of the data sources. The method further includes extracting a set of attributes for each data source, and determining a data schema associated with each data source. The method includes mapping each data schema to a common domain schema. The method also includes linking, based on the mapping and on the set of attributes for each data source, common features across each data source. The method includes generating, in response to the linking, a knowledge graph. The method further includes preparing a visual display for a set of domain insights; and forking the set of domain insights into a first data source.

Type: Application

Filed: January 11, 2021

Publication date: July 14, 2022

Inventors: Namit Kabra, Ritesh Kumar Gupta, Ron Reuben, Vijay Ekambaram, Smitkumar Narotambhai Marvaniya
Self-learning selection of information-analysis runtimes

Patent number: 11288601

Abstract: A self-learning computer-based system has access to multiple runtime modules that are each capable of performing a particular algorithm. Each runtime module implements the algorithm with different code or runs in a different runtime environment. The system responds to a request to run the algorithm by selecting the runtime module or runtime environment that the system predicts will provide the most desirable results based on parameters like accuracy, performance, cost, resource-efficiency, or policy compliance. The system learns how to make such predictions through training sessions conducted by a machine-learning component. This training teaches the system that previous module selections produced certain types of results in the presence of certain conditions. After determining whether similar conditions currently exist, the system uses rules inferred from the training sessions to select the runtime module most likely to produce desired results.

Type: Grant

Filed: March 21, 2019

Date of Patent: March 29, 2022

Assignee: International Business Machines Corporation

Inventors: Ritesh Kumar Gupta, Namit Kabra, Eric Allen Jacobson, Scott Louis Brokaw, Jo Arao Ramos
Handling expiration of resources allocated by a resource manager running a data integration job

Patent number: 11194629

Abstract: A method includes: receiving, by a computer device, resource request for a data integration job, wherein the resource request is received from a job executor module and defines processes of the data integration job; allocating, by the computer device, containers for the processes of the data integration job; launching, by the computer device, a respective wrapper script on each respective one of the containers after allocating the respective one of the containers; and transmitting, by the computer device and in response to the allocating, node details to the job executor module. In embodiments, the wrapper script running on the container is configured to repeatedly check a predefined location for process commands from a job executor. After the resource manager allocates all the containers for a data integration job according to a resource request, the job executor writes the process commands to the predefined location.

Type: Grant

Filed: December 6, 2018

Date of Patent: December 7, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Krishna Kishore Bonagiri, Eric Allen Jacobson, Ritesh Kumar Gupta, Indrani Ghatare, Scott Louis Brokaw
TARGETED DATA ACQUISITION FOR MODEL TRAINING

Publication number: 20210357779

Abstract: Targeted acquisition of data for model training includes automatically generating metadata describing samples, of an initial dataset, in neighborhoods of an embedding space in which the samples are embedded. The samples described by the automatically generated metadata are classified by a classification model, and include both correctly classified samples in the neighborhoods and incorrectly classified samples in the neighborhoods. Additionally, attributes of one or more correctly classified samples of the collection of samples and one or more incorrectly classified samples of the collection of samples are identified, and queries are generated based on the identified attributes, the queries tailored, based on the attributes, to retrieve additional training data for training the classification model to more accurately classify samples and avoid incorrect sample classification.

Type: Application

Filed: May 14, 2020

Publication date: November 18, 2021

Inventors: Namit KABRA, Ritesh Kumar GUPTA, Vijay EKAMBARAM, Smitkumar Narotambhai MARVANIYA
Resolving container preemption

Patent number: 11150956

Abstract: A set of resources required to process a data integration job is determined. In response to determining that the set of resources is not available, queue occupation, for each queue in the computing environment, is predicted. Queue occupation is a workload of queue resources for a future time based on a previous workload. A best queue is selected based on the predicted queue occupation. The best queue is the queue or queues in the computing environment available to be assigned to process the data integration job without preemption. The data integration job is processed using the best queue. It is determined whether a preemption event occurred causing the removal of resources from the best queue. A checkpoint is created in response to determining that a preemption event occurred. The checkpoint indicates the last successful operation completed and provides a point where processing can resume when resources become available.

Type: Grant

Filed: May 21, 2019

Date of Patent: October 19, 2021

Assignee: International Business Machines Corporation

Inventors: Krishna Kishore Bonagiri, Eric A. Jacobson, Ritesh Kumar Gupta, Scott Louis Brokaw

1 2 next