Patents by Inventor Arunima Chaudhary

Arunima Chaudhary has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11861469
    Abstract: An embodiment of the invention may include a method, computer program product, and system for creating a data analysis tool. The method may include a computing device that generates an AI pipeline based on an input dataset, wherein the AI pipeline is generated using an Automated Machine Learning program. The method may include converting the AI pipeline to a non-native format of the Automated Machine Learning program. This may enable the AI pipeline to be used outside of the Automated Machine Learning program, thereby increasing the usefulness of the created program by not tying it to the Automated Machine Learning program. Additionally, this may increase the efficiency of running the AI pipeline by eliminating unnecessary computations performed by the Automated Machine Learning program.
    Type: Grant
    Filed: July 2, 2020
    Date of Patent: January 2, 2024
    Assignee: International Business Machines Corporation
    Inventors: Peter Daniel Kirchner, Gregory Bramble, Horst Cornelius Samulowitz, Dakuo Wang, Arunima Chaudhary, Gregory Filla
  • Publication number: 20230409838
    Abstract: A method, computer program, and computer system are provided for explaining generation of a flow from natural language utterances. Data corresponding to a natural language utterance is received. One or more constraints corresponding to a flow to be generated are determined based on the received natural language utterance. A flow is constructed based on the determined constraints. An explanation associated with the constructed flow is provided, and the explanation identifies parameters corresponding to constructing the flow.
    Type: Application
    Filed: May 31, 2022
    Publication date: December 21, 2023
    Inventors: Tathagata Chakraborti, Arunima Chaudhary, Michelle Brachman, Qian Pan, James Johnson, Yara Rizk, Burak Aksar
  • Patent number: 11763084
    Abstract: A method comprises receiving a new data set; identifying at least one prior data set of a plurality of prior data sets that matches the new data set; generating a natural language data science problem statement for the new data set based on information associated with the at least prior one data set that matches the new data set; outputting the generated natural language data science problem statement for user verification; and in response to receiving user input verifying the natural language generated data science problem statement, generating one or more AutoAI configuration settings for the new data set based on one or more AutoAI configuration settings associated with the at least one prior data set that matches the new data set.
    Type: Grant
    Filed: August 10, 2020
    Date of Patent: September 19, 2023
    Assignee: International Business Machines Corporation
    Inventors: Dakuo Wang, Arunima Chaudhary, Chuang Gan, Mo Yu, Qian Pan, Sijia Liu, Daniel Karl I. Weidele, Abel Valente
  • Patent number: 11688111
    Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.
    Type: Grant
    Filed: July 29, 2020
    Date of Patent: June 27, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
  • Publication number: 20230177032
    Abstract: A computer-implemented method according to one embodiment includes identifying a data set and meta information; and augmenting the data set with additional features in response to an automatic analysis of the data set in view of the meta information.
    Type: Application
    Filed: December 8, 2021
    Publication date: June 8, 2023
    Inventors: Daniel Karl I. Weidele, Lisa Amini, Udayan Khurana, Kavitha Srinivas, Horst Cornelius Samulowitz, Takaaki Tateishi, Carolina Maria Spina, Dakuo Wang, Abel Valente, Arunima Chaudhary, Toshihiro Takahashi
  • Patent number: 11663851
    Abstract: An implicit bias monitoring system is provided. The system receives sensor data including video and audio that are captured during an interaction between a first individual and a second individual. The system determines a set of attributes associated with the interaction based on the received sensor data, the set of attributes comprising at least one of race and gender of the second individual. The system detects an implicit bias based on a combination of the determined set of attributes and a bias profile. The system generates an alert based on the detected implicit bias.
    Type: Grant
    Filed: February 13, 2020
    Date of Patent: May 30, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Justin David Weisz, Abhishek Bhandwaldar, Maryam Ashoori, Arunima Chaudhary, Benjamin Hoover
  • Publication number: 20230153634
    Abstract: A domain of an input dataset is identified and one or more archived domain knowledge features corresponding to the identified domain are identified. One or more user feature definitions for one or more user features defined by a user are inputted. The identified archived domain knowledge features and the user features are processed to generate a set of candidate features for presentation to the user. A selection of a subset of the candidate features is obtained from the user and one or more predictive models are generated based on the selected features.
    Type: Application
    Filed: November 14, 2021
    Publication date: May 18, 2023
    Inventors: Dakuo Wang, Udayan Khurana, Chuang Gan, Gregory Bramble, Abel Valente, Arunima Chaudhary, Carolina Maria Spina, Micah Smith
  • Patent number: 11620550
    Abstract: Embodiments relate to a system, program product, and method for leveraging cognitive systems to facilitate the automated data table discovery for automated machine learning, and, more specifically, to leveraging a trained cognitive system to automatically search for additional data in an external data source that may be merged with an initial user-selected data table to generate a more robust machine learning model. Manual efforts to find and validate data appropriate for building and training a particular model for a particular task are significantly reduced. Specifically, a learning-based approach to leverage with machine learning models to automatically discover related datasets and join the datasets for a given initial dataset is disclosed herein. Operations that include dataset selection facilitate continued reinforcement learning of the systems.
    Type: Grant
    Filed: August 10, 2020
    Date of Patent: April 4, 2023
    Assignee: International Business Machines Corporation
    Inventors: Dakuo Wang, Mo Yu, Arunima Chaudhary, Chuang Gan, Qian Pan, Daniel Karl I. Weidele, Abel Valente, Ji Hui Yang
  • Patent number: 11556816
    Abstract: Systems, computer-implemented methods, and computer program products to facilitate conditional parallel coordinates in automated artificial intelligence with constraints are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise a visualization component that renders a pipeline constraint as a constraint axis having constraint scores of machine learning pipelines in a conditional parallel coordinates visualization. The computer executable components can further comprise a model generation component that generates a machine learning model based on the constraint scores of the machine learning pipelines.
    Type: Grant
    Filed: March 27, 2020
    Date of Patent: January 17, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Daniel Karl I. Weidele, Parikshit Ram, Dakuo Wang, Abel Nicolas Valente, Arunima Chaudhary
  • Publication number: 20220366269
    Abstract: A dataset including features and values associated with the features can be received. Each of the features in the dataset can be mapped to a corresponding node in a knowledge graph based on the concept represented by the corresponding node. The knowledge graph can be traversed to find a candidate node connected to at least one mapped node, the candidate node not being mapped to a feature in the dataset. A concept associated with the candidate node can be identified as a new feature. A machine learning model pipeline can use the features in the dataset and the new feature to select a subset of features for training a machine learning model.
    Type: Application
    Filed: May 11, 2021
    Publication date: November 17, 2022
    Inventors: Dakuo Wang, Udayan Khurana, Daniel Karl I. Weidele, Arunima Chaudhary, Carolina Maria Spina, Abel Valente, Chuang Gan, Horst Cornelius Samulowitz, Lisa Amini
  • Publication number: 20220300821
    Abstract: A computer-implemented method of automatically generating a machine learning model includes identifying one or more visualization features of a dataset associated with a machine learning model selection process. A plurality of candidate machine learning pipelines are configured to perform respective optimizing strategies in parallel based on the identified visualization features. A machine learning model is automatically generated based on at least one of the generated candidate machine learning pipelines.
    Type: Application
    Filed: March 20, 2021
    Publication date: September 22, 2022
    Inventors: Dakuo Wang, Kiran A. Kate, Arunima Chaudhary, Abel Valente, Ioannis Katsis, Chuang Gan, Bei Chen
  • Patent number: 11379710
    Abstract: In accordance with an embodiment of the invention, a method is provided for personalizing machine learning models for users of an automated machine learning system, the machine learning models being generated by an automated machine learning system. The method includes obtaining a first set of datasets for training first, second, and third neural networks, inputting the training datasets to the neural networks, tuning hyperparameters for the first, second, and third neural networks for testing and training the neural networks, inputting a second set of datasets to the trained neural networks and the third neural network generating a third output data including a relevance score for each of the users for each of the machine learning models, and displaying a list of machine learning models associated with each of the users, with each of the machine learning models showing the relevance score.
    Type: Grant
    Filed: February 28, 2020
    Date of Patent: July 5, 2022
    Assignee: International Business Machines Corporation
    Inventors: Dakuo Wang, Chuang Gan, Ming Tan, Arunima Chaudhary, Lin Ju
  • Publication number: 20220164698
    Abstract: A method to automatically assess data quality of data input into a machine learning model and remediate the data includes receiving input data for an automated machine learning model. Selections for a multiple data quality metrics are displayed. A selection for data quality metrics is received. The data quality metrics are determined according to the selection. Selections for data remediation strategies based on the selection of the data quality metrics are displayed. A selection for remediation recommendation strategies is received. The selected data remediation strategies are performed on the input data. Learning from the selection of the data quality metrics and the selection for the remediation strategies is performed. A new customized machine learning model is generated based on the learning.
    Type: Application
    Filed: November 25, 2020
    Publication date: May 26, 2022
    Inventors: Arunima Chaudhary, Dakuo Wang, Abel Valente, Carolina Maria Spina, Hima Patel, Nitin Gupta, Gregory Bramble, Horst Cornelius Samulowitz, Sameep Mehta, Theodoros Salonidis, Daniel M. Gruen, Chaung Gan
  • Publication number: 20220083881
    Abstract: An automated analytic tool (AAT) apparatus analyzes a machine learning system (MLS). The tool comprises a processor configured to receive experiment parameters associated with an experiment performed on the MLS, and captures information from a plurality of stages of the experiment. The information comprises information regarding MLS results and choices made during the experiment. The tool automatically revise the captured information into revised information utilizing a knowledge base comprising information from prior experiments. The tool then presents the revised information to a user.
    Type: Application
    Filed: September 14, 2020
    Publication date: March 17, 2022
    Inventors: Arunima Chaudhary, Dakuo Wang, David John Piorkowski, Daniel M. Gruen, Chuang Gan, Peter Daniel Kirchner, Gregory Bramble, Bei Chen, Abel Valente, Carolina Maria Spina, John Thomas Richards, Abhishek Bhandwaldar
  • Publication number: 20220076144
    Abstract: The exemplary embodiments disclose a method, a computer program product, and a computer system for determining that one or more model pipelines satisfy one or more constraints. The exemplary embodiments may include detecting a user uploading data, one or more constraints, and one or more model pipelines, collecting the data, the one or more constraints, and the one or more model pipelines, and determining that one or more of the model pipelines satisfies all of the one or more constraints based on applying one or more algorithms to the collected data, constraints, and model pipelines.
    Type: Application
    Filed: September 9, 2020
    Publication date: March 10, 2022
    Inventors: Parikshit Ram, Dakuo Wang, Deepak Vijaykeerthy, Vaibhav Saxena, Sijia Liu, Arunima Chaudhary, Gregory Bramble, Horst Cornelius Samulowitz, Alexander Gray
  • Publication number: 20220051112
    Abstract: Systems, computer-implemented methods, and computer program products to facilitate automated model pipeline generation with entity monitoring, interaction, and/or intervention are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that provides a recommended input action corresponding to a model pipeline candidate being evaluated in an automated model pipeline generation process. The computer executable components can further comprise a visualization render component that renders an input visualization corresponding to the model pipeline candidate based on the recommended input action.
    Type: Application
    Filed: August 17, 2020
    Publication date: February 17, 2022
    Inventors: Dakuo Wang, Arunima Chaudhary, Ji Hui Yang, Bei Chen, Gregory Bramble, Chuang Gan, Uri Kartoun, Long Vu
  • Publication number: 20220044136
    Abstract: Embodiments relate to a system, program product, and method for leveraging cognitive systems to facilitate the automated data table discovery for automated machine learning, and, more specifically, to leveraging a trained cognitive system to automatically search for additional data in an external data source that may be merged with an initial user-selected data table to generate a more robust machine learning model. Manual efforts to find and validate data appropriate for building and training a particular model for a particular task are significantly reduced. Specifically, a learning-based approach to leverage with machine learning models to automatically discover related datasets and join the datasets for a given initial dataset is disclosed herein. Operations that include dataset selection facilitate continued reinforcement learning of the systems.
    Type: Application
    Filed: August 10, 2020
    Publication date: February 10, 2022
    Inventors: Dakuo Wang, Mo Yu, Arunima Chaudhary, Chuang Gan, Qian Pan, Daniel Karl I. Weidele, Abel Valente, Ji Hui Yang
  • Publication number: 20220043978
    Abstract: A method comprises receiving a new data set; identifying at least one prior data set of a plurality of prior data sets that matches the new data set; generating a natural language data science problem statement for the new data set based on information associated with the at least prior one data set that matches the new data set; outputting the generated natural language data science problem statement for user verification; and in response to receiving user input verifying the natural language generated data science problem statement, generating one or more AutoAI configuration settings for the new data set based on one or more AutoAI configuration settings associated with the at least one prior data set that matches the new data set.
    Type: Application
    Filed: August 10, 2020
    Publication date: February 10, 2022
    Inventors: Dakuo Wang, Arunima Chaudhary, Chuang Gan, Mo Yu, Qian Pan, Sijia Liu, Daniel Karl I. Weidele, Abel Valente
  • Publication number: 20220036610
    Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.
    Type: Application
    Filed: July 29, 2020
    Publication date: February 3, 2022
    Inventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
  • Publication number: 20220004914
    Abstract: An embodiment of the invention may include a method, computer program product, and system for creating a data analysis tool. The method may include a computing device that generates an AI pipeline based on an input dataset, wherein the AI pipeline is generated using an Automated Machine Learning program. The method may include converting the AI pipeline to a non-native format of the Automated Machine Learning program. This may enable the AI pipeline to be used outside of the Automated Machine Learning program, thereby increasing the usefulness of the created program by not tying it to the Automated Machine Learning program. Additionally, this may increase the efficiency of running the AI pipeline by eliminating unnecessary computations performed by the Automated Machine Learning program.
    Type: Application
    Filed: July 2, 2020
    Publication date: January 6, 2022
    Inventors: Peter Daniel Kirchner, Gregory Bramble, Horst Cornelius Samulowitz, Dakuo Wang, Arunima Chaudhary, Gregory Filla