Patents by Inventor Arunima Chaudhary
Arunima Chaudhary has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11763084Abstract: A method comprises receiving a new data set; identifying at least one prior data set of a plurality of prior data sets that matches the new data set; generating a natural language data science problem statement for the new data set based on information associated with the at least prior one data set that matches the new data set; outputting the generated natural language data science problem statement for user verification; and in response to receiving user input verifying the natural language generated data science problem statement, generating one or more AutoAI configuration settings for the new data set based on one or more AutoAI configuration settings associated with the at least one prior data set that matches the new data set.Type: GrantFiled: August 10, 2020Date of Patent: September 19, 2023Assignee: International Business Machines CorporationInventors: Dakuo Wang, Arunima Chaudhary, Chuang Gan, Mo Yu, Qian Pan, Sijia Liu, Daniel Karl I. Weidele, Abel Valente
-
Patent number: 11688111Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.Type: GrantFiled: July 29, 2020Date of Patent: June 27, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
-
Publication number: 20230177032Abstract: A computer-implemented method according to one embodiment includes identifying a data set and meta information; and augmenting the data set with additional features in response to an automatic analysis of the data set in view of the meta information.Type: ApplicationFiled: December 8, 2021Publication date: June 8, 2023Inventors: Daniel Karl I. Weidele, Lisa Amini, Udayan Khurana, Kavitha Srinivas, Horst Cornelius Samulowitz, Takaaki Tateishi, Carolina Maria Spina, Dakuo Wang, Abel Valente, Arunima Chaudhary, Toshihiro Takahashi
-
Patent number: 11663851Abstract: An implicit bias monitoring system is provided. The system receives sensor data including video and audio that are captured during an interaction between a first individual and a second individual. The system determines a set of attributes associated with the interaction based on the received sensor data, the set of attributes comprising at least one of race and gender of the second individual. The system detects an implicit bias based on a combination of the determined set of attributes and a bias profile. The system generates an alert based on the detected implicit bias.Type: GrantFiled: February 13, 2020Date of Patent: May 30, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Justin David Weisz, Abhishek Bhandwaldar, Maryam Ashoori, Arunima Chaudhary, Benjamin Hoover
-
Publication number: 20230153634Abstract: A domain of an input dataset is identified and one or more archived domain knowledge features corresponding to the identified domain are identified. One or more user feature definitions for one or more user features defined by a user are inputted. The identified archived domain knowledge features and the user features are processed to generate a set of candidate features for presentation to the user. A selection of a subset of the candidate features is obtained from the user and one or more predictive models are generated based on the selected features.Type: ApplicationFiled: November 14, 2021Publication date: May 18, 2023Inventors: Dakuo Wang, Udayan Khurana, Chuang Gan, Gregory Bramble, Abel Valente, Arunima Chaudhary, Carolina Maria Spina, Micah Smith
-
Patent number: 11620550Abstract: Embodiments relate to a system, program product, and method for leveraging cognitive systems to facilitate the automated data table discovery for automated machine learning, and, more specifically, to leveraging a trained cognitive system to automatically search for additional data in an external data source that may be merged with an initial user-selected data table to generate a more robust machine learning model. Manual efforts to find and validate data appropriate for building and training a particular model for a particular task are significantly reduced. Specifically, a learning-based approach to leverage with machine learning models to automatically discover related datasets and join the datasets for a given initial dataset is disclosed herein. Operations that include dataset selection facilitate continued reinforcement learning of the systems.Type: GrantFiled: August 10, 2020Date of Patent: April 4, 2023Assignee: International Business Machines CorporationInventors: Dakuo Wang, Mo Yu, Arunima Chaudhary, Chuang Gan, Qian Pan, Daniel Karl I. Weidele, Abel Valente, Ji Hui Yang
-
Patent number: 11556816Abstract: Systems, computer-implemented methods, and computer program products to facilitate conditional parallel coordinates in automated artificial intelligence with constraints are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise a visualization component that renders a pipeline constraint as a constraint axis having constraint scores of machine learning pipelines in a conditional parallel coordinates visualization. The computer executable components can further comprise a model generation component that generates a machine learning model based on the constraint scores of the machine learning pipelines.Type: GrantFiled: March 27, 2020Date of Patent: January 17, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Daniel Karl I. Weidele, Parikshit Ram, Dakuo Wang, Abel Nicolas Valente, Arunima Chaudhary
-
Publication number: 20220366269Abstract: A dataset including features and values associated with the features can be received. Each of the features in the dataset can be mapped to a corresponding node in a knowledge graph based on the concept represented by the corresponding node. The knowledge graph can be traversed to find a candidate node connected to at least one mapped node, the candidate node not being mapped to a feature in the dataset. A concept associated with the candidate node can be identified as a new feature. A machine learning model pipeline can use the features in the dataset and the new feature to select a subset of features for training a machine learning model.Type: ApplicationFiled: May 11, 2021Publication date: November 17, 2022Inventors: Dakuo Wang, Udayan Khurana, Daniel Karl I. Weidele, Arunima Chaudhary, Carolina Maria Spina, Abel Valente, Chuang Gan, Horst Cornelius Samulowitz, Lisa Amini
-
Publication number: 20220300821Abstract: A computer-implemented method of automatically generating a machine learning model includes identifying one or more visualization features of a dataset associated with a machine learning model selection process. A plurality of candidate machine learning pipelines are configured to perform respective optimizing strategies in parallel based on the identified visualization features. A machine learning model is automatically generated based on at least one of the generated candidate machine learning pipelines.Type: ApplicationFiled: March 20, 2021Publication date: September 22, 2022Inventors: Dakuo Wang, Kiran A. Kate, Arunima Chaudhary, Abel Valente, Ioannis Katsis, Chuang Gan, Bei Chen
-
Patent number: 11379710Abstract: In accordance with an embodiment of the invention, a method is provided for personalizing machine learning models for users of an automated machine learning system, the machine learning models being generated by an automated machine learning system. The method includes obtaining a first set of datasets for training first, second, and third neural networks, inputting the training datasets to the neural networks, tuning hyperparameters for the first, second, and third neural networks for testing and training the neural networks, inputting a second set of datasets to the trained neural networks and the third neural network generating a third output data including a relevance score for each of the users for each of the machine learning models, and displaying a list of machine learning models associated with each of the users, with each of the machine learning models showing the relevance score.Type: GrantFiled: February 28, 2020Date of Patent: July 5, 2022Assignee: International Business Machines CorporationInventors: Dakuo Wang, Chuang Gan, Ming Tan, Arunima Chaudhary, Lin Ju
-
Publication number: 20220164698Abstract: A method to automatically assess data quality of data input into a machine learning model and remediate the data includes receiving input data for an automated machine learning model. Selections for a multiple data quality metrics are displayed. A selection for data quality metrics is received. The data quality metrics are determined according to the selection. Selections for data remediation strategies based on the selection of the data quality metrics are displayed. A selection for remediation recommendation strategies is received. The selected data remediation strategies are performed on the input data. Learning from the selection of the data quality metrics and the selection for the remediation strategies is performed. A new customized machine learning model is generated based on the learning.Type: ApplicationFiled: November 25, 2020Publication date: May 26, 2022Inventors: Arunima Chaudhary, Dakuo Wang, Abel Valente, Carolina Maria Spina, Hima Patel, Nitin Gupta, Gregory Bramble, Horst Cornelius Samulowitz, Sameep Mehta, Theodoros Salonidis, Daniel M. Gruen, Chaung Gan
-
Publication number: 20220083881Abstract: An automated analytic tool (AAT) apparatus analyzes a machine learning system (MLS). The tool comprises a processor configured to receive experiment parameters associated with an experiment performed on the MLS, and captures information from a plurality of stages of the experiment. The information comprises information regarding MLS results and choices made during the experiment. The tool automatically revise the captured information into revised information utilizing a knowledge base comprising information from prior experiments. The tool then presents the revised information to a user.Type: ApplicationFiled: September 14, 2020Publication date: March 17, 2022Inventors: Arunima Chaudhary, Dakuo Wang, David John Piorkowski, Daniel M. Gruen, Chuang Gan, Peter Daniel Kirchner, Gregory Bramble, Bei Chen, Abel Valente, Carolina Maria Spina, John Thomas Richards, Abhishek Bhandwaldar
-
Publication number: 20220076144Abstract: The exemplary embodiments disclose a method, a computer program product, and a computer system for determining that one or more model pipelines satisfy one or more constraints. The exemplary embodiments may include detecting a user uploading data, one or more constraints, and one or more model pipelines, collecting the data, the one or more constraints, and the one or more model pipelines, and determining that one or more of the model pipelines satisfies all of the one or more constraints based on applying one or more algorithms to the collected data, constraints, and model pipelines.Type: ApplicationFiled: September 9, 2020Publication date: March 10, 2022Inventors: Parikshit Ram, Dakuo Wang, Deepak Vijaykeerthy, Vaibhav Saxena, Sijia Liu, Arunima Chaudhary, Gregory Bramble, Horst Cornelius Samulowitz, Alexander Gray
-
Publication number: 20220051112Abstract: Systems, computer-implemented methods, and computer program products to facilitate automated model pipeline generation with entity monitoring, interaction, and/or intervention are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that provides a recommended input action corresponding to a model pipeline candidate being evaluated in an automated model pipeline generation process. The computer executable components can further comprise a visualization render component that renders an input visualization corresponding to the model pipeline candidate based on the recommended input action.Type: ApplicationFiled: August 17, 2020Publication date: February 17, 2022Inventors: Dakuo Wang, Arunima Chaudhary, Ji Hui Yang, Bei Chen, Gregory Bramble, Chuang Gan, Uri Kartoun, Long Vu
-
Publication number: 20220044136Abstract: Embodiments relate to a system, program product, and method for leveraging cognitive systems to facilitate the automated data table discovery for automated machine learning, and, more specifically, to leveraging a trained cognitive system to automatically search for additional data in an external data source that may be merged with an initial user-selected data table to generate a more robust machine learning model. Manual efforts to find and validate data appropriate for building and training a particular model for a particular task are significantly reduced. Specifically, a learning-based approach to leverage with machine learning models to automatically discover related datasets and join the datasets for a given initial dataset is disclosed herein. Operations that include dataset selection facilitate continued reinforcement learning of the systems.Type: ApplicationFiled: August 10, 2020Publication date: February 10, 2022Inventors: Dakuo Wang, Mo Yu, Arunima Chaudhary, Chuang Gan, Qian Pan, Daniel Karl I. Weidele, Abel Valente, Ji Hui Yang
-
Publication number: 20220043978Abstract: A method comprises receiving a new data set; identifying at least one prior data set of a plurality of prior data sets that matches the new data set; generating a natural language data science problem statement for the new data set based on information associated with the at least prior one data set that matches the new data set; outputting the generated natural language data science problem statement for user verification; and in response to receiving user input verifying the natural language generated data science problem statement, generating one or more AutoAI configuration settings for the new data set based on one or more AutoAI configuration settings associated with the at least one prior data set that matches the new data set.Type: ApplicationFiled: August 10, 2020Publication date: February 10, 2022Inventors: Dakuo Wang, Arunima Chaudhary, Chuang Gan, Mo Yu, Qian Pan, Sijia Liu, Daniel Karl I. Weidele, Abel Valente
-
Publication number: 20220036610Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.Type: ApplicationFiled: July 29, 2020Publication date: February 3, 2022Inventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
-
Publication number: 20220004914Abstract: An embodiment of the invention may include a method, computer program product, and system for creating a data analysis tool. The method may include a computing device that generates an AI pipeline based on an input dataset, wherein the AI pipeline is generated using an Automated Machine Learning program. The method may include converting the AI pipeline to a non-native format of the Automated Machine Learning program. This may enable the AI pipeline to be used outside of the Automated Machine Learning program, thereby increasing the usefulness of the created program by not tying it to the Automated Machine Learning program. Additionally, this may increase the efficiency of running the AI pipeline by eliminating unnecessary computations performed by the Automated Machine Learning program.Type: ApplicationFiled: July 2, 2020Publication date: January 6, 2022Inventors: Peter Daniel Kirchner, Gregory Bramble, Horst Cornelius Samulowitz, Dakuo Wang, Arunima Chaudhary, Gregory Filla
-
Publication number: 20210304028Abstract: Systems, computer-implemented methods, and computer program products to facilitate conditional parallel coordinates in automated artificial intelligence with constraints are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise a visualization component that renders a pipeline constraint as a constraint axis having constraint scores of machine learning pipelines in a conditional parallel coordinates visualization. The computer executable components can further comprise a model generation component that generates a machine learning model based on the constraint scores of the machine learning pipelines.Type: ApplicationFiled: March 27, 2020Publication date: September 30, 2021Inventors: Daniel Karl I. Weidele, Parikshit Ram, Dakuo Wang, Abel Nicolas Valente, Arunima Chaudhary
-
Publication number: 20210271956Abstract: In accordance with an embodiment of the invention, a method is provided for personalizing machine learning models for users of an automated machine learning system, the machine learning models being generated by an automated machine learning system. The method includes obtaining a first set of datasets for training first, second, and third neural networks, inputting the training datasets to the neural networks, tuning hyperparameters for the first, second, and third neural networks for testing and training the neural networks, inputting a second set of datasets to the trained neural networks and the third neural network generating a third output data including a relevance score for each of the users for each of the machine learning models, and displaying a list of machine learning models associated with each of the users, with each of the machine learning models showing the relevance score.Type: ApplicationFiled: February 28, 2020Publication date: September 2, 2021Inventors: Dakuo Wang, Chuang Gan, Ming Tan, Arunima Chaudhary, Lin Ju