Patents by Inventor Abel Valente

Abel Valente has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Automatic formulation of data science problem statements

Patent number: 11763084

Abstract: A method comprises receiving a new data set; identifying at least one prior data set of a plurality of prior data sets that matches the new data set; generating a natural language data science problem statement for the new data set based on information associated with the at least prior one data set that matches the new data set; outputting the generated natural language data science problem statement for user verification; and in response to receiving user input verifying the natural language generated data science problem statement, generating one or more AutoAI configuration settings for the new data set based on one or more AutoAI configuration settings associated with the at least one prior data set that matches the new data set.

Type: Grant

Filed: August 10, 2020

Date of Patent: September 19, 2023

Assignee: International Business Machines Corporation

Inventors: Dakuo Wang, Arunima Chaudhary, Chuang Gan, Mo Yu, Qian Pan, Sijia Liu, Daniel Karl I. Weidele, Abel Valente
Visualization of a model selection process in an automated model selection system

Patent number: 11688111

Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.

Type: Grant

Filed: July 29, 2020

Date of Patent: June 27, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
PERFORMING AUTOMATED SEMANTIC FEATURE DISCOVERY

Publication number: 20230177032

Abstract: A computer-implemented method according to one embodiment includes identifying a data set and meta information; and augmenting the data set with additional features in response to an automatic analysis of the data set in view of the meta information.

Type: Application

Filed: December 8, 2021

Publication date: June 8, 2023

Inventors: Daniel Karl I. Weidele, Lisa Amini, Udayan Khurana, Kavitha Srinivas, Horst Cornelius Samulowitz, Takaaki Tateishi, Carolina Maria Spina, Dakuo Wang, Abel Valente, Arunima Chaudhary, Toshihiro Takahashi
COMPOSITE FEATURE ENGINEERING

Publication number: 20230153634

Abstract: A domain of an input dataset is identified and one or more archived domain knowledge features corresponding to the identified domain are identified. One or more user feature definitions for one or more user features defined by a user are inputted. The identified archived domain knowledge features and the user features are processed to generate a set of candidate features for presentation to the user. A selection of a subset of the candidate features is obtained from the user and one or more predictive models are generated based on the selected features.

Type: Application

Filed: November 14, 2021

Publication date: May 18, 2023

Inventors: Dakuo Wang, Udayan Khurana, Chuang Gan, Gregory Bramble, Abel Valente, Arunima Chaudhary, Carolina Maria Spina, Micah Smith
Automated data table discovery for automated machine learning

Patent number: 11620550

Abstract: Embodiments relate to a system, program product, and method for leveraging cognitive systems to facilitate the automated data table discovery for automated machine learning, and, more specifically, to leveraging a trained cognitive system to automatically search for additional data in an external data source that may be merged with an initial user-selected data table to generate a more robust machine learning model. Manual efforts to find and validate data appropriate for building and training a particular model for a particular task are significantly reduced. Specifically, a learning-based approach to leverage with machine learning models to automatically discover related datasets and join the datasets for a given initial dataset is disclosed herein. Operations that include dataset selection facilitate continued reinforcement learning of the systems.

Type: Grant

Filed: August 10, 2020

Date of Patent: April 4, 2023

Assignee: International Business Machines Corporation

Inventors: Dakuo Wang, Mo Yu, Arunima Chaudhary, Chuang Gan, Qian Pan, Daniel Karl I. Weidele, Abel Valente, Ji Hui Yang
INTERACTIVE FEATURE ENGINEERING IN AUTOMATIC MACHINE LEARNING WITH DOMAIN KNOWLEDGE

Publication number: 20220366269

Abstract: A dataset including features and values associated with the features can be received. Each of the features in the dataset can be mapped to a corresponding node in a knowledge graph based on the concept represented by the corresponding node. The knowledge graph can be traversed to find a candidate node connected to at least one mapped node, the candidate node not being mapped to a feature in the dataset. A concept associated with the candidate node can be identified as a new feature. A machine learning model pipeline can use the features in the dataset and the new feature to select a subset of features for training a machine learning model.

Type: Application

Filed: May 11, 2021

Publication date: November 17, 2022

Inventors: Dakuo Wang, Udayan Khurana, Daniel Karl I. Weidele, Arunima Chaudhary, Carolina Maria Spina, Abel Valente, Chuang Gan, Horst Cornelius Samulowitz, Lisa Amini
HYBRID MODEL AND ARCHITECTURE SEARCH FOR AUTOMATED MACHINE LEARNING SYSTEMS

Publication number: 20220300821

Abstract: A computer-implemented method of automatically generating a machine learning model includes identifying one or more visualization features of a dataset associated with a machine learning model selection process. A plurality of candidate machine learning pipelines are configured to perform respective optimizing strategies in parallel based on the identified visualization features. A machine learning model is automatically generated based on at least one of the generated candidate machine learning pipelines.

Type: Application

Filed: March 20, 2021

Publication date: September 22, 2022

Inventors: Dakuo Wang, Kiran A. Kate, Arunima Chaudhary, Abel Valente, Ioannis Katsis, Chuang Gan, Bei Chen
AUTOMATED DATA QUALITY INSPECTION AND IMPROVEMENT FOR AUTOMATED MACHINE LEARNING

Publication number: 20220164698

Abstract: A method to automatically assess data quality of data input into a machine learning model and remediate the data includes receiving input data for an automated machine learning model. Selections for a multiple data quality metrics are displayed. A selection for data quality metrics is received. The data quality metrics are determined according to the selection. Selections for data remediation strategies based on the selection of the data quality metrics are displayed. A selection for remediation recommendation strategies is received. The selected data remediation strategies are performed on the input data. Learning from the selection of the data quality metrics and the selection for the remediation strategies is performed. A new customized machine learning model is generated based on the learning.

Type: Application

Filed: November 25, 2020

Publication date: May 26, 2022

Inventors: Arunima Chaudhary, Dakuo Wang, Abel Valente, Carolina Maria Spina, Hima Patel, Nitin Gupta, Gregory Bramble, Horst Cornelius Samulowitz, Sameep Mehta, Theodoros Salonidis, Daniel M. Gruen, Chaung Gan
INTERPRETABLE VISUALIZATION SYSTEM FOR GRAPH NEURAL NETWORK

Publication number: 20220101120

Abstract: Use a computerized trained graph neural network model to classify an input instance with a predicted label. With a computerized graph neural network interpretation module, compute a gradient-based saliency matrix based on the input instance and the predicted label, by taking a partial derivative of class prediction with respect to an adjacency matrix of the model. With a computerized user interface, obtain user input responsive to the gradient-based saliency matrix. Optionally, modify the trained graph neural network model based on the user input; and re-classify the input instance with a new predicted label based on the modified trained graph neural network model.

Type: Application

Filed: September 30, 2020

Publication date: March 31, 2022

Inventors: Dakuo Wang, Sijia Liu, Abel Valente, Chuang Gan, Bei Chen, Dongyu Liu, Yi Sun
AUTOMATED ANALYSIS GENERATION FOR MACHINE LEARNING SYSTEM

Publication number: 20220083881

Abstract: An automated analytic tool (AAT) apparatus analyzes a machine learning system (MLS). The tool comprises a processor configured to receive experiment parameters associated with an experiment performed on the MLS, and captures information from a plurality of stages of the experiment. The information comprises information regarding MLS results and choices made during the experiment. The tool automatically revise the captured information into revised information utilizing a knowledge base comprising information from prior experiments. The tool then presents the revised information to a user.

Type: Application

Filed: September 14, 2020

Publication date: March 17, 2022

Inventors: Arunima Chaudhary, Dakuo Wang, David John Piorkowski, Daniel M. Gruen, Chuang Gan, Peter Daniel Kirchner, Gregory Bramble, Bei Chen, Abel Valente, Carolina Maria Spina, John Thomas Richards, Abhishek Bhandwaldar
AUTOMATED DATA TABLE DISCOVERY FOR AUTOMATED MACHINE LEARNING

Publication number: 20220044136

Abstract: Embodiments relate to a system, program product, and method for leveraging cognitive systems to facilitate the automated data table discovery for automated machine learning, and, more specifically, to leveraging a trained cognitive system to automatically search for additional data in an external data source that may be merged with an initial user-selected data table to generate a more robust machine learning model. Manual efforts to find and validate data appropriate for building and training a particular model for a particular task are significantly reduced. Specifically, a learning-based approach to leverage with machine learning models to automatically discover related datasets and join the datasets for a given initial dataset is disclosed herein. Operations that include dataset selection facilitate continued reinforcement learning of the systems.

Type: Application

Filed: August 10, 2020

Publication date: February 10, 2022

Inventors: Dakuo Wang, Mo Yu, Arunima Chaudhary, Chuang Gan, Qian Pan, Daniel Karl I. Weidele, Abel Valente, Ji Hui Yang
AUTOMATIC FORMULATION OF DATA SCIENCE PROBLEM STATEMENTS

Publication number: 20220043978

Abstract: A method comprises receiving a new data set; identifying at least one prior data set of a plurality of prior data sets that matches the new data set; generating a natural language data science problem statement for the new data set based on information associated with the at least prior one data set that matches the new data set; outputting the generated natural language data science problem statement for user verification; and in response to receiving user input verifying the natural language generated data science problem statement, generating one or more AutoAI configuration settings for the new data set based on one or more AutoAI configuration settings associated with the at least one prior data set that matches the new data set.

Type: Application

Filed: August 10, 2020

Publication date: February 10, 2022

Inventors: Dakuo Wang, Arunima Chaudhary, Chuang Gan, Mo Yu, Qian Pan, Sijia Liu, Daniel Karl I. Weidele, Abel Valente
VISUALIZATION OF A MODEL SELECTION PROCESS IN AN AUTOMATED MODEL SELECTION SYSTEM

Publication number: 20220036610

Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.

Type: Application

Filed: July 29, 2020

Publication date: February 3, 2022

Inventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han