Patents by Inventor Cicero Nogueira Dos Santos

Cicero Nogueira Dos Santos has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Feature selection using Sobolev Independence Criterion

Patent number: 11645555

Abstract: A machine learning system that implements Sobolev Independence Criterion (SIC) for feature selection is provided. The system receives a dataset including pairings of stimuli and responses. Each stimulus includes multiple features. The system generates a correctly paired sample of stimuli and responses from the dataset by pairing stimuli and responses according to the pairings of stimuli and responses in the dataset. The system generates an alternatively paired sample of stimuli and responses from the dataset by pairing stimuli and responses differently than the pairings of stimuli and responses in the dataset. The system determines a witness function and a feature importance distribution across the features that optimizes a cost function that is evaluated based on the correctly paired and alternatively paired samples of the dataset. The system selects one or more features based on the computed feature importance distribution.

Type: Grant

Filed: October 12, 2019

Date of Patent: May 9, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Youssef Mroueh, Tom Sercu, Mattia Rigotti, Inkit Padhi, Cicero Nogueira Dos Santos
Generating attribute-based samples

Patent number: 11481626

Abstract: A computer-implemented method according to one aspect includes training a latent variable model (LVM), utilizing labeled data and unlabeled data within a data set; training a classifier, utilizing the labeled data and associated labels within the data set; and generating new data having a predetermined set of labels, utilizing the trained LVM and the trained classifier.

Type: Grant

Filed: October 15, 2019

Date of Patent: October 25, 2022

Assignee: International Business Machines Corporation

Inventors: Payel Das, Tom D. J. Sercu, Kahini Wadhawan, Cicero Nogueira Dos Santos, Inkit Padhi, Sebastian Gehrmann
Question Answering using trained generative adversarial network based modeling of text

Patent number: 11481416

Abstract: Mechanisms are provided for implementing a Question Answering (QA) system utilizing a trained generator of a generative adversarial network (GAN) that generates a bag-of-ngrams (BoN) output representing unlabeled data for performing a natural language processing operation. The QA system obtains a plurality of candidate answers to a natural language question, where each candidate answer comprises one or more ngrams. For each candidate answer, a confidence score is generated based on a comparison of the one or more ngrams in the candidate answer to ngrams in the BoN output of the generator neural network of the GAN. A final answer to the input natural language question is selected from the plurality of candidate answers based on the confidence scores associated with the candidate answers, and is output.

Type: Grant

Filed: July 12, 2018

Date of Patent: October 25, 2022

Assignee: International Business Machines Corporation

Inventors: Dheeru Dua, Cicero Nogueira Dos Santos, Bowen Zhou
Systems, apparatuses, and methods to generate synthetic queries from customer data for training of document querying machine learning models

Patent number: 11475067

Abstract: Techniques for generation of synthetic queries from customer data for training of document querying machine learning (ML) models as a service are described. A service may receive one or more documents from a user, generate a set of question and answer pairs from the one or more documents from the user using a machine learning model trained to predict a question from an answer, and store the set of question and answer pairs generated from the one or more documents from the user. The question and answer pairs may be used to train another machine learning model, for example, a document ranking model, a passage ranking model, a question/answer model, or a frequently asked question (FAQ) model.

Type: Grant

Filed: November 27, 2019

Date of Patent: October 18, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Cicero Nogueira Dos Santos, Xiaofei Ma, Peng Xu, Ramesh M. Nallapati, Bing Xiang, Sudipta Sengupta, Zhiguo Wang, Patrick Ng
False detection rate control with null-hypothesis

Patent number: 11373760

Abstract: A machine learning system receives a witness function that is determined based on an initial sample of a dataset comprising multiple pairs of stimuli and responses. Each stimulus includes multiple features. The system receives a holdout sample of the dataset comprising one or more pairs of stimuli and responses that are not used to determine the witness function. The system generates a simulated sample based on the holdout sample. Values of a particular feature of the stimuli of the simulated sample are predicted based on values of features other than the particular feature of the stimuli of the simulated sample. The system applies the holdout sample to the witness function to obtain a first result. The system applies the simulated sample to the witness function to obtain a second result. The system determines whether to select the particular feature based on a comparison between the first result and the second result.

Type: Grant

Filed: October 12, 2019

Date of Patent: June 28, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Youssef Mroueh, Tom Sercu, Mattia Rigotti, Inkit Padhi, Cicero Nogueira Dos Santos
Generative adversarial network based modeling of text for natural language processing

Patent number: 11281976

Abstract: Mechanisms are provided to implement a generative adversarial network (GAN) for natural language processing. With these mechanisms, a generator neural network of the GAN is configured to generate a bag-of-ngrams (BoN) output based on a noise vector input and a discriminator neural network of the GAN is configured to receive a BoN input, where the BoN input is either the BoN output from the generator neural network or a BoN input associated with an actual portion of natural language text. The mechanisms further configure the discriminator neural network of the GAN to output an indication of a probability as to whether the input BoN is from the actual portion of natural language text or is the BoN output of the generator neural network.

Type: Grant

Filed: July 12, 2018

Date of Patent: March 22, 2022

Assignee: International Business Machines Corporation

Inventors: Dheeru Dua, Cicero Nogueira Dos Santos, Bowen Zhou
FILTERING ARTIFICIAL INTELLIGENCE DESIGNED MOLECULES FOR LABORATORY TESTING

Publication number: 20210366580

Abstract: Techniques for filtering artificial intelligence (AI)-designed molecules for laboratory testing provided. According to an embodiment, computer implemented method can comprise selecting, by a system operatively coupled to a processor, a first subset of AI-designed molecules from a set of AI-designed molecules as candidate pharmaceutical agents based on classification of the AI-designed molecules using one or more classifiers. The method further comprises selecting, by the system, a second subset of the candidate pharmaceutical agents for wet laboratory testing based on evaluation of molecular interactions between the candidate pharmaceutical agents and one or more biological targets using one or more computer simulations.

Type: Application

Filed: May 21, 2020

Publication date: November 25, 2021

Inventors: Payel Das, Flaviu Cipcigan, Kahini Wadhawan, Inkit Padhi, Enara C Vijil, Pin-Yu Chen, Aleksandra Mojsilovic, Tom D.J. Sercu, Cicero Nogueira dos Santos
Automatic generation of content using multimedia

Patent number: 11170270

Abstract: Techniques for content generation are provided. A plurality of discriminative terms is determined based at least in part on a first plurality of documents that are related to a first concept, and a plurality of positive exemplars and a plurality of negative exemplars are identified using the plurality of discriminative terms. A first machine learning (ML) model is trained to classify images into concepts, based on the plurality of positive exemplars and the plurality of negative exemplars. A second concept related to the first concept is then determined, based on the first ML model. A second ML model is trained to generate images based on the second concept, and a first image is generated using the second ML model. The first image is then refined using a style transfer ML model that was trained using a plurality of style images.

Type: Grant

Filed: October 17, 2019

Date of Patent: November 9, 2021

Assignee: International Business Machines Corporation

Inventors: Michele Merler, Mauro Martino, Cicero Nogueira Dos Santos, Alfio Massimiliano Gliozzo, John R. Smith
SYSTEMS, APPARATUSES, AND METHODS TO GENERATE SYNTHETIC QUERIES FROM CUSTOMER DATA FOR TRAINING OF DOCUMENT QUERYING MACHINE LEARNING MODELS

Publication number: 20210157857

Abstract: Techniques for generation of synthetic queries from customer data for training of document querying machine learning (ML) models as a service are described. A service may receive one or more documents from a user, generate a set of question and answer pairs from the one or more documents from the user using a machine learning model trained to predict a question from an answer, and store the set of question and answer pairs generated from the one or more documents from the user. The question and answer pairs may be used to train another machine learning model, for example, a document ranking model, a passage ranking model, a question/answer model, or a frequently asked question (FAQ) model.

Type: Application

Filed: November 27, 2019

Publication date: May 27, 2021

Inventors: Cicero NOGUEIRA DOS SANTOS, Xiaofei MA, Peng XU, Ramesh M. NALLAPATI, Bing XIANG, Sudipta SENGUPTA, Zhiguo WANG, Patrick NG
AUTOMATIC GENERATION OF CONTENT USING MULTIMEDIA

Publication number: 20210117736

Abstract: Techniques for content generation are provided. A plurality of discriminative terms is determined based at least in part on a first plurality of documents that are related to a first concept, and a plurality of positive exemplars and a plurality of negative exemplars are identified using the plurality of discriminative terms. A first machine learning (ML) model is trained to classify images into concepts, based on the plurality of positive exemplars and the plurality of negative exemplars. A second concept related to the first concept is then determined, based on the first ML model. A second ML model is trained to generate images based on the second concept, and a first image is generated using the second ML model. The first image is then refined using a style transfer ML model that was trained using a plurality of style images.

Type: Application

Filed: October 17, 2019

Publication date: April 22, 2021

Inventors: Michele Merler, Mauro Martino, Cicero NOGUEIRA DOS SANTOS, Alfio Massimiliano Gliozzo, John R. Smith
FEATURE SELECTION USING SOBOLEV INDEPENDENCE CRITERION

Publication number: 20210110285

Abstract: A machine learning system that implements Sobolev Independence Criterion (SIC) for feature selection is provided. The system receives a dataset including pairings of stimuli and responses. Each stimulus includes multiple features. The system generates a correctly paired sample of stimuli and responses from the dataset by pairing stimuli and responses according to the pairings of stimuli and responses in the dataset. The system generates an alternatively paired sample of stimuli and responses from the dataset by pairing stimuli and responses differently than the pairings of stimuli and responses in the dataset. The system determines a witness function and a feature importance distribution across the features that optimizes a cost function that is evaluated based on the correctly paired and alternatively paired samples of the dataset. The system selects one or more features based on the computed feature importance distribution.

Type: Application

Filed: October 12, 2019

Publication date: April 15, 2021

Inventors: Youssef Mroueh, Tom Sercu, Mattia Rigotti, Inkit Padhi, Cicero Nogueira Dos Santos
FALSE DETECTION RATE CONTROL WITH NULL-HYPOTHESIS

Publication number: 20210110409

Abstract: A machine learning system receives a witness function that is determined based on an initial sample of a dataset comprising multiple pairs of stimuli and responses. Each stimulus includes multiple features. The system receives a holdout sample of the dataset comprising one or more pairs of stimuli and responses that are not used to determine the witness function. The system generates a simulated sample based on the holdout sample. Values of a particular feature of the stimuli of the simulated sample are predicted based on values of features other than the particular feature of the stimuli of the simulated sample. The system applies the holdout sample to the witness function to obtain a first result. The system applies the simulated sample to the witness function to obtain a second result. The system determines whether to select the particular feature based on a comparison between the first result and the second result.

Type: Application

Filed: October 12, 2019

Publication date: April 15, 2021

Inventors: Youssef Mroueh, Tom Sercu, Mattia Rigotti, Inkit Padhi, Cicero Nogueira Dos Santos
GENERATING ATTRIBUTE-BASED SAMPLES

Publication number: 20210110255

Abstract: A computer-implemented method according to one aspect includes training a latent variable model (LVM), utilizing labeled data and unlabeled data within a data set; training a classifier, utilizing the labeled data and associated labels within the data set; and generating new data having a predetermined set of labels, utilizing the trained LVM and the trained classifier.

Type: Application

Filed: October 15, 2019

Publication date: April 15, 2021

Inventors: Payel Das, Tom D. J. Sercu, Kahini Wadhawan, Cicero Nogueira Dos Santos, Inkit Padhi, Sebastian Gehrmann
Predicting user posting behavior in social media applications

Patent number: 10902345

Abstract: A computer-implemented method includes extracting a plurality of topics from a plurality of unlabeled social media posts, mapping the plurality of topics to a plurality of frequencies, each frequency in the plurality of frequencies indicating how often a corresponding topic in the plurality of topics occurs in the plurality of unlabeled social media posts, and predicting, based in part on the plurality of frequencies, a future social media posting behavior of a specific social media user, wherein the future social media posting behavior includes a specific topic about which the specific social media user is likely to post at a time in the future and a frequency with which the specific topic is likely to occur in posts of the specific social media user that are created at the time in the future.

Type: Grant

Filed: January 19, 2017

Date of Patent: January 26, 2021

Assignee: International Business Machines Corporation

Inventors: Paulo Rodrigo Cavalin, Maira Gatti de Bayser, Alexandre Rademaker, Cicero Nogueira Dos Santos
WASSERSTEIN BARYCENTER MODEL ENSEMBLING

Publication number: 20200342361

Abstract: A method, system and apparatus of ensembling, including inputting a set of models that predict different sets of attributes, determining a source set of attributes and a target set of attributes using a barycenter with an optimal transport metric, and determining a consensus among the set of models whose predictions are defined on the source set of attributes.

Type: Application

Filed: April 29, 2019

Publication date: October 29, 2020

Inventors: Youssef Mroueh, Pierre L. Dognin, Igor Melnyk, Jarret Ross, Tom Sercu, Cicero Nogueira Dos Santos
UNSUPERVISED TEXT STYLE TRANSFER SYSTEM FOR IMPROVED ONLINE SOCIAL MEDIA EXPERIENCE

Publication number: 20200110797

Abstract: An unsupervised text style transfer method, system, and computer program product include classifying a style of an input message, translating the input message into a second style, re-writing the input message into a second message having the second style, and distributing the second message in the second style.

Type: Application

Filed: October 4, 2018

Publication date: April 9, 2020

Inventors: Igor Melnyk, Cicero Nogueira Dos Santos, Inkit Padhi, Kahini Wadhawan, Abhishek Kumar
Generative Adversarial Network Based Modeling of Text for Natural Language Processing

Publication number: 20200019863

Abstract: Mechanisms are provided to implement a generative adversarial network (GAN) for natural language processing. With these mechanisms, a generator neural network of the GAN is configured to generate a bag-of-ngrams (BoN) output based on a noise vector input and a discriminator neural network of the GAN is configured to receive a BoN input, where the BoN input is either the BoN output from the generator neural network or a BoN input associated with an actual portion of natural language text. The mechanisms further configure the discriminator neural network of the GAN to output an indication of a probability as to whether the input BoN is from the actual portion of natural language text or is the BoN output of the generator neural network.

Type: Application

Filed: July 12, 2018

Publication date: January 16, 2020

Inventors: Dheeru Dua, Cicero Nogueira Dos Santos, Bowen Zhou
Question Answering Using Trained Generative Adversarial Network Based Modeling of Text

Publication number: 20200019642

Abstract: Mechanisms are provided for implementing a Question Answering (QA) system utilizing a trained generator of a generative adversarial network (GAN) that generates a bag-of-ngrams (BoN) output representing unlabeled data for performing a natural language processing operation. The QA system obtains a plurality of candidate answers to a natural language question, where each candidate answer comprises one or more ngrams. For each candidate answer, a confidence score is generated based on a comparison of the one or more ngrams in the candidate answer to ngrams in the BoN output of the generator neural network of the GAN. A final answer to the input natural language question is selected from the plurality of candidate answers based on the confidence scores associated with the candidate answers, and is output.

Type: Application

Filed: July 12, 2018

Publication date: January 16, 2020

Inventors: Dheeru Dua, Cicero Nogueira Dos Santos, Bowen Zhou
PREDICTING USER POSTING BEHAVIOR IN SOCIAL MEDIA APPLICATIONS

Publication number: 20180204125

Abstract: A computer-implemented method includes extracting a plurality of topics from a plurality of unlabeled social media posts, mapping the plurality of topics to a plurality of frequencies, each frequency in the plurality of frequencies indicating how often a corresponding topic in the plurality of topics occurs in the plurality of unlabeled social media posts, and predicting, based in part on the plurality of frequencies, a future social media posting behavior of a specific social media user, wherein the future social media posting behavior includes a specific topic about which the specific social media user is likely to post at a time in the future and a frequency with which the specific topic is likely to occur in posts of the specific social media user that are created at the time in the future.

Type: Application

Filed: January 19, 2017

Publication date: July 19, 2018

Inventors: Paulo Rodrigo Cavalin, Maira Gatti de Bayser, Alexandre Rademaker, Cicero Nogueira Dos Santos
TEXT CLASSIFICATION BY RANKING WITH CONVOLUTIONAL NEURAL NETWORKS

Publication number: 20170308790

Abstract: According to an aspect a method includes configuring a convolutional neural network (CNN) for classifying text based on word embedding features into a predefined set of classes identified by class labels. The predefined set of classes includes a class labeled none-of-the-above for text that does not fit into any of the other classes in the predefined set of classes. The CNN is trained based on a set of training data. The training includes learning parameters of class distributed vector representations (DVRs) of each of the predefined set of classes. The learning includes minimizing a pair-wise ranking loss function over the set of training data. A class embedding matrix of the class DVRs of the predefined set of classes that excludes a class embedding for the none-of-the-above class is generated. Each column in the class embedding matrix corresponds to one of the predefined classes.

Type: Application

Filed: April 21, 2016

Publication date: October 26, 2017

Inventors: Cicero Nogueira dos Santos, Bing Xiang, Bowen Zhou

1 2 next