Patents by Inventor Ariel Gedaliah Kobren

Ariel Gedaliah Kobren has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Extracted model adversaries for improved black box attacks

Patent number: 12265890

Abstract: Techniques are described for identifying successful adversarial attacks for a black box reading comprehension model using an extracted white box reading comprehension model. The system trains a white box reading comprehension model that behaves similar to the black box reading comprehension model using the set of queries and corresponding responses from the black box reading comprehension model as training data. The system tests adversarial attacks, involving modified informational content for execution of queries, against the trained white box reading comprehension model. Queries used for successful attacks on the white box model may be applied to the black box model itself as part of a black box improvement process.

Type: Grant

Filed: December 9, 2020

Date of Patent: April 1, 2025

Assignee: Oracle International Corporation

Inventors: Naveen Jafer Nizar, Ariel Gedaliah Kobren
Guided augmentation of data sets for machine learning models

Patent number: 12242568

Abstract: Techniques are disclosed for augmenting data sets used for training machine learning models and for generating predictions by trained machine learning models. These techniques may increase a number and diversity of examples within an initial training dataset of sentences by extracting a subset of words from the existing training dataset of sentences. The techniques may conserve scarce sample data in few-shot situations by training a data generation model using general data obtained from a general data source.

Type: Grant

Filed: September 6, 2022

Date of Patent: March 4, 2025

Assignee: Oracle International Corporation

Inventors: Ariel Gedaliah Kobren, Swetasudha Panda, Michael Louis Wick, Qinlan Shen, Jason Anthony Peck
Augmented Training Set Or Test Set For Improved Classification Model Robustness

Publication number: 20240184998

Abstract: A target set of texts, for training and/or evaluating a text classification model, is augmented using insertions into a base text within the original target set. In an embodiment, an expanded text, including the base text and an insertion word, must satisfy one or more inclusion criteria in order to be added to the target set. The inclusion criteria may require that the expanded text constitutes a successful attack on the classification model, the expanded text has a satisfactory perplexity score, and/or the expanded text is verified as being valid. In an embodiment, if a number of expanded texts added into the target set is below a threshold number, insertions are made into an expanded text (which was generated based on the base text). Inclusion criteria are evaluated against the doubly-expanded text to determine whether to add the doubly-expanded text to the target set.

Type: Application

Filed: February 8, 2024

Publication date: June 6, 2024

Applicant: Oracle International Corporation

Inventors: Naveen Jafer Nizar, Ariel Gedaliah Kobren
Augmented training set or test set for improved classification model robustness

Patent number: 11934795

Abstract: A target set of texts, for training and/or evaluating a text classification model, is augmented using insertions into a base text within the original target set. In an embodiment, an expanded text, including the base text and an insertion word, must satisfy one or more inclusion criteria in order to be added to the target set. The inclusion criteria may require that the expanded text constitutes a successful attack on the classification model, the expanded text has a satisfactory perplexity score, and/or the expanded text is verified as being valid. In an embodiment, if a number of expanded texts added into the target set is below a threshold number, insertions are made into an expanded text (which was generated based on the base text). Inclusion criteria are evaluated against the doubly-expanded text to determine whether to add the doubly-expanded text to the target set.

Type: Grant

Filed: August 3, 2021

Date of Patent: March 19, 2024

Assignee: Oracle International Corporation

Inventors: Naveen Jafer Nizar, Ariel Gedaliah Kobren
GUIDED AUGMENTION OF DATA SETS FOR MACHINE LEARNING MODELS

Publication number: 20230401286

Abstract: Techniques are disclosed for augmenting data sets used for training machine learning models and for generating predictions by trained machine learning models. These techniques may increase a number and diversity of examples within an initial training dataset of sentences by extracting a subset of words from the existing training dataset of sentences. The techniques may conserve scarce sample data in few-shot situations by training a data generation model using general data obtained from a general data source.

Type: Application

Filed: September 6, 2022

Publication date: December 14, 2023

Applicant: Oracle International Corporation

Inventors: Ariel Gedaliah Kobren, Swetasudha Panda, Michael Louis Wick, Qinlan Shen, Jason Anthony Peck
AUGMENTING DATA SETS FOR SELECTING MACHINE LEARNING MODELS

Publication number: 20230401285

Abstract: Techniques are disclosed for augmenting data sets used for training machine learning models and for generating predictions by trained machine learning models. The techniques generate synthesized data from sample data and train a machine learning model using the synthesized data to augment a sample data set. Embodiments selectively partition the sample data set and synthesized data into a training data and a validation data, which are used to generate and select machine learning models.

Type: Application

Filed: September 6, 2022

Publication date: December 14, 2023

Applicant: Oracle International Corporation

Inventors: Ariel Gedaliah Kobren, Swetasudha Panda, Michael Louis Wick, Qinlan Shen, Jason Anthony Peck
ENTROPY-BASED ANTI-MODELING FOR MACHINE LEARNING APPLICATIONS

Publication number: 20230368015

Abstract: Techniques are described herein for training and applying machine learning models. The techniques include implementing an entropy-based loss function for training high-capacity machine learning models, such as deep neural networks, with anti-modeling. The entropy-based loss function may cause the model to have high entropy on negative data, helping prevent the model from becoming confidently wrong about the negative data while reducing the likelihood of generalizing from disfavored signals.

Type: Application

Filed: September 8, 2022

Publication date: November 16, 2023

Applicant: Oracle International Corporation

Inventors: Michael Louis Wick, Ariel Gedaliah Kobren, Swetasudha Panda
AUGMENTING DATA SETS FOR MACHINE LEARNING MODELS

Publication number: 20230032208

Abstract: Techniques are disclosed for augmenting data sets used for training machine learning models and for generating predictions by trained machine learning models. These techniques may increase a number (and diversity) of examples within an initial training dataset of sentences by extracting a subset of words from the existing training dataset of sentences. The extracted subset includes no stopwords and fewer content words than found in the initial training dataset. The remaining words may be re-ordered. Using the extracted and re-ordered subset of words, the dataset generation model produces a second set of sentences that are different from the first set. The second set of sentences may be used to increase a number of examples in classes with few examples.

Type: Application

Filed: July 30, 2021

Publication date: February 2, 2023

Applicant: Oracle International Corporation

Inventors: Ariel Gedaliah Kobren, Naveen Jafer Nizar, Michael Louis Wick, Swetasudha Panda
AUGMENTED TRAINING SET OR TEST SET FOR IMPROVED CLASSIFICATION MODEL ROBUSTNESS

Publication number: 20220245362

Abstract: A target set of texts, for training and/or evaluating a text classification model, is augmented using insertions into a base text within the original target set. In an embodiment, an expanded text, including the base text and an insertion word, must satisfy one or more inclusion criteria in order to be added to the target set. The inclusion criteria may require that the expanded text constitutes a successful attack on the classification model, the expanded text has a satisfactory perplexity score, and/or the expanded text is verified as being valid. In an embodiment, if a number of expanded texts added into the target set is below a threshold number, insertions are made into an expanded text (which was generated based on the base text). Inclusion criteria are evaluated against the doubly-expanded text to determine whether to add the doubly-expanded text to the target set.

Type: Application

Filed: August 3, 2021

Publication date: August 4, 2022

Applicant: Oracle International Corporation

Inventors: Naveen Jafer Nizar, Ariel Gedaliah Kobren
EXTRACTED MODEL ADVERSARIES FOR IMPROVED BLACK BOX ATTACKS

Publication number: 20220051134

Abstract: Techniques are described for identifying successful adversarial attacks for a black box reading comprehension model using an extracted white box reading comprehension model. The system trains a white box reading comprehension model that behaves similar to the black box reading comprehension model using the set of queries and corresponding responses from the black box reading comprehension model as training data. The system tests adversarial attacks, involving modified informational content for execution of queries, against the trained white box reading comprehension model. Queries used for successful attacks on the white box model may be applied to the black box model itself as part of a black box improvement process.

Type: Application

Filed: December 9, 2020

Publication date: February 17, 2022

Applicant: Oracle International Corporation

Inventors: Naveen Jafer Nizar, Ariel Gedaliah Kobren