Patents by Inventor Robert James Oberbreckling

Robert James Oberbreckling has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHODS FOR TRANSFORMING CATEGORICAL SOURCE DATA DISTRIBUTIONS INTO TARGET DATA DISTRIBUTIONS FOR USE BY MACHINE LEARNING MODEL TRAINING

Publication number: 20230409958

Abstract: Techniques for optimizing the sampling of both the highest resource source category and lower resource source categories of a given categorical source data set with an initial distribution such that a target distribution of the given categorical source data set may be reached are described. Data items of the respective categories may be indexed and sampled such that the number of data items that are sampled more than once are tracked. Sampling, according to the target distribution, may continue until stop criteria are satisfied. In some cases, the respective indexes are used to determine the moment at which the highest resource source category is fully sampled, therefore minimizing the number of duplicate data items and optimizing the use of the respective categories of data items. The target distribution may then be used to train a machine learning model.

Type: Application

Filed: June 17, 2022

Publication date: December 21, 2023

Inventor: Robert James Oberbreckling
Techniques for relationship discovery between datasets

Patent number: 11704321

Abstract: The present disclosure related to techniques for analyzing data from multiple different data sources to determine a relationship between the data (also referred to herein a “data relationship discovery”). The relationships between any two compared datasets may be used to determine one or more recommendations for merging (e.g., joining), or “blending,” the data sets together. Relationship discovery may include determining a relationship between a subset of data, such as a relationship between a pair of columns, or column pair, each column in a different dataset of the datasets that are compared. Given two datasets to process for relationship discovery, relationship discovery may identify and recommends a ranked subset of column pairs between two compared datasets. The ranked column pairs identified as a relationship may be useful for blending the datasets with respect to those column pairs.

Type: Grant

Filed: March 23, 2020

Date of Patent: July 18, 2023

Assignee: Oracle International Corporation

Inventors: Robert James Oberbreckling, Luis E. Rivas, Michael Malak, Glenn Allen Murray
Techniques for dataset similarity discovery

Patent number: 11163527

Abstract: The present disclosure relates to techniques for analysis of data from multiple different data sources to determine similarity amongst the datasets. Determining a similarity between datasets may be useful for downstream processing of those datasets for different uses. A graphical interface may be provided to display detailed results including: a similarity prediction, data similarity prediction, column order similarity prediction, document type similarity prediction, prediction of overlapping or related columns, orphaned column prediction (e.g., a left orphaned column or a right orphaned column).

Type: Grant

Filed: August 27, 2019

Date of Patent: November 2, 2021

Assignee: Oracle International Corporation

Inventors: Robert James Oberbreckling, Luis E. Rivas
Bias parameters for topic modeling

Patent number: 10990763

Abstract: Systems and methods are disclosed to improve a topic modeling system that tunes a topic model for a set of topics from a corpus of documents, by allowing users to pre-inform the tuning process with bias parameters for desired associations in the topic model. In embodiments, the topic model may be a Latent Dirichlet Allocation (LDA) model. In embodiments, the bias parameter may indicate a fixed association where a particular word in a particular document is associated with a particular topic. In embodiments, the bias parameter may specify a weight value that biases the inference process with regard to a particular association. Advantageously, the disclosed features allow users to specify a small number of parameters to steer the tuning process towards a set of desired topics. As a result, the topic model may be generated more quickly and with more useful topics.

Type: Grant

Filed: May 9, 2019

Date of Patent: April 27, 2021

Assignee: Oracle International Corporation

Inventors: Daniel Peterson, Jean-Baptiste Frederic George Tristan, Robert James Oberbreckling
Data standardization techniques

Patent number: 10885056

Abstract: Techniques are disclosed for standardization of data. According to a first technique, standard representation terms are determined for to-be-standardized data using the to-be-standardized data itself and without using any external reference data. According to a second technique, a combination of the to-be-standardized data and an external reference is used to determine standard representation terms for the to-be-standardized data.

Type: Grant

Filed: September 25, 2018

Date of Patent: January 5, 2021

Assignee: ORACLE INTERNATIONAL CORPORATION

Inventors: Michael Malak, Luis E. Rivas, Mark L. Kreider, Philip Ogren, Robert James Oberbreckling
BIAS PARAMETERS FOR TOPIC MODELING

Publication number: 20200279019

Abstract: Systems and methods are disclosed to improve a topic modeling system that tunes a topic model for a set of topics from a corpus of documents, by allowing users to pre-inform the tuning process with bias parameters for desired associations in the topic model. In embodiments, the topic model may be a Latent Dirichlet Allocation (LDA) model. In embodiments, the bias parameter may indicate a fixed association where a particular word in a particular document is associated with a particular topic. In embodiments, the bias parameter may specify a weight value that biases the inference process with regard to a particular association. Advantageously, the disclosed features allow users to specify a small number of parameters to steer the tuning process towards a set of desired topics. As a result, the topic model may be generated more quickly and with more useful topics.

Type: Application

Filed: May 9, 2019

Publication date: September 3, 2020

Inventors: Daniel Peterson, Jean-Baptiste Frederic George Tristan, Robert James Oberbreckling
TECHNIQUES FOR RELATIONSHIP DISCOVERY BETWEEN DATASETS

Publication number: 20200242111

Abstract: The present disclosure related to techniques for analyzing data from multiple different data sources to determine a relationship between the data (also referred to herein a “data relationship discovery”). The relationships between any two compared datasets may be used to determine one or more recommendations for merging (e.g., joining), or “blending,” the data sets together. Relationship discovery may include determining a relationship between a subset of data, such as a relationship between a pair of columns, or column pair, each column in a different dataset of the datasets that are compared. Given two datasets to process for relationship discovery, relationship discovery may identify and recommends a ranked subset of column pairs between two compared datasets. The ranked column pairs identified as a relationship may be useful for blending the datasets with respect to those column pairs.

Type: Application

Filed: March 23, 2020

Publication date: July 30, 2020

Applicant: Oracle International Corporation

Inventors: Robert James Oberbreckling, Luis E. Rivas, Michael Malak, Glenn Allen Murray
Techniques for relationship discovery between datasets

Patent number: 10650000

Abstract: The present disclosure related to techniques for analyzing data from multiple different data sources to determine a relationship between the data (also referred to herein a “data relationship discovery”). The relationships between any two compared datasets may be used to determine one or more recommendations for merging (e.g., joining), or “blending,” the data sets together. Relationship discovery may include determining a relationship between a subset of data, such as a relationship between a pair of columns, or column pair, each column in a different dataset of the datasets that are compared. Given two datasets to process for relationship discovery, relationship discovery may identify and recommends a ranked subset of column pairs between two compared datasets. The ranked column pairs identified as a relationship may be useful for blending the datasets with respect to those column pairs.

Type: Grant

Filed: September 14, 2017

Date of Patent: May 12, 2020

Assignee: Oracle International Corporation

Inventors: Robert James Oberbreckling, Luis E. Rivas, Michael Malak, Glenn Allen Murray
TECHNIQUES FOR DATASET SIMILARITY DISCOVERY

Publication number: 20190384571

Abstract: The present disclosure relates to techniques for analysis of data from multiple different data sources to determine similarity amongst the datasets. Determining a similarity between datasets may be useful for downstream processing of those datasets for different uses. A graphical interface may be provided to display detailed results including: a similarity prediction, data similarity prediction, column order similarity prediction, document type similarity prediction, prediction of overlapping or related columns, orphaned column prediction (e.g., a left orphaned column or a right orphaned column).

Type: Application

Filed: August 27, 2019

Publication date: December 19, 2019

Applicant: Oracle International Corporation

Inventors: Robert James Oberbreckling, Luis E. Rivas
Techniques for dataset similarity discovery

Patent number: 10445062

Abstract: The present disclosure relates to techniques for analysis of data from multiple different data sources to determine similarity amongst the datasets. Determining a similarity between datasets may be useful for downstream processing of those datasets for different uses. A graphical interface may be provided to display detailed results including: a similarity prediction, data similarity prediction, column order similarity prediction, document type similarity prediction, prediction of overlapping or related columns, orphaned column prediction (e.g., a left orphaned column or a right orphaned column). Detecting similarities may be useful for leveraging prior data transformations generated for the datasets that are analyzed.

Type: Grant

Filed: September 15, 2017

Date of Patent: October 15, 2019

Assignee: Oracle International Corporation

Inventors: Robert James Oberbreckling, Luis E. Rivas
DATA STANDARDIZATION TECHNIQUES

Publication number: 20190102441

Abstract: Techniques are disclosed for standardization of data. According to a first technique, standard representation terms are determined for to-be-standardized data using the to-be-standardized data itself and without using any external reference data. According to a second technique, a combination of the to-be-standardized data and an external reference is used to determine standard representation terms for the to-be-standardized data.

Type: Application

Filed: September 25, 2018

Publication date: April 4, 2019

Applicant: Oracle International Corporation

Inventors: Michael Malak, Luis E. Rivas, Mark L. Kreider, Philip Ogren, Robert James Oberbreckling
TECHNIQUES FOR DATASET SIMILARITY DISCOVERY

Publication number: 20180074786

Abstract: The present disclosure relates to techniques for analysis of data from multiple different data sources to determine similarity amongst the datasets. Determining a similarity between datasets may be useful for downstream processing of those datasets for different uses. A graphical interface may be provided to display detailed results including: a similarity prediction, data similarity prediction, column order similarity prediction, document type similarity prediction, prediction of overlapping or related columns, orphaned column prediction (e.g., a left orphaned column or a right orphaned column). Detecting similarities may be useful for leveraging prior data transformations generated for the datasets that are analyzed.

Type: Application

Filed: September 15, 2017

Publication date: March 15, 2018

Applicant: Oracle International Corporation

Inventors: Robert James Oberbreckling, Luis E. Rivas
TECHNIQUES FOR RELATIONSHIP DISCOVERY BETWEEN DATASETS

Publication number: 20180075104

Abstract: The present disclosure related to techniques for analyzing data from multiple different data sources to determine a relationship between the data (also referred to herein a “data relationship discovery”). The relationships between any two compared datasets may be used to determine one or more recommendations for merging (e.g., joining), or “blending,” the data sets together. Relationship discovery may include determining a relationship between a subset of data, such as a relationship between a pair of columns, or column pair, each column in a different dataset of the datasets that are compared. Given two datasets to process for relationship discovery, relationship discovery may identify and recommends a ranked subset of column pairs between two compared datasets. The ranked column pairs identified as a relationship may be useful for blending the datasets with respect to those column pairs.

Type: Application

Filed: September 14, 2017

Publication date: March 15, 2018

Applicant: Oracle International Corporation

Inventors: Robert James Oberbreckling, Luis E. Rivas, Michael Malak, Glenn Allen Murray