Patents by Inventor Shafi Ur Rahman

Shafi Ur Rahman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SYNTHETIC DATA GENERATION FOR MACHINE LEARNING MODELS

Publication number: 20240112045

Abstract: A method may include generating synthetic data based on input data and training a machine learning model based on the synthetic data. The synthetic data may be generated by determining a plurality of data points representing an archetype probability distribution of a plurality of archetypes, clustering the plurality of data points into one or more clusters associated with transactional behavior patterns, generating a threshold metric representing a peak distribution density of the plurality of data points associated with a corresponding cluster, removing, from the plurality of data points, one or more non-representative data points to define a reduced set of the plurality of data points, generating an updated archetype probability distribution based at least on the reduced set of the plurality of data points, and generating representative transaction data based on the updated archetype probability distribution and threshold metric. Related methods and articles of manufacture are al so disclosed.

Type: Application

Filed: September 30, 2022

Publication date: April 4, 2024

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
OVERLY OPTIMISTIC DATA PATTERNS AND LEARNED ADVERSARIAL LATENT FEATURES

Publication number: 20240039934

Abstract: Systems for improving security of a computer-implemented artificial intelligence by monitoring one or more transactions received by the machine learning decision model; receiving a first score generated by the machine learning decision model in association with a first transaction; identifying the first transaction as belonging to a first class, in response to the first score being lower than a certain score threshold and the first transaction having a low occurrence likelihood; receiving a second score in association with the first transaction based on one or more adversarial latent features associated with the first transaction as detectable by an adversary detection model; and determining at least one adversarial latent transaction feature being exploited by the first transaction, in response to determining that the second score falls above the certain score threshold.

Type: Application

Filed: October 11, 2023

Publication date: February 1, 2024

Applicant: FICO

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
Overly optimistic data patterns and learned adversarial latent features

Patent number: 11818147

Abstract: Systems, methods and computer program products for improving security of artificial intelligence systems. The system comprising processors for monitoring one or more transactions received by a machine learning decision model to determine a first score associated with a first transaction. The first transaction may be identified as likely adversarial, in response to the first score being lower than a certain score threshold and the first transaction having a low occurrence likelihood. A second score may be generated in association with the first transaction based on one or more adversarial latent features associated with the first transaction. At least one adversarial latent feature may be detected as being exploited by the first transaction, in response to determining that the second score falls above the certain score threshold. Accordingly, an abnormal volume of activations of adversarial latent features spanning across a plurality of transactions scored may be detected and blocked.

Type: Grant

Filed: November 23, 2020

Date of Patent: November 14, 2023

Assignee: Fair Isaac Corporation

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
Building resilient models to address dynamic customer data use rights

Patent number: 11709918

Abstract: A system and method for constructing an improved computing model that preserves use rights for data utilized by the model. A first dataset is accessed to build a computing model. The first data set is subject to terminable usage rights provisions. A portion of the first dataset is sampled to generate a second dataset. Vectors present in the first dataset and the second dataset are discretized. In response to determine that the usage rights associated with the primary dataset have been terminated, a coverage depletion for the second dataset is computed based on the usage rights termination associated with the first dataset. An estimated mean time to coverage failure for the first model based on the depletion coverage is determined for the second dataset. One or more data points are removed from the first dataset due to the termination of usage rights.

Type: Grant

Filed: April 17, 2020

Date of Patent: July 25, 2023

Assignee: FAIR ISAAC CORPORATION

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
Latent feature dimensionality bounds for robust machine learning on high dimensional datasets

Patent number: 11687804

Abstract: Computer-implemented methods and systems for quantifying appropriate machine learning model complexity corresponding to training dataset are provided. The method comprises monitoring, using one or more processors, N observed variables, v1 through vN, of a training dataset for a machine learning model; translating the N observed variables into m equisized bin indexes which generate mN possible equisized hypercells to estimate a fundamental dimensionality for the dataset; generating one or more samples by assigning a record in the dataset with numbers j through k as set id; generating a merged sample Si, for one or more values of the set id i, where i goes from j to k; and computing a fractal dimension of the equisized hypercube phase space based on count of cells with data coverage of at least one data point.

Type: Grant

Filed: June 30, 2020

Date of Patent: June 27, 2023

Assignee: Fair Isaac Corporation

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
LATENT FEATURE BASED MODEL BIAS MITIGATION IN ARTIFICIAL INTELLIGENCE SYSTEMS

Publication number: 20230085575

Abstract: To eliminating bias from artificial intelligent (AI) systems, a list of class identifiers and features derived from class identifiers represented in training data fed to an AI system are identified for purpose of training a predictive model. Correlation analysis of input features is conducted from a list of raw variables, r, in a dataset and a plurality of derived features, x, with one or more class identifiers in the list of class identifiers and features derived from these class identifiers. A first list of input features is identified, one or more input features are in the first list belonging to and correlated with the one or more class identifiers or features derived from class identifiers. A second list of sets of input features is created to identify a set of combinations of input features that are not allowed to interact based on identifying biased latent features.

Type: Application

Filed: September 13, 2021

Publication date: March 16, 2023

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
TEMPORAL EXPLANATIONS OF MACHINE LEARNING MODEL OUTCOMES

Publication number: 20220374744

Abstract: In transactional systems where past transactions can have impact on the current score of a machine learning based decision model, the transactions that are most responsible for the score and the associated reasons are determined by the transactional system. A system and method identifies such past transactions that maximally impact the current score and allow for a more effective understanding of the scores generated by a model in a transactional system and explanation of specific transactions for automated decisioning, to explain the scores in terms of past transactions. Further an existing instance-based explanation system is used to identify the reasons for the score, and how the identified transactions influence these reasons. A combination of impact on score and impact on reasons determines the most impactful past transaction with respect to the most recent score being explained.

Type: Application

Filed: May 17, 2022

Publication date: November 24, 2022

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
METHOD AND APPARATUS FOR ANALYZING COVERAGE, BIAS, AND MODEL EXPLANATIONS IN LARGE DIMENSIONAL MODELING DATA

Publication number: 20220358111

Abstract: A system and method for analyzing coverage, bias and model explanations in large dimensional modeling data includes discretizing three or more variables of a dataset to generate a discretized phase space represented as a grid of a plurality of cells, the dataset comprising a plurality of records, each record of the plurality of records having a value and a unique identifier (ID). A grid transformation is applied to each record in the dataset to assign each record to a cell of the plurality of cells of the grid according to the grid transformation. A grid index is generated to reference each cell using a discretized feature vector. A grid storage for storing the records assigned to each cell of the grid is then created. The grid storage using the ID of each record as a reference to each record and the discretized feature vector as a key to each cell.

Type: Application

Filed: May 10, 2022

Publication date: November 10, 2022

Applicant: Fair Isaac Corporation

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
Temporal explanations of machine learning model outcomes

Patent number: 11373109

Abstract: In transactional systems where past transactions can have impact on the current score of a machine learning based decision model, the transactions that are most responsible for the score and the associated reasons are determined by the transactional system. A system and method identifies such past transactions that maximally impact the current score and allow for a more effective understanding of the scores generated by a model in a transactional system and explanation of specific transactions for automated decisioning, to explain the scores in terms of past transactions. Further an existing instance-based explanation system is used to identify the reasons for the score, and how the identified transactions influence these reasons. A combination of impact on score and impact on reasons determines the most impactful past transaction with respect to the most recent score being explained.

Type: Grant

Filed: July 2, 2019

Date of Patent: June 28, 2022

Assignee: FAIR ISAAC CORPORATION

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
OVERLY OPTIMISTIC DATA PATTERNS AND LEARNED ADVERSARIAL LATENT FEATURES

Publication number: 20220166782

Abstract: Systems, methods and computer program products for improving security of artificial intelligence systems. The system comprising processors for monitoring one or more transactions received by a machine learning decision model to determine a first score associated with a first transaction. The first transaction may be identified as likely adversarial, in response to the first score being lower than a certain score threshold and the first transaction having a low occurrence likelihood. A second score may be generated in association with the first transaction based on one or more adversarial latent features associated with the first transaction. At least one adversarial latent feature may be detected as being exploited by the first transaction, in response to determining that the second score falls above the certain score threshold. Accordingly, an abnormal volume of activations of adversarial latent features spanning across a plurality of transactions scored may be detected and blocked.

Type: Application

Filed: November 23, 2020

Publication date: May 26, 2022

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
LATENT FEATURE DIMENSIONALITY BOUNDS FOR ROBUST MACHINE LEARNING ON HIGH DIMENSIONAL DATASETS

Publication number: 20210406724

Abstract: Computer-implemented methods and systems for quantifying appropriate machine learning model complexity corresponding to training dataset are provided. The method comprises monitoring, using one or more processors, N observed variables, v1 through vN, of a training dataset for a machine learning model; translating the N observed variables into m equisized bin indexes which generate mN possible equisized hypercells to estimate a fundamental dimensionality for the dataset; generating one or more samples by assigning a record in the dataset with numbers j through k as set id; generating a merged sample Si, for one or more values of the set id i, where i goes from j to k; and computing a fractal dimension of the equisized hypercube phase space based on count of cells with data coverage of at least one data point.

Type: Application

Filed: June 30, 2020

Publication date: December 30, 2021

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
Data-driven product grouping

Patent number: 11087339

Abstract: Data for a plurality of entities that can be offered a plurality of products can be obtained. The data can include categorical data and numeric data. Based on business constraints, some of all of the data can be selected. The selected data can be converted to another set of numeric data, wherein the categorical values are converted to numeric values. Dimensions of the converted data can be reduced to generate another set of data. Based on this another set of data, clusters of entities can be formed. The products can be grouped by assigning a unique product identifier of each product to a corresponding cluster. This grouping of products can be used by a predictive model to predict a likelihood of an entity to purchase a particular product in a future time period. Related methods, apparatus, systems, techniques and articles are also described.

Type: Grant

Filed: October 9, 2017

Date of Patent: August 10, 2021

Assignee: FAIR ISAAC CORPORATION

Inventors: Amit Kiran Sowani, Eeshan Malhotra, Shafi Ur Rahman
TEMPORAL EXPLANATIONS OF MACHINE LEARNING MODEL OUTCOMES

Publication number: 20210004703

Abstract: In transactional systems where past transactions can have impact on the current score of a machine learning based decision model, the transactions that are most responsible for the score and the associated reasons are determined by the transactional system. A system and method identifies such past transactions that maximally impact the current score and allow for a more effective understanding of the scores generated by a model in a transactional system and explanation of specific transactions for automated decisioning, to explain the scores in terms of past transactions. Further an existing instance-based explanation system is used to identify the reasons for the score, and how the identified transactions influence these reasons. A combination of impact on score and impact on reasons determines the most impactful past transaction with respect to the most recent score being explained.

Type: Application

Filed: July 2, 2019

Publication date: January 7, 2021

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
BUILDING RESILIENT MODELS TO ADDRESS DYNAMIC CUSTOMER DATA USE RIGHTS

Publication number: 20200242216

Abstract: A system and method for constructing an improved computing model that preserves use rights for data utilized by the model. A first dataset is accessed to build a computing model. The first data set is subject to terminable usage rights provisions. A portion of the first dataset is sampled to generate a second dataset. Vectors present in the first dataset and the second dataset are discretized. In response to determine that the usage rights associated with the primary dataset have been terminated, a coverage depletion for the second dataset is computed based on the usage rights termination associated with the first dataset. An estimated mean time to coverage failure for the first model based on the depletion coverage is determined for the second dataset. One or more data points are removed from the first dataset due to the termination of usage rights.

Type: Application

Filed: April 17, 2020

Publication date: July 30, 2020

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
Identifying latent states of machines based on machine logs

Patent number: 10713140

Abstract: The state of a system is determined in which data sets are generated that include a plurality of data instances representing states of one or more components of a computer system. The data instances generated by one or more data set sources that are configured to output a data instance in response to a trigger associated with the one or more components. The data instances are normalized by the application of one or more rules. The data instances from individual data set sources are separately collated to generate groups of time-specific collated data instances. State types may be assigned to each of the collated data instance groups. Distributions of state-types across the groups may be determined and a list of infrequent state-types may be generated based on the determined distributions of state-types across the groups.

Type: Grant

Filed: June 10, 2015

Date of Patent: July 14, 2020

Assignee: FAIR ISAAC CORPORATION

Inventors: Ashish Gupta, Shafi Ur Rahman, Sambandan Murugan
Building resilient models to address dynamic customer data use rights

Patent number: 10657229

Abstract: A system and method of building a decision or prediction model used for analyzing and scoring behavioral transactions is disclosed. A customer dataset in a model development store is used to build an original model is subject to a data right usage withdrawal, the original model having coverage over the customer dataset extract, using data sampling, a portion of the customer dataset to generate a model surrogate dataset. The system and method discretize vectors present in both the model surrogate dataset and the customer dataset, and receive data representing the data right usage withdrawal from the customer dataset. The system and method determine a depletion of the model surrogate dataset according to the data right usage withdrawal, and compute an estimated mean time to coverage failure of the original model based on the depletion of the model surrogate dataset according to the data right usage withdrawal.

Type: Grant

Filed: November 21, 2017

Date of Patent: May 19, 2020

Assignee: Fair Isaac Corporation

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
Detecting anomalous states of machines

Patent number: 10360093

Abstract: The state of a system is determined in which data sets are generated that include a plurality of data instances representing states of one or more components of a computer system. The data instances generated by one or more data set sources that are configured to output a data instance in response to a trigger associated with the one or more components. The data instances are normalized by the application of one or more rules. The data instances from individual data set sources are separately collated to generate groups of time-specific collated data instances. State types may be assigned to each of the collated data instance groups. Distributions of state-types across the groups may be determined and a list of infrequent state-types may be generated based on the determined distributions of state-types across the groups.

Type: Grant

Filed: November 18, 2015

Date of Patent: July 23, 2019

Assignee: Fair Isaac Corporation

Inventors: Shafi Ur Rahman, Ashish Gupta
Building Resilient Models to Address Dynamic Customer Data Use Rights

Publication number: 20190155996

Abstract: A system and method of building a decision or prediction model used for analyzing and scoring behavioral transactions is disclosed. A customer dataset in a model development store is used to build an original model is subject to a data right usage withdrawal, the original model having coverage over the customer dataset extract, using data sampling, a portion of the customer dataset to generate a model surrogate dataset. The system and method discretize vectors present in both the model surrogate dataset and the customer dataset, and receive data representing the data right usage withdrawal from the customer dataset. The system and method determine a depletion of the model surrogate dataset according to the data right usage withdrawal, and compute an estimated mean time to coverage failure of the original model based on the depletion of the model surrogate dataset according to the data right usage withdrawal.

Type: Application

Filed: November 21, 2017

Publication date: May 23, 2019

Inventors: Scott Michael Zoldi, Shafi Ur Rahman
Data-Driven Product Grouping

Publication number: 20180032916

Abstract: Data for a plurality of entities that can be offered a plurality of products can be obtained. The data can include categorical data and numeric data. Based on business constraints, some of all of the data can be selected. The selected data can be converted to another set of numeric data, wherein the categorical values are converted to numeric values. Dimensions of the converted data can be reduced to generate another set of data. Based on this another set of data, clusters of entities can be formed. The products can be grouped by assigning a unique product identifier of each product to a corresponding cluster. This grouping of products can be used by a predictive model to predict a likelihood of an entity to purchase a particular product in a future time period. Related methods, apparatus, systems, techniques and articles are also described.

Type: Application

Filed: October 9, 2017

Publication date: February 1, 2018

Inventors: Amit Kiran Sowani, Eeshan Malhotra, Shafi Ur Rahman
Data-driven product grouping

Patent number: 9785890

Abstract: Data for a plurality of entities that can be offered a plurality of products can be obtained. The data can include categorical data and numeric data. Based on business constraints, some of all of the data can be selected. The selected data can be converted to another set of numeric data, wherein the categorical values are converted to numeric values. Dimensions of the converted data can be reduced to generate another set of data. Based on this another set of data, clusters of entities can be formed. The products can be grouped by assigning a unique product identifier of each product to a corresponding cluster. This grouping of products can be used by a predictive model to predict a likelihood of an entity to purchase a particular product in a future time period. Related methods, apparatus, systems, techniques and articles are also described.

Type: Grant

Filed: August 10, 2012

Date of Patent: October 10, 2017

Assignee: FAIR ISAAC CORPORATION

Inventors: Amit Sowani, Eeshan Malhotra, Shafi Ur Rahman

1 2 next