Patents by Inventor Shafi Ur Rahman
Shafi Ur Rahman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240112045Abstract: A method may include generating synthetic data based on input data and training a machine learning model based on the synthetic data. The synthetic data may be generated by determining a plurality of data points representing an archetype probability distribution of a plurality of archetypes, clustering the plurality of data points into one or more clusters associated with transactional behavior patterns, generating a threshold metric representing a peak distribution density of the plurality of data points associated with a corresponding cluster, removing, from the plurality of data points, one or more non-representative data points to define a reduced set of the plurality of data points, generating an updated archetype probability distribution based at least on the reduced set of the plurality of data points, and generating representative transaction data based on the updated archetype probability distribution and threshold metric. Related methods and articles of manufacture are al so disclosed.Type: ApplicationFiled: September 30, 2022Publication date: April 4, 2024Inventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Publication number: 20240039934Abstract: Systems for improving security of a computer-implemented artificial intelligence by monitoring one or more transactions received by the machine learning decision model; receiving a first score generated by the machine learning decision model in association with a first transaction; identifying the first transaction as belonging to a first class, in response to the first score being lower than a certain score threshold and the first transaction having a low occurrence likelihood; receiving a second score in association with the first transaction based on one or more adversarial latent features associated with the first transaction as detectable by an adversary detection model; and determining at least one adversarial latent transaction feature being exploited by the first transaction, in response to determining that the second score falls above the certain score threshold.Type: ApplicationFiled: October 11, 2023Publication date: February 1, 2024Applicant: FICOInventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Patent number: 11818147Abstract: Systems, methods and computer program products for improving security of artificial intelligence systems. The system comprising processors for monitoring one or more transactions received by a machine learning decision model to determine a first score associated with a first transaction. The first transaction may be identified as likely adversarial, in response to the first score being lower than a certain score threshold and the first transaction having a low occurrence likelihood. A second score may be generated in association with the first transaction based on one or more adversarial latent features associated with the first transaction. At least one adversarial latent feature may be detected as being exploited by the first transaction, in response to determining that the second score falls above the certain score threshold. Accordingly, an abnormal volume of activations of adversarial latent features spanning across a plurality of transactions scored may be detected and blocked.Type: GrantFiled: November 23, 2020Date of Patent: November 14, 2023Assignee: Fair Isaac CorporationInventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Patent number: 11709918Abstract: A system and method for constructing an improved computing model that preserves use rights for data utilized by the model. A first dataset is accessed to build a computing model. The first data set is subject to terminable usage rights provisions. A portion of the first dataset is sampled to generate a second dataset. Vectors present in the first dataset and the second dataset are discretized. In response to determine that the usage rights associated with the primary dataset have been terminated, a coverage depletion for the second dataset is computed based on the usage rights termination associated with the first dataset. An estimated mean time to coverage failure for the first model based on the depletion coverage is determined for the second dataset. One or more data points are removed from the first dataset due to the termination of usage rights.Type: GrantFiled: April 17, 2020Date of Patent: July 25, 2023Assignee: FAIR ISAAC CORPORATIONInventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Patent number: 11687804Abstract: Computer-implemented methods and systems for quantifying appropriate machine learning model complexity corresponding to training dataset are provided. The method comprises monitoring, using one or more processors, N observed variables, v1 through vN, of a training dataset for a machine learning model; translating the N observed variables into m equisized bin indexes which generate mN possible equisized hypercells to estimate a fundamental dimensionality for the dataset; generating one or more samples by assigning a record in the dataset with numbers j through k as set id; generating a merged sample Si, for one or more values of the set id i, where i goes from j to k; and computing a fractal dimension of the equisized hypercube phase space based on count of cells with data coverage of at least one data point.Type: GrantFiled: June 30, 2020Date of Patent: June 27, 2023Assignee: Fair Isaac CorporationInventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Publication number: 20230085575Abstract: To eliminating bias from artificial intelligent (AI) systems, a list of class identifiers and features derived from class identifiers represented in training data fed to an AI system are identified for purpose of training a predictive model. Correlation analysis of input features is conducted from a list of raw variables, r, in a dataset and a plurality of derived features, x, with one or more class identifiers in the list of class identifiers and features derived from these class identifiers. A first list of input features is identified, one or more input features are in the first list belonging to and correlated with the one or more class identifiers or features derived from class identifiers. A second list of sets of input features is created to identify a set of combinations of input features that are not allowed to interact based on identifying biased latent features.Type: ApplicationFiled: September 13, 2021Publication date: March 16, 2023Inventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Publication number: 20220374744Abstract: In transactional systems where past transactions can have impact on the current score of a machine learning based decision model, the transactions that are most responsible for the score and the associated reasons are determined by the transactional system. A system and method identifies such past transactions that maximally impact the current score and allow for a more effective understanding of the scores generated by a model in a transactional system and explanation of specific transactions for automated decisioning, to explain the scores in terms of past transactions. Further an existing instance-based explanation system is used to identify the reasons for the score, and how the identified transactions influence these reasons. A combination of impact on score and impact on reasons determines the most impactful past transaction with respect to the most recent score being explained.Type: ApplicationFiled: May 17, 2022Publication date: November 24, 2022Inventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Publication number: 20220358111Abstract: A system and method for analyzing coverage, bias and model explanations in large dimensional modeling data includes discretizing three or more variables of a dataset to generate a discretized phase space represented as a grid of a plurality of cells, the dataset comprising a plurality of records, each record of the plurality of records having a value and a unique identifier (ID). A grid transformation is applied to each record in the dataset to assign each record to a cell of the plurality of cells of the grid according to the grid transformation. A grid index is generated to reference each cell using a discretized feature vector. A grid storage for storing the records assigned to each cell of the grid is then created. The grid storage using the ID of each record as a reference to each record and the discretized feature vector as a key to each cell.Type: ApplicationFiled: May 10, 2022Publication date: November 10, 2022Applicant: Fair Isaac CorporationInventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Patent number: 11373109Abstract: In transactional systems where past transactions can have impact on the current score of a machine learning based decision model, the transactions that are most responsible for the score and the associated reasons are determined by the transactional system. A system and method identifies such past transactions that maximally impact the current score and allow for a more effective understanding of the scores generated by a model in a transactional system and explanation of specific transactions for automated decisioning, to explain the scores in terms of past transactions. Further an existing instance-based explanation system is used to identify the reasons for the score, and how the identified transactions influence these reasons. A combination of impact on score and impact on reasons determines the most impactful past transaction with respect to the most recent score being explained.Type: GrantFiled: July 2, 2019Date of Patent: June 28, 2022Assignee: FAIR ISAAC CORPORATIONInventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Publication number: 20220166782Abstract: Systems, methods and computer program products for improving security of artificial intelligence systems. The system comprising processors for monitoring one or more transactions received by a machine learning decision model to determine a first score associated with a first transaction. The first transaction may be identified as likely adversarial, in response to the first score being lower than a certain score threshold and the first transaction having a low occurrence likelihood. A second score may be generated in association with the first transaction based on one or more adversarial latent features associated with the first transaction. At least one adversarial latent feature may be detected as being exploited by the first transaction, in response to determining that the second score falls above the certain score threshold. Accordingly, an abnormal volume of activations of adversarial latent features spanning across a plurality of transactions scored may be detected and blocked.Type: ApplicationFiled: November 23, 2020Publication date: May 26, 2022Inventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Publication number: 20210406724Abstract: Computer-implemented methods and systems for quantifying appropriate machine learning model complexity corresponding to training dataset are provided. The method comprises monitoring, using one or more processors, N observed variables, v1 through vN, of a training dataset for a machine learning model; translating the N observed variables into m equisized bin indexes which generate mN possible equisized hypercells to estimate a fundamental dimensionality for the dataset; generating one or more samples by assigning a record in the dataset with numbers j through k as set id; generating a merged sample Si, for one or more values of the set id i, where i goes from j to k; and computing a fractal dimension of the equisized hypercube phase space based on count of cells with data coverage of at least one data point.Type: ApplicationFiled: June 30, 2020Publication date: December 30, 2021Inventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Patent number: 11087339Abstract: Data for a plurality of entities that can be offered a plurality of products can be obtained. The data can include categorical data and numeric data. Based on business constraints, some of all of the data can be selected. The selected data can be converted to another set of numeric data, wherein the categorical values are converted to numeric values. Dimensions of the converted data can be reduced to generate another set of data. Based on this another set of data, clusters of entities can be formed. The products can be grouped by assigning a unique product identifier of each product to a corresponding cluster. This grouping of products can be used by a predictive model to predict a likelihood of an entity to purchase a particular product in a future time period. Related methods, apparatus, systems, techniques and articles are also described.Type: GrantFiled: October 9, 2017Date of Patent: August 10, 2021Assignee: FAIR ISAAC CORPORATIONInventors: Amit Kiran Sowani, Eeshan Malhotra, Shafi Ur Rahman
-
Publication number: 20210004703Abstract: In transactional systems where past transactions can have impact on the current score of a machine learning based decision model, the transactions that are most responsible for the score and the associated reasons are determined by the transactional system. A system and method identifies such past transactions that maximally impact the current score and allow for a more effective understanding of the scores generated by a model in a transactional system and explanation of specific transactions for automated decisioning, to explain the scores in terms of past transactions. Further an existing instance-based explanation system is used to identify the reasons for the score, and how the identified transactions influence these reasons. A combination of impact on score and impact on reasons determines the most impactful past transaction with respect to the most recent score being explained.Type: ApplicationFiled: July 2, 2019Publication date: January 7, 2021Inventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Publication number: 20200242216Abstract: A system and method for constructing an improved computing model that preserves use rights for data utilized by the model. A first dataset is accessed to build a computing model. The first data set is subject to terminable usage rights provisions. A portion of the first dataset is sampled to generate a second dataset. Vectors present in the first dataset and the second dataset are discretized. In response to determine that the usage rights associated with the primary dataset have been terminated, a coverage depletion for the second dataset is computed based on the usage rights termination associated with the first dataset. An estimated mean time to coverage failure for the first model based on the depletion coverage is determined for the second dataset. One or more data points are removed from the first dataset due to the termination of usage rights.Type: ApplicationFiled: April 17, 2020Publication date: July 30, 2020Inventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Patent number: 10713140Abstract: The state of a system is determined in which data sets are generated that include a plurality of data instances representing states of one or more components of a computer system. The data instances generated by one or more data set sources that are configured to output a data instance in response to a trigger associated with the one or more components. The data instances are normalized by the application of one or more rules. The data instances from individual data set sources are separately collated to generate groups of time-specific collated data instances. State types may be assigned to each of the collated data instance groups. Distributions of state-types across the groups may be determined and a list of infrequent state-types may be generated based on the determined distributions of state-types across the groups.Type: GrantFiled: June 10, 2015Date of Patent: July 14, 2020Assignee: FAIR ISAAC CORPORATIONInventors: Ashish Gupta, Shafi Ur Rahman, Sambandan Murugan
-
Patent number: 10657229Abstract: A system and method of building a decision or prediction model used for analyzing and scoring behavioral transactions is disclosed. A customer dataset in a model development store is used to build an original model is subject to a data right usage withdrawal, the original model having coverage over the customer dataset extract, using data sampling, a portion of the customer dataset to generate a model surrogate dataset. The system and method discretize vectors present in both the model surrogate dataset and the customer dataset, and receive data representing the data right usage withdrawal from the customer dataset. The system and method determine a depletion of the model surrogate dataset according to the data right usage withdrawal, and compute an estimated mean time to coverage failure of the original model based on the depletion of the model surrogate dataset according to the data right usage withdrawal.Type: GrantFiled: November 21, 2017Date of Patent: May 19, 2020Assignee: Fair Isaac CorporationInventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Patent number: 10360093Abstract: The state of a system is determined in which data sets are generated that include a plurality of data instances representing states of one or more components of a computer system. The data instances generated by one or more data set sources that are configured to output a data instance in response to a trigger associated with the one or more components. The data instances are normalized by the application of one or more rules. The data instances from individual data set sources are separately collated to generate groups of time-specific collated data instances. State types may be assigned to each of the collated data instance groups. Distributions of state-types across the groups may be determined and a list of infrequent state-types may be generated based on the determined distributions of state-types across the groups.Type: GrantFiled: November 18, 2015Date of Patent: July 23, 2019Assignee: Fair Isaac CorporationInventors: Shafi Ur Rahman, Ashish Gupta
-
Publication number: 20190155996Abstract: A system and method of building a decision or prediction model used for analyzing and scoring behavioral transactions is disclosed. A customer dataset in a model development store is used to build an original model is subject to a data right usage withdrawal, the original model having coverage over the customer dataset extract, using data sampling, a portion of the customer dataset to generate a model surrogate dataset. The system and method discretize vectors present in both the model surrogate dataset and the customer dataset, and receive data representing the data right usage withdrawal from the customer dataset. The system and method determine a depletion of the model surrogate dataset according to the data right usage withdrawal, and compute an estimated mean time to coverage failure of the original model based on the depletion of the model surrogate dataset according to the data right usage withdrawal.Type: ApplicationFiled: November 21, 2017Publication date: May 23, 2019Inventors: Scott Michael Zoldi, Shafi Ur Rahman
-
Publication number: 20180032916Abstract: Data for a plurality of entities that can be offered a plurality of products can be obtained. The data can include categorical data and numeric data. Based on business constraints, some of all of the data can be selected. The selected data can be converted to another set of numeric data, wherein the categorical values are converted to numeric values. Dimensions of the converted data can be reduced to generate another set of data. Based on this another set of data, clusters of entities can be formed. The products can be grouped by assigning a unique product identifier of each product to a corresponding cluster. This grouping of products can be used by a predictive model to predict a likelihood of an entity to purchase a particular product in a future time period. Related methods, apparatus, systems, techniques and articles are also described.Type: ApplicationFiled: October 9, 2017Publication date: February 1, 2018Inventors: Amit Kiran Sowani, Eeshan Malhotra, Shafi Ur Rahman
-
Patent number: 9785890Abstract: Data for a plurality of entities that can be offered a plurality of products can be obtained. The data can include categorical data and numeric data. Based on business constraints, some of all of the data can be selected. The selected data can be converted to another set of numeric data, wherein the categorical values are converted to numeric values. Dimensions of the converted data can be reduced to generate another set of data. Based on this another set of data, clusters of entities can be formed. The products can be grouped by assigning a unique product identifier of each product to a corresponding cluster. This grouping of products can be used by a predictive model to predict a likelihood of an entity to purchase a particular product in a future time period. Related methods, apparatus, systems, techniques and articles are also described.Type: GrantFiled: August 10, 2012Date of Patent: October 10, 2017Assignee: FAIR ISAAC CORPORATIONInventors: Amit Sowani, Eeshan Malhotra, Shafi Ur Rahman