Patents by Inventor Shafi Ur Rahman

Shafi Ur Rahman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240112045
    Abstract: A method may include generating synthetic data based on input data and training a machine learning model based on the synthetic data. The synthetic data may be generated by determining a plurality of data points representing an archetype probability distribution of a plurality of archetypes, clustering the plurality of data points into one or more clusters associated with transactional behavior patterns, generating a threshold metric representing a peak distribution density of the plurality of data points associated with a corresponding cluster, removing, from the plurality of data points, one or more non-representative data points to define a reduced set of the plurality of data points, generating an updated archetype probability distribution based at least on the reduced set of the plurality of data points, and generating representative transaction data based on the updated archetype probability distribution and threshold metric. Related methods and articles of manufacture are al so disclosed.
    Type: Application
    Filed: September 30, 2022
    Publication date: April 4, 2024
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Publication number: 20240039934
    Abstract: Systems for improving security of a computer-implemented artificial intelligence by monitoring one or more transactions received by the machine learning decision model; receiving a first score generated by the machine learning decision model in association with a first transaction; identifying the first transaction as belonging to a first class, in response to the first score being lower than a certain score threshold and the first transaction having a low occurrence likelihood; receiving a second score in association with the first transaction based on one or more adversarial latent features associated with the first transaction as detectable by an adversary detection model; and determining at least one adversarial latent transaction feature being exploited by the first transaction, in response to determining that the second score falls above the certain score threshold.
    Type: Application
    Filed: October 11, 2023
    Publication date: February 1, 2024
    Applicant: FICO
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Patent number: 11818147
    Abstract: Systems, methods and computer program products for improving security of artificial intelligence systems. The system comprising processors for monitoring one or more transactions received by a machine learning decision model to determine a first score associated with a first transaction. The first transaction may be identified as likely adversarial, in response to the first score being lower than a certain score threshold and the first transaction having a low occurrence likelihood. A second score may be generated in association with the first transaction based on one or more adversarial latent features associated with the first transaction. At least one adversarial latent feature may be detected as being exploited by the first transaction, in response to determining that the second score falls above the certain score threshold. Accordingly, an abnormal volume of activations of adversarial latent features spanning across a plurality of transactions scored may be detected and blocked.
    Type: Grant
    Filed: November 23, 2020
    Date of Patent: November 14, 2023
    Assignee: Fair Isaac Corporation
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Patent number: 11709918
    Abstract: A system and method for constructing an improved computing model that preserves use rights for data utilized by the model. A first dataset is accessed to build a computing model. The first data set is subject to terminable usage rights provisions. A portion of the first dataset is sampled to generate a second dataset. Vectors present in the first dataset and the second dataset are discretized. In response to determine that the usage rights associated with the primary dataset have been terminated, a coverage depletion for the second dataset is computed based on the usage rights termination associated with the first dataset. An estimated mean time to coverage failure for the first model based on the depletion coverage is determined for the second dataset. One or more data points are removed from the first dataset due to the termination of usage rights.
    Type: Grant
    Filed: April 17, 2020
    Date of Patent: July 25, 2023
    Assignee: FAIR ISAAC CORPORATION
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Patent number: 11687804
    Abstract: Computer-implemented methods and systems for quantifying appropriate machine learning model complexity corresponding to training dataset are provided. The method comprises monitoring, using one or more processors, N observed variables, v1 through vN, of a training dataset for a machine learning model; translating the N observed variables into m equisized bin indexes which generate mN possible equisized hypercells to estimate a fundamental dimensionality for the dataset; generating one or more samples by assigning a record in the dataset with numbers j through k as set id; generating a merged sample Si, for one or more values of the set id i, where i goes from j to k; and computing a fractal dimension of the equisized hypercube phase space based on count of cells with data coverage of at least one data point.
    Type: Grant
    Filed: June 30, 2020
    Date of Patent: June 27, 2023
    Assignee: Fair Isaac Corporation
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Publication number: 20230085575
    Abstract: To eliminating bias from artificial intelligent (AI) systems, a list of class identifiers and features derived from class identifiers represented in training data fed to an AI system are identified for purpose of training a predictive model. Correlation analysis of input features is conducted from a list of raw variables, r, in a dataset and a plurality of derived features, x, with one or more class identifiers in the list of class identifiers and features derived from these class identifiers. A first list of input features is identified, one or more input features are in the first list belonging to and correlated with the one or more class identifiers or features derived from class identifiers. A second list of sets of input features is created to identify a set of combinations of input features that are not allowed to interact based on identifying biased latent features.
    Type: Application
    Filed: September 13, 2021
    Publication date: March 16, 2023
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Publication number: 20220374744
    Abstract: In transactional systems where past transactions can have impact on the current score of a machine learning based decision model, the transactions that are most responsible for the score and the associated reasons are determined by the transactional system. A system and method identifies such past transactions that maximally impact the current score and allow for a more effective understanding of the scores generated by a model in a transactional system and explanation of specific transactions for automated decisioning, to explain the scores in terms of past transactions. Further an existing instance-based explanation system is used to identify the reasons for the score, and how the identified transactions influence these reasons. A combination of impact on score and impact on reasons determines the most impactful past transaction with respect to the most recent score being explained.
    Type: Application
    Filed: May 17, 2022
    Publication date: November 24, 2022
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Publication number: 20220358111
    Abstract: A system and method for analyzing coverage, bias and model explanations in large dimensional modeling data includes discretizing three or more variables of a dataset to generate a discretized phase space represented as a grid of a plurality of cells, the dataset comprising a plurality of records, each record of the plurality of records having a value and a unique identifier (ID). A grid transformation is applied to each record in the dataset to assign each record to a cell of the plurality of cells of the grid according to the grid transformation. A grid index is generated to reference each cell using a discretized feature vector. A grid storage for storing the records assigned to each cell of the grid is then created. The grid storage using the ID of each record as a reference to each record and the discretized feature vector as a key to each cell.
    Type: Application
    Filed: May 10, 2022
    Publication date: November 10, 2022
    Applicant: Fair Isaac Corporation
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Patent number: 11373109
    Abstract: In transactional systems where past transactions can have impact on the current score of a machine learning based decision model, the transactions that are most responsible for the score and the associated reasons are determined by the transactional system. A system and method identifies such past transactions that maximally impact the current score and allow for a more effective understanding of the scores generated by a model in a transactional system and explanation of specific transactions for automated decisioning, to explain the scores in terms of past transactions. Further an existing instance-based explanation system is used to identify the reasons for the score, and how the identified transactions influence these reasons. A combination of impact on score and impact on reasons determines the most impactful past transaction with respect to the most recent score being explained.
    Type: Grant
    Filed: July 2, 2019
    Date of Patent: June 28, 2022
    Assignee: FAIR ISAAC CORPORATION
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Publication number: 20220166782
    Abstract: Systems, methods and computer program products for improving security of artificial intelligence systems. The system comprising processors for monitoring one or more transactions received by a machine learning decision model to determine a first score associated with a first transaction. The first transaction may be identified as likely adversarial, in response to the first score being lower than a certain score threshold and the first transaction having a low occurrence likelihood. A second score may be generated in association with the first transaction based on one or more adversarial latent features associated with the first transaction. At least one adversarial latent feature may be detected as being exploited by the first transaction, in response to determining that the second score falls above the certain score threshold. Accordingly, an abnormal volume of activations of adversarial latent features spanning across a plurality of transactions scored may be detected and blocked.
    Type: Application
    Filed: November 23, 2020
    Publication date: May 26, 2022
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Publication number: 20210406724
    Abstract: Computer-implemented methods and systems for quantifying appropriate machine learning model complexity corresponding to training dataset are provided. The method comprises monitoring, using one or more processors, N observed variables, v1 through vN, of a training dataset for a machine learning model; translating the N observed variables into m equisized bin indexes which generate mN possible equisized hypercells to estimate a fundamental dimensionality for the dataset; generating one or more samples by assigning a record in the dataset with numbers j through k as set id; generating a merged sample Si, for one or more values of the set id i, where i goes from j to k; and computing a fractal dimension of the equisized hypercube phase space based on count of cells with data coverage of at least one data point.
    Type: Application
    Filed: June 30, 2020
    Publication date: December 30, 2021
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Patent number: 11087339
    Abstract: Data for a plurality of entities that can be offered a plurality of products can be obtained. The data can include categorical data and numeric data. Based on business constraints, some of all of the data can be selected. The selected data can be converted to another set of numeric data, wherein the categorical values are converted to numeric values. Dimensions of the converted data can be reduced to generate another set of data. Based on this another set of data, clusters of entities can be formed. The products can be grouped by assigning a unique product identifier of each product to a corresponding cluster. This grouping of products can be used by a predictive model to predict a likelihood of an entity to purchase a particular product in a future time period. Related methods, apparatus, systems, techniques and articles are also described.
    Type: Grant
    Filed: October 9, 2017
    Date of Patent: August 10, 2021
    Assignee: FAIR ISAAC CORPORATION
    Inventors: Amit Kiran Sowani, Eeshan Malhotra, Shafi Ur Rahman
  • Publication number: 20210004703
    Abstract: In transactional systems where past transactions can have impact on the current score of a machine learning based decision model, the transactions that are most responsible for the score and the associated reasons are determined by the transactional system. A system and method identifies such past transactions that maximally impact the current score and allow for a more effective understanding of the scores generated by a model in a transactional system and explanation of specific transactions for automated decisioning, to explain the scores in terms of past transactions. Further an existing instance-based explanation system is used to identify the reasons for the score, and how the identified transactions influence these reasons. A combination of impact on score and impact on reasons determines the most impactful past transaction with respect to the most recent score being explained.
    Type: Application
    Filed: July 2, 2019
    Publication date: January 7, 2021
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Publication number: 20200242216
    Abstract: A system and method for constructing an improved computing model that preserves use rights for data utilized by the model. A first dataset is accessed to build a computing model. The first data set is subject to terminable usage rights provisions. A portion of the first dataset is sampled to generate a second dataset. Vectors present in the first dataset and the second dataset are discretized. In response to determine that the usage rights associated with the primary dataset have been terminated, a coverage depletion for the second dataset is computed based on the usage rights termination associated with the first dataset. An estimated mean time to coverage failure for the first model based on the depletion coverage is determined for the second dataset. One or more data points are removed from the first dataset due to the termination of usage rights.
    Type: Application
    Filed: April 17, 2020
    Publication date: July 30, 2020
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Patent number: 10713140
    Abstract: The state of a system is determined in which data sets are generated that include a plurality of data instances representing states of one or more components of a computer system. The data instances generated by one or more data set sources that are configured to output a data instance in response to a trigger associated with the one or more components. The data instances are normalized by the application of one or more rules. The data instances from individual data set sources are separately collated to generate groups of time-specific collated data instances. State types may be assigned to each of the collated data instance groups. Distributions of state-types across the groups may be determined and a list of infrequent state-types may be generated based on the determined distributions of state-types across the groups.
    Type: Grant
    Filed: June 10, 2015
    Date of Patent: July 14, 2020
    Assignee: FAIR ISAAC CORPORATION
    Inventors: Ashish Gupta, Shafi Ur Rahman, Sambandan Murugan
  • Patent number: 10657229
    Abstract: A system and method of building a decision or prediction model used for analyzing and scoring behavioral transactions is disclosed. A customer dataset in a model development store is used to build an original model is subject to a data right usage withdrawal, the original model having coverage over the customer dataset extract, using data sampling, a portion of the customer dataset to generate a model surrogate dataset. The system and method discretize vectors present in both the model surrogate dataset and the customer dataset, and receive data representing the data right usage withdrawal from the customer dataset. The system and method determine a depletion of the model surrogate dataset according to the data right usage withdrawal, and compute an estimated mean time to coverage failure of the original model based on the depletion of the model surrogate dataset according to the data right usage withdrawal.
    Type: Grant
    Filed: November 21, 2017
    Date of Patent: May 19, 2020
    Assignee: Fair Isaac Corporation
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Patent number: 10360093
    Abstract: The state of a system is determined in which data sets are generated that include a plurality of data instances representing states of one or more components of a computer system. The data instances generated by one or more data set sources that are configured to output a data instance in response to a trigger associated with the one or more components. The data instances are normalized by the application of one or more rules. The data instances from individual data set sources are separately collated to generate groups of time-specific collated data instances. State types may be assigned to each of the collated data instance groups. Distributions of state-types across the groups may be determined and a list of infrequent state-types may be generated based on the determined distributions of state-types across the groups.
    Type: Grant
    Filed: November 18, 2015
    Date of Patent: July 23, 2019
    Assignee: Fair Isaac Corporation
    Inventors: Shafi Ur Rahman, Ashish Gupta
  • Publication number: 20190155996
    Abstract: A system and method of building a decision or prediction model used for analyzing and scoring behavioral transactions is disclosed. A customer dataset in a model development store is used to build an original model is subject to a data right usage withdrawal, the original model having coverage over the customer dataset extract, using data sampling, a portion of the customer dataset to generate a model surrogate dataset. The system and method discretize vectors present in both the model surrogate dataset and the customer dataset, and receive data representing the data right usage withdrawal from the customer dataset. The system and method determine a depletion of the model surrogate dataset according to the data right usage withdrawal, and compute an estimated mean time to coverage failure of the original model based on the depletion of the model surrogate dataset according to the data right usage withdrawal.
    Type: Application
    Filed: November 21, 2017
    Publication date: May 23, 2019
    Inventors: Scott Michael Zoldi, Shafi Ur Rahman
  • Publication number: 20180032916
    Abstract: Data for a plurality of entities that can be offered a plurality of products can be obtained. The data can include categorical data and numeric data. Based on business constraints, some of all of the data can be selected. The selected data can be converted to another set of numeric data, wherein the categorical values are converted to numeric values. Dimensions of the converted data can be reduced to generate another set of data. Based on this another set of data, clusters of entities can be formed. The products can be grouped by assigning a unique product identifier of each product to a corresponding cluster. This grouping of products can be used by a predictive model to predict a likelihood of an entity to purchase a particular product in a future time period. Related methods, apparatus, systems, techniques and articles are also described.
    Type: Application
    Filed: October 9, 2017
    Publication date: February 1, 2018
    Inventors: Amit Kiran Sowani, Eeshan Malhotra, Shafi Ur Rahman
  • Patent number: 9785890
    Abstract: Data for a plurality of entities that can be offered a plurality of products can be obtained. The data can include categorical data and numeric data. Based on business constraints, some of all of the data can be selected. The selected data can be converted to another set of numeric data, wherein the categorical values are converted to numeric values. Dimensions of the converted data can be reduced to generate another set of data. Based on this another set of data, clusters of entities can be formed. The products can be grouped by assigning a unique product identifier of each product to a corresponding cluster. This grouping of products can be used by a predictive model to predict a likelihood of an entity to purchase a particular product in a future time period. Related methods, apparatus, systems, techniques and articles are also described.
    Type: Grant
    Filed: August 10, 2012
    Date of Patent: October 10, 2017
    Assignee: FAIR ISAAC CORPORATION
    Inventors: Amit Sowani, Eeshan Malhotra, Shafi Ur Rahman