Patents by Inventor Shayak Sen

Shayak Sen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240394574
    Abstract: A computing machine receives a representation of a machine learning model, a representation of a first data segment, and a representation of a second data segment. The computing machine computes an output difference between an output of the machine learning model applied to the first data segment and an output of the machine learning model applied to the second data segment. The computing machine determines a set of reasons for the computed output difference based on a set of metrics defining distance between feature importance distributions, the set of reasons identifying a set of features from a feature vector of the machine learning model along with a relative contribution of each feature to the computed output difference. The computing machine provides an output representing the set of reasons.
    Type: Application
    Filed: July 31, 2024
    Publication date: November 28, 2024
    Inventors: Anupam Datta, Shayak Sen, Apoorv Gupta, David Sandai Kurokawa
  • Publication number: 20230097940
    Abstract: In some implementations, a computing machine accesses an artificial intelligence model and a dataset for the artificial intelligence model, the dataset comprising at least one datapoint. The computing machine identifies a feature group used by the artificial intelligence model, the feature group comprising at least two features having a similarity with one another exceeding a similarity threshold, wherein the feature group comprises a subset of the features used by the artificial intelligence model. The computing machine determines an overall influence value for the feature group on an output of the artificial intelligence model applied to the dataset. The computing machine provides an output representing the overall influence value.
    Type: Application
    Filed: August 29, 2022
    Publication date: March 30, 2023
    Inventors: David Sandai Kurokawa, Shayak Sen, Anupam Datta, Divya Gopinath, Apoorv Gupta
  • Publication number: 20230092949
    Abstract: A computer accesses an artificial intelligence (AI) model, a labeled in-sample (IS) dataset, and an unlabeled out-of-sample (OOS) dataset, the labeled IS dataset storing IS input values and corresponding IS output values, the unlabeled OOS dataset storing OOS input values but not corresponding OOS output values. The computer modifies, via importance sampling and based on a likelihood that a given datapoint from the IS dataset is associated with the OOS dataset, weights of multiple datapoints in the labeled IS dataset to generate a weighted IS dataset. The computer calculates an estimated performance metric of the AI model on the OOS dataset using at least a subset of datapoints in the weighted IS dataset. The computer provides an output representing the estimated performance metric of the AI model on the OOS dataset.
    Type: Application
    Filed: August 29, 2022
    Publication date: March 23, 2023
    Inventors: Divya Gopinath, David Sandai Kurokawa, Shayak Sen, Anupam Datta
  • Publication number: 20220269991
    Abstract: Computer accesses training dataset with plurality of datapoints, each datapoint having input vector of feature values and output value. Training dataset is for training machine learning engine to predict the output value based on the input vector of feature values. The computer stores the training dataset as a two-dimensional vector with rows representing datapoints and columns representing features. The computer computes, for each feature value, a QII (quantitative input influence) value measuring a degree of influence that the feature exerts on the output value. For each datapoint from at least a subset of the plurality of datapoints, the computer (i) determines whether the QII value for each feature value in the input vector is within a predefined range, and (ii) upon determining that the QII value for a given feature value in the input vector is not within the predefined range: adjusts the training dataset or the machine learning engine.
    Type: Application
    Filed: February 16, 2022
    Publication date: August 25, 2022
    Inventors: David Sandai Kurokawa, Shayak Sen, Anupam Datta
  • Publication number: 20220012613
    Abstract: A computing machine receives a representation of a machine learning model, a representation of a first data segment, and a representation of a second data segment. The computing machine computes an output difference between an output of the machine learning model applied to the first data segment and an output of the machine learning model applied to the second data segment. The computing machine determines a set of reasons for the computed output difference based on a set of metrics defining distance between feature importance distributions, the set of reasons identifying a set of features from a feature vector of the machine learning model along with a relative contribution of each feature to the computed output difference. The computing machine provides an output representing the set of reasons.
    Type: Application
    Filed: July 9, 2021
    Publication date: January 13, 2022
    Inventors: Anupam Datta, Shayak Sen, Apoorv Gupta, David Sandai Kurokawa
  • Publication number: 20210357729
    Abstract: A computing machine accesses a set of intermediate artificial neurons in a deep neural network. The deep neural network is fully or partially trained. The computing machine computes, for each artificial neuron in the set of intermediate artificial neurons, an influence score based on an average gradient of an output quantity of interest with respect to the artificial neuron across a plurality of inputs weighted by a probability of each input. The computing machine provides an output associated with the computed influence scores.
    Type: Application
    Filed: September 26, 2019
    Publication date: November 18, 2021
    Inventors: Klas Leino, Shayak Sen, Anupam Datta, Matthew Fredrikson
  • Publication number: 20180121817
    Abstract: The subject disclosure relates to devices, systems, and methods for algorithmic transparency into algorithmic decision-making systems. In non-limiting aspects, the disclosed subject matter facilitates generating a set of intervention inputs for an algorithmic decision-making system, observing the outcomes of the algorithmic decision-making system, and determining Quantitative Input Influence (QII) measures for the algorithmic decision-making system, wherein the at least one QII measure describes degree of influence of inputs on outcomes of the algorithmic decision-making system. In further non-limiting aspects, the disclosed subject matter facilitates generating transparency reports related to the QII measures, including transparency reports, regarding inputs, regarding individuals, and regarding groups of individuals, while maintaining privacy. Further non-limiting embodiments are provided that illustrate the advantages and flexibility of the disclosed subject matter.
    Type: Application
    Filed: October 27, 2017
    Publication date: May 3, 2018
    Inventors: Anupam Datta, Shayak Sen, Yair Zick