Patents by Inventor Itay Margolin

Itay Margolin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250077382
    Abstract: Certain aspects of the disclosure provide a method for detecting data collection errors by processing error data with a plurality of regression models to generate a plurality of predicted error rates over a plurality of time intervals. The method includes determining an error mode by applying a set of policy rules optimized for determining the error mode to the plurality of predicted error rates.
    Type: Application
    Filed: August 30, 2023
    Publication date: March 6, 2025
    Inventors: Itay MARGOLIN, Aleksandr KIM, Yair HORESH
  • Publication number: 20250045596
    Abstract: At least one processor may receive a query response generated by a query machine learning (ML) model, wherein the query response is generated in response to a query from a client device. The at least one processor may generate an evaluated likelihood of the query response being found in a training data set comprising known valid data, wherein the generating is performed using an evaluation ML model. The at least one processor may determine that the evaluated likelihood indicates the query response is likely to include valid data. In response to the determining, the at least one processor may return the query response to the client device.
    Type: Application
    Filed: July 31, 2023
    Publication date: February 6, 2025
    Applicant: INTUIT INC.
    Inventors: Liran DREVAL, Itay MARGOLIN
  • Patent number: 12216717
    Abstract: A Large Language Model (LLM) for classifying documents by identifying indicators within the documents. A smart caching mechanism stores document classifications and associated indicators output from the LLM. The database contains document details, classifications, and associated indicators. A classification module classifies a new document by analyzing it for indicators, checking the cache for a match, and querying the database for the indicators if no match is found. The module applies a majority vote based on the classifications associated with the indicators.
    Type: Grant
    Filed: March 20, 2024
    Date of Patent: February 4, 2025
    Assignee: INTUIT INC.
    Inventors: Itay Margolin, Eilon Sheetrit, Ido Joseph Farhi
  • Patent number: 12189723
    Abstract: Methods and systems are presented for imputing missing data items within a first dataset based on data associated with a second dataset that is the nearest neighbor of the first dataset. A first mapping model is configured to map data subsets corresponding to a first data source to first positions in a multi-dimensional space. A second mapping model is configured to map data subsets corresponding to a second data source to second positions in the multi-dimensional space. The first and second mapping models are trained together to reduce a distance between positions mapped by the first and second mapping models based on corresponding data subsets that belong to the same entity. A nearest neighbor dataset to the first dataset is identified based on the first and second mapping models. Data associated with the nearest neighbor dataset is used to impute the missing data items of the first dataset.
    Type: Grant
    Filed: September 15, 2023
    Date of Patent: January 7, 2025
    Assignee: PAYPAL, INC.
    Inventors: Itay Margolin, Tomer Handelman
  • Publication number: 20240362528
    Abstract: A system is configured to train a machine learning tree network using path based features, such as leaf nodes or connections between nodes. A first machine learning tree network model, for example, may be trained using a first set of training data, and used to generate predictions for a second set of training data. The path based features are determined from the first machine learning tree network model when generating the predictions for the second set of training data. The path based features may then be used to train a second machine learning tree network model, e.g., using logistic regression.
    Type: Application
    Filed: April 26, 2023
    Publication date: October 31, 2024
    Applicant: Intuit Inc.
    Inventor: Itay Margolin
  • Patent number: 12118621
    Abstract: A first schema accessed is associated with a plurality of entities that are participants of an electronic system. Each entity has a corresponding attribute. Each of a first subset of the entities has a respective attribute value below a threshold. Each of a second subset of the entities has a respective attribute value above the threshold. According to the first schema, it is determined that data and/or transactions associated with the first and second subsets of the entities are processed using a first and a second model, respectively. A second schema is generated by softening the predefined threshold such that according to the second schema, data and/or transactions associated with the first subset of the entities and data and/or transactions associated with the second subset of the entities are each processed using both the first model and the second model. The second schema is implemented in the electronic system.
    Type: Grant
    Filed: August 23, 2021
    Date of Patent: October 15, 2024
    Assignee: PAYPAL, INC.
    Inventor: Itay Margolin
  • Patent number: 12067068
    Abstract: The present disclosure provides techniques for data retrieval using machine learning. One example method includes receiving a plurality of training episodes associated with different environments, wherein each training episode of the plurality of training episodes includes a sequence of states, computing, based on the plurality of training episodes, total counts of a plurality of values in the states, initializing, for each state of the sequence of states in each training episode of the plurality of training episodes, a reward based on the total counts of the plurality of values, and training a reinforcement learning agent using the rewards.
    Type: Grant
    Filed: April 28, 2023
    Date of Patent: August 20, 2024
    Assignee: INTUIT INC.
    Inventor: Itay Margolin
  • Publication number: 20240177051
    Abstract: There are provided systems and methods for adjustment of training data sets for fairness-aware artificial intelligence models. A service provider, such as an electronic transaction processor for digital transactions, may utilize different decision services that implement rules and/or artificial intelligence models for decision-making of data including data in production computing environment. Decision services may be used for data processing and decision-making, where multiple decision services may be invoked during run-time in order to complete a data processing request. When processing data, machine learning and other artificial intelligence models may be utilized by such decision services. These may be trained using a sampled training data set that takes into account data records' diversity and model attribution scores as providing valuable data points or observations for training and/or retraining the ML model. The sampled training data may be analyzed to determine these scores and generated for training.
    Type: Application
    Filed: November 28, 2022
    Publication date: May 30, 2024
    Inventors: Roy Lothan, Itay Margolin, Matan Marudi, Yarden Raiskin
  • Patent number: 11995410
    Abstract: Systems and methods use hierarchical models to process conversations. A set of word vectors is processed using a sentence model, resulting in a sentence vector for the conversation message. The sentence vector is modified to include a time value. A set of sentence vectors, representing a time window, is processed using a window model, to generate a window vector for that time window. The window vector is updated to include a count value. A set of window vectors, corresponding to a set of time windows within the conversation, is processed using a conversation model to generate a conversation vector. A sentiment value indicating the sentiment of the conversation is presented, using a processor that generates the sentiment value from the conversation vector.
    Type: Grant
    Filed: June 30, 2023
    Date of Patent: May 28, 2024
    Assignee: Intuit Inc.
    Inventor: Itay Margolin
  • Patent number: 11983629
    Abstract: Systems and methods for training machine learning models are disclosed. An example method includes receiving a semi-labeled set of training samples including a first set of training samples, where each training sample in the first set is assigned a known label, and a second set of training samples, where each training sample in the second set has an unknown label, determining a first loss component, the first loss component providing a loss associated with the first set, determining a second loss component, the second loss component having a value which increases based on a difference between a distribution of individually predicted values of at least the second set and an expected overall distribution of at least the second set, and training the machine learning model, based on the first loss component and the second loss component, to predict labels for unlabeled input data.
    Type: Grant
    Filed: October 31, 2022
    Date of Patent: May 14, 2024
    Assignee: Intuit Inc.
    Inventor: Itay Margolin
  • Publication number: 20240143996
    Abstract: Systems and methods for training machine learning models are disclosed. An example method includes receiving a semi-labeled set of training samples including a first set of training samples, where each training sample in the first set is assigned a known label, and a second set of training samples, where each training sample in the second set has an unknown label, determining a first loss component, the first loss component providing a loss associated with the first set, determining a second loss component, the second loss component having a value which increases based on a difference between a distribution of individually predicted values of at least the second set and an expected overall distribution of at least the second set, and training the machine learning model, based on the first loss component and the second loss component, to predict labels for unlabeled input data.
    Type: Application
    Filed: October 31, 2022
    Publication date: May 2, 2024
    Applicant: Intuit Inc.
    Inventor: Itay MARGOLIN
  • Patent number: 11947521
    Abstract: A processor may identify a plurality of data sets subject to upcoming update processing in a next update cycle. For each of the plurality of data sets, the processor may determine a probability that data included in the data set has changed since a most recent update processing. The processor may exclude a first subset of the plurality of data sets having respective probabilities below a threshold value from the upcoming update processing until the respective probabilities are determined again in a subsequent update cycle. The processor may perform the upcoming update processing on the plurality of the data sets not included in the first subset, where the upcoming update processing may include obtaining updated data from at least one external data source.
    Type: Grant
    Filed: December 6, 2022
    Date of Patent: April 2, 2024
    Assignee: Intuit Inc.
    Inventors: Aleksandr Kim, Itay Margolin, Yair Horesh
  • Patent number: 11941072
    Abstract: A method and system that proactively generate alerts for updating a scraping script to avoid scraping script errors. A predetermined number of webpages targeted by the scraping script are randomly sampled. The scraping script is appended to each webpage in the sample. A structured list of text fragments across the webpages with the appended script is generated. At predetermined time intervals, a fresh set of webpages is sampled, the scraping script is appended to the webpages, and a new structured list is generated. If the new structured list and the previous structured list do not match, the webpages may have been changed and the scraping script may have to be updated. An alert is generated indicating that such update is required and may include a location of the mismatch. Therefore, scraping script errors are proactively detected and can be rectified before an actual error occurs and propagates.
    Type: Grant
    Filed: June 29, 2023
    Date of Patent: March 26, 2024
    Assignee: INTUIT INC.
    Inventors: Itay Margolin, Aleksandr Kim, Yair Horesh
  • Patent number: 11928134
    Abstract: Certain aspects of the present disclosure provide techniques for medoid-based data compression. One example method generally includes receiving item data indicative of one or more items, determining one or more medoids based on the item data, determining, for each item of the one or more items, a corresponding medoid based on the one or more medoids, identifying, for each item of the one or more items, a difference between the item and the corresponding medoid for the item, storing the one or more medoids, and storing, for each item of the one or more items, the identified difference between the item and the corresponding medoid.
    Type: Grant
    Filed: August 31, 2022
    Date of Patent: March 12, 2024
    Assignee: Intuit, Inc.
    Inventor: Itay Margolin
  • Publication number: 20240078292
    Abstract: Methods and systems are presented for imputing missing data items within a first dataset based on data associated with a second dataset that is the nearest neighbor of the first dataset. A first mapping model is configured to map data subsets corresponding to a first data source to first positions in a multi-dimensional space. A second mapping model is configured to map data subsets corresponding to a second data source to second positions in the multi-dimensional space. The first and second mapping models are trained together to reduce a distance between positions mapped by the first and second mapping models based on corresponding data subsets that belong to the same entity. A nearest neighbor dataset to the first dataset is identified based on the first and second mapping models. Data associated with the nearest neighbor dataset is used to impute the missing data items of the first dataset.
    Type: Application
    Filed: September 15, 2023
    Publication date: March 7, 2024
    Inventors: Itay Margolin, Torner Handeiman
  • Publication number: 20240070169
    Abstract: Certain aspects of the present disclosure provide techniques for medoid-based data compression. One example method generally includes receiving item data indicative of one or more items, determining one or more medoids based on the item data, determining, for each item of the one or more items, a corresponding medoid based on the one or more medoids, identifying, for each item of the one or more items, a difference between the item and the corresponding medoid for the item, storing the one or more medoids, and storing, for each item of the one or more items, the identified difference between the item and the corresponding medoid.
    Type: Application
    Filed: August 31, 2022
    Publication date: February 29, 2024
    Inventor: Itay MARGOLIN
  • Patent number: 11900384
    Abstract: Computer system security and efficiency of processing operations may be improved using techniques that are described relating to analyzing user actions based on time of day of occurrence, and using time of a day as a factor in determining whether a particular action should be allowed or disallowed. Past action times can be transformed to a two-dimensional representation using a radial time schema that avoids discontinuity. A probability distribution can indicate a likelihood of whether a new action fits a previous pattern. If a new user action is relatively unlikely due to time of day, the new user action might be denied/prevented from completing, thus enhancing computer system security and avoiding unnecessary computational processing costs.
    Type: Grant
    Filed: April 26, 2022
    Date of Patent: February 13, 2024
    Assignee: PayPal, Inc.
    Inventors: Itay Margolin, Shlomit Plavner, Ofri Raviv
  • Patent number: 11886827
    Abstract: Systems and methods for generating a contextually adaptable classifier model are disclosed. An example method is performed by one or more processors of a system and includes obtaining a dataset, feature values, and labels, transforming each datapoint into a natural language statement (NLS) associating the datapoint's feature values and label with feature identifiers and a label identifier, generating a feature matrix for each NLS, transforming the feature matrix into a global feature vector, generating a target vector for each NLS, transforming the target vector into a global target vector having a same shape, and generating, using the vectors, a similarity measurement operation, and a loss function, a classifier model trained to generate a compatibility score predictive of an accuracy at which the classifier model can classify given data based on at least one of a different feature characterizing the given data or a different label for classifying the given data.
    Type: Grant
    Filed: July 31, 2023
    Date of Patent: January 30, 2024
    Assignee: Intuit Inc.
    Inventor: Itay Margolin
  • Publication number: 20240005099
    Abstract: Techniques are disclosed relating to weakly supervised machine learning, which may be employed when there is a limited amount of labeled data available. A computer system may generate respective sets of synthetic labels for unlabeled data for a classification problem, where a given set of synthetic labels is produced by a corresponding one of a plurality of different label models. The computer system may then fit a set of supervised models, where each supervised model is fitted with one of the respective sets of synthetic labels to produce a respective set of predictions. The computer system may then evaluate the set of supervised models based on their respective set of predictions and using a set of labeled data for the classification problem. The evaluation may be used to select a particular supervised model and its corresponding label model.
    Type: Application
    Filed: June 30, 2022
    Publication date: January 4, 2024
    Inventors: Alon Dourban, Roy Lothan, Myriam Lesmy, Maya Cohen, Itay Margolin
  • Patent number: 11861335
    Abstract: A system deploying a machine learning technique that utilizes known code graph and abstract syntax tree pairs for known JSON objects to learn a function for predicting a corresponding abstract syntax tree from a new JSON object. The predicted abstract syntax tree is used to generate code for formatting the new JSON object into a standardized data structure.
    Type: Grant
    Filed: July 28, 2023
    Date of Patent: January 2, 2024
    Assignee: INTUIT INC.
    Inventors: Itay Margolin, Yair Horesh