Patents by Inventor Arun Kumar Jagota

Arun Kumar Jagota has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11886461
    Abstract: A system tokenizes raw values and corresponding standardized values into raw token sequences and corresponding standardized token sequences. A machine-learning model learns standardization from token insertions and token substitutions that modify the raw token sequences to match the corresponding standardized token sequences. The system tokenizes an input value into an input token sequence. The machine-learning model determines a probability of inserting an insertion token after an insertion markable token in the input token sequence. If the probability of inserting the insertion token satisfies a threshold, the system inserts the insertion token after the insertion markable token in the input token sequence. The machine-learning model determines a probability of substituting a substitution token for a substitutable token in the input token sequence.
    Type: Grant
    Filed: July 31, 2019
    Date of Patent: January 30, 2024
    Assignee: Salesforce, Inc.
    Inventors: Arun Kumar Jagota, Stanislav Georgiev
  • Publication number: 20240020479
    Abstract: A cloud platform trains a machine-learned entity matching model that generates predictions on whether a pair of electronic records refer to a same entity. In one embodiment, the entity matching model is configured as a transformer architecture. In one instance, the entity matching model is trained using a combination of a first loss and a second loss. The first loss indicates a difference between an entity matching prediction for a training instance and a respective match label for the training instance. The second loss indicates a difference between a set of named-entity recognition (NER) predictions for the training instance and the set of NER labels for the tokens of the training instance.
    Type: Application
    Filed: September 23, 2022
    Publication date: January 18, 2024
    Inventors: Akash Singh, Rajdeep Dua, Arun Kumar Jagota
  • Patent number: 11790278
    Abstract: An online system performs predictions for real-time tasks and near real-time tasks that need to be performed by a deadline. A client device receives a real-time machine learning based model associated with a measure of accuracy. If the client device determines that a task can be performed using predictions having less than the specified measure of accuracy, the client device uses the real-time machine learning based model. If the client device determines that a higher level of accuracy of results is required, the client device sends a request to an online system. The online system provides a prediction along with a string representing a rationale for the prediction.
    Type: Grant
    Filed: January 31, 2020
    Date of Patent: October 17, 2023
    Inventors: Rakesh Ganapathi Karanth, Arun Kumar Jagota, Kaushal Bansal, Amrita Dasgupta
  • Patent number: 11755680
    Abstract: A system receives a record which includes a string and separates the string into a number of tokens, including a token and another token. The system identifies a pattern that includes an entity, another entity, and a number of entities that equals the number of tokens, and another pattern that includes the same number of entities as the number of tokens. The system determines a combined probability that combines a probability based on the number of entries in the entity's dictionary which stores the token, and another probability based on a number of character types in the other entity that match characters in the other token. If the combined probability associated with the pattern is greater than another combined probability associated with the other pattern, the system matches the record to a system record based on recognizing the token as the entity and the other token as the other entity.
    Type: Grant
    Filed: January 22, 2020
    Date of Patent: September 12, 2023
    Assignee: Salesforce, Inc.
    Inventors: Arun Kumar Jagota, Ajitesh Jain
  • Patent number: 11755582
    Abstract: Adaptive field-level matching is described. A system identifies first elements in a field of a prospective record for a database, and second elements in the field of a candidate record, in the database, for matching the prospective record. The system identifies features corresponding to any of the first elements that are identical to any of the second elements, any of the first elements that are absent from the second elements, and any of the second elements that are absent from the first elements. A machine-learning model uses the features to determine a field match score for the candidate record's field. Another machine-learning model weighs the field match score and weighs another field match score for another field of the candidate record to determine a record match score for the candidate record. If the record match score satisfies a threshold, the system identifies the candidate record as matching the prospective record.
    Type: Grant
    Filed: April 30, 2020
    Date of Patent: September 12, 2023
    Assignee: Salesforce, Inc.
    Inventors: Arun Kumar Jagota, Ajitesh Jain, Rahul Mathias Madan, Shravani Madhavaram
  • Patent number: 11755914
    Abstract: System determines first and second scores based on applying function to features of first and second values in fields in first and second records, respectively. System determines first priority based on first score and second priority based on second score for displaying first and second values in fields in first profile. System revises, based on feedback associated with first value and second value, parameter associated with function and determines third score based on applying function, associated with revised parameter, to feature of third value in field in third record. System determines fourth score based on applying function, associated with revised parameter, to feature of fourth value in field in fourth record and determines third priority, based on third score, for displaying third value in field in second profile and fourth priority, based on fourth score, for displaying fourth value in field in second profile.
    Type: Grant
    Filed: March 21, 2019
    Date of Patent: September 12, 2023
    Assignee: Salesforce, Inc.
    Inventors: Arun Kumar Jagota, Piranavan Selvanandan
  • Publication number: 20230259831
    Abstract: An online system performs predictions for real-time tasks and near real-time tasks based on available network bandwidth. A client device receives a regression based machine learning model. Responsive to receiving a task, the client device determines an available network bandwidth for the client device. If the available network bandwidth is below a threshold, the client device uses the regression based machine learning model to perform the task. If the client device determines that the network bandwidth is above the threshold, the client device extracts features of the task, serializes the extracted features, and transmits the serialized features to an online system, causing the online system to use a different machine learning model to perform the task based on the serialized features.
    Type: Application
    Filed: April 20, 2023
    Publication date: August 17, 2023
    Inventors: Rakesh Ganapathi Karanth, Arun Kumar Jagota, Kaushal Bansal, Amrita Dasgupta
  • Patent number: 11651291
    Abstract: An online system performs predictions for real-time tasks and near real-time tasks that need to be performed by a deadline. A client device receives a real-time machine learning based model associated with a measure of accuracy. If the client device determines that a task can be performed using predictions having less than the specified measure of accuracy, the client device uses the real-time machine learning based model. If the client device determines that a higher level of accuracy of results is required, the client device sends a request to an online system. The online system provides a prediction along with a string representing a rationale for the prediction.
    Type: Grant
    Filed: January 30, 2020
    Date of Patent: May 16, 2023
    Assignee: Salesforce, Inc.
    Inventors: Rakesh Ganapathi Karanth, Arun Kumar Jagota, Kaushal Bansal, Amrita Dasgupta
  • Patent number: 11620483
    Abstract: A model is trained to create a probability distribution of counts based on counts of distinct values stored by person profiles in a field. The model is trained to create another probability distribution of counts based on other counts of other distinct values stored by the person profiles in another field. The count of distinct values stored by a person profile in the field is identified. Another count of distinct values stored by the person profile in the other field is identified. A score is determined based on a cumulative distribution function of the count under the probability distribution of counts. Another score is determined based on the cumulative distribution function of the other count under the other probability distribution of counts. If the score and the other score combine in an overall score that satisfies a threshold, a message is output about the person profile being suspected of corruption.
    Type: Grant
    Filed: January 17, 2020
    Date of Patent: April 4, 2023
    Assignee: Salesforce, Inc.
    Inventor: Arun Kumar Jagota
  • Patent number: 11436233
    Abstract: A system creates graph of nodes connected by edges. Each node represents corresponding value of corresponding attribute and is associated with count of corresponding value. Each edge is associated with count of instances that values represented by corresponding connected nodes are associated with each other. The system identifies each node associated with first count as first set of keys, and deletes each node associated with first count. The system identifies each edge associated with second count as second set of keys, and deletes each edge associated with second count. The system identifies each node associated with third count as third set of keys, and deletes each node associated with third count. The system identifies each edge associated with fourth count as fourth set of keys, and deletes each edge associated with fourth count. The system uses each set of keys to search and match records.
    Type: Grant
    Filed: January 25, 2019
    Date of Patent: September 6, 2022
    Assignee: Salesforce, Inc.
    Inventor: Arun Kumar Jagota
  • Publication number: 20220222547
    Abstract: System determines, for first value at first time in time series, first estimate based on value and velocity for time series, first lag, and first time. System determines, for first value, second estimate based on value and velocity for time series, second lag, and first time. System determines first weight based on difference between second value, at second time in time series, and first estimate and second weight based on difference between second value and second estimate. System determines, for second value, first forecast based on value and velocity for time series, first lag, and second time. System determines, for second value, second forecast based on value and estimated velocity for time series, second lag, and second time. System determines, for second value, combined forecast based on first forecast weighed by first weight and second forecast weighed by second weight. If combined forecast satisfies threshold, system outputs alert.
    Type: Application
    Filed: January 8, 2021
    Publication date: July 14, 2022
    Inventor: Arun Kumar Jagota
  • Patent number: 11372928
    Abstract: Determine first count of first records storing first value in first field, second count of second records storing second value in second field, third count of third records storing third value in third field. Determine count threshold using first, second and third counts, dispersion measure based on dispersion of values stored in second field by first records and other dispersion measure based on other dispersion of values stored in third field by first records. Train machine-learning model to determine dispersion measure threshold based on dispersion and other dispersion measures. If first count is greater than count threshold, and dispersion measure is greater than dispersion measure threshold, create match index based on first and second fields. Receive prospective record storing first value in first field, second value in second field. Use match index to identify record storing first value in first field, second value in second field as matching prospective record.
    Type: Grant
    Filed: January 29, 2020
    Date of Patent: June 28, 2022
    Assignee: salesforce.com, inc.
    Inventors: Arun Kumar Jagota, Ajitesh Jain, Rahul Mathias Madan, Shravani Madhavaram
  • Patent number: 11360990
    Abstract: A method and system of matching field values of a field type are described. Blurring operations are applied on a first and second values to obtain blurred values. A first maximum score is determined from first scores for blurred values, where each one of the first scores is indicative of a confidence that a match of the first and the second values occurs with knowledge of a first blurred value. A second maximum score is determined from second scores for the blurred values, where each one of the second scores is indicative of a confidence that a non-match of the first and the second values occurs with knowledge of the first blurred value. Responsive to determining that the first maximum score is greater than the second maximum score, an indication that the first value matches the second value is output.
    Type: Grant
    Filed: June 21, 2019
    Date of Patent: June 14, 2022
    Assignee: salesforce.com, inc.
    Inventor: Arun Kumar Jagota
  • Publication number: 20220121983
    Abstract: System receives input value in time series and determines first difference between input value at input time, and first value in time series at input time minus first lag. System determines first score based on first difference and both first average and first dispersion for first lag and time series values. System determines second difference between input value at input time, and second value in timeseries at input time minus second lag. System determines second score based on second difference and both second average and second dispersion for second lag and time series values. System transforms first and second scores into normalized anomaly score in normalized anomaly score time series. Time series database system stores normalized anomaly score time series and input value's time series into time series database. If normalized anomaly score satisfies threshold, system outputs alert including normalized anomaly score and input value retrieved from time series database.
    Type: Application
    Filed: October 20, 2020
    Publication date: April 21, 2022
    Inventor: Arun Kumar Jagota
  • Patent number: 11244238
    Abstract: Search query result set count estimation is described. A system parses data set query that includes first query attribute and second query attribute. The system identifies first hierarchy of connected nodes including a first node representing a first query attribute, and a second hierarchy of other connected nodes including a second node representing a second query attribute. The system identifies a directed arc connecting first correlated node in first hierarchy to second correlated node in second hierarchy. The system identifies cross-hierarchy probabilities of correlations between values of a first attribute represented by the first correlated node and values of a second attribute represented by the second correlated node.
    Type: Grant
    Filed: January 29, 2018
    Date of Patent: February 8, 2022
    Assignee: salesforce.com, inc.
    Inventors: Arun Kumar Jagota, Kevin Han
  • Patent number: 11244004
    Abstract: A system creates a graph of nodes connected by edges, the nodes including: i) a first node associated with a first value and a count of the first value, and ii) a second node associated with a second value and a count of the second value, the edges including an edge that connects the first and second nodes and is associated with a count of instances of the first value being stored with the second value. The system includes each node and each associated with clique count less than clique threshold in keys sets and deletes each node and each edge associated with clique count less than clique threshold. The system identifies triplet nodes connected by triplet edges. If estimated clique count for triplet values represented by triplet nodes is less than clique threshold, the system includes triplet values in keys set and identify triplet of nodes as analyzed.
    Type: Grant
    Filed: October 23, 2019
    Date of Patent: February 8, 2022
    Assignee: salesforce.com, inc.
    Inventor: Arun Kumar Jagota
  • Patent number: 11176156
    Abstract: A system determines a name probability based on a first name dataset frequency of a first name value stored by a first name field in a personal record and a last name dataset frequency of a last name value stored by a last name field in a personal record. The system determines at least one other probability based on another dataset frequency of another value stored by another field in the personal record and an additional dataset frequency of an additional value stored by an additional field in the personal record. The system determines a combined probability based on the name probability and the at least one other probability. The system increments a count of identifiable personal records for each personal record that has a corresponding combined probability that satisfies an identifiability threshold. The system outputs a message based on the count of identifiable personal records.
    Type: Grant
    Filed: May 10, 2019
    Date of Patent: November 16, 2021
    Assignee: salesforce.com, inc.
    Inventors: Arun Kumar Jagota, Stanislav Georgiev
  • Publication number: 20210342353
    Abstract: Adaptive field-level matching is described. A system identifies first elements in a field of a prospective record for a database, and second elements in the field of a candidate record, in the database, for matching the prospective record. The system identifies features corresponding to any of the first elements that are identical to any of the second elements, any of the first elements that are absent from the second elements, and any of the second elements that are absent from the first elements. A machine-learning model uses the features to determine a field match score for the candidate record's field. Another machine-learning model weighs the field match score and weighs another field match score for another field of the candidate record to determine a record match score for the candidate record. If the record match score satisfies a threshold, the system identifies the candidate record as matching the prospective record.
    Type: Application
    Filed: April 30, 2020
    Publication date: November 4, 2021
    Inventors: Arun Kumar Jagota, Ajitesh Jain, Rahul Mathias Madan, Shravani Madhavaram
  • Patent number: 11163740
    Abstract: A training set is created via creating adjacent classified substrings by using character classes to replace corresponding characters in adjacent substrings in each training character string, and associating each pair of adjacent classified substrings and each pair of adjacent substrings with corresponding labels indicating whether corresponding pairs include any token boundary. The system splits input character string into beginning and ending parts and creates classified beginning part by replacing beginning part character with corresponding class and classified ending part by replacing ending part character with corresponding class. The machine-learning model determines probability of token identification, based on training set to determine count of instances that classified beginning part is paired with classified ending part and count of corresponding labels that indicate inclusion of any token boundary.
    Type: Grant
    Filed: July 30, 2019
    Date of Patent: November 2, 2021
    Assignee: salesforce.com, inc.
    Inventor: Arun Kumar Jagota
  • Patent number: 11157508
    Abstract: A method and system for estimating a number of distinct entities in a set of records are described. For each one of a subset of records, a set of match rule keys are generated based on a set of match rules. Each match rule from the set of match rules defines a match between records, and each match rule key from the set of match rule keys includes at least a key field value. A high order key for the record is determined based on the match rule keys, and a counter associated with the high order key is incremented. When each record from the subset of records has been processed by determining the match rule keys, and incrementing the counter(s) of the high order keys, a sum of a number of counters that have a non-zero value is performed to estimate the distinct entities in the records.
    Type: Grant
    Filed: June 21, 2019
    Date of Patent: October 26, 2021
    Assignee: salesforce.com, inc.
    Inventor: Arun Kumar Jagota