Patents by Inventor Micha Gideon Moffie

Micha Gideon Moffie has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240152519
    Abstract: An example system includes a processor to receive a value formatted using a composite subtraction format. The processor can determine an interval of a number of intervals that corresponds to the received value using minuend rank. The processor can then calculate rank of the value based on the position of the value within the determined interval.
    Type: Application
    Filed: November 9, 2022
    Publication date: May 9, 2024
    Inventors: Ariel FARKASH, Micha Gideon MOFFIE
  • Publication number: 20240152698
    Abstract: An enhanced system and method are provided for data-driven named entity type disambiguation of one or more disclosed embodiments. A system and a non-limiting computer-implemented method provides named-entity type disambiguation; receiving an unstructured document, analyzing the document using a set of Named Entity Recognition (NER) annotators, each generating annotated entities. For each respective annotated entity an Entity Disambiguation Module resolves a target entity type when a mention was assigned multiple entity types by different NER annotators by leveraging domain knowledge to form a set of first resolved entities. An Annotation Ranker associates a computed score to each entity in the set of first resolved entities using information in a knowledge base.
    Type: Application
    Filed: November 9, 2022
    Publication date: May 9, 2024
    Inventors: Mohammed Fahd ALHAMID, Stefano BRAGHIN, Jing Xin DUAN, Mokhtar KANDIL, Youngja PARK, Micha Gideon MOFFIE
  • Patent number: 11949667
    Abstract: An example system includes a processor to receive a graph-based masking policy and a composite payload containing a data object to be masked. The processor is to instantiate a masking engine based on the graph-based masking policy. The processor is to execute the masking engine on the composite payload to generate a masked payload comprising a masked data object. The data object to be masked is masked in place such that the resulting composite payload type is maintained. The processor is to output the masked payload.
    Type: Grant
    Filed: June 23, 2021
    Date of Patent: April 2, 2024
    Assignee: International Business Machines Corporation
    Inventors: Micha Gideon Moffie, Ariel Farkash
  • Publication number: 20240062004
    Abstract: A method comprising: obtaining multiple glossary terms each comprising one or more words; generating multiple fuzzy tokens from each word of each of the glossary terms; calculating a similarity score for each of the fuzzy tokens, the similarity score denoting a similarity between the respective fuzzy token and its respective word; obtaining multiple input terms to be matched with the multiple glossary terms; separating each of the input terms into multiple input tokens; generating multiple n-grams from each of the input tokens; comparing the n-grams with the fuzzy tokens, to output a list of matching n-grams and fuzzy tokens; based on the list of matching n-grams and fuzzy tokens, identifying, from the glossary terms, candidate glossary term matches for each of the input terms; and calculating one or more scores that quantify the match between each of the candidate glossary term matches and its respective input term.
    Type: Application
    Filed: August 22, 2022
    Publication date: February 22, 2024
    Inventors: Shlomit Ifergan Shachor, Natalia Razinkov, Micha Gideon Moffie, Omer Yehuda Boehm
  • Patent number: 11893132
    Abstract: A method, computer system, and a computer program product for personal data discovery is provided. The present invention may include determining at least one feature used to train a target machine learning (ML) model. The present invention may also include mapping the determined at least one feature to at least one location of a data store including at least one personal data associated with the determined at least one feature. The present invention may further include retrieving a data record of the at least one personal data associated with the mapped at least one feature from the at least one location of the data store. The present invention may also include determining that the target ML model includes a trace of the retrieved data record. The present invention may further include marking the target ML model as containing the at least one personal data.
    Type: Grant
    Filed: February 23, 2021
    Date of Patent: February 6, 2024
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Abigail Goldsteen, Micha Gideon Moffie, Ariel Farkash
  • Publication number: 20230418859
    Abstract: A method, computer system, and a computer program product for data processing, comprising obtaining a plurality of files from a data source. These files are analyzed the files for information about the content and in order to determine structural information of each file. Once the files have been analyzed, information in each file may be sorted and categorized by common content. Sensitive information may also be extracted and categorized separately. Information may then be then merged using the categories to create a single unified file.
    Type: Application
    Filed: June 27, 2022
    Publication date: December 28, 2023
    Inventors: Youngja Park, MOHAMMED FAHD ALHAMID, Stefano Braghin, Jing Xin Duan, Mokhtar Kandil, Michael Vu Le, Killian Levacher, Micha Gideon Moffie, Ian Michael Molloy, Walid Rjaibi, ARIEL FARKASH
  • Patent number: 11841977
    Abstract: An example system includes a processor to receive training data and predictions on the training data of a trained machine learning model to be anonymized. The processor is to generate generalized data from training data based on the predictions of the trained machine learning model on the training data. The processor is to train an anonymized machine learning model using the generalized data.
    Type: Grant
    Filed: February 11, 2021
    Date of Patent: December 12, 2023
    Assignee: International Business Machines Corporation
    Inventors: Abigail Goldsteen, Ariel Farkash, Micha Gideon Moffie, Gilad Ezov, Ron Shmelkin
  • Patent number: 11675976
    Abstract: Embodiments of the present systems and methods may provide techniques to distinguish between data categories. For example, a method implemented in a computer system may comprise obtaining, at the computer system, a plurality of data strings in different categories, each category having a same string pattern, determining a loose string format and a set of restrictions based on at least one string pattern, classifying the plurality of data strings to respective different categories based on a loose string format of the data strings and on the restrictions on the data strings of the different categories using a classification score indicating utilizing restriction information of other categories when determining the matching of a category, and decreasing the classification score if a mean restriction matching proportion is not part of a category or is a threshold amount above an expected mean restriction matching proportion.
    Type: Grant
    Filed: July 7, 2019
    Date of Patent: June 13, 2023
    Assignee: International Business Machines Corporation
    Inventors: Sigal Asaf, Ariel Farkash, Lev Greenberg, Micha Gideon Moffie
  • Publication number: 20230006983
    Abstract: An example system includes a processor to receive a graph-based masking policy and a composite payload containing a data object to be masked. The processor is to instantiate a masking engine based on the graph-based masking policy. The processor is to execute the masking engine on the composite payload to generate a masked payload comprising a masked data object. The data object to be masked is masked in place such that the resulting composite payload type is maintained. The processor is to output the masked payload.
    Type: Application
    Filed: June 23, 2021
    Publication date: January 5, 2023
    Inventors: Micha Gideon MOFFIE, Ariel FARKASH
  • Publication number: 20220405099
    Abstract: An example system includes a processor to receive an instance of a composite format comprising a masking restriction. The processor can generate a mask for the instance of the composite format based on the masking restriction. The processor can output the generated mask.
    Type: Application
    Filed: June 20, 2021
    Publication date: December 22, 2022
    Inventors: Ariel FARKASH, Micha Gideon MOFFIE
  • Publication number: 20220398107
    Abstract: An example system includes a processor to receive a valid instance of a finite regular expression format. The processor is to generate a state machine corresponding to the finite regular expression format. The processor is to recursively compute a number of matched strings for each state and transition in the generated state machine. The processor is to recursively rank the valid instance of the finite regular expression format using the generated state machine with the computed numbers of matched strings. The processor is to output a number rank for the valid instance of the finite regular expression format.
    Type: Application
    Filed: June 15, 2021
    Publication date: December 15, 2022
    Inventors: Ariel FARKASH, Micha Gideon MOFFIE
  • Publication number: 20220398327
    Abstract: An example system includes a processor to receive an instance of a format and a masking restriction. The processor can rank the instance of the format to generate an integer in an effective domain of the format. The processor can apply noise to the integer based on the masking restriction to generate a perturbed integer. The processor can unrank the perturbed integer to generate a second instance of the format.
    Type: Application
    Filed: June 10, 2021
    Publication date: December 15, 2022
    Inventors: Ariel FARKASH, Micha Gideon MOFFIE
  • Publication number: 20220391529
    Abstract: An example system includes a processor to generate regular expressions representing textual pattern facets of sub-formats of a composite format, and a regular expression representing a composite textual pattern of the composite format based on sub-format and composition type. The processor can search the data using generated regular expression representing composite textual patterns to detect occurrences of candidate matches. The processor can recursively match and validate the detected occurrences with the composite format and hierarchically match and validate sub-formats in the detected occurrence. The processor can mask in place the detected occurrence of the composite format in the data using ranking-based integer format preserving masking.
    Type: Application
    Filed: June 1, 2021
    Publication date: December 8, 2022
    Inventors: Ariel FARKASH, Micha Gideon MOFFIE
  • Publication number: 20220269814
    Abstract: A method, computer system, and a computer program product for personal data discovery is provided. The present invention may include determining at least one feature used to train a target machine learning (ML) model. The present invention may also include mapping the determined at least one feature to at least one location of a data store including at least one personal data associated with the determined at least one feature. The present invention may further include retrieving a data record of the at least one personal data associated with the mapped at least one feature from the at least one location of the data store. The present invention may also include determining that the target ML model includes a trace of the retrieved data record. The present invention may further include marking the target ML model as containing the at least one personal data.
    Type: Application
    Filed: February 23, 2021
    Publication date: August 25, 2022
    Inventors: Abigail Goldsteen, Micha Gideon Moffie, ARIEL FARKASH
  • Publication number: 20220253554
    Abstract: An example system includes a processor to receive training data and predictions on the training data of a trained machine learning model to be anonymized. The processor is to generate generalized data from training data based on the predictions of the trained machine learning model on the training data. The processor is to train an anonymized machine learning model using the generalized data.
    Type: Application
    Filed: February 11, 2021
    Publication date: August 11, 2022
    Inventors: Abigail GOLDSTEEN, Ariel FARKASH, Micha Gideon MOFFIE, Gilad EZOV, Ron SHMELKIN
  • Patent number: 11281728
    Abstract: A method, apparatus and a product for data generalization for predictive models. The method comprising: based on a labeled dataset, determining a plurality of buckets, each of which has an associated label; determining a plurality of clusters, grouping similar instances in the same bucket; based on the plurality of clusters, determining an alternative set of features comprising a set of generalized features, wherein each generalized feature corresponds to a cluster of the plurality of clusters, wherein a generalized feature that corresponds to a cluster is indicative of the instance being mapped to the corresponding cluster; obtaining a second instance; determining a generalized second instance that comprises a valuation of the alternative set of features for the second instance; and based on the generalized second instance, determining a label for the second instance.
    Type: Grant
    Filed: August 6, 2019
    Date of Patent: March 22, 2022
    Assignee: International Business Machines Corporation
    Inventors: Gilad Ezov, Ariel Farkash, Abigail Goldsteen, Ron Shmelkin, Micha Gideon Moffie
  • Patent number: 11240044
    Abstract: Embodiments of the present systems and methods may provide techniques for verifying the correct application purpose for applications that serve multiple purposes and to determine the correct purpose for each requested data access. For example, in an embodiment, a method for controlling application access to data implemented in a computer comprising a processor, memory accessible by the processor, and computer program instructions stored in the memory and executable by the processor may comprise: receiving an application comprising a plurality of application parts, each application part associated with a declared data access purpose and generating a cryptographic certificate for each application part to be certified by determining whether a declared data access purpose for each application part to be certified is correct and the only data access purpose for that part, wherein the declared purpose is included in purpose information associated with each application part to be certified.
    Type: Grant
    Filed: November 22, 2018
    Date of Patent: February 1, 2022
    Assignee: International Business Machines Corporation
    Inventors: Ariel Farkash, Abigail Goldsteen, Micha Gideon Moffie
  • Patent number: 11194841
    Abstract: Automated classification, by: Obtaining an examined document having an examined value appearing therein. Identifying: a location in the examined document at which the examined value appears, and a structure of the examined value. Identifying additional documents of a same type as the examined document, in which values having a same structure as the examined value appear at a same location as in the examined document. Applying a classifier to the examined value and the values in the additional documents, to output a single class to which the examined value and the values in the additional documents belong.
    Type: Grant
    Filed: November 28, 2019
    Date of Patent: December 7, 2021
    Assignee: International Business Machines Corporation
    Inventors: Sigal Asaf, Ariel Farkash, Micha Gideon Moffie
  • Publication number: 20210165807
    Abstract: Automated classification, by: Obtaining an examined document having an examined value appearing therein. Identifying: a location in the examined document at which the examined value appears, and a structure of the examined value. Identifying additional documents of a same type as the examined document, in which values having a same structure as the examined value appear at a same location as in the examined document. Applying a classifier to the examined value and the values in the additional documents, to output a single class to which the examined value and the values in the additional documents belong.
    Type: Application
    Filed: November 28, 2019
    Publication date: June 3, 2021
    Inventors: Sigal Asaf, ARIEL FARKASH, Micha Gideon Moffie
  • Publication number: 20210042356
    Abstract: A method, apparatus and a product for data generalization for predictive models. The method comprising: based on a labeled dataset, determining a plurality of buckets, each of which has an associated label; determining a plurality of clusters, grouping similar instances in the same bucket; based on the plurality of clusters, determining an alternative set of features comprising a set of generalized features, wherein each generalized feature corresponds to a cluster of the plurality of clusters, wherein a generalized feature that corresponds to a cluster is indicative of the instance being mapped to the corresponding cluster; obtaining a second instance; determining a generalized second instance that comprises a valuation of the alternative set of features for the second instance; and based on the generalized second instance, determining a label for the second instance.
    Type: Application
    Filed: August 6, 2019
    Publication date: February 11, 2021
    Inventors: GILAD EZOV, ARIEL FARKASH, Abigail Goldsteen, RON SHMELKIN, Micha Gideon Moffie