Patents by Inventor Christopher Muffat

Christopher Muffat has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12046067
    Abstract: Methods and systems for extracting personal data from a sensitive document are provided. The system includes a document prediction module, a cropping module, a denoising module, and an optical character recognition (OCR) module. The document prediction module predicts type of document of the sensitive document using a keypoint matching-based approach and the cropping module extracts document shape and extracts one or more fields comprising text or pictures from the sensitive document. The denoising module prepares the one or more fields for optical character recognition, and the OCR module performs optical character recognition on the denoised one or more fields to detect characters in the one or more fields.
    Type: Grant
    Filed: June 15, 2021
    Date of Patent: July 23, 2024
    Assignee: DATHENA SCIENCE PTE. LTD.
    Inventors: Christopher Muffat, Tetiana Kodliuk
  • Patent number: 12039074
    Abstract: Systems and methods for personal data classification, linkage and purpose of processing prediction are provided. The system for personal data classification includes an entity extraction module for extracting personal data from one or more data repositories in a computer network or cloud infrastructure, a linkage module coupled to the entity extraction module, a linkage module coupled to the entity extraction module and a processing prediction module. The entity extraction module performs entity recognition from the structured, semi-structured and unstructured records in the one or more data repositories. The linkage module uses graph-based methodology to link the personal data to one or more individuals.
    Type: Grant
    Filed: December 31, 2019
    Date of Patent: July 16, 2024
    Assignee: Dathena Science Pte Ltd
    Inventors: Christopher Muffat, Tetiana Kodliuk
  • Patent number: 12033040
    Abstract: Systems, methods and computer readable medium are provided for perform a method for content and context aware data classification or a method for content and context aware data security anomaly detection. The method for content and context aware data confidentiality classification includes scanning one or more documents in one or more network data repositories of a computer network and extracting content features and context features of the one or more documents into one or more term frequency-inverse document frequency (TF-IDF) vectors and one or more latent semantic indexing (LSI) vectors. The method further includes classifying the one or more documents into a number of category classifications by machine learning the extracted content features and context features of the one or more documents at a file management platform of the computer network, each of the category classifications being associated with one or more confidentiality classifications.
    Type: Grant
    Filed: August 14, 2018
    Date of Patent: July 9, 2024
    Assignee: Dathena Science Ptd. Ltd.
    Inventor: Christopher Muffat
  • Patent number: 11675926
    Abstract: Methods and systems for data management of documents in one or more data repositories in a computer network or cloud infrastructure are provided. The method includes sampling the documents in the one or more data repositories and formulating representative subsets of the sampled documents. The method further includes generating sampled data sets of the sampled documents and balancing the sampled data sets for further processing of the sampled documents. The formulation of the representative subsets is performed for identification of some of the representative subsets for initial processing.
    Type: Grant
    Filed: December 30, 2019
    Date of Patent: June 13, 2023
    Assignee: DATHENA SCIENCE PTE LTD
    Inventors: Christopher Muffat, Tetiana Kodliuk
  • Publication number: 20220398399
    Abstract: Methods and systems for extracting personal data from a sensitive document are provided. The system includes a document prediction module, a cropping module, a denoising module, and an optical character recognition (OCR) module. The document prediction module predicts type of document of the sensitive document using a keypoint matching-based approach and the cropping module extracts document shape and extracts one or more fields comprising text or pictures from the sensitive document. The denoising module prepares the one or more fields for optical character recognition, and the OCR module performs optical character recognition on the denoised one or more fields to detect characters in the one or more fields.
    Type: Application
    Filed: June 15, 2021
    Publication date: December 15, 2022
    Applicant: Dathena Science Pte. Ltd.
    Inventors: Christopher MUFFAT, Tetiana KODLIUK
  • Patent number: 11461371
    Abstract: Methods and systems for data loss prevention and autolabelling of business categories and confidentiality based on text summarization are provided. The method for data loss prevention includes entering a combination of keywords and/or keyphrases and offline unsupervised mapping of a path of transfer of specific groups of documents. The offline unsupervised mapping includes keyword/keyphrase extraction from the specific groups of documents and normalization of candidates. The method further includes vectorization of the extracted keywords/keyphrases from the specific groups of documents and quantitative performance measurement of the keyword/keyphrase extraction to derive keywords and/or keyphrases suitable for data loss prevention.
    Type: Grant
    Filed: December 31, 2019
    Date of Patent: October 4, 2022
    Assignee: DATHENA SCIENCE PTE LTD.
    Inventors: Christopher Muffat, Tetiana Kodliuk
  • Publication number: 20210374533
    Abstract: Methods, systems and computer readable medium for explainable artificial intelligence are provided. The method for explainable artificial intelligence includes receiving a document and pre-processing the document to prepare information in the document for processing. The method further includes processing the information by an artificial neural network for one or more tasks. In addition, the method includes providing explanations and visualization of the processing by the artificial neural network to a user during processing of the information by the artificial neural network.
    Type: Application
    Filed: May 27, 2021
    Publication date: December 2, 2021
    Applicant: Dathena Science Pte. Ltd.
    Inventors: Christopher MUFFAT, Tetiana KODLIUK, Adel RAHIMI
  • Publication number: 20210319179
    Abstract: Systems, methods and computer readable medium are provided for perform a method for content and context aware data classification or a method for content and context aware data security anomaly detection. The method for content and context aware data confidentiality classification includes scanning one or more documents in one or more network data repositories of a computer network and extracting content features and context features of the one or more documents into one or more term frequency-inverse document frequency (TF-IDF) vectors and one or more latent semantic indexing (LSI) vectors. The method further includes classifying the one or more documents into a number of category classifications by machine learning the extracted content features and context features of the one or more documents at a file management platform of the computer network, each of the category classifications being associated with one or more confidentiality classifications.
    Type: Application
    Filed: August 14, 2018
    Publication date: October 14, 2021
    Applicant: Dathena Science Pte. Ltd.
    Inventor: Christopher MUFFAT
  • Publication number: 20200279105
    Abstract: Methods, systems and deep learning engines for content and context aware data classification by business category and confidentiality level are provided. The deep learning engine includes a feature extraction module and a classification and labelling module. The feature extraction module extracts both context features and document features from documents and the classification and labelling module is configured for content and context aware data classification of the documents by business category and confidentiality level using neural networks.
    Type: Application
    Filed: December 31, 2019
    Publication date: September 3, 2020
    Inventors: Christopher MUFFAT, Tetiana KODLIUK
  • Publication number: 20200250241
    Abstract: Methods and systems for data management of documents in one or more data repositories in a computer network or cloud infrastructure are provided. The method includes sampling the documents in the one or more data repositories and formulating representative subsets of the sampled documents. The method further includes generating sampled data sets of the sampled documents and balancing the sampled data sets for further processing of the sampled documents. The formulation of the representative subsets is performed for identification of some of the representative subsets for initial processing.
    Type: Application
    Filed: December 30, 2019
    Publication date: August 6, 2020
    Inventors: Christopher Muffat, Tetiana Kodliuk
  • Publication number: 20200250139
    Abstract: Systems and methods for personal data classification, linkage and purpose of processing prediction are provided. The system for personal data classification includes an entity extraction module for extracting personal data from one or more data repositories in a computer network or cloud infrastructure, a linkage module coupled to the entity extraction module, a linkage module coupled to the entity extraction module and a processing prediction module. The entity extraction module performs entity recognition from the structured, semi-structured and unstructured records in the one or more data repositories. The linkage module uses graph-based methodology to link the personal data to one or more individuals.
    Type: Application
    Filed: December 31, 2019
    Publication date: August 6, 2020
    Inventors: Christopher MUFFAT, Tetiana KODLIUK
  • Publication number: 20200226154
    Abstract: Methods and systems for data loss prevention and autolabelling of business categories and confidentiality based on text summarization are provided. The method for data loss prevention includes entering a combination of keywords and/or keyphrases and offline unsupervised mapping of a path of transfer of specific groups of documents. The offline unsupervised mapping includes keyword/keyphrase extraction from the specific groups of documents and normalization of candidates. The method further includes vectorization of the extracted keywords/keyphrases from the specific groups of documents and quantitative performance measurement of the keyword/keyphrase extraction to derive keywords and/or keyphrases suitable for data loss prevention.
    Type: Application
    Filed: December 31, 2019
    Publication date: July 16, 2020
    Inventors: Christopher Muffat, Tetiana Kodliuk