Patents by Inventor Sourabh Dixit

Sourabh Dixit has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11790678
    Abstract: A data processing system receives a plurality of electronic documents in image format, and extracts text data using an optical character recognition processor. The system determines a plurality of candidate entity data and candidate context data based on the extracted text data using a trained natural language processing closed-domain question answering model. The system accesses n-gram words stored in a knowledge base, and determines similarity scores between each candidate context data and each of the n-gram words. The system determines a weighted average of the similarity scores, and selects an optimum entity data from the plurality of candidate entity data based on the weighted average of the similarity scores.
    Type: Grant
    Filed: March 30, 2022
    Date of Patent: October 17, 2023
    Assignee: COMETGAZE LIMITED
    Inventors: Allan Beechinor, Sourabh Dixit, Anurag Banerjee, Chavvi Nihal Chandani, Shivansh Bhandari
  • Publication number: 20230316791
    Abstract: A data processing system receives a plurality of electronic documents in image format, and extracts text data using an optical character recognition processor. The system determines a plurality of candidate entity data and candidate context data based on the extracted text data using a trained natural language processing closed-domain question answering model. The system accesses n-gram words stored in a knowledge base, and determines similarity scores between each candidate context data and each of the n-gram words. The system determines a weighted average of the similarity scores, and selects an optimum entity data from the plurality of candidate entity data based on the weighted average of the similarity scores.
    Type: Application
    Filed: March 30, 2022
    Publication date: October 5, 2023
    Inventors: Allan Beechinor, Sourabh Dixit, Anurag Banerjee, Chavvi Nihal Chandani, Shivansh Bhandari
  • Patent number: 11651256
    Abstract: A data processing system receives a plurality of electronic documents in image format. The system extracts text from the electronic documents using an optical character recognition processor, and determines a plurality of entity data based on the extracted text. The system receives pre-defined question data from a user, and determines pre-annotated answer data based on the entity data and the question data using an open-domain question answering model. The system determines context data based on the entity data and the question data. The system provides the pre-annotated answer data to the user, and receives corrected entity data from the user. The system trains a closed-domain question answering model based on the corrected entity data and re-aligned context data. The system determines a plurality of n-gram words based on the corrected entity data and the context data using a context phrase model. The n-grams words are stored in a knowledge base.
    Type: Grant
    Filed: March 30, 2022
    Date of Patent: May 16, 2023
    Assignee: ALTADA TECHNOLOGY SOLUTIONS LTD.
    Inventors: Allan Beechinor, Sourabh Dixit, Anurag Banerjee, Chavvi Nihal Chandani, Shivansh Bhandari
  • Patent number: 11487798
    Abstract: A data processing system receives a plurality of electronic documents in image format. For each signature segment, the system determines associated surrounding text using an optical character recognition processor. The system accesses first data stored in a signature knowledge base. The system determines first similarity scores based on the associated surrounding text and the first data using a statistical measure technique. The system selects an optimum signature segment based on a distance metric between each signature segment and each associated surrounding text. For each stamp segment, the system extracts text from the stamp segment using an optical character recognition processor. The system accesses second data stored in a stamp knowledge base. The system determines second similarity scores based on the extracted text and the second data using the statistical measure technique. The system selects an optimum stamp segment based on the second similarity scores.
    Type: Grant
    Filed: March 30, 2022
    Date of Patent: November 1, 2022
    Assignee: ALTADA TECHNOLOGY SOLUTIONS LTD.
    Inventors: Allan Beechinor, Sourabh Dixit, Anurag Banerjee, Chavvi Nihal Chandani, Shivansh Bhandari