Patents by Inventor Vartika SINGH

Vartika SINGH has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for automatically processing electronic documents

Patent number: 8897563

Abstract: In a document analysis system that receives and processes jobs from a plurality of users, in which each job may contain multiple electronic documents, to extract data from the electronic documents, a method of automatically pre-processing each received electronic document using a plurality of image transformation algorithms to improve subsequent data extraction from said document is provided. The method includes: electronically partitioning each received electronic document page into pieces; automatically processing each piece of the received electronic document page using each of a plurality of image pre-processing algorithms to produce a plurality of image variations of each piece; and analyzing the outputs of subsequent processing and data extraction, on each of the image variations of the pieces to determine which output is best, from the plurality of outputs for each piece.

Type: Grant

Filed: October 28, 2013

Date of Patent: November 25, 2014

Assignee: Gruntworx, LLC

Inventors: Girish Welling, Nirupam Sarkar, Tushar Mahata, Vartika Singh, Depankar Neogi, Steven K. Ladd
Systems and methods for automatically processing electronic documents using multiple image transformation algorithms

Patent number: 8571317

Abstract: In a document analysis system that receives and processes jobs from a plurality of users, in which each job may contain multiple electronic documents, to extract data from the electronic documents, a method of automatically pre-processing each received electronic document using a plurality of image transformation algorithms to improve subsequent data extraction from said document is provided. The method includes: electronically partitioning each received electronic document page into pieces; automatically processing each piece of the received electronic document page using each of a plurality of image pre-processing algorithms to produce a plurality of image variations of each piece; and analyzing the outputs of subsequent processing and data extraction, on each of the image variations of the pieces to determine which output is best, from the plurality of outputs for each piece.

Type: Grant

Filed: January 14, 2011

Date of Patent: October 29, 2013

Assignee: Gruntworx, LLC

Inventors: Girish Welling, Nirupam Sarkar, Tushar Mahata, Vartika Singh, Depankar Neogi, Steven K. Ladd
SYSTEMS AND METHODS FOR AUTOMATICALLY EXTRACTING DATA FROM ELETRONIC DOCUMENTS USING MULTIPLE CHARACTER RECOGNITION ENGINES

Publication number: 20110255784

Abstract: In a document analysis system that receives and processes jobs from a plurality of users, in which each job may contain multiple electronic documents, to extract data from the electronic documents, a method of automatically extracting data from each received electronic document using a plurality of character recognition engines is provided. The method includes: automatically processing each received electronic document page using each of a plurality of recognition engines to extract data; comparing quality of data extracted from each of the recognition engines to assign a confidence score to the extracted data; and selecting extracted data having highest confidence score as the correct extracted data.

Type: Application

Filed: January 14, 2011

Publication date: October 20, 2011

Applicant: COPANION, INC.

Inventors: Girish WELLING, Vartika SINGH, Gopal KRISHNA, Tushar MAHATA, Nirupam SARKAR, Depankar NEOGI, Steven K. LADD
SYSTEMS AND METHODS FOR AUTOMATICALLY EXTRACTING DATA BY NARROWING DATA SEARCH SCOPE USING CONTOUR MATCHING

Publication number: 20110255794

Abstract: A method of extracting data by narrowing a scope of data search using contour matching of select elements in a document is provided. The method includes: analyzing each document to automatically extract images and text features wherein said analyzing compares extracted features with a first search space of candidate features to try and recognize the extracted features; automatically processing each unrecognized feature using a contour recognition engine to generate a contour of the unrecognized feature; automatically selecting a second search space of candidate features through contour matching using the contour of the unrecognized feature, wherein the second search space of candidate features is narrower than the first search space of candidate features; and comparing the unrecognized feature with said second search space to identify the previously unrecognized feature.

Type: Application

Filed: January 14, 2011

Publication date: October 20, 2011

Applicant: Copanion, Inc.

Inventors: Depankar Neogi, Vartika Singh, Girish Welling, Steven K. Ladd, Xujun Peng
SYSTEMS AND METHODS FOR AUTOMATICALLY PROCESSING ELECTRONIC DOCUMENTS USING MULTIPLE IMAGE TRANSFORMATION ALGORITHMS

Publication number: 20110255782

Abstract: In a document analysis system that receives and processes jobs from a plurality of users, in which each job may contain multiple electronic documents, to extract data from the electronic documents, a method of automatically pre-processing each received electronic document using a plurality of image transformation algorithms to improve subsequent data extraction from said document is provided. The method includes: electronically partitioning each received electronic document page into pieces; automatically processing each piece of the received electronic document page using each of a plurality of image pre-processing algorithms to produce a plurality of image variations of each piece; and analyzing the outputs of subsequent processing and data extraction, on each of the image variations of the pieces to determine which output is best, from the plurality of outputs for each piece.

Type: Application

Filed: January 14, 2011

Publication date: October 20, 2011

Applicant: Copanion, Inc.

Inventors: Girish Welling, Nirupam Sarkar, Tushar Mahta, Vartika Singh, Depankar Neogi, Steven K. Ladd
SYSTEMS AND METHODS FOR AUTOMATICALLY EXTRACTING DATA FROM ELECTRONIC DOCUMENTS CONTAINING MULTIPLE LAYOUT FEATURES

Publication number: 20110255789

Abstract: A method of automatically extracting data from an electronic document containing a plurality of layout features through progressive refinement is provided. The method includes: analyzing each document to automatically extract images and text features wherein each document includes at least two features that are related to each other, and wherein said analyzing compares extracted features with a first search space of candidate features to try and recognize the extracted features; if one of the at least two related features is not recognized and at least one feature is recognized, selecting a second search space of candidate features in response thereto and in response to predefined rules about the relationship between the two features; and comparing the unrecognized feature with said selected second search space.

Type: Application

Filed: January 14, 2011

Publication date: October 20, 2011

Applicant: COPANION, INC.

Inventors: Depankar NEOGI, Steven K. LADD, Girish WELLING, Arjun KUMAR, Vartika SINGH, Matthew DUGGAN, Tushar MAHATA, Xiaobin YANG, Jian-Wu XU, Janice O'NEIL, Nirupam SARKAR, Gopal KRISHNA
Systems and methods for automatically reducing data search space and improving data extraction accuracy using known constraints in a layout of extracted data elements

Publication number: 20110258195

Abstract: A method of automatically narrowing data search space and improving accuracy of data extraction using known constraints in a layout of extracted data elements for classified documented is provided.

Type: Application

Filed: January 14, 2011

Publication date: October 20, 2011

Inventors: Girish WELLING, Vartika SINGH, Janice O'NEIL, Depankar NEOGI, Steven K. LADD
SYSTEMS AND METHODS FOR TRAINING DOCUMENT ANALYSIS SYSTEM FOR AUTOMATICALLY EXTRACTING DATA FROM DOCUMENTS

Publication number: 20110258150

Abstract: A method of training a document analysis system to extract data from documents is provided. The method includes: automatically analyzing images and text features extracted from a document to associate the document with a corresponding document category; comparing the extracted text features with a set of text features associated with corresponding category of the document, in which the set of text features includes a set of characters, words, and phrases; if the extracted features are found to consist of the characters, words, and phrases belonging to the set of text features associated with the corresponding document category, storing the extracted text features as the data contained in the corresponding document; and, if the extracted text features are found to include at least one text feature that does not belong to the set of text features associated with the corresponding document category, submitting the unrecognized text features to a training phase.

Type: Application

Filed: January 14, 2011

Publication date: October 20, 2011

Applicant: COPANION, INC.

Inventors: Depankar NEOGI, Steven K. LADD, Girish WELLING, Arjun KUMAR, Vartika SINGH, Matthew DUGGAN, Tushar MAHATA, Xiaobin YANG, Jian-Wu XU, Janice O'NEIL, Nirupam SARKAR, Gopal KRISHNA
SYSTEMS AND METHODS FOR AUTOMATICALLY EXTRACTING DATA FROM ELECTRONIC DOCUMENT PAGE INCLUDING MULTIPLE COPIES OF A FORM

Publication number: 20110258182

Abstract: In a document analysis system that receives and processes jobs from a plurality of users, in which each job may contain multiple electronic documents, to extract data from the electronic documents, a method of extracting data from a received electronic document page that includes multiple copies of a form is provided. The method comprising: automatically processing a received electronic document page that includes multiple copies of a form to group the multiple copies into corresponding number of records; automatically extracting data from each of the multiple copies of the form and saving the extracted data into the corresponding record; automatically comparing the extracted data in the records to determine which copy of the extracted data to select; if all extracted data instances are identical, assigning a high confidence score to the extracted data; and, if all extracted data instances are not identical, flagging the extracted data for a further processing.

Type: Application

Filed: January 14, 2011

Publication date: October 20, 2011

Inventors: Vartika Singh, Matthew Duggan, Girish Welling, Depankar Neogi, Steven K. Ladd
SYSTEMS AND METHODS FOR AUTOMATICALLY EXTRACTING DATA FROM ELECTRONIC DOCUMENTS INCLUDING TABLES

Publication number: 20110249905

Abstract: A method of automatically extracting data from an electronic document including tables is provided. The method includes: automatically identifying rows of the table using gaps in horizontal projections of the plurality of image sections, wherein at least some of the identified rows in close proximity are collected to form table formations; and automatically identifying columns of the table using at least some of the plurality of image sections that are vertically aligned, wherein the identified columns are grown in each of the table formations using gaps in vertical projections of the plurality of image sections until an obstruction is reached. The method further includes automatically identifying labels in the plurality of corresponding image sections to associate the identified labels with at least one of the identified columns and the identified rows; and automatically extracting data from cells of the table formed by the identified rows and columns.

Type: Application

Filed: June 23, 2011

Publication date: October 13, 2011

Applicant: Copanion, Inc.

Inventors: Vartika SINGH, Girish Welling, Depankar Neogi, Steven K. Ladd