Patents by Inventor Punitha Chandrasekar

Punitha Chandrasekar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SYSTEMS AND METHODS FOR AUTOMATED END-TO-END TEXT EXTRACTION OF ELECTRONIC DOCUMENTS

Publication number: 20230419711

Abstract: Systems and methods for extracting data from electronic documents using optical character recognition (OCR) and non-OCR based text extraction. A server computing device initiates non-OCR based text extraction for each page of an electronic document. The server calculates a document text coverage percentage corresponding to the non-OCR based text extraction for the whole document and, in response to determining that the document text coverage percentage is below a first threshold, initiates OCR for the document. The server calculates a page text coverage percentage corresponding to the non-OCR based text extraction for one or more pages of the electronic document and, in response to determining that the page text coverage percentage is below a second threshold, initiates OCR for the pages. The server combines first text extracted from the electronic document using non-OCR based text extraction and second text extracted from the electronic document using OCR.

Type: Application

Filed: June 27, 2022

Publication date: December 28, 2023

Inventors: Keerthan Ramnath, Punitha Chandrasekar, Hui Su, Shyam Subramanian, Rachna Saxena, Mohamed Mahdi Alouane, Vinay Iyengar
Automatic identification of document sections to generate a searchable data structure

Patent number: 11657078

Abstract: Methods and apparatuses are described for automatically identifying text sections of a document to generate a searchable hierarchical data structure. A computing device receives a document comprising text entities and converts the document from a first format to a second format, including generating metadata associated with text alignment, text position, text spacing, or fonts. The computing device extracts the text blocks, including determining coordinates associated with each text block using the metadata. The computing device determines document sections using the document metadata by identifying strings in the extracted text blocks that indicate a presence of a bullet point in the document, assigns a hierarchical category to each identified document section, and inserts text of each document section into a hierarchical data structure based upon the assigned hierarchical category.

Type: Grant

Filed: October 14, 2021

Date of Patent: May 23, 2023

Assignee: FMR LLC

Inventors: Ananya Bal, Punitha Chandrasekar, Vinay Iyengar, Bidhan Roy
AUTOMATIC IDENTIFICATION OF DOCUMENT SECTIONS TO GENERATE A SEARCHABLE DATA STRUCTURE

Publication number: 20230119590

Abstract: Methods and apparatuses are described for automatically identifying text sections of a document to generate a searchable hierarchical data structure. A computing device receives a document comprising text entities and converts the document from a first format to a second format, including generating metadata associated with text alignment, text position, text spacing, or fonts. The computing device extracts the text blocks, including determining coordinates associated with each text block using the metadata. The computing device determines document sections using the document metadata by identifying strings in the extracted text blocks that indicate a presence of a bullet point in the document, assigns a hierarchical category to each identified document section, and inserts text of each document section into a hierarchical data structure based upon the assigned hierarchical category.

Type: Application

Filed: October 14, 2021

Publication date: April 20, 2023

Inventors: Ananya Bal, Punitha Chandrasekar, Vinay Iyengar, Bidhan Roy
Systems and methods for data extraction from electronic documents using data patterns

Patent number: 11625419

Abstract: Systems and methods for extracting data from electronic documents based on data patterns. The method includes receiving electronic template documents. Each template document corresponds to a type of electronic document. The method further includes, for each template document, processing the template document using a text extraction and data processing application. The method also includes, for each template document, determining a data extraction formula corresponding to the type of electronic document. The method further includes, storing the data extraction formula in a first database. The method also includes, receiving an electronic document including user data and a Unicode corresponding to the type of document. The method also includes, processing and classifying the electronic document into the type of document corresponding to the Unicode.

Type: Grant

Filed: October 6, 2020

Date of Patent: April 11, 2023

Assignee: FMR LLC

Inventors: Punitha Chandrasekar, Sourav Karmakar, Amol Vinayak Jadhav, Bidhan Roy, Victor S. Y. Lo, Varun Vivek Aher, Ankit Garg
Systems and Methods for Data Extraction from Electronic Documents Using Data Patterns

Publication number: 20220107964

Abstract: Systems and methods for extracting data from electronic documents based on data patterns. The method includes receiving electronic template documents. Each template document corresponds to a type of electronic document. The method further includes, for each template document, processing the template document using a text extraction and data processing application. The method also includes, for each template document, determining a data extraction formula corresponding to the type of electronic document. The method further includes, storing the data extraction formula in a first database. The method also includes, receiving an electronic document including user data and a Unicode corresponding to the type of document. The method also includes, processing and classifying the electronic document into the type of document corresponding to the Unicode.

Type: Application

Filed: October 6, 2020

Publication date: April 7, 2022

Inventors: Punitha Chandrasekar, Sourav Karmakar, Amol Vinayak Jadhav, Bidhan Roy, Victor S. Y. Lo, Varun Vivek Aher, Ankit Garg

SYSTEMS AND METHODS FOR AUTOMATED END-TO-END TEXT EXTRACTION OF ELECTRONIC DOCUMENTS

Automatic identification of document sections to generate a searchable data structure

AUTOMATIC IDENTIFICATION OF DOCUMENT SECTIONS TO GENERATE A SEARCHABLE DATA STRUCTURE

Systems and methods for data extraction from electronic documents using data patterns

Systems and Methods for Data Extraction from Electronic Documents Using Data Patterns