Patents by Inventor Punitha Chandrasekar

Punitha Chandrasekar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230419711
    Abstract: Systems and methods for extracting data from electronic documents using optical character recognition (OCR) and non-OCR based text extraction. A server computing device initiates non-OCR based text extraction for each page of an electronic document. The server calculates a document text coverage percentage corresponding to the non-OCR based text extraction for the whole document and, in response to determining that the document text coverage percentage is below a first threshold, initiates OCR for the document. The server calculates a page text coverage percentage corresponding to the non-OCR based text extraction for one or more pages of the electronic document and, in response to determining that the page text coverage percentage is below a second threshold, initiates OCR for the pages. The server combines first text extracted from the electronic document using non-OCR based text extraction and second text extracted from the electronic document using OCR.
    Type: Application
    Filed: June 27, 2022
    Publication date: December 28, 2023
    Inventors: Keerthan Ramnath, Punitha Chandrasekar, Hui Su, Shyam Subramanian, Rachna Saxena, Mohamed Mahdi Alouane, Vinay Iyengar
  • Patent number: 11657078
    Abstract: Methods and apparatuses are described for automatically identifying text sections of a document to generate a searchable hierarchical data structure. A computing device receives a document comprising text entities and converts the document from a first format to a second format, including generating metadata associated with text alignment, text position, text spacing, or fonts. The computing device extracts the text blocks, including determining coordinates associated with each text block using the metadata. The computing device determines document sections using the document metadata by identifying strings in the extracted text blocks that indicate a presence of a bullet point in the document, assigns a hierarchical category to each identified document section, and inserts text of each document section into a hierarchical data structure based upon the assigned hierarchical category.
    Type: Grant
    Filed: October 14, 2021
    Date of Patent: May 23, 2023
    Assignee: FMR LLC
    Inventors: Ananya Bal, Punitha Chandrasekar, Vinay Iyengar, Bidhan Roy
  • Publication number: 20230119590
    Abstract: Methods and apparatuses are described for automatically identifying text sections of a document to generate a searchable hierarchical data structure. A computing device receives a document comprising text entities and converts the document from a first format to a second format, including generating metadata associated with text alignment, text position, text spacing, or fonts. The computing device extracts the text blocks, including determining coordinates associated with each text block using the metadata. The computing device determines document sections using the document metadata by identifying strings in the extracted text blocks that indicate a presence of a bullet point in the document, assigns a hierarchical category to each identified document section, and inserts text of each document section into a hierarchical data structure based upon the assigned hierarchical category.
    Type: Application
    Filed: October 14, 2021
    Publication date: April 20, 2023
    Inventors: Ananya Bal, Punitha Chandrasekar, Vinay Iyengar, Bidhan Roy
  • Patent number: 11625419
    Abstract: Systems and methods for extracting data from electronic documents based on data patterns. The method includes receiving electronic template documents. Each template document corresponds to a type of electronic document. The method further includes, for each template document, processing the template document using a text extraction and data processing application. The method also includes, for each template document, determining a data extraction formula corresponding to the type of electronic document. The method further includes, storing the data extraction formula in a first database. The method also includes, receiving an electronic document including user data and a Unicode corresponding to the type of document. The method also includes, processing and classifying the electronic document into the type of document corresponding to the Unicode.
    Type: Grant
    Filed: October 6, 2020
    Date of Patent: April 11, 2023
    Assignee: FMR LLC
    Inventors: Punitha Chandrasekar, Sourav Karmakar, Amol Vinayak Jadhav, Bidhan Roy, Victor S. Y. Lo, Varun Vivek Aher, Ankit Garg
  • Publication number: 20220107964
    Abstract: Systems and methods for extracting data from electronic documents based on data patterns. The method includes receiving electronic template documents. Each template document corresponds to a type of electronic document. The method further includes, for each template document, processing the template document using a text extraction and data processing application. The method also includes, for each template document, determining a data extraction formula corresponding to the type of electronic document. The method further includes, storing the data extraction formula in a first database. The method also includes, receiving an electronic document including user data and a Unicode corresponding to the type of document. The method also includes, processing and classifying the electronic document into the type of document corresponding to the Unicode.
    Type: Application
    Filed: October 6, 2020
    Publication date: April 7, 2022
    Inventors: Punitha Chandrasekar, Sourav Karmakar, Amol Vinayak Jadhav, Bidhan Roy, Victor S. Y. Lo, Varun Vivek Aher, Ankit Garg