Patents by Inventor Shreyas Bettadapura Guruprasad

Shreyas Bettadapura Guruprasad has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11216425
    Abstract: A system and method of recognizing data in a table area from unstructured data includes a computer network, one or more processors communicatively coupled with the computer network, a storage location, and a graph-theoretic engine that receives an input stream of unstructured data associated. A table area is recognized from unstructured data, through one or more computer processors, from an input stream of unstructured data received over a computer network. One or more table headers associated with the detected one or more table areas are recognized. Further, one or more column delimiters associated with each column of the detected one or more table areas are determined. One or more tabular data associated with the detected one or more table areas are extracted. The extracted tabular data is mapped to one or more target schema to store onto a relational database.
    Type: Grant
    Filed: September 26, 2019
    Date of Patent: January 4, 2022
    Assignee: INFOSYS LIMITED
    Inventors: Radha Krishna Pisipati, Jianlin Zhang, Shreyas Bettadapura Guruprasad, Uma Devi Ganugula, Krishnamurty Sai Deepak
  • Patent number: 11080563
    Abstract: A computer implemented a method and system for enrichment of OCR extracted data is disclosed comprising of accepting a set of extraction criteria and a set of configuration parameters by a data extraction engine. The data extraction engine captures data satisfying an extraction criteria using the configuration parameters and adapts the captured data using a set of domain specific rules and a set of OCR error patterns. A learning engine generates learning data models using the adapted data and the configuration parameters and the system dynamically updates the extraction criteria using the generated learning data models. The extraction criteria comprise one or more extraction templates wherein an extraction template includes one of a regular expression, geometric markers, anchor text markers and a combination thereof.
    Type: Grant
    Filed: June 19, 2019
    Date of Patent: August 3, 2021
    Assignee: INFOSYS LIMITED
    Inventors: Shreyas Bettadapura Guruprasad, Radha Krishna Pisipati
  • Publication number: 20200097451
    Abstract: A system and method of recognizing data in a table area from unstructured data includes a computer network, one or more processors communicatively coupled with the computer network, a storage location, and a graph-theoretic engine that receives an input stream of unstructured data associated. A table area is recognized from unstructured data, through one or more computer processors, from an input stream of unstructured data received over a computer network. One or more table headers associated with the detected one or more table areas are recognized. Further, one or more column delimiters associated with each column of the detected one or more table areas are determined. One or more tabular data associated with the detected one or more table areas are extracted. The extracted tabular data is mapped to one or more target schema to store onto a relational database.
    Type: Application
    Filed: September 26, 2019
    Publication date: March 26, 2020
    Applicant: Infosys Limited
    Inventors: Radha Krishna Pisipati, Jianlin Zhang, Shreyas Bettadapura Guruprasad, Uma Devi Ganugula, Krishnamurty Sai Deepak
  • Publication number: 20200005089
    Abstract: A computer implemented a method and system for enrichment of OCR extracted data is disclosed comprising of accepting a set of extraction criteria and a set of configuration parameters by a data extraction engine. The data extraction engine captures data satisfying an extraction criteria using the configuration parameters and adapts the captured data using a set of domain specific rules and a set of OCR error patterns. A learning engine generates learning data models using the adapted data and the configuration parameters and the system dynamically updates the extraction criteria using the generated learning data models. The extraction criteria comprise one or more extraction templates wherein an extraction template includes one of a regular expression, geometric markers, anchor text markers and a combination thereof.
    Type: Application
    Filed: June 19, 2019
    Publication date: January 2, 2020
    Applicant: Infosys Limited
    Inventors: Shreyas Bettadapura Guruprasad, Radha Krishna Pisipati
  • Patent number: 10049096
    Abstract: Methods and systems for template creation for a data extraction tool. A first template is selected from a plurality of documents provided by a user. An OCR engine annotates the first template and at least one data region in the first template corresponding to a set of parameters required in a target template is identified by selecting a geometrical region on the first template. At least one interim template is created based on the identification, and the plurality of documents are analyzed using the interim template to extract data values in the data region. The documents are converted to a format compliant with the target template based on the analysis.
    Type: Grant
    Filed: May 26, 2016
    Date of Patent: August 14, 2018
    Assignee: Infosys Limited
    Inventors: Krishnamurty Sai Deepak, Ganesh Kumar Nunnagoppula, Ann Matthew, Harikrishna G. N Rai, P. Radha Krishna, Rajesh Balakrishnan, Shreyas Bettadapura Guruprasad, Bintu G. Vasudevan
  • Publication number: 20160371246
    Abstract: Methods and systems for template creation for a data extraction tool. A first template is selected from a plurality of documents provided by a user. An OCR engine annotates the first template and at least one data region in the first template corresponding to a set of parameters required in a target template is identified by selecting a geometrical region on the first template. At least one interim template is created based on the identification, and the plurality of documents are analyzed using the interim template to extract data values in the data region. The documents are converted to a format compliant with the target template based on the analysis.
    Type: Application
    Filed: May 26, 2016
    Publication date: December 22, 2016
    Inventors: Krishnamurty Sai Deepak, Ganesh Kumar Nunnagoppula, Ann Matthew, Harikrishna G. N Rai, P. Radha Krishna, Rajesh Balakrishnan, Shreyas Bettadapura Guruprasad, Bintu G. Vasudevan