Patents by Inventor Vijay S. YELLAPRAGADA
Vijay S. YELLAPRAGADA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11140290Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, segmenting at least a first digital image of the plurality of images into pixels associated with a foreground of the first digital image and pixels associated with a background of the first digital image, detecting a plurality of contours in the segmented first digital image, deciding, for each detected contour of the plurality of contours, whether that contour is an open contour or a closed contour, and determining that one or more sides of the document is out of bounds based, at least in part, on the decisions.Type: GrantFiled: April 16, 2020Date of Patent: October 5, 2021Assignee: INTUIT, INC.Inventors: Vijay S. Yellapragada, Peijun Chiang, Daniel Lee, Jason Hall, Shailesh Soliwal
-
Publication number: 20200244831Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, segmenting at least a first digital image of the plurality of images into pixels associated with a foreground of the first digital image and pixels associated with a background of the first digital image, detecting a plurality of contours in the segmented first digital image, deciding, for each detected contour of the plurality of contours, whether that contour is an open contour or a closed contour, and determining that one or more sides of the document is out of bounds based, at least in part, on the decisions.Type: ApplicationFiled: April 16, 2020Publication date: July 30, 2020Inventors: Vijay S. YELLAPRAGADA, Daniel LEE, Jason HALL, Shailesh SOLIWAL
-
Patent number: 10659643Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, segmenting at least a first digital image of the plurality of images into pixels associated with a foreground of the first digital image and pixels associated with a background of the first digital image, detecting a plurality of contours in the segmented first digital image, deciding, for each detected contour of the plurality of contours, whether that contour is an open contour or a closed contour, and determining that one or more sides of the document is out-of-bounds based, at least in part, on the decisions.Type: GrantFiled: November 15, 2018Date of Patent: May 19, 2020Assignee: INTUIT, INC.Inventors: Vijay S. Yellapragada, Peijun Chiang, Daniel Lee, Jason Hall, Shailesh Soliwal
-
Patent number: 10558856Abstract: The present disclosure relates to optical character recognition using captured video. According to one embodiment, using a first image in stream of images depicting a document, the device extracts text data in a portion of the document depicted in the first image and determines a first confidence level regarding an accuracy of the extracted text data. If the first confidence level satisfies a threshold value, the device saves the extracted text data as recognized content of the source document. Otherwise, the device extracts the text data from the portion of the document as depicted in one or more second images in the stream and determines a second confidence level for the text data extracted from each second image until identifying one of the second images where the second confidence level associated with the text data extracted from the identified second image satisfies the threshold value.Type: GrantFiled: September 4, 2018Date of Patent: February 11, 2020Assignee: INTUIT INC.Inventors: Vijay S. Yellapragada, Peijun Chiang, Sreeneel K. Maddika
-
Patent number: 10339373Abstract: Techniques are disclosed for performing optical character recognition (OCR) by identifying a template based on a hash of a document. One embodiment includes a method for identifying a template associated with an image. The method includes receiving a digital image, a portion of the image depicting a first document, and extracting the portion of the image. The method further includes scaling the portion of the image and generating a first hash from the scaled image. The method further includes comparing the first hash to a set of hashes, each corresponding to a template. The method further includes selecting a first template as corresponding to the first document based on comparing the first hash to the set of hashes and extracting one or more sections of the portion of the image based on the selected first template. The method further includes performing OCR on the extracted one or more sections.Type: GrantFiled: August 24, 2018Date of Patent: July 2, 2019Assignee: INTUIT INC.Inventors: Vijay S. Yellapragada, Peijun Chiang, Sreeneel K. Maddika
-
Patent number: 10289905Abstract: Systems of the present disclosure generate accurate training data for optical character recognition (OCR). Systems disclosed herein generates images of a text passage as displayed piecemeal in a user interface (UI) element rendered in a selected font type and size, determine accurate dimensions and locations of bounding boxes for each character pictured in the images, stitch together a training image by concatenating the images, and associate the training image, the bounding box dimensions and locations, and the text passage together in a collection of training data. The collection of training data also includes a computer-readable master copy of the text passage with newline characters inserted therein.Type: GrantFiled: August 24, 2018Date of Patent: May 14, 2019Assignee: Intuit Inc.Inventors: Eugene Krivopaltsev, Sreeneel K. Maddika, Vijay S. Yellapragada
-
Patent number: 10282604Abstract: Systems of the present disclosure generate accurate training data for optical character recognition (OCR). Systems disclosed herein generates images of a text passage as displayed piecemeal in a user interface (UI) element rendered in a selected font type and size, determine accurate dimensions and locations of bounding boxes for each character pictured in the images, stitch together a training image by concatenating the images, and associate the training image, the bounding box dimensions and locations, and the text passage together in a collection of training data. The collection of training data also includes a computer-readable master copy of the text passage with newline characters inserted therein.Type: GrantFiled: August 23, 2018Date of Patent: May 7, 2019Assignee: Intuit, Inc.Inventors: Eugene Krivopaltsev, Sreeneel K. Maddika, Vijay S. Yellapragada
-
Publication number: 20190089856Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, segmenting at least a first digital image of the plurality of images into pixels associated with a foreground of the first digital image and pixels associated with a background of the first digital image, detecting a plurality of contours in the segmented first digital image, deciding, for each detected contour of the plurality of contours, whether that contour is an open contour or a closed contour, and determining that one or more sides of the document is out-of-bounds based, at least in part, on the decisions.Type: ApplicationFiled: November 15, 2018Publication date: March 21, 2019Inventors: Vijay S. YELLAPRAGADA, Peijun CHIANG, Daniel LEE, Jason HALL, Shailesh SOLIWAL
-
Publication number: 20180365487Abstract: Systems of the present disclosure generate accurate training data for optical character recognition (OCR). Systems disclosed herein generates images of a text passage as displayed piecemeal in a user interface (UI) element rendered in a selected font type and size, determine accurate dimensions and locations of bounding boxes for each character pictured in the images, stitch together a training image by concatenating the images, and associate the training image, the bounding box dimensions and locations, and the text passage together in a collection of training data. The collection of training data also includes a computer-readable master copy of the text passage with newline characters inserted therein.Type: ApplicationFiled: August 23, 2018Publication date: December 20, 2018Inventors: Eugene KRIVOPALTSEV, Sreeneel K. MADDIKA, Vijay S. YELLAPRAGADA
-
Publication number: 20180365488Abstract: Systems of the present disclosure generate accurate training data for optical character recognition (OCR). Systems disclosed herein generates images of a text passage as displayed piecemeal in a user interface (UI) element rendered in a selected font type and size, determine accurate dimensions and locations of bounding boxes for each character pictured in the images, stitch together a training image by concatenating the images, and associate the training image, the bounding box dimensions and locations, and the text passage together in a collection of training data. The collection of training data also includes a computer-readable master copy of the text passage with newline characters inserted therein.Type: ApplicationFiled: August 24, 2018Publication date: December 20, 2018Inventors: Eugene KRIVOPALTSEV, Sreeneel K. MADDIKA, Vijay S. YELLAPRAGADA
-
Patent number: 10108879Abstract: The present disclosure includes techniques for selecting a candidate presentation style for individual documents for inclusion in an aggregate training data set for a document type that may be used to train an OCR processing engine prior to identifying text in an image of a document of the document type. In one embodiment, text input corresponding to a text sample in a document is received, and an image of the text sample in the document is received. For each of a plurality of candidate presentation styles, an OCR processing engine is trained using a training data set corresponding to the given candidate presentation style, and the OCR processing engine is used, as trained, to identify text in the received image. The OCR processing results for each candidate presentation style are compared to the received text input. A candidate presentation style for the document is selected based on the comparisons.Type: GrantFiled: September 21, 2016Date of Patent: October 23, 2018Assignee: Intuit inc.Inventors: Eugene Krivopaltsev, Sreeneel K. Maddika, Vijay S. Yellapragada
-
Patent number: 10095920Abstract: Techniques are disclosed for performing optical character recognition (OCR) by identifying a template based on a hash of a document. One embodiment includes a method for identifying a template associated with an image. The method includes receiving a digital image, a portion of the image depicting a first document, and extracting the portion of the image. The method further includes scaling the portion of the image and generating a first hash from the scaled image. The method further includes comparing the first hash to a set of hashes, each corresponding to a template. The method further includes selecting a first template as corresponding to the first document based on comparing the first hash to the set of hashes and extracting one or more sections of the portion of the image based on the selected first template. The method further includes performing OCR on the extracted one or more sections.Type: GrantFiled: July 28, 2016Date of Patent: October 9, 2018Assignee: INTUIT INCInventors: Vijay S. Yellapragada, Peijun Chiang, Sreeneel Maddika
-
Patent number: 10089523Abstract: Systems of the present disclosure generate accurate training data for optical character recognition (OCR). Systems disclosed herein generates images of a text passage as displayed piecemeal in a user interface (UI) element rendered in a selected font type and size, determine accurate dimensions and locations of bounding boxes for each character pictured in the images, stitch together a training image by concatenating the images, and associate the training image, the bounding box dimensions and locations, and the text passage together in a collection of training data. The collection of training data also includes a computer-readable master copy of the text passage with newline characters inserted therein.Type: GrantFiled: October 5, 2016Date of Patent: October 2, 2018Assignee: INTUIT INC.Inventors: Eugene Krivopaltsev, Sreeneel K. Maddika, Vijay S. Yellapragada
-
Publication number: 20180096200Abstract: Systems of the present disclosure generate accurate training data for optical character recognition (OCR). Systems disclosed herein generates images of a text passage as displayed piecemeal in a user interface (UI) element rendered in a selected font type and size, determine accurate dimensions and locations of bounding boxes for each character pictured in the images, stitch together a training image by concatenating the images, and associate the training image, the bounding box dimensions and locations, and the text passage together in a collection of training data. The collection of training data also includes a computer-readable master copy of the text passage with newline characters inserted therein.Type: ApplicationFiled: October 5, 2016Publication date: April 5, 2018Inventors: Eugene KRIVOPALTSEV, Sreeneel K. MADDIKA, Vijay S. YELLAPRAGADA
-
Publication number: 20180082146Abstract: The present disclosure includes techniques for selecting a candidate presentation style for individual documents for inclusion in an aggregate training data set for a document type that may be used to train an OCR processing engine prior to identifying text in an image of a document of the document type. In one embodiment, text input corresponding to a text sample in a document is received, and an image of the text sample in the document is received. For each of a plurality of candidate presentation styles, an OCR processing engine is trained using a training data set corresponding to the given candidate presentation style, and the OCR processing engine is used, as trained, to identify text in the received image. The OCR processing results for each candidate presentation style are compared to the received text input. A candidate presentation style for the document is selected based on the comparisons.Type: ApplicationFiled: September 21, 2016Publication date: March 22, 2018Inventors: Eugene KRIVOPALTSEV, Sreeneel K. MADDIKA, Vijay S. YELLAPRAGADA
-
Publication number: 20180032804Abstract: Techniques are disclosed for performing optical character recognition (OCR) by identifying a template based on a hash of a document. One embodiment includes a method for identifying a template associated with an image. The method includes receiving a digital image, a portion of the image depicting a first document, and extracting the portion of the image. The method further includes scaling the portion of the image and generating a first hash from the scaled image. The method further includes comparing the first hash to a set of hashes, each corresponding to a template. The method further includes selecting a first template as corresponding to the first document based on comparing the first hash to the set of hashes and extracting one or more sections of the portion of the image based on the selected first template. The method further includes performing OCR on the extracted one or more sections.Type: ApplicationFiled: July 28, 2016Publication date: February 1, 2018Inventors: Vijay S. YELLAPRAGADA, Peijun CHIANG, Sreeneel MADDIKA