Patents by Inventor Peijun Chiang

Peijun Chiang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Out-of-bounds detection for a document in a live camera feed

Patent number: 11140290

Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, segmenting at least a first digital image of the plurality of images into pixels associated with a foreground of the first digital image and pixels associated with a background of the first digital image, detecting a plurality of contours in the segmented first digital image, deciding, for each detected contour of the plurality of contours, whether that contour is an open contour or a closed contour, and determining that one or more sides of the document is out of bounds based, at least in part, on the decisions.

Type: Grant

Filed: April 16, 2020

Date of Patent: October 5, 2021

Assignee: INTUIT, INC.

Inventors: Vijay S. Yellapragada, Peijun Chiang, Daniel Lee, Jason Hall, Shailesh Soliwal
Out-of bounds detection of a document in a live camera feed

Patent number: 10659643

Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, segmenting at least a first digital image of the plurality of images into pixels associated with a foreground of the first digital image and pixels associated with a background of the first digital image, detecting a plurality of contours in the segmented first digital image, deciding, for each detected contour of the plurality of contours, whether that contour is an open contour or a closed contour, and determining that one or more sides of the document is out-of-bounds based, at least in part, on the decisions.

Type: Grant

Filed: November 15, 2018

Date of Patent: May 19, 2020

Assignee: INTUIT, INC.

Inventors: Vijay S. Yellapragada, Peijun Chiang, Daniel Lee, Jason Hall, Shailesh Soliwal
Optical character recognition (OCR) accuracy by combining results across video frames

Patent number: 10558856

Abstract: The present disclosure relates to optical character recognition using captured video. According to one embodiment, using a first image in stream of images depicting a document, the device extracts text data in a portion of the document depicted in the first image and determines a first confidence level regarding an accuracy of the extracted text data. If the first confidence level satisfies a threshold value, the device saves the extracted text data as recognized content of the source document. Otherwise, the device extracts the text data from the portion of the document as depicted in one or more second images in the stream and determines a second confidence level for the text data extracted from each second image until identifying one of the second images where the second confidence level associated with the text data extracted from the identified second image satisfies the threshold value.

Type: Grant

Filed: September 4, 2018

Date of Patent: February 11, 2020

Assignee: INTUIT INC.

Inventors: Vijay S. Yellapragada, Peijun Chiang, Sreeneel K. Maddika
Detecting font size in a digital image

Patent number: 10354161

Abstract: The present disclosure relates to optical character recognition, and more specifically techniques for detecting font size in a digital image. Accordingly to one embodiment, a client device receives a digital image of a document having one or more textual components. The client device finds one or more contours bounding the one or more textual components in the digital image of the document. The client device detects a font size for text contained in the digital image using the one or more contours. The client device extracts the text from the digital image upon detecting that the detected font size is above a defined threshold value.

Type: Grant

Filed: June 5, 2017

Date of Patent: July 16, 2019

Assignee: INTUIT, INC.

Inventors: Peijun Chiang, Vijay Yellapragada
Optical character recognition utilizing hashed templates

Patent number: 10339373

Abstract: Techniques are disclosed for performing optical character recognition (OCR) by identifying a template based on a hash of a document. One embodiment includes a method for identifying a template associated with an image. The method includes receiving a digital image, a portion of the image depicting a first document, and extracting the portion of the image. The method further includes scaling the portion of the image and generating a first hash from the scaled image. The method further includes comparing the first hash to a set of hashes, each corresponding to a template. The method further includes selecting a first template as corresponding to the first document based on comparing the first hash to the set of hashes and extracting one or more sections of the portion of the image based on the selected first template. The method further includes performing OCR on the extracted one or more sections.

Type: Grant

Filed: August 24, 2018

Date of Patent: July 2, 2019

Assignee: INTUIT INC.

Inventors: Vijay S. Yellapragada, Peijun Chiang, Sreeneel K. Maddika
Detecting long documents in a live camera feed

Patent number: 10257375

Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, determining a type of the document, loading one or more pre-defined metrics associated with the document based on the determined type of the document, determining one or more characteristics of the document based on one or more analyses performed on the plurality of digital images of the document, comparing the one or more characteristics of the document with the one or more pre-defined metrics, and determining the document to be a long document based, at least in part, on the comparison.

Type: Grant

Filed: June 14, 2017

Date of Patent: April 9, 2019

Assignee: INTUIT, INC.

Inventors: Vijay Yellapragada, Peijun Chiang, Daniel Lee, Jason Hall, Shailesh Soliwal
OUT-OF-BOUNDS DETECTION OF A DOCUMENT IN A LIVE CAMERA FEED

Publication number: 20190089856

Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, segmenting at least a first digital image of the plurality of images into pixels associated with a foreground of the first digital image and pixels associated with a background of the first digital image, detecting a plurality of contours in the segmented first digital image, deciding, for each detected contour of the plurality of contours, whether that contour is an open contour or a closed contour, and determining that one or more sides of the document is out-of-bounds based, at least in part, on the decisions.

Type: Application

Filed: November 15, 2018

Publication date: March 21, 2019

Inventors: Vijay S. YELLAPRAGADA, Peijun CHIANG, Daniel LEE, Jason HALL, Shailesh SOLIWAL
Identification of duplicate copies of a form in a document

Patent number: 10229315

Abstract: Aspects of the present disclosure provide methods and apparatuses for detecting duplicate copies of a form in an image of a document. An exemplary method generally includes obtaining a first digital image of a document, performing one or more transformations on the first digital image, determining one or more rectangles in the transformed first digital image, identifying at least a first duplicate copy of the form being depicted in the first digital image based, at least in part, on the detected one or more rectangles, and generating, based on the identified duplicate copy of the form, a notification that the first digital image includes at least the first duplicate copy of the form.

Type: Grant

Filed: July 27, 2016

Date of Patent: March 12, 2019

Assignee: INTUIT, INC.

Inventors: Vijay Yellapragada, Peijun Chiang, Sreeneel K. Maddika
Optical character recognition (OCR) accuracy by combining results across video frames

Patent number: 10210384

Abstract: The present disclosure relates to optical character recognition using captured video. According to one embodiment, using a first image in stream of images depicting a document, the device extracts text data in a portion of the document depicted in the first image and determines a first confidence level regarding an accuracy of the extracted text data. If the first confidence level satisfies a threshold value, the device saves the extracted text data as recognized content of the source document. Otherwise, the device extracts the text data from the portion of the document as depicted in one or more second images in the stream and determines a second confidence level for the text data extracted from each second image until identifying one of the second images where the second confidence level associated with the text data extracted from the identified second image satisfies the threshold value.

Type: Grant

Filed: July 25, 2016

Date of Patent: February 19, 2019

Assignee: INTUIT INC.

Inventors: Vijay Yellapragada, Peijun Chiang, Sreeneel K. Maddika
Out-of bounds detection of a document in a live camera feed

Patent number: 10171695

Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, segmenting at least a first digital image of the plurality of images into pixels associated with a foreground of the first digital image and pixels associated with a background of the first digital image, detecting a plurality of contours in the segmented first digital image, deciding, for each detected contour of the plurality of contours, whether that contour is an open contour or a closed contour, and determining that one or more sides of the document is out-of-bounds based, at least in part, on the decisions.

Type: Grant

Filed: June 14, 2017

Date of Patent: January 1, 2019

Assignee: Intuit Inc.

Inventors: Vijay Yellapragada, Peijun Chiang, Daniel Lee, Jason Hall, Shailesh Soliwal
Detecting orientation of textual documents on a live camera feed

Patent number: 10163007

Abstract: The present disclosure relates to the extraction of text from an image including a depiction of a document. According to one embodiment, a mobile device receives an image depicting a document. The mobile device identifies a plurality of text areas in the document and identifies a midpoint of each of the plurality of text areas in the document. The mobile device detects one or more lines of text in the document including a plurality of text areas, where the plurality of text areas included in a line of text are associated with a midpoint having a coordinate within a threshold number of pixels on one axis in a two-dimensional space. Based on an orientation of the detected one or more lines of text, the mobile device determines a probable orientation of the document and extracts text from the image based on the determined probable orientation of the document.

Type: Grant

Filed: April 27, 2017

Date of Patent: December 25, 2018

Assignee: INTUIT INC.

Inventors: Daniel Lee, Vijay Yellapragada, Shailesh Soliwal, Peijun Chiang
DETECTING LONG DOCUMENTS IN A LIVE CAMERA FEED

Publication number: 20180367688

Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, determining a type of the document, loading one or more pre-defined metrics associated with the document based on the determined type of the document, determining one or more characteristics of the document based on one or more analyses performed on the plurality of digital images of the document, comparing the one or more characteristics of the document with the one or more pre-defined metrics, and determining the document to be a long document based, at least in part, on the comparison.

Type: Application

Filed: June 14, 2017

Publication date: December 20, 2018

Inventors: Vijay YELLAPRAGADA, Peijun CHIANG, Daniel LEE, Jason HALL, Shailesh SOLIWAL
OUT-OF-BOUNDS DETECTION OF A DOCUMENT IN A LIVE CAMERA FEED

Publication number: 20180367689

Abstract: Aspects of the present disclosure provide methods and apparatuses for processing a digital image of a document, for example, to determine whether the document is a long document. An exemplary method generally includes obtaining a plurality of digital images of the document, segmenting at least a first digital image of the plurality of images into pixels associated with a foreground of the first digital image and pixels associated with a background of the first digital image, detecting a plurality of contours in the segmented first digital image, deciding, for each detected contour of the plurality of contours, whether that contour is an open contour or a closed contour, and determining that one or more sides of the document is out-of-bounds based, at least in part, on the decisions.

Type: Application

Filed: June 14, 2017

Publication date: December 20, 2018

Inventors: Vijay YELLAPRAGADA, Peijun CHIANG, Daniel LEE, Jason HALL, Shailesh SOLIWAL
DETECTING FONT SIZE IN A DIGITAL IMAGE

Publication number: 20180349722

Abstract: The present disclosure relates to optical character recognition, and more specifically techniques for detecting font size in a digital image. Accordingly to one embodiment, a client device receives a digital image of a document having one or more textual components. The client device finds one or more contours bounding the one or more textual components in the digital image of the document. The client device detects a font size for text contained in the digital image using the one or more contours. The client device extracts the text from the digital image upon detecting that the detected font size is above a defined threshold value.

Type: Application

Filed: June 5, 2017

Publication date: December 6, 2018

Inventors: Peijun CHIANG, Vijay YELLAPRAGADA
DETECTING ORIENTATION OF TEXTUAL DOCUMENTS ON A LIVE CAMERA FEED

Publication number: 20180314884

Abstract: The present disclosure relates to the extraction of text from an image including a depiction of a document. According to one embodiment, a mobile device receives an image depicting a document. The mobile device identifies a plurality of text areas in the document and identifies a midpoint of each of the plurality of text areas in the document. The mobile device detects one or more lines of text in the document including a plurality of text areas, where the plurality of text areas included in a line of text are associated with a midpoint having a coordinate within a threshold number of pixels on one axis in a two-dimensional space. Based on an orientation of the detected one or more lines of text, the mobile device determines a probable orientation of the document and extracts text from the image based on the determined probable orientation of the document.

Type: Application

Filed: April 27, 2017

Publication date: November 1, 2018

Inventors: Daniel LEE, Vijay YELLAPRAGADA, Shailesh SOLIWAL, Peijun CHIANG
Optical character recognition utilizing hashed templates

Patent number: 10095920

Abstract: Techniques are disclosed for performing optical character recognition (OCR) by identifying a template based on a hash of a document. One embodiment includes a method for identifying a template associated with an image. The method includes receiving a digital image, a portion of the image depicting a first document, and extracting the portion of the image. The method further includes scaling the portion of the image and generating a first hash from the scaled image. The method further includes comparing the first hash to a set of hashes, each corresponding to a template. The method further includes selecting a first template as corresponding to the first document based on comparing the first hash to the set of hashes and extracting one or more sections of the portion of the image based on the selected first template. The method further includes performing OCR on the extracted one or more sections.

Type: Grant

Filed: July 28, 2016

Date of Patent: October 9, 2018

Assignee: INTUIT INC

Inventors: Vijay S. Yellapragada, Peijun Chiang, Sreeneel Maddika
Performing optical character recognition using spatial information of regions within a structured document

Patent number: 10013643

Abstract: Techniques are disclosed for facilitating optical character recognition (OCR) by identifying one or more regions in an electronic document to perform the OCR. For example a method for identifying information in an electronic document includes obtaining a set of training documents for each template of a plurality of templates for the electronic document, extracting spatial attributes for at least a first label region and at least a first corresponding value region from the set, and training a classifier model based on the extracted spatial attributes, wherein the classifier model is used to identify the information in the electronic document. The spatial attributes represent a position of at least the first label region and at least the first value region within the electronic document.

Type: Grant

Filed: July 26, 2016

Date of Patent: July 3, 2018

Assignee: INTUIT INC.

Inventors: Vijay Yellapragada, Peijun Chiang, Sreeneel K. Maddika
IDENTIFICATION OF DUPLICATE COPIES OF A FORM IN A DOCUMENT

Publication number: 20180032811

Abstract: Aspects of the present disclosure provide methods and apparatuses for detecting duplicate copies of a form in an image of a document. An exemplary method generally includes obtaining a first digital image of a document, performing one or more transformations on the first digital image, determining one or more rectangles in the transformed first digital image, identifying at least a first duplicate copy of the form being depicted in the first digital image based, at least in part, on the detected one or more rectangles, and generating, based on the identified duplicate copy of the form, a notification that the first digital image includes at least the first duplicate copy of the form.

Type: Application

Filed: July 27, 2016

Publication date: February 1, 2018

Applicant: INTUIT INC.

Inventors: Vijay YELLAPRAGADA, Peijun CHIANG, Sreeneel K. MADDIKA
PERFORMING OPTICAL CHARACTER RECOGNITION USING SPATIAL INFORMATION OF REGIONS WITHIN A STRUCTURED DOCUMENT

Publication number: 20180032842

Abstract: Techniques are disclosed for facilitating optical character recognition (OCR) by identifying one or more regions in an electronic document to perform the OCR. For example a method for identifying information in an electronic document includes obtaining a set of training documents for each template of a plurality of templates for the electronic document, extracting spatial attributes for at least a first label region and at least a first corresponding value region from the set, and training a classifier model based on the extracted spatial attributes, wherein the classifier model is used to identify the information in the electronic document. The spatial attributes represent a position of at least the first label region and at least the first value region within the electronic document.

Type: Application

Filed: July 26, 2016

Publication date: February 1, 2018

Inventors: Vijay YELLAPRAGADA, Peijun CHIANG, Sreeneel K. MADDIKA
OPTICAL CHARACTER RECOGNITION UTILIZING HASHED TEMPLATES

Publication number: 20180032804

Abstract: Techniques are disclosed for performing optical character recognition (OCR) by identifying a template based on a hash of a document. One embodiment includes a method for identifying a template associated with an image. The method includes receiving a digital image, a portion of the image depicting a first document, and extracting the portion of the image. The method further includes scaling the portion of the image and generating a first hash from the scaled image. The method further includes comparing the first hash to a set of hashes, each corresponding to a template. The method further includes selecting a first template as corresponding to the first document based on comparing the first hash to the set of hashes and extracting one or more sections of the portion of the image based on the selected first template. The method further includes performing OCR on the extracted one or more sections.

Type: Application

Filed: July 28, 2016

Publication date: February 1, 2018

Inventors: Vijay S. YELLAPRAGADA, Peijun CHIANG, Sreeneel MADDIKA

1 2 next