Patents by Inventor Douglas Ryan Gray

Douglas Ryan Gray has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Collaborative text detection and recognition

Patent number: 9436883

Abstract: Various embodiments provide methods and systems for identifying text in an image by applying suitable text detection parameters in text detection. The suitable text detection parameters can be determined based on parameter metric feedback from one or more text identification subtasks, such as text detection, text recognition, preprocessing, character set mapping, pattern matching and validation. In some embodiments, the image can be defined into one or more image regions by performing glyph detection on the image. Text detection parameters applying to each of the one or more image regions can be adjusted based on measured one or more parameter metrics in the respective image region.

Type: Grant

Filed: August 3, 2015

Date of Patent: September 6, 2016

Inventors: Xiaofan Lin, Adam Wiggen Kraft, Yu Lou, Douglas Ryan Gray, Colin Jon Taylor
Image-based character recognition

Patent number: 9390340

Abstract: Various embodiments enable a device to perform tasks such as processing an image to recognize and locate text in the image, and providing the recognized text an application executing on the device for performing a function (e.g., calling a number, opening an internet browser, etc.) associated with the recognized text. In at least one embodiment, processing the image includes substantially simultaneously or concurrently processing the image with at least two recognition engines, such as at least two optical character recognition (OCR) engines, running in a multithreaded mode. In at least one embodiment, the recognition engines can be tuned so that their respective processing speeds are roughly the same. Utilizing multiple recognition engines enables processing latency to be close to that of using only one recognition engine.

Type: Grant

Filed: May 26, 2015

Date of Patent: July 12, 2016

Assignee: A9.com

Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Yu Lou
VIDEO CONTENT ALIGNMENT

Publication number: 20160133299

Abstract: Various embodiments identify differences between frame sequences of a video. For example, to determine a difference between two versions of a video, a fingerprint of each frame of the two versions is generated. From the fingerprints, a run-length encoded representation of each version is generated. The fingerprints which appear only once (i.e., unique fingerprints) in the entire video are identified from each version and compared to identify matching unique fingerprints across versions. The matching unique fingerprints are sorted and filtered to determine split points, which are used to align the two versions of the video. Accordingly, each version is segmented into smaller frame sequences using the split points. Once segmented, the individual frames of each segment are aligned across versions using a dynamic programming algorithm. After aligning the segments at a frame level, the segments are reassembled to generate a global alignment output.

Type: Application

Filed: January 15, 2016

Publication date: May 12, 2016

Inventors: Ismet Zeki Yalniz, Adam Carlson, Douglas Ryan Gray, Colin Jon Taylor
SELF-PORTRAIT ENHANCEMENT TECHNIQUES

Publication number: 20160110848

Abstract: Systems and approaches are provided for optimizing self-portraiture. The background of the self-portrait can be enhanced by image registration or stitching techniques of images captured using one or more conventional cameras. Multiple standard resolution images can be stitched together to generate a panoramic or a composite image of a higher resolution. Foreground elements, such as one or more representations of users, can also be enhanced in various ways. The representations of the users can be composited to exclude undesirable elements, such as image data of one of the users extending her arm to capture the self-portrait. An ideal pose of the users can automatically be selected and other image enhancements, such as histogram optimization, brightness and contrast optimization, color-cast correction, or reduction or removal of noise, can automatically be performed to minimize user effort in capturing self-portraits.

Type: Application

Filed: December 30, 2015

Publication date: April 21, 2016

Inventors: Douglas Ryan Gray, Colin Jon Taylor, Xiaofan Lin
TEXT ENTITY RECOGNITION

Publication number: 20160098611

Abstract: Various embodiments enable the identification of semi-structured text entities in an imager. The identification of the text entities is a relatively simple problem when the text is stored in a computer and free of errors, but much more challenging if the source is the output of an optical character recognition (OCR) engine from a natural scene image. Accordingly, output from an OCR engine is analyzed to isolate a character string indicative of a text entity. Each character of the string is then assigned to a character class to produce a character class string and the text entity of the string is identified based in part on a pattern of the character class string.

Type: Application

Filed: December 16, 2015

Publication date: April 7, 2016

Inventors: DOUGLAS RYAN GRAY, XIAOFAN LIN, ARNAB SANAT KUMAR DHUA, YU LOU
Automated recognition of text utilizing multiple images

Patent number: 9292739

Abstract: Various embodiments enable text aggregation from multiple image frames of text. Accordingly, in order to stitch newly scanned areas of a document together, text in a respective image is recognized and analyzed using an algorithm to identify pairs of corresponding words in other images. Upon identifying a minimum number of matching pairs between two respective images, a mapping between the same can be determined based at least in part on a geometric correspondence between respective identified pairs. Based on this mapping, the recognized text of the two images can be merged by adding words of one image to the other using the matching word pairs as alignment data points.

Type: Grant

Filed: December 12, 2013

Date of Patent: March 22, 2016

Assignee: A9.com, Inc.

Inventors: Douglas Ryan Gray, Colin Jay Taylor, Xiaofan Lin, Adam Wiggen Kraft, Yu Lou, Arnab Sanat Kumar Dhua
Video content alignment

Patent number: 9275682

Abstract: Various embodiments identify differences between frame sequences of a video. For example, to determine a difference between two versions of a video, a fingerprint of each frame of the two versions is generated. From the fingerprints, a run-length encoded representation of each version is generated. The fingerprints which appear only once (i.e., unique fingerprints) in the entire video are identified from each version and compared to identify matching unique fingerprints across versions. The matching unique fingerprints are sorted and filtered to determine split points, which are used to align the two versions of the video. Accordingly, each version is segmented into smaller frame sequences using the split points. Once segmented, the individual frames of each segment are aligned across versions using a dynamic programming algorithm. After aligning the segments at a frame level, the segments are reassembled to generate a global alignment output.

Type: Grant

Filed: September 26, 2014

Date of Patent: March 1, 2016

Assignee: A9.com, Inc.

Inventors: Ismet Zeki Yalniz, Adam Carlson, Douglas Ryan Gray, Colin Jon Taylor
Text entity recognition

Patent number: 9256795

Abstract: Various embodiments enable the identification of semi-structured text entities in an imager. The identification of the text entities is a relatively simple problem when the text is stored in a computer and free of errors, but much more challenging if the source is the output of an optical character recognition (OCR) engine from a natural scene image. Accordingly, output from an OCR engine is analyzed to isolate a character string indicative of a text entity. Each character of the string is then assigned to a character class to produce a character class string and the text entity of the string is identified based in part on a pattern of the character class string.

Type: Grant

Filed: March 15, 2013

Date of Patent: February 9, 2016

Assignee: A9.com, Inc.

Inventors: Douglas Ryan Gray, Xiaofan Lin, Arnab Sanat Kumar Dhua, Yu Lou
Self-portrait enhancement techniques

Patent number: 9247129

Abstract: Systems and approaches are provided for optimizing self-portraiture. The background of the self-portrait can be enhanced by image registration or stitching techniques of images captured using one or more conventional cameras. Multiple standard resolution images can be stitched together to generate a panoramic or a composite image of a higher resolution. Foreground elements, such as one or more representations of users, can also be enhanced in various ways. The representations of the users can be composited to exclude undesirable elements, such as image data of one of the users extending her arm to capture the self-portrait. An ideal pose of the users can automatically be selected and other image enhancements, such as histogram optimization, brightness and contrast optimization, color-cast correction, or reduction or removal of noise, can automatically be performed to minimize user effort in capturing self-portraits.

Type: Grant

Filed: August 30, 2013

Date of Patent: January 26, 2016

Assignee: A9.com, Inc.

Inventors: Douglas Ryan Gray, Colin Jon Taylor, Xiaofan Lin
AUGMENTED REALITY RECOMMENDATIONS

Publication number: 20160019618

Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.

Type: Application

Filed: September 23, 2015

Publication date: January 21, 2016

Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Atul Kumar, Yu Lou
COLLABORATIVE TEXT DETECTION AND RECOGNITION

Publication number: 20150339536

Abstract: Various embodiments provide methods and systems for identifying text in an image by applying suitable text detection parameters in text detection. The suitable text detection parameters can be determined based on parameter metric feedback from one or more text identification subtasks, such as text detection, text recognition, preprocessing, character set mapping, pattern matching and validation. In some embodiments, the image can be defined into one or more image regions by performing glyph detection on the image. Text detection parameters applying to each of the one or more image regions can be adjusted based on measured one or more parameter metrics in the respective image region.

Type: Application

Filed: August 3, 2015

Publication date: November 26, 2015

Inventors: Xiaofan LIN, Adam Wiggen KRAFT, Yu LOU, Douglas Ryan GRAY, Colin Jon TAYLOR
Augmented reality recommendations

Patent number: 9183583

Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.

Type: Grant

Filed: May 13, 2013

Date of Patent: November 10, 2015

Assignee: A9.com, Inc.

Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Atul Kumar, Yu Lou
OBJECT TRACKING IN ZOOMED VIDEO

Publication number: 20150268822

Abstract: A user can select an object represented in video content in order to set a magnification level with respect to that object. A portion of the video frames containing a representation of the object is selected to maintain a presentation size of the representation corresponding to the magnification level. The selection provides for a “smart zoom” feature enabling an object of interest, such as a face of an actor, to be used in selecting an appropriate portion of each frame to magnify, such that the magnification results in a portion of the frame being selected that includes the one or more objects of interest to the user. Pre-generated tracking data can be provided for some objects, which can enable a user to select an object and then have predetermined portion selections and magnifications applied that can provide for a smoother user experience than for dynamically-determined data.

Type: Application

Filed: May 21, 2014

Publication date: September 24, 2015

Applicant: Amazon Technologies, Inc.

Inventors: Charles Benjamin Franklin Waggoner, Colin Jon Taylor, Jeffrey P. Bezos, Douglas Ryan Gray
Image-Based Character Recognition

Publication number: 20150254507

Abstract: Various embodiments enable a device to perform tasks such as processing an image to recognize and locate text in the image, and providing the recognized text an application executing on the device for performing a function (e.g., calling a number, opening an internet browser, etc.) associated with the recognized text. In at least one embodiment, processing the image includes substantially simultaneously or concurrently processing the image with at least two recognition engines, such as at least two optical character recognition (OCR) engines, running in a multithreaded mode. In at least one embodiment, the recognition engines can be tuned so that their respective processing speeds are roughly the same. Utilizing multiple recognition engines enables processing latency to be close to that of using only one recognition engine.

Type: Application

Filed: May 26, 2015

Publication date: September 10, 2015

Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Yu Lou
Collaborative text detection and recognition

Patent number: 9098888

Abstract: Various embodiments provide methods and systems for identifying text in an image by applying suitable text detection parameters in text detection. The suitable text detection parameters can be determined based on parameter metric feedback from one or more text identification subtasks, such as text detection, text recognition, preprocessing, character set mapping, pattern matching and validation. In some embodiments, the image can be defined into one or more image regions by performing glyph detection on the image. Text detection parameters applying to each of the one or more image regions can be adjusted based on measured one or more parameter metrics in the respective image region.

Type: Grant

Filed: December 12, 2013

Date of Patent: August 4, 2015

Assignee: A9.com, Inc.

Inventors: Xiaofan Lin, Adam Wiggen Kraft, Yu Lou, Douglas Ryan Gray, Colin Jon Taylor
Image-based character recognition

Patent number: 9043349

Abstract: Various embodiments enable a device to perform tasks such as processing an image to recognize and locate text in the image, and providing the recognized text an application executing on the device for performing a function (e.g., calling a number, opening an internet browser, etc.) associated with the recognized text. In at least one embodiment, processing the image includes substantially simultaneously or concurrently processing the image with at least two recognition engines, such as at least two optical character recognition (OCR) engines, running in a multithreaded mode. In at least one embodiment, the recognition engines can be tuned so that their respective processing speeds are roughly the same. Utilizing multiple recognition engines enables processing latency to be close to that of using only one recognition engine.

Type: Grant

Filed: November 29, 2012

Date of Patent: May 26, 2015

Assignee: A9.com, Inc.

Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Yu Lou
AUGMENTED REALITY RECOMENDATIONS

Publication number: 20140337174

Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.

Type: Application

Filed: May 13, 2013

Publication date: November 13, 2014

Applicant: A9.com, Inc.

Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Atul Kumar, Yu Lou

prev 1 2 3