Patents by Inventor Douglas Ryan Gray

Douglas Ryan Gray has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9436883
    Abstract: Various embodiments provide methods and systems for identifying text in an image by applying suitable text detection parameters in text detection. The suitable text detection parameters can be determined based on parameter metric feedback from one or more text identification subtasks, such as text detection, text recognition, preprocessing, character set mapping, pattern matching and validation. In some embodiments, the image can be defined into one or more image regions by performing glyph detection on the image. Text detection parameters applying to each of the one or more image regions can be adjusted based on measured one or more parameter metrics in the respective image region.
    Type: Grant
    Filed: August 3, 2015
    Date of Patent: September 6, 2016
    Inventors: Xiaofan Lin, Adam Wiggen Kraft, Yu Lou, Douglas Ryan Gray, Colin Jon Taylor
  • Patent number: 9390340
    Abstract: Various embodiments enable a device to perform tasks such as processing an image to recognize and locate text in the image, and providing the recognized text an application executing on the device for performing a function (e.g., calling a number, opening an internet browser, etc.) associated with the recognized text. In at least one embodiment, processing the image includes substantially simultaneously or concurrently processing the image with at least two recognition engines, such as at least two optical character recognition (OCR) engines, running in a multithreaded mode. In at least one embodiment, the recognition engines can be tuned so that their respective processing speeds are roughly the same. Utilizing multiple recognition engines enables processing latency to be close to that of using only one recognition engine.
    Type: Grant
    Filed: May 26, 2015
    Date of Patent: July 12, 2016
    Assignee: A9.com
    Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Yu Lou
  • Publication number: 20160133299
    Abstract: Various embodiments identify differences between frame sequences of a video. For example, to determine a difference between two versions of a video, a fingerprint of each frame of the two versions is generated. From the fingerprints, a run-length encoded representation of each version is generated. The fingerprints which appear only once (i.e., unique fingerprints) in the entire video are identified from each version and compared to identify matching unique fingerprints across versions. The matching unique fingerprints are sorted and filtered to determine split points, which are used to align the two versions of the video. Accordingly, each version is segmented into smaller frame sequences using the split points. Once segmented, the individual frames of each segment are aligned across versions using a dynamic programming algorithm. After aligning the segments at a frame level, the segments are reassembled to generate a global alignment output.
    Type: Application
    Filed: January 15, 2016
    Publication date: May 12, 2016
    Inventors: Ismet Zeki Yalniz, Adam Carlson, Douglas Ryan Gray, Colin Jon Taylor
  • Publication number: 20160110848
    Abstract: Systems and approaches are provided for optimizing self-portraiture. The background of the self-portrait can be enhanced by image registration or stitching techniques of images captured using one or more conventional cameras. Multiple standard resolution images can be stitched together to generate a panoramic or a composite image of a higher resolution. Foreground elements, such as one or more representations of users, can also be enhanced in various ways. The representations of the users can be composited to exclude undesirable elements, such as image data of one of the users extending her arm to capture the self-portrait. An ideal pose of the users can automatically be selected and other image enhancements, such as histogram optimization, brightness and contrast optimization, color-cast correction, or reduction or removal of noise, can automatically be performed to minimize user effort in capturing self-portraits.
    Type: Application
    Filed: December 30, 2015
    Publication date: April 21, 2016
    Inventors: Douglas Ryan Gray, Colin Jon Taylor, Xiaofan Lin
  • Publication number: 20160098611
    Abstract: Various embodiments enable the identification of semi-structured text entities in an imager. The identification of the text entities is a relatively simple problem when the text is stored in a computer and free of errors, but much more challenging if the source is the output of an optical character recognition (OCR) engine from a natural scene image. Accordingly, output from an OCR engine is analyzed to isolate a character string indicative of a text entity. Each character of the string is then assigned to a character class to produce a character class string and the text entity of the string is identified based in part on a pattern of the character class string.
    Type: Application
    Filed: December 16, 2015
    Publication date: April 7, 2016
    Inventors: DOUGLAS RYAN GRAY, XIAOFAN LIN, ARNAB SANAT KUMAR DHUA, YU LOU
  • Patent number: 9292739
    Abstract: Various embodiments enable text aggregation from multiple image frames of text. Accordingly, in order to stitch newly scanned areas of a document together, text in a respective image is recognized and analyzed using an algorithm to identify pairs of corresponding words in other images. Upon identifying a minimum number of matching pairs between two respective images, a mapping between the same can be determined based at least in part on a geometric correspondence between respective identified pairs. Based on this mapping, the recognized text of the two images can be merged by adding words of one image to the other using the matching word pairs as alignment data points.
    Type: Grant
    Filed: December 12, 2013
    Date of Patent: March 22, 2016
    Assignee: A9.com, Inc.
    Inventors: Douglas Ryan Gray, Colin Jay Taylor, Xiaofan Lin, Adam Wiggen Kraft, Yu Lou, Arnab Sanat Kumar Dhua
  • Patent number: 9275682
    Abstract: Various embodiments identify differences between frame sequences of a video. For example, to determine a difference between two versions of a video, a fingerprint of each frame of the two versions is generated. From the fingerprints, a run-length encoded representation of each version is generated. The fingerprints which appear only once (i.e., unique fingerprints) in the entire video are identified from each version and compared to identify matching unique fingerprints across versions. The matching unique fingerprints are sorted and filtered to determine split points, which are used to align the two versions of the video. Accordingly, each version is segmented into smaller frame sequences using the split points. Once segmented, the individual frames of each segment are aligned across versions using a dynamic programming algorithm. After aligning the segments at a frame level, the segments are reassembled to generate a global alignment output.
    Type: Grant
    Filed: September 26, 2014
    Date of Patent: March 1, 2016
    Assignee: A9.com, Inc.
    Inventors: Ismet Zeki Yalniz, Adam Carlson, Douglas Ryan Gray, Colin Jon Taylor
  • Patent number: 9256795
    Abstract: Various embodiments enable the identification of semi-structured text entities in an imager. The identification of the text entities is a relatively simple problem when the text is stored in a computer and free of errors, but much more challenging if the source is the output of an optical character recognition (OCR) engine from a natural scene image. Accordingly, output from an OCR engine is analyzed to isolate a character string indicative of a text entity. Each character of the string is then assigned to a character class to produce a character class string and the text entity of the string is identified based in part on a pattern of the character class string.
    Type: Grant
    Filed: March 15, 2013
    Date of Patent: February 9, 2016
    Assignee: A9.com, Inc.
    Inventors: Douglas Ryan Gray, Xiaofan Lin, Arnab Sanat Kumar Dhua, Yu Lou
  • Patent number: 9247129
    Abstract: Systems and approaches are provided for optimizing self-portraiture. The background of the self-portrait can be enhanced by image registration or stitching techniques of images captured using one or more conventional cameras. Multiple standard resolution images can be stitched together to generate a panoramic or a composite image of a higher resolution. Foreground elements, such as one or more representations of users, can also be enhanced in various ways. The representations of the users can be composited to exclude undesirable elements, such as image data of one of the users extending her arm to capture the self-portrait. An ideal pose of the users can automatically be selected and other image enhancements, such as histogram optimization, brightness and contrast optimization, color-cast correction, or reduction or removal of noise, can automatically be performed to minimize user effort in capturing self-portraits.
    Type: Grant
    Filed: August 30, 2013
    Date of Patent: January 26, 2016
    Assignee: A9.com, Inc.
    Inventors: Douglas Ryan Gray, Colin Jon Taylor, Xiaofan Lin
  • Publication number: 20160019618
    Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.
    Type: Application
    Filed: September 23, 2015
    Publication date: January 21, 2016
    Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Atul Kumar, Yu Lou
  • Publication number: 20150339536
    Abstract: Various embodiments provide methods and systems for identifying text in an image by applying suitable text detection parameters in text detection. The suitable text detection parameters can be determined based on parameter metric feedback from one or more text identification subtasks, such as text detection, text recognition, preprocessing, character set mapping, pattern matching and validation. In some embodiments, the image can be defined into one or more image regions by performing glyph detection on the image. Text detection parameters applying to each of the one or more image regions can be adjusted based on measured one or more parameter metrics in the respective image region.
    Type: Application
    Filed: August 3, 2015
    Publication date: November 26, 2015
    Inventors: Xiaofan LIN, Adam Wiggen KRAFT, Yu LOU, Douglas Ryan GRAY, Colin Jon TAYLOR
  • Patent number: 9183583
    Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.
    Type: Grant
    Filed: May 13, 2013
    Date of Patent: November 10, 2015
    Assignee: A9.com, Inc.
    Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Atul Kumar, Yu Lou
  • Publication number: 20150268822
    Abstract: A user can select an object represented in video content in order to set a magnification level with respect to that object. A portion of the video frames containing a representation of the object is selected to maintain a presentation size of the representation corresponding to the magnification level. The selection provides for a “smart zoom” feature enabling an object of interest, such as a face of an actor, to be used in selecting an appropriate portion of each frame to magnify, such that the magnification results in a portion of the frame being selected that includes the one or more objects of interest to the user. Pre-generated tracking data can be provided for some objects, which can enable a user to select an object and then have predetermined portion selections and magnifications applied that can provide for a smoother user experience than for dynamically-determined data.
    Type: Application
    Filed: May 21, 2014
    Publication date: September 24, 2015
    Applicant: Amazon Technologies, Inc.
    Inventors: Charles Benjamin Franklin Waggoner, Colin Jon Taylor, Jeffrey P. Bezos, Douglas Ryan Gray
  • Publication number: 20150254507
    Abstract: Various embodiments enable a device to perform tasks such as processing an image to recognize and locate text in the image, and providing the recognized text an application executing on the device for performing a function (e.g., calling a number, opening an internet browser, etc.) associated with the recognized text. In at least one embodiment, processing the image includes substantially simultaneously or concurrently processing the image with at least two recognition engines, such as at least two optical character recognition (OCR) engines, running in a multithreaded mode. In at least one embodiment, the recognition engines can be tuned so that their respective processing speeds are roughly the same. Utilizing multiple recognition engines enables processing latency to be close to that of using only one recognition engine.
    Type: Application
    Filed: May 26, 2015
    Publication date: September 10, 2015
    Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Yu Lou
  • Patent number: 9098888
    Abstract: Various embodiments provide methods and systems for identifying text in an image by applying suitable text detection parameters in text detection. The suitable text detection parameters can be determined based on parameter metric feedback from one or more text identification subtasks, such as text detection, text recognition, preprocessing, character set mapping, pattern matching and validation. In some embodiments, the image can be defined into one or more image regions by performing glyph detection on the image. Text detection parameters applying to each of the one or more image regions can be adjusted based on measured one or more parameter metrics in the respective image region.
    Type: Grant
    Filed: December 12, 2013
    Date of Patent: August 4, 2015
    Assignee: A9.com, Inc.
    Inventors: Xiaofan Lin, Adam Wiggen Kraft, Yu Lou, Douglas Ryan Gray, Colin Jon Taylor
  • Patent number: 9043349
    Abstract: Various embodiments enable a device to perform tasks such as processing an image to recognize and locate text in the image, and providing the recognized text an application executing on the device for performing a function (e.g., calling a number, opening an internet browser, etc.) associated with the recognized text. In at least one embodiment, processing the image includes substantially simultaneously or concurrently processing the image with at least two recognition engines, such as at least two optical character recognition (OCR) engines, running in a multithreaded mode. In at least one embodiment, the recognition engines can be tuned so that their respective processing speeds are roughly the same. Utilizing multiple recognition engines enables processing latency to be close to that of using only one recognition engine.
    Type: Grant
    Filed: November 29, 2012
    Date of Patent: May 26, 2015
    Assignee: A9.com, Inc.
    Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Yu Lou
  • Publication number: 20140337174
    Abstract: Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.
    Type: Application
    Filed: May 13, 2013
    Publication date: November 13, 2014
    Applicant: A9.com, Inc.
    Inventors: Xiaofan Lin, Arnab Sanat Kumar Dhua, Douglas Ryan Gray, Atul Kumar, Yu Lou