Patents by Inventor David Paul Ramos

David Paul Ramos has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11947913
    Abstract: Techniques for performing multi-stage entity resolution (ER) processing are described. A system may determine a portion of a user input corresponding to an entity name, and may request an entity provider component to perform a search to determine one or more entities corresponding to the entity name. The preliminary search results may be sent to a skill selection component for processing, while the entity provider component performs a complete search to determine entities corresponding to the entity name. A selected skill component may request the complete search results to perform its processing, including determining an output responsive to the user input.
    Type: Grant
    Filed: June 24, 2021
    Date of Patent: April 2, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: David Paul Ramos, Tonytip Ketudat, Vikas Chawla, Lukas Leon Brower
  • Patent number: 11157696
    Abstract: Techniques for performing entity resolution as part of natural language understanding processing are described. During offline operations, a system may convert text (representing entities known to the system) into audio of various languages. The languages into which the text is converted may depend on the location where the entity is likely to be spoken by users of the system. At runtime, the system processes a user input using text-based entity resolution. If text-based entity resolution fails, the system may identify user speech corresponding to an entity to be resolved, and attempt to phonetically match the user speech to the audio of the known entities. Results of the phonetic entity resolution may then used by downstream components, such as skills.
    Type: Grant
    Filed: June 25, 2018
    Date of Patent: October 26, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: David Paul Ramos, Ameya Ashok Limaye
  • Patent number: 10216989
    Abstract: Disclosed are techniques for providing additional information for text in an image. In some implementations, a computing device receives an image including text. Optical character recognition (OCR) is performed on the image to produce recognized text. A word or a phrase is selected from the recognized text for providing additional information. One or more potential meanings of the selected word or phrase are determined. One of the potential meanings is selected based on other text in the image. A source of additional information corresponding to the selected meaning is selected for providing the additional information to a user's device.
    Type: Grant
    Filed: October 15, 2015
    Date of Patent: February 26, 2019
    Assignee: Amazon Technologies, Inc.
    Inventors: David Paul Ramos, Matthew Joseph Cole, Matthew Daniel Hart
  • Patent number: 9659224
    Abstract: Disclosed are techniques for merging optical character recognized (OCR'd) text from frames of image data. In some implementations, a device sends frames of image data to a server, where each frame includes at least a portion of a captured textual item. The server performs optical character recognition (OCR) on the image data of each frame. When OCR'd text from respective frames is returned to the device from the server, the device can perform matching operations on the text, for instance, using bounding boxes and/or edit distance processing. The device can merge any identified matches of OCR'd text from different frames. The device can then display the merged text with any corrections.
    Type: Grant
    Filed: March 31, 2014
    Date of Patent: May 23, 2017
    Assignee: Amazon Technologies, Inc.
    Inventors: Matthew Joseph Cole, Sonjeev Jahagirdar, Matthew Daniel Hart, David Paul Ramos, Ankur Datta, Utkarsh Prateek, Emilie Noelle McConville, Prashant Hegde, Avnish Sikka
  • Patent number: 9367736
    Abstract: A multi-orientation text detection method and associated system is disclosed that utilizes orientation-variant glyph features to determine a text line in an image regardless of an orientation of the text line. Glyph features are determined for each glyph in an image with respect to a neighboring glyph. The glyph features are provided to a learned classifier that outputs a glyph pair score for each neighboring glyph pair. Each glyph pair score indicates a likelihood that the corresponding pair of neighboring glyphs form part of a same text line. The glyph pair scores are used to identify candidate text lines, which are then ranked to select a final set of text lines in the image.
    Type: Grant
    Filed: September 1, 2015
    Date of Patent: June 14, 2016
    Assignee: Amazon Technologies, Inc.
    Inventors: Thibaud Senechal, Quan Wang, Daniel Makoto Willenson, Shuang Wu, Yue Liu, Shiv Naga Prasad Vitaladevuni, David Paul Ramos, Qingfeng Yu
  • Patent number: 9355336
    Abstract: Disclosed are techniques for recognizing text from one or more frames of image data using contextual information. In some implementations, image data including a captured textual item is processed to identify an entity in the image data. A context can be selected using the entity, where the context corresponds to a dictionary. Text in the captured textual item can be identified using the dictionary. The identified text can be output to a display device.
    Type: Grant
    Filed: April 23, 2014
    Date of Patent: May 31, 2016
    Assignee: Amazon Technologies, Inc.
    Inventors: Sonjeev Jahagirdar, Matthew Joseph Cole, David Paul Ramos, Utkarsh Prateek, Emilie Noelle McConville, Ankur Datta, Laura Varnum Finney, Yue Liu, Bhavesh Anil Doshi, Avnish Sikka, Michael Vanne
  • Patent number: 9305227
    Abstract: Embodiments of the subject technology provide for a hybrid OCR approach which combines server and device side processing that can offset disadvantages of performing OCR solely on the server side or the device side. More specifically, the subject technology utilizes image characteristics such as glyph details and image quality measurements to opportunistically schedule OCR processing on the mobile device and/or server. In this regard, text extracted by a “faster” OCR engine (e.g., one with less latency) is displayed to a user, which is then updated by the result of a more accurate OCR engine (e.g., an OCR engine provided by the server). This approach allows factoring in additional parameters such as network latency and user preference for making scheduling decisions. Thus, the subject technology may provide significant gains in terms of reduced latency and increased accuracy by implementing one or more techniques associated with this hybrid OCR approach.
    Type: Grant
    Filed: December 23, 2013
    Date of Patent: April 5, 2016
    Assignee: Amazon Technologies, Inc.
    Inventors: Rakesh Madhavan Nambiar, Sonjeev Jahagirdar, Matthew Joseph Cole, Matias Omar Gregorio Benitez, Junxiong Jia, David Paul Ramos
  • Patent number: 9286683
    Abstract: Approaches to enable a computing device, such as a phone or tablet computer, to detect when text contained in an image captured by the camera is sufficiently close to the edge of the screen and to infer whether the text is likely to be cut off by the edge of the screen such that the text contained in the image is incomplete. If the incomplete text corresponds to actionable text associated with a function that can be invoked on the computing device, the computing device may wait until the remaining portion of the actionable text is captured by the camera and made available for processing before invoking the corresponding function on the computing device.
    Type: Grant
    Filed: April 17, 2013
    Date of Patent: March 15, 2016
    Assignee: Amazon Technologies, Inc.
    Inventor: David Paul Ramos
  • Patent number: 9262689
    Abstract: Embodiments of the subject technology provide for determining a region of a first acquired image based at least on a viewing mode and a set of respective positions of graphical elements to decrease the pre-processing time and perceived latency for the first image. One or more regions of text in the first image are detected, and a set of regions of text that overlap with the region of the image is determined and pre-processed. The subject technology may then pre-process an entirety of a subsequent image (e.g., to pick up missing text from the region of the first image). Thus, additional OCR results may be provided to the user by using the subsequent image(s) and merging subsequent results with previous results from the first image.
    Type: Grant
    Filed: December 18, 2013
    Date of Patent: February 16, 2016
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Avnish Sikka, David Paul Ramos, Matthew Daniel Hart, Yue Liu, Emilie Noelle McConville
  • Patent number: 9239961
    Abstract: The recognition of text in an acquired image is improved by using general and type-specific heuristics that can determine the likelihood that a portion of the text is truncated at an edge of an image, frame, or screen. Truncated text can be filtered such that the user is not provided with an option to perform an undesirable task, such as to dial an incorrect number or connect to an incorrect Web address, based on recognizing an incomplete text string. The general and type-specific heuristics can be combined to improve confidence, and the image data can be pre-processed on the device before processing with an optical character recognition (OCR) engine. Multiple frames can be analyzed to attempt to recognize words or characters that might have been truncated in one or more of the frames.
    Type: Grant
    Filed: September 24, 2014
    Date of Patent: January 19, 2016
    Assignee: Amazon Technologies, Inc.
    Inventors: Matthew Joseph Cole, Yue Liu, David Paul Ramos, Avnish Sikka
  • Patent number: 9165186
    Abstract: Disclosed are techniques for providing additional information for text in an image. In some implementations, a computing device receives an image including text. Optical character recognition (OCR) is performed on the image to produce recognized text. One or more topics corresponding to the recognized text is determined. A word or a phrase is selected from the recognized text for providing additional information. One or more potential meanings of the selected word or phrase are determined. One of the potential meanings is selected using the one or more topics. A source of additional information corresponding to the selected meaning is selected for providing the additional information to a user's device.
    Type: Grant
    Filed: May 30, 2014
    Date of Patent: October 20, 2015
    Assignee: Amazon Technologies, Inc.
    Inventors: David Paul Ramos, Matthew Joseph Cole, Matthew Daniel Hart
  • Patent number: 9058644
    Abstract: Various embodiments enable regions of text to be identified in an image captured by a camera of a computing device for preprocessing before being analyzed by a visual recognition engine. For example, each of the identified regions can be analyzed or tested to determine whether a respective region contains a quality associated with poor text recognition results, such as poor contrast, blur, noise, and the like, which can be measured by one or more algorithms. Upon identifying a region with such a quality, an image quality enhancement can be automatically applied to the respective region without user instruction or intervention. Accordingly, once each region has been cleared of the quality associated with poor recognition, the regions of text can be processed with a visual recognition algorithm or engine.
    Type: Grant
    Filed: March 13, 2013
    Date of Patent: June 16, 2015
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: David Paul Ramos, Chang Yuan, Keith Harrison Goodman, Avnish Sikka