Patents by Inventor Dmitriy Genzel

Dmitriy Genzel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9536180
    Abstract: Methods and systems for grapheme splitting of text input for recognition are provided. A method may include receiving a text input in a script and segmenting the text input into one or more graphemes. Each of the one or more graphemes may be split into one or more recognition units based on one or more recognition unit identification criteria associated with the script. Next, a text recognition system may be trained using the recognition units. Text input may be handwritten text input received from a user or a scanned image of text.
    Type: Grant
    Filed: December 30, 2013
    Date of Patent: January 3, 2017
    Assignee: Google Inc.
    Inventors: Thomas Deselaers, Daniel Martin Keysers, Dmitriy Genzel, Ashok Chhabedia Popat
  • Publication number: 20150186738
    Abstract: Methods and systems for grapheme splitting of text input for recognition are provided. A method may include receiving a text input in a script and segmenting the text input into one or more graphemes. Each of the one or more graphemes may be split into one or more recognition units based on one or more recognition unit identification criteria associated with the script. Next, a text recognition system may be trained using the recognition units. Text input may be handwritten text input received from a user or a scanned image of text.
    Type: Application
    Filed: December 30, 2013
    Publication date: July 2, 2015
    Applicant: Google Inc.
    Inventors: Thomas Deselaers, Daniel Martin Keysers, Dmitriy Genzel, Ashok Chhabedia Popat
  • Patent number: 8953885
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing optical character recognition. In one aspect, a method includes receiving a text image I. A set of feature functions are evaluated for a log linear model to determine respective feature values for the text image I, wherein each feature function hi maps the text image I to a feature value, and wherein each feature function hi is associated with a respective feature weight ?i. A transcription {circumflex over (T)} is determined that minimizes a cost of the log linear model.
    Type: Grant
    Filed: September 14, 2012
    Date of Patent: February 10, 2015
    Assignee: Google Inc.
    Inventors: Franz Josef Och, Ashok Chhabedia Popat, Dmitriy Genzel, Michael E. Jahr
  • Patent number: 8626486
    Abstract: Methods, systems, and apparatus, including computer program products, for correcting spelling in text. A text input is received for translation. One or more suspect words in the text input are identified. For each suspect word, one or more candidate words are identified. A score for the text input and scores for each of one or more candidate inputs are determined, where each candidate input is the text input with one or more of the suspect words each replaced by a respective candidate word. If any, a candidate input whose score is highest among the scores for the candidate inputs and is greater than the text input score by at least a threshold is selected. Otherwise, the text input is selected. A translation of a selected candidate input or the selected text input is provided as the translation of the text input.
    Type: Grant
    Filed: September 5, 2007
    Date of Patent: January 7, 2014
    Assignee: Google Inc.
    Inventors: Franz J. Och, Dmitriy Genzel
  • Patent number: 8521516
    Abstract: Systems, methods, and apparatuses including computer program products are provided for training machine learning systems. In some implementations, a method is provided. The method includes receiving a collection of phrases, normalizing a plurality of phrases of the collection of phrases, the normalizing being based at least in part on lexicographic normalizing rules, and generating a normalized phrase table including a plurality of key-value pairs, each key value pair includes a key corresponding to a normalized phrase and a value corresponding to one or more un-normalized phrases associated with the normalized key, each un-normalized phrase having one or more parameters.
    Type: Grant
    Filed: March 25, 2009
    Date of Patent: August 27, 2013
    Assignee: Google Inc.
    Inventors: Franz Josef Och, Ignacio E Thayer, Ioannis Tsochandaridis, Dmitriy Genzel
  • Publication number: 20130151235
    Abstract: Systems, methods, and apparatuses including computer program products are provided for training machine learning systems. In some implementations, a method is provided. The method includes receiving a collection of phrases, normalizing a plurality of phrases of the collection of phrases, the normalizing being based at least in part on lexicographic normalizing rules, and generating a normalized phrase table including a plurality of key-value pairs, each key value pair includes a key corresponding to a normalized phrase and a value corresponding to one or more un-normalized phrases associated with the normalized key, each un-normalized phrase having one or more parameters.
    Type: Application
    Filed: March 25, 2009
    Publication date: June 13, 2013
    Applicant: GOOGLE INC.
    Inventors: Franz Josef Och, Ignacio E. Thayer, Ioannis Tsochandaridis, Dmitriy Genzel
  • Publication number: 20130144592
    Abstract: Methods, systems, and apparatus, including computer program products, for correcting spelling in text. A text input is received for translation. One or more suspect words in the text input are identified. For each suspect word, one or more candidate words are identified. A score for the text input and scores for each of one or more candidate inputs are determined, where each candidate input is the text input with one or more of the suspect words each replaced by a respective candidate word. If any, a candidate input whose score is highest among the scores for the candidate inputs and is greater than the text input score by at least a threshold is selected. Otherwise, the text input is selected. A translation of a selected candidate input or the selected text input is provided as the translation of the text input.
    Type: Application
    Filed: September 5, 2007
    Publication date: June 6, 2013
    Applicant: GOOGLE INC.
    Inventors: Franz J. Och, Dmitriy Genzel