Patents by Inventor Gustavo Abrego

Gustavo Abrego has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7529668
    Abstract: A system and method for implementing a refined dictionary for speech recognition includes a database analyzer that initially identifies first vocabulary words that are present in a training database and second vocabulary words that are not present in the training database. A relevance module then performs refinement procedures upon the first vocabulary words to produce refined short word pronunciations and refined long word pronunciations that are added to a refined dictionary. A consensus module compares the second pronunciations with calculated plurality pronunciations to identify final consensus pronunciations that are then included in the refined dictionary.
    Type: Grant
    Filed: August 3, 2004
    Date of Patent: May 5, 2009
    Assignees: Sony Corporation, Sony Electronics Inc.
    Inventors: Gustavo Abrego, Lex S. Olorenshaw
  • Patent number: 7392186
    Abstract: A system and method for effectively implementing an optimized language model for speech recognition includes initial language models each created by combining source models according to selectable interpolation coefficients that define proportional relationships for combining the source models. A rescoring module iteratively utilizes the initial language models to process input development data for calculating word-error rates that each correspond to a different one of the initial language models. An optimized language model is then selected from the initial language models by identifying an optimal word-error rate from among the foregoing word-error rates. The speech recognizer may then utilize the optimized language model for effectively performing various speech recognition procedures.
    Type: Grant
    Filed: March 30, 2004
    Date of Patent: June 24, 2008
    Assignees: Sony Corporation, Sony Electronics Inc.
    Inventors: Lei Duan, Gustavo Abrego, Xavier Menendez-Pidal, Lex Olorenshaw
  • Publication number: 20060136209
    Abstract: A system and method for effectively performing speech recognition procedures includes enhanced demiphone acoustic models that a speech recognition engine utilizes to perform the speech recognition procedures. The enhanced demiphone acoustic models each have three states that are collectively arranged to form a preceding demiphone and a succeeding demiphone. An acoustic model generator may utilize a decision tree for analyzing speech context information from a training database. The acoustic model generator then effectively configures each of the enhanced demiphone acoustic models as either a succeeding-dominant enhanced demiphone acoustic model or a preceding-dominant enhanced demiphone acoustic model to accurately model speech characteristics.
    Type: Application
    Filed: December 16, 2004
    Publication date: June 22, 2006
    Inventors: Xavier Menendez-Pidal, Lex Olorenshaw, Gustavo Abrego
  • Publication number: 20060031069
    Abstract: A system and method for performing a grapheme-to-phoneme conversion procedure includes a graphone model generator that performs a graphone model training procedure to produce an N-gram graphone model based upon dictionary entries in a training dictionary. A grapheme-to-phoneme decoder then references the N-gram graphone model to perform grapheme-to-phoneme decoding procedures to convert input text into corresponding output phonemes.
    Type: Application
    Filed: August 3, 2004
    Publication date: February 9, 2006
    Inventors: Jun Huang, Gustavo Abrego, Lex Olorenshaw
  • Publication number: 20060031070
    Abstract: A system and method for implementing a refined dictionary for speech recognition includes a database analyzer that initially identifies first vocabulary words that are present in a training database and second vocabulary words that are not present in the training database. A relevance module then performs refinement procedures upon the first vocabulary words to produce refined short word pronunciations and refined long word pronunciations that are added to a refined dictionary. A consensus module compares the second pronunciations with calculated plurality pronunciations to identify final consensus pronunciations that are then included in the refined dictionary.
    Type: Application
    Filed: August 3, 2004
    Publication date: February 9, 2006
    Inventors: Gustavo Abrego, Lex Olorenshaw
  • Publication number: 20060031071
    Abstract: A system and method for automatically implementing a finite state automaton for speech recognition includes a finite state automaton generator that analyzes one or more input text sequences and automatically creates a node table and a link table to define the finite state automaton. The node table includes N-tuples from the input text sequences. Each N-tuple includes a current word and a corresponding history of one or more prior words from the input text sequences. The node table also includes unique node identifiers that each correspond to a different respective one of the current words. The link table includes specific links between successive words from the input text sequences. The links identified in the link table are defined by utilizing start node identifiers and end node identifiers from the unique node identifiers of the node table.
    Type: Application
    Filed: August 3, 2004
    Publication date: February 9, 2006
    Inventors: Gustavo Abrego, Atsuo Hiroe, Eugene Koontz
  • Publication number: 20050228671
    Abstract: A system and method for utilizing speech recognition to efficiently perform data indexing procedures includes an authoring module that coordinates an authoring procedure for creating an index file that has pattern word sets corresponding to data objects stored in a memory of a host electronic device. The pattern word sets are generated with a speech recognition engine that transforms spoken data descriptions into text data descriptions for creating the pattern word sets. The pattern word sets are associated in the index file with data object identifiers that uniquely identify the corresponding data objects. A retrieval module manages a retrieval procedure in which the speech recognition engine converts a spoken data request into a text data request. The retrieval module compares the text data request and the pattern word sets to identify a requested object identifier for locating a requested data object from among the data objects stored in the memory of the host electronic device.
    Type: Application
    Filed: March 30, 2004
    Publication date: October 13, 2005
    Inventors: Lex Olorenshaw, Gustavo Abrego, Eugene Koontz
  • Publication number: 20050228667
    Abstract: A system and method for effectively implementing an optimized language model for speech recognition includes initial language models each created by combining source models according to selectable interpolation coefficients that define proportional relationships for combining the source models. A rescoring module iteratively utilizes the initial language models to process input development data for calculating word-error rates that each correspond to a different one of the initial language models. An optimized language model is then selected from the initial language models by identifying an optimal word-error rate from among the foregoing word-error rates. The speech recognizer may then utilize the optimized language model for effectively performing various speech recognition procedures.
    Type: Application
    Filed: March 30, 2004
    Publication date: October 13, 2005
    Inventors: Lei Duan, Gustavo Abrego, Xavier Menendez-Pidal, Lex Olorenshaw
  • Publication number: 20050209854
    Abstract: A system and method for performing a refinement procedure to effectively implement a speech recognition dictionary for spontaneous speech recognition may include a problematic word identifier configured to divide vocabulary words from an initial speech recognition dictionary into problematic words and non-problematic words according to pre-defined identification criteria. A candidate generator may analyze the problematic words to produce one or more pronunciation candidates for each of the problematic words. An optimization module may then perform an optimization process for refining one or more pronunciation candidates according to certain optimization criteria to thereby generate optimized problematic pronunciations. A dictionary refinement manager may finally combine the optimized problematic pronunciations with non-problematic pronunciations of the non-problematic words to produce a refined speech recognition dictionary for use by the speech recognition system.
    Type: Application
    Filed: March 22, 2004
    Publication date: September 22, 2005
    Inventors: Gustavo Abrego, Xavier Menendez-Pidal, Lex Olorenshaw
  • Publication number: 20050209849
    Abstract: A system and method for automatically cataloguing data by utilizing speech recognition procedures includes an electronic device that captures audio/video data and corresponding verbal narration. A speech recognition engine coupled to the electronic device automatically performs a speech recognition process upon the audio/video data and verbal narration to generate labels that correspond to respective subject matter locations in the audio/video data. A label manager of the electronic device manages a label mode for generating and storing the foregoing labels. The label manager also controls a label search mode during which a system user utilizes the labels to automatically locate corresponding subject matter locations in the captured audio/video data.
    Type: Application
    Filed: March 22, 2004
    Publication date: September 22, 2005
    Inventors: Gustavo Abrego, Lex Olorenshaw, Lei Duan, Xavier Menendez-Pidal