Patents by Inventor Jerome R. Bellegarda

Jerome R. Bellegarda has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7165032
    Abstract: Pronunciation for an input word is modeled by generating a set of candidate phoneme strings having pronunciations close to the input word in an orthographic space. Phoneme sub-strings in the set are selected as the pronunciation. In one aspect, a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word is used to determine the candidate phoneme strings. The words are chosen from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary. In another aspect, the phoneme sub-strings are selected by aligning the candidate phoneme strings on common phoneme sub-strings to produce an occurrence count, which is used to choose the phoneme sub-strings for the pronunciation.
    Type: Grant
    Filed: November 22, 2002
    Date of Patent: January 16, 2007
    Assignee: Apple Computer, Inc.
    Inventor: Jerome R. Bellegarda
  • Patent number: 7149695
    Abstract: A method and apparatus for command recognition using semantic inference and word agglomeration is described herein. According to one aspect of the present invention, a method for recognizing a voice command comprises recognizing a sequence of words received as the voice command. The sequence of words is further agglomerated into a sequence of word n-tuples. Semantic inference is applied to the sequence of word n-tuples to recognize the voice command.
    Type: Grant
    Filed: October 13, 2000
    Date of Patent: December 12, 2006
    Assignee: Apple Computer, Inc.
    Inventor: Jerome R. Bellegarda
  • Patent number: 7124081
    Abstract: A method and apparatus for speech recognition using latent semantic adaptation is described herein. According to one aspect of the present invention, a method for recognizing speech comprises using latent semantic analysis (LSA) to generate an LSA space for a collection of documents and to continually adapt the LSA space with new documents as they become available. Adaptation of the LSA space is optimally two-sided, taking into account the new words in the new documents. Alternatively, adaptation is one-sided, taking into account the new documents but discarding any new words appearing in those documents.
    Type: Grant
    Filed: September 28, 2001
    Date of Patent: October 17, 2006
    Assignee: Apple Computer, Inc.
    Inventor: Jerome R. Bellegarda
  • Patent number: 7076527
    Abstract: A method and apparatus for filtering messages comprising determining a first semantic anchor corresponding to a first group of messages, for example, legitimate messages and a second semantic anchor corresponding to a second group of messages, for example, unsolicited messages. Determining a vector corresponding to an incoming message; comparing the vector corresponding to the incoming message with at least one of the first semantic anchor and the second semantic anchor to obtain a first comparison value and a second comparison value; and filtering the incoming message based on the first comparison value and the second comparison value.
    Type: Grant
    Filed: June 14, 2001
    Date of Patent: July 11, 2006
    Assignee: Apple Computer, Inc.
    Inventors: Jerome R. Bellegarda, Devang Naik, Kim E. A. Silverman
  • Patent number: 7047193
    Abstract: Pronunciation for an input word is modeled by generating a set of candidate phoneme strings having pronunciations close to the input word in an orthographic space. Phoneme sub-strings in the set are selected as the pronunciation. In one aspect, a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word is used to determine the candidate phoneme strings. The words are chosen from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary. In another aspect, the phoneme sub-strings are selected by aligning the candidate phoneme strings on common phoneme sub-strings to produce an occurrence count, which is used to choose the phoneme sub-strings for the pronunciation.
    Type: Grant
    Filed: September 13, 2002
    Date of Patent: May 16, 2006
    Assignee: Apple Computer, Inc.
    Inventor: Jerome R. Bellegarda
  • Patent number: 6836760
    Abstract: A method and apparatus to use semantic inference with speech recognition systems includes recognizing at least one spoken word, processing the spoken word using a context-free grammar, deriving an output from the context-free grammar, and translating the output to a predetermined command.
    Type: Grant
    Filed: September 29, 2000
    Date of Patent: December 28, 2004
    Assignee: Apple Computer, Inc.
    Inventors: Jerome R. Bellegarda, Kim E. A. Silverman
  • Patent number: 6785652
    Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.
    Type: Grant
    Filed: December 19, 2002
    Date of Patent: August 31, 2004
    Assignee: Apple Computer, Inc.
    Inventors: Jerome R. Bellegarda, Kim Silverman
  • Patent number: 6778952
    Abstract: A method and system for dynamic language modeling of a document are described. In one embodiment, a number of local probabilities of a current document are computed and a vector representation of the current document in a latent semantic analysis (LSA) space is determined. In addition, a number of global probabilities based upon the vector representation of the current document in an LSA space is computed. Further, the local probabilities and the global probabilities are combined to produce the language modeling.
    Type: Grant
    Filed: September 12, 2002
    Date of Patent: August 17, 2004
    Assignee: Apple Computer, Inc.
    Inventor: Jerome R. Bellegarda
  • Publication number: 20040054533
    Abstract: Pronunciation for an input word is modeled by generating a set of candidate phoneme strings having pronunciations close to the input word in an orthographic space. Phoneme sub-strings in the set are selected as the pronunciation. In one aspect, a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word is used to determine the candidate phoneme strings. The words are chosen from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary. In another aspect, the phoneme sub-strings are selected by aligning the candidate phoneme strings on common phoneme sub-strings to produce an occurrence count, which is used to choose the phoneme sub-strings for the pronunciation.
    Type: Application
    Filed: November 22, 2002
    Publication date: March 18, 2004
    Inventor: Jerome R. Bellegarda
  • Publication number: 20030093277
    Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.
    Type: Application
    Filed: December 19, 2002
    Publication date: May 15, 2003
    Inventors: Jerome R. Bellegarda, Kim Silverman
  • Patent number: 6553344
    Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.
    Type: Grant
    Filed: February 22, 2002
    Date of Patent: April 22, 2003
    Assignee: Apple Computer, Inc.
    Inventors: Jerome R. Bellegarda, Kim Silverman
  • Publication number: 20030069909
    Abstract: A method and system for dynamic language modeling of a document are described. In one embodiment, a number of local probabilities of a current document are computed and a vector representation of the current document in a latent semantic analysis (LSA) space is determined. In addition, a number of global probabilities based upon the vector representation of the current document in an LSA space is computed. Further, the local probabilities and the global probabilities are combined to produce the language modeling.
    Type: Application
    Filed: September 12, 2002
    Publication date: April 10, 2003
    Inventor: Jerome R. Bellegarda
  • Publication number: 20030009526
    Abstract: A method and apparatus for filtering messages comprising determining a first semantic anchor corresponding to a first group of messages, for example, legitimate messages and a second semantic anchor corresponding to a second group of messages, for example, unsolicited messages. Determining a vector corresponding to an incoming message; comparing the vector corresponding to the incoming message with at least one of the first semantic anchor and the second semantic anchor to obtain a first comparison value and a second comparison value; and filtering the incoming message based on the first comparison value and the second comparison value.
    Type: Application
    Filed: June 14, 2001
    Publication date: January 9, 2003
    Inventors: Jerome R. Bellegarda, Devang Naik, Kim E.A. Silverman
  • Patent number: 6477488
    Abstract: A method and system for dynamic language modeling of a document are described. In one embodiment, a number of local probabilities of a current document are computed and a vector representation of the current document in a latent semantic analysis (LSA) space is determined. In addition, a number of global probabilities based upon the vector representation of the current document in an LSA space is computed. Further, the local probabilities and the global probabilities are combined to produce the language modeling.
    Type: Grant
    Filed: March 10, 2000
    Date of Patent: November 5, 2002
    Assignee: Apple Computer, Inc.
    Inventor: Jerome R. Bellegarda
  • Publication number: 20020138270
    Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.
    Type: Application
    Filed: February 22, 2002
    Publication date: September 26, 2002
    Applicant: Apple Computer, Inc.
    Inventors: Jerome R. Bellegarda, Kim Silverman
  • Patent number: 6374217
    Abstract: Speech or acoustic signals are processed directly using a hybrid stochastic language model produced by integrating a latent semantic analysis language model into an n-gram probability language model. The latent semantic analysis language model probability is computed using a first pseudo-document vector that is derived from a second pseudo-document vector with the pseudo-document vectors representing pseudo-documents created from the signals received at different times. The first pseudo-document vector is derived from the second pseudo-document vector by updating the second pseudo-document vector directly in latent semantic analysis space in response to at least one addition of a candidate word of the received speech signals to the pseudo-document represented by the second pseudo-document vector. Updating precludes mapping a sparse representation for a pseudo-document into the latent semantic space to produce the first pseudo-document vector.
    Type: Grant
    Filed: March 12, 1999
    Date of Patent: April 16, 2002
    Assignee: Apple Computer, Inc.
    Inventor: Jerome R. Bellegarda
  • Patent number: 6366884
    Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.
    Type: Grant
    Filed: November 8, 1999
    Date of Patent: April 2, 2002
    Assignee: Apple Computer, Inc.
    Inventors: Jerome R. Bellegarda, Kim Silverman
  • Patent number: 6208971
    Abstract: A method and apparatus for command recognition using data-driven semantic inference includes recognizing a sequence of words received as the voice command. Data-driven semantic inference is then used with the recognized sequence of words to recognize the voice command. Thus, the command is identified on the basis of the semantics of words of the spoken command rather than the particular grammar of each of predetermined different ways the command could be worded.
    Type: Grant
    Filed: October 30, 1998
    Date of Patent: March 27, 2001
    Assignee: Apple Computer, Inc.
    Inventors: Jerome R. Bellegarda, Kim E. A. Silverman
  • Patent number: 6154722
    Abstract: A method and an apparatus for a speech recognition system that uses a language model based on an integrated finite state grammar probability and an n-gram probability are provided. According to one aspect of the invention, speech signals are received into a processor of a speech recognition system. The speech signals are processed using a speech recognition system hosting a language model. The language model is produced by integrating a finite state grammar probability and an n-gram probability. In the integration, the n-gram probability is modified based on information provided by the finite state grammar probability; thus, the finite state grammar probability is subordinate to the n-gram probability. The language model is used by a decoder along with at least one acoustic model to perform a hypothesis search on an acoustic sequence to provide a word sequence output. The word sequence generated is representative of the received speech signals.
    Type: Grant
    Filed: December 18, 1997
    Date of Patent: November 28, 2000
    Assignee: Apple Computer, Inc.
    Inventor: Jerome R. Bellegarda
  • Patent number: 6064960
    Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.
    Type: Grant
    Filed: December 18, 1997
    Date of Patent: May 16, 2000
    Assignee: Apple Computer, Inc.
    Inventors: Jerome R. Bellegarda, Kim Silverman