Patents by Inventor Jerome R. Bellegarda

Jerome R. Bellegarda has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Unsupervised data-driven pronunciation modeling

Patent number: 7165032

Abstract: Pronunciation for an input word is modeled by generating a set of candidate phoneme strings having pronunciations close to the input word in an orthographic space. Phoneme sub-strings in the set are selected as the pronunciation. In one aspect, a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word is used to determine the candidate phoneme strings. The words are chosen from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary. In another aspect, the phoneme sub-strings are selected by aligning the candidate phoneme strings on common phoneme sub-strings to produce an occurrence count, which is used to choose the phoneme sub-strings for the pronunciation.

Type: Grant

Filed: November 22, 2002

Date of Patent: January 16, 2007

Assignee: Apple Computer, Inc.

Inventor: Jerome R. Bellegarda
Method and apparatus for speech recognition using semantic inference and word agglomeration

Patent number: 7149695

Abstract: A method and apparatus for command recognition using semantic inference and word agglomeration is described herein. According to one aspect of the present invention, a method for recognizing a voice command comprises recognizing a sequence of words received as the voice command. The sequence of words is further agglomerated into a sequence of word n-tuples. Semantic inference is applied to the sequence of word n-tuples to recognize the voice command.

Type: Grant

Filed: October 13, 2000

Date of Patent: December 12, 2006

Assignee: Apple Computer, Inc.

Inventor: Jerome R. Bellegarda
Method and apparatus for speech recognition using latent semantic adaptation

Patent number: 7124081

Abstract: A method and apparatus for speech recognition using latent semantic adaptation is described herein. According to one aspect of the present invention, a method for recognizing speech comprises using latent semantic analysis (LSA) to generate an LSA space for a collection of documents and to continually adapt the LSA space with new documents as they become available. Adaptation of the LSA space is optimally two-sided, taking into account the new words in the new documents. Alternatively, adaptation is one-sided, taking into account the new documents but discarding any new words appearing in those documents.

Type: Grant

Filed: September 28, 2001

Date of Patent: October 17, 2006

Assignee: Apple Computer, Inc.

Inventor: Jerome R. Bellegarda
Method and apparatus for filtering email

Patent number: 7076527

Abstract: A method and apparatus for filtering messages comprising determining a first semantic anchor corresponding to a first group of messages, for example, legitimate messages and a second semantic anchor corresponding to a second group of messages, for example, unsolicited messages. Determining a vector corresponding to an incoming message; comparing the vector corresponding to the incoming message with at least one of the first semantic anchor and the second semantic anchor to obtain a first comparison value and a second comparison value; and filtering the incoming message based on the first comparison value and the second comparison value.

Type: Grant

Filed: June 14, 2001

Date of Patent: July 11, 2006

Assignee: Apple Computer, Inc.

Inventors: Jerome R. Bellegarda, Devang Naik, Kim E. A. Silverman
Unsupervised data-driven pronunciation modeling

Patent number: 7047193

Abstract: Pronunciation for an input word is modeled by generating a set of candidate phoneme strings having pronunciations close to the input word in an orthographic space. Phoneme sub-strings in the set are selected as the pronunciation. In one aspect, a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word is used to determine the candidate phoneme strings. The words are chosen from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary. In another aspect, the phoneme sub-strings are selected by aligning the candidate phoneme strings on common phoneme sub-strings to produce an occurrence count, which is used to choose the phoneme sub-strings for the pronunciation.

Type: Grant

Filed: September 13, 2002

Date of Patent: May 16, 2006

Assignee: Apple Computer, Inc.

Inventor: Jerome R. Bellegarda
Use of semantic inference and context-free grammar with speech recognition system

Patent number: 6836760

Abstract: A method and apparatus to use semantic inference with speech recognition systems includes recognizing at least one spoken word, processing the spoken word using a context-free grammar, deriving an output from the context-free grammar, and translating the output to a predetermined command.

Type: Grant

Filed: September 29, 2000

Date of Patent: December 28, 2004

Assignee: Apple Computer, Inc.

Inventors: Jerome R. Bellegarda, Kim E. A. Silverman
Method and apparatus for improved duration modeling of phonemes

Patent number: 6785652

Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.

Type: Grant

Filed: December 19, 2002

Date of Patent: August 31, 2004

Assignee: Apple Computer, Inc.

Inventors: Jerome R. Bellegarda, Kim Silverman
Method for dynamic context scope selection in hybrid N-gram+LSA language modeling

Patent number: 6778952

Abstract: A method and system for dynamic language modeling of a document are described. In one embodiment, a number of local probabilities of a current document are computed and a vector representation of the current document in a latent semantic analysis (LSA) space is determined. In addition, a number of global probabilities based upon the vector representation of the current document in an LSA space is computed. Further, the local probabilities and the global probabilities are combined to produce the language modeling.

Type: Grant

Filed: September 12, 2002

Date of Patent: August 17, 2004

Assignee: Apple Computer, Inc.

Inventor: Jerome R. Bellegarda
Unsupervised data-driven pronunciation modeling

Publication number: 20040054533

Abstract: Pronunciation for an input word is modeled by generating a set of candidate phoneme strings having pronunciations close to the input word in an orthographic space. Phoneme sub-strings in the set are selected as the pronunciation. In one aspect, a first closeness measure between phoneme strings for words chosen from a dictionary and contexts within the input word is used to determine the candidate phoneme strings. The words are chosen from the dictionary based on a second closeness measure between a representation of the input word in the orthographic space and orthographic anchors corresponding to the words in the dictionary. In another aspect, the phoneme sub-strings are selected by aligning the candidate phoneme strings on common phoneme sub-strings to produce an occurrence count, which is used to choose the phoneme sub-strings for the pronunciation.

Type: Application

Filed: November 22, 2002

Publication date: March 18, 2004

Inventor: Jerome R. Bellegarda
Method and apparatus for improved duration modeling of phonemes

Publication number: 20030093277

Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.

Type: Application

Filed: December 19, 2002

Publication date: May 15, 2003

Inventors: Jerome R. Bellegarda, Kim Silverman
Method and apparatus for improved duration modeling of phonemes

Patent number: 6553344

Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.

Type: Grant

Filed: February 22, 2002

Date of Patent: April 22, 2003

Assignee: Apple Computer, Inc.

Inventors: Jerome R. Bellegarda, Kim Silverman
Method for dynamic context scope selection in hybrid N-gramlanguage modeling

Publication number: 20030069909

Abstract: A method and system for dynamic language modeling of a document are described. In one embodiment, a number of local probabilities of a current document are computed and a vector representation of the current document in a latent semantic analysis (LSA) space is determined. In addition, a number of global probabilities based upon the vector representation of the current document in an LSA space is computed. Further, the local probabilities and the global probabilities are combined to produce the language modeling.

Type: Application

Filed: September 12, 2002

Publication date: April 10, 2003

Inventor: Jerome R. Bellegarda
Method and apparatus for filtering email

Publication number: 20030009526

Abstract: A method and apparatus for filtering messages comprising determining a first semantic anchor corresponding to a first group of messages, for example, legitimate messages and a second semantic anchor corresponding to a second group of messages, for example, unsolicited messages. Determining a vector corresponding to an incoming message; comparing the vector corresponding to the incoming message with at least one of the first semantic anchor and the second semantic anchor to obtain a first comparison value and a second comparison value; and filtering the incoming message based on the first comparison value and the second comparison value.

Type: Application

Filed: June 14, 2001

Publication date: January 9, 2003

Inventors: Jerome R. Bellegarda, Devang Naik, Kim E.A. Silverman
Method for dynamic context scope selection in hybrid n-gram+LSA language modeling

Patent number: 6477488

Abstract: A method and system for dynamic language modeling of a document are described. In one embodiment, a number of local probabilities of a current document are computed and a vector representation of the current document in a latent semantic analysis (LSA) space is determined. In addition, a number of global probabilities based upon the vector representation of the current document in an LSA space is computed. Further, the local probabilities and the global probabilities are combined to produce the language modeling.

Type: Grant

Filed: March 10, 2000

Date of Patent: November 5, 2002

Assignee: Apple Computer, Inc.

Inventor: Jerome R. Bellegarda
Method and apparatus for improved duration modeling of phonemes

Publication number: 20020138270

Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.

Type: Application

Filed: February 22, 2002

Publication date: September 26, 2002

Applicant: Apple Computer, Inc.

Inventors: Jerome R. Bellegarda, Kim Silverman
Fast update implementation for efficient latent semantic language modeling

Patent number: 6374217

Abstract: Speech or acoustic signals are processed directly using a hybrid stochastic language model produced by integrating a latent semantic analysis language model into an n-gram probability language model. The latent semantic analysis language model probability is computed using a first pseudo-document vector that is derived from a second pseudo-document vector with the pseudo-document vectors representing pseudo-documents created from the signals received at different times. The first pseudo-document vector is derived from the second pseudo-document vector by updating the second pseudo-document vector directly in latent semantic analysis space in response to at least one addition of a candidate word of the received speech signals to the pseudo-document represented by the second pseudo-document vector. Updating precludes mapping a sparse representation for a pseudo-document into the latent semantic space to produce the first pseudo-document vector.

Type: Grant

Filed: March 12, 1999

Date of Patent: April 16, 2002

Assignee: Apple Computer, Inc.

Inventor: Jerome R. Bellegarda
Method and apparatus for improved duration modeling of phonemes

Patent number: 6366884

Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.

Type: Grant

Filed: November 8, 1999

Date of Patent: April 2, 2002

Assignee: Apple Computer, Inc.

Inventors: Jerome R. Bellegarda, Kim Silverman
Method and apparatus for command recognition using data-driven semantic inference

Patent number: 6208971

Abstract: A method and apparatus for command recognition using data-driven semantic inference includes recognizing a sequence of words received as the voice command. Data-driven semantic inference is then used with the recognized sequence of words to recognize the voice command. Thus, the command is identified on the basis of the semantics of words of the spoken command rather than the particular grammar of each of predetermined different ways the command could be worded.

Type: Grant

Filed: October 30, 1998

Date of Patent: March 27, 2001

Assignee: Apple Computer, Inc.

Inventors: Jerome R. Bellegarda, Kim E. A. Silverman
Method and apparatus for a speech recognition system language model that integrates a finite state grammar probability and an N-gram probability

Patent number: 6154722

Abstract: A method and an apparatus for a speech recognition system that uses a language model based on an integrated finite state grammar probability and an n-gram probability are provided. According to one aspect of the invention, speech signals are received into a processor of a speech recognition system. The speech signals are processed using a speech recognition system hosting a language model. The language model is produced by integrating a finite state grammar probability and an n-gram probability. In the integration, the n-gram probability is modified based on information provided by the finite state grammar probability; thus, the finite state grammar probability is subordinate to the n-gram probability. The language model is used by a decoder along with at least one acoustic model to perform a hypothesis search on an acoustic sequence to provide a word sequence output. The word sequence generated is representative of the received speech signals.

Type: Grant

Filed: December 18, 1997

Date of Patent: November 28, 2000

Assignee: Apple Computer, Inc.

Inventor: Jerome R. Bellegarda
Method and apparatus for improved duration modeling of phonemes

Patent number: 6064960

Abstract: A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis system. The received text is processed using a sum-of-products phoneme duration model that is used in either the formant method or the concatenative method of speech generation. The phoneme duration model, which is used along with a phoneme pitch model, is produced by developing a non-exponential functional transformation form for use with a generalized additive model. The non-exponential functional transformation form comprises a root sinusoidal transformation that is controlled in response to a minimum phoneme duration and a maximum phoneme duration. The minimum and maximum phoneme durations are observed in training data. The received text is processed by specifying at least one of a number of contextual factors for the generalized additive model.

Type: Grant

Filed: December 18, 1997

Date of Patent: May 16, 2000

Assignee: Apple Computer, Inc.

Inventors: Jerome R. Bellegarda, Kim Silverman

prev … 4 5 6 7 8 9 next