Patents by Inventor Laurence S. Gillick
Laurence S. Gillick has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 7133827Abstract: A new word model is trained from synthetic word samples derived by Monte Carlo techniques from one or more prior word models. The prior word model can be a phonetic word model and the new word model can be a non-phonetic, whole-word, word model. The prior word model can be trained from data that has undergone a first channel normalization and the synthesized word samples from which the new word model is trained can undergo a different channel normalization similar to that to be used in a given speech recognition context. The prior word model can have a first model structure and the new word model can have a second, different, model structure. These differences in model structure can include, for example, differences of model topology; differences of model complexity; and differences in the type of basis function used in a description of such probability distributions.Type: GrantFiled: February 6, 2003Date of Patent: November 7, 2006Assignee: Voice Signal Technologies, Inc.Inventors: Laurence S. Gillick, Donald R. McAllaster, Daniel L. Roth
-
Patent number: 7120582Abstract: The invention provides techniques for creating and using fragmented word models to increase the effective size of an active vocabulary of a speech recognition system. The active vocabulary represents all words and word fragments that the speech recognition system is able to recognize. Each word may be represented by a combination of acoustic models. As such, the active vocabulary represents the combinations of acoustic models that the speech recognition system may compare to a user's speech to identify acoustic models that best match the user's speech. The effective size of the active vocabulary may be increased by dividing words into constituent components or fragments (for example, prefixes, suffixes, separators, infixes, and roots) and including each component as a separate entry in the active vocabulary.Type: GrantFiled: September 7, 1999Date of Patent: October 10, 2006Assignee: Dragon Systems, Inc.Inventors: Jonathan H. Young, Haakon L. Chevalier, Laurence S. Gillick, Toffee A. Albina, Marlboro B. Moore, III, Paul E. Rensing, Jonathan P. Yamron
-
Patent number: 7003456Abstract: A computer-based method of routing a message to a system includes receiving a message, and processing the message using large-vocabulary continuous speech recognition to generate a string of text corresponding to the message. The method includes generating a confidence estimate of the string of text corresponding to the message and comparing the confidence estimate to a predetermined threshold. If the confidence estimate satisfies the predetermined threshold, the string of text is forwarded to the system. If the confidence estimate does not satisfy the predetermined threshold, the information relating to the message is forwarded to a transcriptionist. The message may include one or more utterances. Each utterance in the message may be separately or jointly processed. In this way, a confidence estimate may be generated and evaluated for each utterance or for the whole message. Information relating to each utterance may be separately or jointly forwarded based on the results of the generation and evaluation.Type: GrantFiled: June 12, 2001Date of Patent: February 21, 2006Inventors: Laurence S. Gillick, Robert Roth, Linda Manganaro, Barbara R. Peskin, David C. Petty, Ashwin Rao
-
Publication number: 20040260547Abstract: A method of processing speech in a noisy environment includes determining, upon a wake-up command, when the environment is too noisy to yield reliable recognition of a user's spoken words, and alerting the user that the environment is too noisy. Determining when the environment is too noisy includes calculating a ratio of signal to noise. The signal corresponds to of an amount of energy in the spoken utterance, and the noise corresponds to an amount of energy in the background noise. The method further includes comparing the signal to noise to a threshold.Type: ApplicationFiled: May 10, 2004Publication date: December 23, 2004Applicant: Voice Signal TechnologiesInventors: Jordan Cohen, Daniel L, Roth, Laurence S. Gillick
-
Publication number: 20040210438Abstract: A method for speech recognition. The method uses a single pronunciation estimator to train acoustic phoneme models and recognize utterances from multiple languages. The method includes accepting text spellings of training words in a plurality of sets of training words, each set corresponding to a different one of a plurality of languages. The method also includes, for each of the sets of training words in the plurality, receiving pronunciations for the training words in the set, the pronunciations being characteristic of native speakers of the language of the set, the pronunciations also being in terms of subword units at least some of which are common to two or more of the languages. The method also includes training a single pronunciation estimator using data comprising the text spellings and the pronunciations of the training words.Type: ApplicationFiled: November 17, 2003Publication date: October 21, 2004Inventors: Laurence S. Gillick, Thomas E. Lynch, Michael J. Newman, Daniel L. Roth, Steven A. Wegmann, Jonathan P. Yamron
-
Publication number: 20040073428Abstract: Text-to-speech synthesis modifies the pitch of the sounds it concatenates to generate speech, when such sounds are in compressed, coded form, so as to make them sound better together. The pitch, duration, and energy of such concatenated sounds can be altered to better match, respectively, pitch, duration, and/or energy contours generated from phonetic spelling of the speech to be synthesized, which can, in turn, be derived from the text to be synthesized. The synthesized speech can be generated from the encoded sound of sub-word snippets as well as of one or more whole words. The duration of concatenated sounds can be changed by inserting or deleting sound frames associated with individual snippets. Such text-to-speech can be used to say words recognized by speech recognition, such as to provide feedback on the recognition. Such text-to-speech synthesis can be used in portable devices such as cellphones, PDAs, and/or wrist phones.Type: ApplicationFiled: October 10, 2002Publication date: April 15, 2004Inventors: Igor Zlokarnik, Laurence S. Gillick, Jordan R. Cohen
-
Publication number: 20020133341Abstract: A computer-based method of routing a message to a system includes receiving a message, and processing the message using large-vocabulary continuous speech recognition to generate a string of text corresponding to the message. The method includes generating a confidence estimate of the string of text corresponding to the message and comparing the confidence estimate to a predetermined threshold. If the confidence estimate satisfies the predetermined threshold, the string of text is forwarded to the system. If the confidence estimate does not satisfy the predetermined threshold, the information relating to the message is forwarded to a transcriptionist. The message may include one or more utterances. Each utterance in the message may be separately or jointly processed. In this way, a confidence estimate may be generated and evaluated for each utterance or for the whole message. Information relating to each utterance may be separately or jointly forwarded based on the results of the generation and evaluation.Type: ApplicationFiled: June 12, 2001Publication date: September 19, 2002Inventors: Laurence S. Gillick, Robert Roth, Linda Manganaro, Barbara R. Peskin, David C. Petty, Ashwin Rao
-
Patent number: 6224636Abstract: The content of a speech sample is recognized using a computer system by evaluating the speech sample against a nonparametric set of training observations, for example, utterances from one or more human speakers. The content of the speech sample is recognized based on the evaluation results. The speech recognition process also may rely on a comparison between the speech sample and a parametric model of the training observations.Type: GrantFiled: February 28, 1997Date of Patent: May 1, 2001Assignee: Dragon Systems, Inc.Inventors: Steven A. Wegmann, Laurence S. Gillick
-
Patent number: 6167377Abstract: Language model results are combined according to a combination expression to produce combined language model results for a set of candidates. A candidate is selected and the combination expression is adjusted using language model results associated with the selected candidate.Type: GrantFiled: March 28, 1997Date of Patent: December 26, 2000Assignee: Dragon Systems, Inc.Inventors: Laurence S. Gillick, Joel M. Gould, Robert Roth, Paul A. van Mulbregt, Michael D. Bibeault
-
Patent number: 6151575Abstract: A source-adapted model for use in speech recognition is generated by defining a linear relationship between a first element of an initial model and a first element of the source-adapted model. Thereafter, speech data that corresponds to the first element of the initial model is assembled from a set of speech data for a particular source associated with the source-adapted model. A linear transform that maps between the assembled speech data and the first element of the initial model is then determined. Finally, a first element of the source-adapted model is produced from the first element of the initial model using the linear transform.Type: GrantFiled: October 28, 1997Date of Patent: November 21, 2000Assignee: Dragon Systems, Inc.Inventors: Michael Jack Newman, Laurence S. Gillick, Venkatesh Nagesha
-
Patent number: 6052657Abstract: System for segmenting text and identifying segment topics that match a user-specified topic. Topic tracking system creates a set of topic models from training text containing topic boundaries using a clustering algorithm. User supplies topic text. System creates a topic model of the topic text and adds the topic model to the set of topic models. User-supplied test text is segmented according to the set of topic models. Segments relating to the same topic as the topic text are selected.Type: GrantFiled: November 25, 1997Date of Patent: April 18, 2000Assignee: Dragon Systems, Inc.Inventors: Jonathan P. Yamron, Paul G. Bamberg, James Barnett, Laurence S. Gillick, Paul A. van Mulbregt
-
Patent number: 6029124Abstract: A speech sample is evaluated using a computer. Training data that include samples of speech are received and stored along with identification of speech elements to which portions of the training data are related. A speech sample is received and speech recognition is performed on the speech sample to produce recognition results. Finally, the recognition results are evaluated in view of the training data and the identification of the speech elements to which the portions of the training data are related. The technique may be used to perform tasks such as speech recognition, speaker identification, and language identification.Type: GrantFiled: March 31, 1998Date of Patent: February 22, 2000Assignee: Dragon Systems, Inc.Inventors: Laurence S. Gillick, Andres Corrada-Emmanuel, Michael J. Newman, Barbara R. Peskin
-
Patent number: 5946654Abstract: A speech model is produced for use in determining whether a speaker associated with the speech model produced an unidentified speech sample. First a sample of speech of a particular speaker is obtained. Next, the contents of the sample of speech are identified using speech recognition. Finally, a speech model associated with the particular speaker is produced using the sample of speech and the identified contents thereof. The speech model is produced without using an external mechanism to monitor the accuracy with which the contents were identified.Type: GrantFiled: February 21, 1997Date of Patent: August 31, 1999Assignee: Dragon Systems, Inc.Inventors: Michael Jack Newman, Laurence S. Gillick, Yoshiko Ito
-
Patent number: 5822730Abstract: A speech recognition technique uses lexical tree pre-filtering to obtain lists of words for use in performing speech recognition. The lexical tree pre-filtering includes representing a vocabulary of words using a lexical tree and identifying a first subset of the vocabulary that may correspond to speech spoken beginning at a first time by propagating through the lexical tree information about the speech spoken beginning at the first time. A second subset of the vocabulary that may correspond to speech spoken beginning at a second time is identified by propagating through the lexical tree information about the speech spoken beginning at the second time. Words included in the speech are recognized by comparing speech spoken beginning at the first time with words from the first subset of the vocabulary and speech spoken beginning at the second time with words from the second subset of the vocabulary. The state of the lexical tree is not reset between identifying the first and second subsets.Type: GrantFiled: August 22, 1996Date of Patent: October 13, 1998Assignee: Dragon Systems, Inc.Inventors: Robert Roth, James K. Baker, Laurence S. Gillick, Alan Walsh
-
Patent number: 5715367Abstract: A computerized system time aligns frames of spoken training data against models of the speech sounds; automatically selects different sets of phonetic context classifications which divide the speech sound models into speech sound groups aligned against acoustically similar frames; creates model components from the frames aligned against speech sound groups with related classifications; and uses these model components to build a separate model for each related speech sound group. A decision tree classifies speech sounds into such groups, and related speech sound groups descend from common tree nodes. New speech samples time aligned against a given speech sound group's model update models of related speech sound groups, decreasing the training data required to adapt the system. The phonetic context classifications can be based on knowledge of which contextual features are associated with acoustic similarity.Type: GrantFiled: January 23, 1995Date of Patent: February 3, 1998Assignee: Dragon Systems, Inc.Inventors: Laurence S. Gillick, Francesco Scattone
-
Patent number: 5680511Abstract: In one aspect, the invention provides word recognition systems that operate to recognize an unrecognized or ambiguous word that occurs within a passage of words. The system can offer several words as choice words for inserting into the passage to replace the unrecognized word. The system can select the best choice word by using the choice word to extract from a reference source, sample passages of text that relate to the choice word. For example, the system can select the dictionary passage that defines the choice word. The system then compares the selected passage to the current passage, and generates a score that indicates the likelihood that the choice word would occur within that passage of text. The system can select the choice word with the best score to substitute into the passage. The passage of words being analyzed can be any word sequence including an utterance, a portion of handwritten text, a portion of typewritten text or other such sequence of words, numbers and characters.Type: GrantFiled: June 7, 1995Date of Patent: October 21, 1997Assignee: Dragon Systems, Inc.Inventors: Janet M. Baker, Laurence S. Gillick, James K. Baker, Jonathan P. Yamron
-
Patent number: 5526463Abstract: The system of the invention relates to continuous speech pre-filtering systems for use in discrete and continuous speech recognition computer systems. The speech to be recognized is converted from utterances to frame data sets, which frame data sets are smoothed to generate a smooth frame model over a predetermined number of frames. A resident vocabulary is stored within the computer as clusters of word models which are acoustically similar over a succession of frame periods. A cluster score is generated by the system, which score includes the likelihood of the smooth frames evaluated using a probability model for the cluster against which the smooth frame model is being compared. Cluster sets having cluster scores below a predetermined acoustic threshold are removed from further consideration. The remaining cluster sets are unpacked for determination of a word score for each unpacked word.Type: GrantFiled: April 9, 1993Date of Patent: June 11, 1996Assignee: Dragon Systems, Inc.Inventors: Laurence S. Gillick, Robert S. Roth
-
Patent number: 5202952Abstract: A continuous speech prefiltering system for use in continuous speech recognition computer systems. The speech to be recognized is converted from utterances to frame data sets, which frame data sets are smoothed to generate a smooth frame model over a predetermined number of frames. A resident vocabulary is stored within the computer as clusters of word models which are acoustically similar over a succession of frame periods. A cluster score is generated by the system, which score includes the likelihood of the smooth frames evaluated using a probability model for the cluster against which the smooth frame model is being compared. Cluster sets having cluster scores below a predetermined acoustic threshold are removed from further consideration. The remaining cluster sets are unpacked for determination of a word score for each unpacked word.Type: GrantFiled: June 22, 1990Date of Patent: April 13, 1993Assignee: Dragon Systems, Inc.Inventors: Laurence S. Gillick, Robert S. Roth