Patents by Inventor Michael D. Edgington
Michael D. Edgington has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10049669Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.Type: GrantFiled: January 6, 2012Date of Patent: August 14, 2018Assignee: Nuance Communications, Inc.Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Patent number: 10032455Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.Type: GrantFiled: January 6, 2012Date of Patent: July 24, 2018Assignee: Nuance Communications, Inc.Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Patent number: 9953653Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.Type: GrantFiled: January 6, 2012Date of Patent: April 24, 2018Assignee: Nuance Communications, Inc.Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Patent number: 9183843Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture. An indication of the availability of the remote speech recognition to perform speech recognition at a point in time may be provided to a user of the client device via a user interface of the client device.Type: GrantFiled: January 22, 2013Date of Patent: November 10, 2015Assignee: Nuance Communications, Inc.Inventors: Mark Fanty, Timothy Lynch, Michael J. Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Patent number: 8930194Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.Type: GrantFiled: January 6, 2012Date of Patent: January 6, 2015Assignee: Nuance Communications, Inc.Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Patent number: 8898065Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.Type: GrantFiled: January 6, 2012Date of Patent: November 25, 2014Assignee: Nuance Communications, Inc.Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Patent number: 8762150Abstract: Systems, methods and apparatus for determining an estimated endpoint of human speech in a sound wave received by a mobile device having a speech encoder for encoding the sound wave to produce an encoded representation of the sound wave. The estimated endpoint may be determined by analyzing information available from the speech encoder, without analyzing the sound wave directly and without producing a decoded representation of the sound wave. The encoded representation of the sound wave may be transmitted to a remote server for speech recognition processing, along with an indication of the estimated endpoint.Type: GrantFiled: September 16, 2010Date of Patent: June 24, 2014Assignee: Nuance Communications, Inc.Inventors: Michael D. Edgington, Stephen W. Laverty, Gunnar Evermann
-
Patent number: 8706488Abstract: In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.Type: GrantFiled: February 27, 2013Date of Patent: April 22, 2014Assignee: Nuance Communications, Inc.Inventors: Michael D. Edgington, Laurence Gillick, Jordan R. Cohen
-
Patent number: 8447592Abstract: In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.Type: GrantFiled: September 13, 2005Date of Patent: May 21, 2013Assignee: Nuance Communications, Inc.Inventors: Michael D. Edgington, Laurence Gillick, Jordan R. Cohen
-
Publication number: 20120179469Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.Type: ApplicationFiled: January 6, 2012Publication date: July 12, 2012Applicant: Nuance Communication, Inc.Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Publication number: 20120179464Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.Type: ApplicationFiled: January 6, 2012Publication date: July 12, 2012Applicant: Nuance Communications, Inc.Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Publication number: 20120179457Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.Type: ApplicationFiled: January 6, 2012Publication date: July 12, 2012Applicant: Nuance Communications, Inc.Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Publication number: 20120179463Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.Type: ApplicationFiled: January 6, 2012Publication date: July 12, 2012Applicant: Nuance Communications, Inc.Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Publication number: 20120179471Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.Type: ApplicationFiled: January 6, 2012Publication date: July 12, 2012Applicant: Nuance Communications, Inc.Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
-
Publication number: 20120072211Abstract: Systems, methods and apparatus for determining an estimated endpoint of human speech in a sound wave received by a mobile device having a speech encoder for encoding the sound wave to produce an encoded representation of the sound wave. The estimated endpoint may be determined by analyzing information available from the speech encoder, without analyzing the sound wave directly and without producing a decoded representation of the sound wave. The encoded representation of the sound wave may be transmitted to a remote server for speech recognition processing, along with an indication of the estimated endpoint.Type: ApplicationFiled: September 16, 2010Publication date: March 22, 2012Applicant: Nuance Communications, Inc.Inventors: Michael D. Edgington, Stephen W. Laverty, Gunnar Evermann
-
Patent number: 6839671Abstract: In this invention dialogue states for a dialogue model are created using a training corpus of example human—human dialogues. Dialogue states are modelled at the turn level rather than at the move level, and the dialogue states are derived from the training corpus. The range of operator dialogue utterances is actually quite small in many services and therefore may be categorized into a set of predetermined meanings. This is an important assumption which is not true of general conversation, but is often true of conversations between telephone operators and people. Phrases are specified which have specific substitution and deletion penalties, for example the two phrases “I would like to” and “can I” may be specified as a possible substitution with low or zero penalty. Thus allows common equivalent phrases are given low substitution penalties. Insignificant phrases such as ‘erm’ are given low or zero deletion penalties.Type: GrantFiled: December 19, 2000Date of Patent: January 4, 2005Assignee: British Telecommunications public limited companyInventors: David J. Attwater, Michael D. Edgington, Peter J. Durston
-
Publication number: 20030091163Abstract: In this invention dialogue states for a dialogue model are created using a training corpus of example human-human dialogues. Dialogue states are modelled at the turn level rather than at the move level, and the dialogue states are derived from the training corpus.Type: ApplicationFiled: May 20, 2002Publication date: May 15, 2003Inventors: David J Attwater, Michael D Edgington, Peter J Durston