Patents by Inventor Michael D. Edgington

Michael D. Edgington has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Configurable speech recognition system using multiple recognizers

Patent number: 10049669

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

Type: Grant

Filed: January 6, 2012

Date of Patent: August 14, 2018

Assignee: Nuance Communications, Inc.

Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
Configurable speech recognition system using a pronunciation alignment between multiple recognizers

Patent number: 10032455

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

Type: Grant

Filed: January 6, 2012

Date of Patent: July 24, 2018

Assignee: Nuance Communications, Inc.

Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
Configurable speech recognition system using multiple recognizers

Patent number: 9953653

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

Type: Grant

Filed: January 6, 2012

Date of Patent: April 24, 2018

Assignee: Nuance Communications, Inc.

Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
Configurable speech recognition system using multiple recognizers

Patent number: 9183843

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture. An indication of the availability of the remote speech recognition to perform speech recognition at a point in time may be provided to a user of the client device via a user interface of the client device.

Type: Grant

Filed: January 22, 2013

Date of Patent: November 10, 2015

Assignee: Nuance Communications, Inc.

Inventors: Mark Fanty, Timothy Lynch, Michael J. Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
Configurable speech recognition system using multiple recognizers

Patent number: 8930194

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

Type: Grant

Filed: January 6, 2012

Date of Patent: January 6, 2015

Assignee: Nuance Communications, Inc.

Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
Configurable speech recognition system using multiple recognizers

Patent number: 8898065

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

Type: Grant

Filed: January 6, 2012

Date of Patent: November 25, 2014

Assignee: Nuance Communications, Inc.

Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
Using codec parameters for endpoint detection in speech recognition

Patent number: 8762150

Abstract: Systems, methods and apparatus for determining an estimated endpoint of human speech in a sound wave received by a mobile device having a speech encoder for encoding the sound wave to produce an encoded representation of the sound wave. The estimated endpoint may be determined by analyzing information available from the speech encoder, without analyzing the sound wave directly and without producing a decoded representation of the sound wave. The encoded representation of the sound wave may be transmitted to a remote server for speech recognition processing, along with an indication of the estimated endpoint.

Type: Grant

Filed: September 16, 2010

Date of Patent: June 24, 2014

Assignee: Nuance Communications, Inc.

Inventors: Michael D. Edgington, Stephen W. Laverty, Gunnar Evermann
Methods and apparatus for formant-based voice synthesis

Patent number: 8706488

Abstract: In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.

Type: Grant

Filed: February 27, 2013

Date of Patent: April 22, 2014

Assignee: Nuance Communications, Inc.

Inventors: Michael D. Edgington, Laurence Gillick, Jordan R. Cohen
Methods and apparatus for formant-based voice systems

Patent number: 8447592

Abstract: In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.

Type: Grant

Filed: September 13, 2005

Date of Patent: May 21, 2013

Assignee: Nuance Communications, Inc.

Inventors: Michael D. Edgington, Laurence Gillick, Jordan R. Cohen
CONFIGURABLE SPEECH RECOGNITION SYSTEM USING MULTIPLE RECOGNIZERS

Publication number: 20120179471

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

Type: Application

Filed: January 6, 2012

Publication date: July 12, 2012

Applicant: Nuance Communications, Inc.

Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
CONFIGURABLE SPEECH RECOGNITION SYSTEM USING MULTIPLE RECOGNIZERS

Publication number: 20120179469

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

Type: Application

Filed: January 6, 2012

Publication date: July 12, 2012

Applicant: Nuance Communication, Inc.

Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
CONFIGURABLE SPEECH RECOGNITION SYSTEM USING MULTIPLE RECOGNIZERS

Publication number: 20120179464

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

Type: Application

Filed: January 6, 2012

Publication date: July 12, 2012

Applicant: Nuance Communications, Inc.

Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
CONFIGURABLE SPEECH RECOGNITION SYSTEM USING MULTIPLE RECOGNIZERS

Publication number: 20120179457

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

Type: Application

Filed: January 6, 2012

Publication date: July 12, 2012

Applicant: Nuance Communications, Inc.

Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
CONFIGURABLE SPEECH RECOGNITION SYSTEM USING MULTIPLE RECOGNIZERS

Publication number: 20120179463

Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

Type: Application

Filed: January 6, 2012

Publication date: July 12, 2012

Applicant: Nuance Communications, Inc.

Inventors: Michael Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
USING CODEC PARAMETERS FOR ENDPOINT DETECTION IN SPEECH RECOGNITION

Publication number: 20120072211

Abstract: Systems, methods and apparatus for determining an estimated endpoint of human speech in a sound wave received by a mobile device having a speech encoder for encoding the sound wave to produce an encoded representation of the sound wave. The estimated endpoint may be determined by analyzing information available from the speech encoder, without analyzing the sound wave directly and without producing a decoded representation of the sound wave. The encoded representation of the sound wave may be transmitted to a remote server for speech recognition processing, along with an indication of the estimated endpoint.

Type: Application

Filed: September 16, 2010

Publication date: March 22, 2012

Applicant: Nuance Communications, Inc.

Inventors: Michael D. Edgington, Stephen W. Laverty, Gunnar Evermann
Learning of dialogue states and language model of spoken information system

Patent number: 6839671

Abstract: In this invention dialogue states for a dialogue model are created using a training corpus of example human—human dialogues. Dialogue states are modelled at the turn level rather than at the move level, and the dialogue states are derived from the training corpus. The range of operator dialogue utterances is actually quite small in many services and therefore may be categorized into a set of predetermined meanings. This is an important assumption which is not true of general conversation, but is often true of conversations between telephone operators and people. Phrases are specified which have specific substitution and deletion penalties, for example the two phrases “I would like to” and “can I” may be specified as a possible substitution with low or zero penalty. Thus allows common equivalent phrases are given low substitution penalties. Insignificant phrases such as ‘erm’ are given low or zero deletion penalties.

Type: Grant

Filed: December 19, 2000

Date of Patent: January 4, 2005

Assignee: British Telecommunications public limited company

Inventors: David J. Attwater, Michael D. Edgington, Peter J. Durston