Patents by Inventor Daniel E. Badt

Daniel E. Badt has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Providing speech recognition data to a speech enabled device when providing a new entry that is selectable via a speech recognition interface of the device

Patent number: 8010345

Abstract: The present invention discloses a solution for providing a phonetic representation for a content item along with a content item delivered to a speech enabled computing device. The phonetic representation can be specified in a manner that enables it to be added to a speech recognition grammar of the speech enabled computing device. Thus, the device can recognize speech commands using the newly added phonetic representation that involve the content item. Current implementations of speech recognition systems of this type rely internal generation of speech recognition data that is added to the speech recognition grammar. Generation of speech recognition data can, however, be resource intensive, which can be particularly problematic when the speech enabled device is resource limited. The disclosed solution offloads the task of providing the speech recognition data to an external device, such as a relatively resource rich server or a desktop device.

Type: Grant

Filed: December 18, 2007

Date of Patent: August 30, 2011

Assignee: International Business Machines Corporation

Inventors: Neal J. Alewine, Daniel E. Badt
Supporting multiple speech enabled user interface consoles within a motor vehicle

Patent number: 7904300

Abstract: An in-vehicle system that shares speech processing resources among multiple applications located within a vehicle. The system can include one or more software applications, each associated with different functionally independent in-vehicle consoles. Each application can have a console specific user interface. The system can also include a single in-vehicle speech processing system implemented separately from the in-vehicle consoles. The speech processing system can execute speech processing tasks responsive to requests received from the applications. That is, the in-vehicle speech processing system can provide speech processing capabilities for the applications. The provided speech processing capabilities can include text-to-speech capabilities and speech recognition capabilities.

Type: Grant

Filed: August 10, 2005

Date of Patent: March 8, 2011

Assignee: Nuance Communications, Inc.

Inventors: Lisa Abbott, Daniel E. Badt, Werayuth T. Charoenruengkit, John W. Eckhart, Michael Florio, Gary R. Hanson, Harvey M. Ruback, William Russell Whitehead, Steven G. Woodward
Overriding default speech processing behavior using a default focus receiver

Patent number: 7848928

Abstract: A method for implementing speech focus in a speech processing system can include the step of establishing a default focus receiver as a first entity to request speech focus of a speech processing system having multiple applications that share speech resources based upon speech focus. An event occurrence can be detected. An event handler of the default speech receiver can previously define behavior for the event occurrence and where default system behavior can be implemented within the speech processing system for the event occurrence. The default system behavior can be utilized when speech focus is not assigned during the event occurrence. Responsive to the event occurrence, at least one programmatic action can be performed in accordance with machine readable instructions of the event handler. The default system behavior is not implemented responsive to the event occurrence.

Type: Grant

Filed: August 10, 2005

Date of Patent: December 7, 2010

Assignee: Nuance Communications, Inc.

Inventors: Lisa Abbott, Daniel E. Badt, John W. Eckhart, Harvey M. Ruback, Steven G. Woodward
Enhancement to Viterbi speech processing algorithm for hybrid speech models that conserves memory

Patent number: 7805305

Abstract: The present invention discloses a method for semantically processing speech for speech recognition purposes. The method can reduce an amount of memory required for a Viterbi search of an N-gram language model having a value of N greater than two and also having at least one embedded grammar that appears in a multiple contexts to a memory size of approximately a bigram model search space with respect to the embedded grammar. The method also reduces needed CPU requirements. Achieved reductions can be accomplished by representing the embedded grammar as a recursive transition network (RTN), where only one instance of the recursive transition network is used for the contexts. Other than the embedded grammars, a Hidden Markov Model (HMM) strategy can be used for the search space.

Type: Grant

Filed: October 12, 2006

Date of Patent: September 28, 2010

Assignee: Nuance Communications, Inc.

Inventors: Daniel E. Badt, Tomas Beran, Radek Hampl, Pavel Krbec, Jan Sedivy
REDUCING A SIZE OF A COMPILED SPEECH RECOGNITION GRAMMAR

Publication number: 20090171663

Abstract: The present invention discloses creating and using speech recognition grammars of reduced size. The reduced speech recognition grammars can include a set of entries, each entry having a unique identifier and a phonetic representation that is used when matching speech input against the entries. Each entry can lack a textual spelling corresponding to the phonetic representation. The reduced speech recognition grammar can be digitally encoded and stored in a computer readable media, such as a hard drive or flash memory of a portable speech enabled device.

Type: Application

Filed: January 2, 2008

Publication date: July 2, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: DANIEL E. BADT, VLADIMIR BERGL, JOHN W. ECKHART, RADEK HAMPL, JONATHAN PALGON, HARVEY M. RUBACK
PROVIDING SPEECH RECOGNITION DATA TO A SPEECH ENABLED DEVICE WHEN PROVIDING A NEW ENTRY THAT IS SELECTABLE VIA A SPEECH RECOGNITION INTERFACE OF THE DEVICE

Publication number: 20090157392

Abstract: The present invention discloses a solution for providing a phonetic representation for a content item along with a content item delivered to a speech enabled computing device. The phonetic representation can be specified in a manner that enables it to be added to a speech recognition grammar of the speech enabled computing device. Thus, the device can recognize speech commands using the newly added phonetic representation that involve the content item. Current implementations of speech recognition systems of this type rely internal generation of speech recognition data that is added to the speech recognition grammar. Generation of speech recognition data can, however, be resource intensive, which can be particularly problematic when the speech enabled device is resource limited. The disclosed solution offloads the task of providing the speech recognition data to an external device, such as a relatively resource rich server or a desktop device.

Type: Application

Filed: December 18, 2007

Publication date: June 18, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Neal J. ALEWINE, Daniel E. BADT
ENHANCEMENT TO VITERBI SPEECH PROCESSING ALGORITHM FOR HYBRID SPEECH MODELS THAT CONSERVES MEMORY

Publication number: 20080091429

Abstract: The present invention discloses a method for semantically processing speech for speech recognition purposes. The method can reduce an amount of memory required for a Viterbi search of an N-gram language model having a value of N greater than two and also having at least one embedded grammar that appears in a multiple contexts to a memory size of approximately a bigram model search space with respect to the embedded grammar. The method also reduces needed CPU requirements. Achieved reductions can be accomplished by representing the embedded grammar as a recursive transition network (RTN), where only one instance of the recursive transition network is used for the contexts. Other than the embedded grammars, a Hidden Markov Model (HMM) strategy can be used for the search space.

Type: Application

Filed: October 12, 2006

Publication date: April 17, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Daniel E. Badt, Tomas Beran, Radek Hampl, Pavel Krbec, Jan Sedivy
Method of managing a speech cache

Patent number: 6741963

Abstract: A method and a system for improving recall of speech data in a computer speech system can include a plurality of speech cache management steps including providing a speech cache, receiving a speech system input and identifying a speech event in the received speech system input, the speech event comprising speech data. Subsequently, the speech data can be compared to pre-determined speech cache entry criteria; and, if the speech data meets one of the pre-determined entry criteria, at least one entry can be added to the speech cache, the at least one entry corresponding to the speech data. Additionally, the speech data can be compared to pre-determined speech cache exit criteria; and, if the speech data meets one of the pre-determined exit criteria, at least one entry can be purged from the speech cache, the at least one entry corresponding to the speech data. The entry criteria can include frequently used speech data, recently used speech data, and important speech data.

Type: Grant

Filed: June 21, 2000

Date of Patent: May 25, 2004

Assignee: International Business Machines Corporation

Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey M. Ruback, Carl A. Smith, Ronald E. VanBuskirk, Huifang Wang, Steven G. Woodward
Audio notification management system

Patent number: 6738742

Abstract: A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.

Type: Grant

Filed: February 11, 2003

Date of Patent: May 18, 2004

Assignee: International Business Machines Corporation

Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey Ruback, Carl A. Smith, Ronald E. Vanbuskirk, Huifang Wang, Steven G. Woodward
Audio notification management system

Publication number: 20030130850

Abstract: A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.

Type: Application

Filed: February 11, 2003

Publication date: July 10, 2003

Applicant: International Business Machines Corporation

Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey Ruback, Carl A. Smith, Ronald E. Vanbuskirk, Huifang Wang, Steven G. Woodward
Audio notification management system

Patent number: 6542868

Abstract: A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.

Type: Grant

Filed: September 23, 1999

Date of Patent: April 1, 2003

Assignee: International Business Machines Corporation

Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey Ruback, Carl A. Smith, Ronald E. Vanbuskirk, Huifang Wang, Steven G. Woodward