Patents by Inventor Harvey M. Ruback

Harvey M. Ruback has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Selective enablement of speech recognition grammars

Patent number: 9196252

Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.

Type: Grant

Filed: October 26, 2009

Date of Patent: November 24, 2015

Assignee: Nuance Communications, Inc.

Inventors: Harvey M. Ruback, Steven G. Woodward
Dynamically adjusting speech grammar weights based on usage

Patent number: 8131548

Abstract: A speech processing method can automatically and dynamically adjust speech grammar weights at runtime based upon usage data. Each of the speech grammar weights can be associated with an available speech command contained within a speech grammar to which the speech grammar weights apply. The usage data can indicate a relative frequency with which each of the available speech commands is utilized.

Type: Grant

Filed: March 6, 2006

Date of Patent: March 6, 2012

Assignee: Nuance Communications, Inc.

Inventors: Oscar J. Blass, Harvey M. Ruback, Roberto Vila
Method and system for improved speech recognition by degrading utterance pronunciations

Patent number: 7983914

Abstract: A speech recognition system or method can include a speech input device and a processor coupled to the speech input device. The processor can be programmed to identify a plurality of words that are members of confusable pairs of words where each pair includes a target word and a substituted word. The processor can degrade a pronunciation of the substituted word to provide a worse pronunciation of the substituted word. The processor can further compare the pronunciation of the target word with the worse pronunciation to the substituted word. The processor can be further programmed to reduce confusion between the substituted word and other words in a recognition grammar of the speech recognition engine and can also narrow the scope within which the substituted word is recognized.

Type: Grant

Filed: August 10, 2005

Date of Patent: July 19, 2011

Assignee: Nuance Communications, Inc.

Inventors: John W. Eckhart, Harvey M. Ruback
Supporting multiple speech enabled user interface consoles within a motor vehicle

Patent number: 7904300

Abstract: An in-vehicle system that shares speech processing resources among multiple applications located within a vehicle. The system can include one or more software applications, each associated with different functionally independent in-vehicle consoles. Each application can have a console specific user interface. The system can also include a single in-vehicle speech processing system implemented separately from the in-vehicle consoles. The speech processing system can execute speech processing tasks responsive to requests received from the applications. That is, the in-vehicle speech processing system can provide speech processing capabilities for the applications. The provided speech processing capabilities can include text-to-speech capabilities and speech recognition capabilities.

Type: Grant

Filed: August 10, 2005

Date of Patent: March 8, 2011

Assignee: Nuance Communications, Inc.

Inventors: Lisa Abbott, Daniel E. Badt, Werayuth T. Charoenruengkit, John W. Eckhart, Michael Florio, Gary R. Hanson, Harvey M. Ruback, William Russell Whitehead, Steven G. Woodward
Overriding default speech processing behavior using a default focus receiver

Patent number: 7848928

Abstract: A method for implementing speech focus in a speech processing system can include the step of establishing a default focus receiver as a first entity to request speech focus of a speech processing system having multiple applications that share speech resources based upon speech focus. An event occurrence can be detected. An event handler of the default speech receiver can previously define behavior for the event occurrence and where default system behavior can be implemented within the speech processing system for the event occurrence. The default system behavior can be utilized when speech focus is not assigned during the event occurrence. Responsive to the event occurrence, at least one programmatic action can be performed in accordance with machine readable instructions of the event handler. The default system behavior is not implemented responsive to the event occurrence.

Type: Grant

Filed: August 10, 2005

Date of Patent: December 7, 2010

Assignee: Nuance Communications, Inc.

Inventors: Lisa Abbott, Daniel E. Badt, John W. Eckhart, Harvey M. Ruback, Steven G. Woodward
REDUCING A SIZE OF A COMPILED SPEECH RECOGNITION GRAMMAR

Publication number: 20090171663

Abstract: The present invention discloses creating and using speech recognition grammars of reduced size. The reduced speech recognition grammars can include a set of entries, each entry having a unique identifier and a phonetic representation that is used when matching speech input against the entries. Each entry can lack a textual spelling corresponding to the phonetic representation. The reduced speech recognition grammar can be digitally encoded and stored in a computer readable media, such as a hard drive or flash memory of a portable speech enabled device.

Type: Application

Filed: January 2, 2008

Publication date: July 2, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: DANIEL E. BADT, VLADIMIR BERGL, JOHN W. ECKHART, RADEK HAMPL, JONATHAN PALGON, HARVEY M. RUBACK
USING FINITE STATE GRAMMARS TO VARY OUTPUT GENERATED BY A TEXT-TO-SPEECH SYSTEM

Publication number: 20080312929

Abstract: The present invention discloses a text-to-speech system that provides output variability. The system can include a finite state grammar, a variability engine and a text-to-speech engine. The finite state grammar can contain a phrase role consisting of one or more phrase elements. The phrase rule can deterministically generate a variable text phrase based upon at least one random number. The phrase rule can include a definition for each of the phrase elements. Each definition can be associated with at least one defined text string. The variability engine can construct a random text phrase responsive to receiving an action command, wherein said finite state grammar is used to create the text phrase. The variability engine can also rely on user-specified weights to adjust the output probabilities. The speech-to-text engine can convert the text phrase generated by the variability engine into speech output.

Type: Application

Filed: June 12, 2007

Publication date: December 18, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: OSCAR J. BLASS, PARITOSH D. PATEL, HARVEY M. RUBACK, ROBERTO VILA
Selective enablement of speech recognition grammars

Patent number: 7366673

Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. Selecting can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Selecting can further include registering the speech grammar in the recognition system.

Type: Grant

Filed: June 15, 2001

Date of Patent: April 29, 2008

Assignee: International Business Machines Corporation

Inventors: Harvey M. Ruback, Steven G. Woodward
Audio device characterization for accurate predictable volume control

Patent number: 6999591

Abstract: An automatic gain control method in accordance with the inventive arrangements can include the following steps. Initially, an audio signal can be provided to an audio device which has a range of permissible signal level settings and a signal level controller for establishing a particular signal level setting. In addition, an actual signal level can be measured for the audio signal at an established signal level setting. The measured actual signal level further can be stored in a volume map along with the corresponding established signal level setting. Following the storage of the measured actual signal level in the volume map, a different signal level setting can be established using the signal level controller. Subsequently, the actual signal level can be re-measured and the re-measured actual signal level and corresponding established different signal level setting can be stored in the volume map.

Type: Grant

Filed: February 27, 2001

Date of Patent: February 14, 2006

Assignee: International Business Machines Corporation

Inventors: Francis Fado, Peter J. Guasti, Gary R. Hanson, Keith P. Loring, Harvey M. Ruback, Steven G. Woodward
Method of managing a speech cache

Patent number: 6741963

Abstract: A method and a system for improving recall of speech data in a computer speech system can include a plurality of speech cache management steps including providing a speech cache, receiving a speech system input and identifying a speech event in the received speech system input, the speech event comprising speech data. Subsequently, the speech data can be compared to pre-determined speech cache entry criteria; and, if the speech data meets one of the pre-determined entry criteria, at least one entry can be added to the speech cache, the at least one entry corresponding to the speech data. Additionally, the speech data can be compared to pre-determined speech cache exit criteria; and, if the speech data meets one of the pre-determined exit criteria, at least one entry can be purged from the speech cache, the at least one entry corresponding to the speech data. The entry criteria can include frequently used speech data, recently used speech data, and important speech data.

Type: Grant

Filed: June 21, 2000

Date of Patent: May 25, 2004

Assignee: International Business Machines Corporation

Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey M. Ruback, Carl A. Smith, Ronald E. VanBuskirk, Huifang Wang, Steven G. Woodward
Selective enablement of speech recognition grammars

Publication number: 20030046074

Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.

Type: Application

Filed: June 15, 2001

Publication date: March 6, 2003

Applicant: International Business Machines Corporation

Inventors: Harvey M. Ruback, Steven G. Woodward
Filter/controller callbacks in an embedded audio layer

Publication number: 20020161588

Abstract: A system configured to automatically modify audio data and audio device behavior can include an embedded audio layer (EAL), the EAL providing an abstracted interface to an audio device; a filter/controller linked list accessible by the EAL; and, a registration processor for registering filter/controllers (FCs) with the EAL, the registration processor inserting an FC reference into the filter/controller linked list upon registration of a corresponding FC. The system can further include an unregistration processor for unregistering FCs with the EAL, the unregistration processor removing an FC reference from the filter/controller linked list upon unregistration of a corresponding FC. Finally, the system can include an enablement processor for enabling use of a corresponding FC when a specified audio device is active; and, a disablement processor for disabling use of an enabled FC when a specified audio device is active.

Type: Application

Filed: February 23, 2001

Publication date: October 31, 2002

Applicant: International Business Machines Corporation

Inventors: Joseph Camosy, Gary R. Hanson, Brien H. Muschett, Harvey M. Ruback, Steven G. Woodward
Audio device characterization for accurate predictable volume control

Publication number: 20020159608

Abstract: An automatic gain control method in accordance with the inventive arrangements can include the following steps. Initially, an audio signal can be provided to an audio device which has a range of permissible signal level settings and a signal level controller for establishing a particular signal level setting. In addition, an actual signal level can be measured for the audio signal at an established signal level setting. The measured actual signal level further can be stored in a volume map along with the corresponding established signal level setting. Following the storage of the measured actual signal level in the volume map, a different signal level setting can be established using the signal level controller. Subsequently, the actual signal level can be re-measured and the re-measured actual signal level and corresponding established different signal level setting can be stored in the volume map.

Type: Application

Filed: February 27, 2001

Publication date: October 31, 2002

Applicant: International Business Machines Corporation

Inventors: Francis Fado, Peter J. Guasti, Gary R. Hanson, Keith P. Loring, Harvey M. Ruback, Steven G. Woodward
Multi-action voice macro method

Patent number: 5873064

Abstract: Method for implementing a multi-action voice macro (140) for a voice recognition navigator program (102) on a computer system. The method involves analyzing a target application program (22) to determine a plurality of target application states (24). Each of the target application states (24) is comprised of a plurality of window objects. The target application states are arranged in the form of one or more sub-context trees, with each of the sub-context trees comprised of a plurality of sub-context objects (50, 52, 54, 56, 58, 60, 62, 64, 66, 68). A set of user inputs is determined to which each of the window objects will be responsive. Each user input is assigned a corresponding voice macro (140) which simulates the user inputs in response to a spoken utterance. The voice macro (140) includes a link field (148), which identifies at least one linked macro to be executed by the navigator program (102) when a specific vocabulary phrase for the voice macro (140) is spoken by a user.

Type: Grant

Filed: November 8, 1996

Date of Patent: February 16, 1999

Assignee: International Business Machines Corporation

Inventors: Mario E. De Armas, Harvey M. Ruback
Internal window object tree method for representing graphical user interface applications for speech navigation

Patent number: 5864819

Abstract: Method for representing a target software application program to a voice recognition navigator program on a computer system. The method requires analyzing an application program to determine a plurality of application states. Each of the application states is defined as a set of window objects within the application for performing a specific user task. According to the invention, each of the application states is preferably represented by a sub-context tree, comprised of a plurality of sub-context objects. The tree allows the navigator to associate decoded spoken commands to specific window objects.

Type: Grant

Filed: November 8, 1996

Date of Patent: January 26, 1999

Assignee: International Business Machines Corporation

Inventors: Mario E. De Armas, Harvey M. Ruback