Patents by Inventor Harvey Ruback

Harvey Ruback has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SELECTIVE ENABLEMENT OF SPEECH RECOGNITION GRAMMARS

Publication number: 20100049521

Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.

Type: Application

Filed: October 26, 2009

Publication date: February 25, 2010

Applicant: Nuance Communications, Inc.

Inventors: Harvey Ruback, Steven Woodward
Selective enablement of speech recognition grammars

Patent number: 7610204

Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.

Type: Grant

Filed: March 5, 2008

Date of Patent: October 27, 2009

Assignee: Nuance Communications, Inc.

Inventors: Harvey Ruback, Steven Woodward
SELECTIVE ENABLEMENT OF SPEECH RECOGNITION GRAMMARS

Publication number: 20080189111

Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.

Type: Application

Filed: March 5, 2008

Publication date: August 7, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Harvey Ruback, Steven Woodward
Dynamically adjusting speech grammar weights based on usage

Publication number: 20070208555

Abstract: A speech processing method can automatically and dynamically adjust speech grammar weights at runtime based upon usage data. Each of the speech grammar weights can be associated with an available speech command contained within a speech grammar to which the speech grammar weights apply. The usage data can indicate a relative frequency with which each of the available speech commands is utilized.

Type: Application

Filed: March 6, 2006

Publication date: September 6, 2007

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Oscar Blass, Harvey Ruback, Roberto Vila
Overriding default speech processing behavior using a default focus receiver

Publication number: 20070038462

Abstract: A method for implementing speech focus in a speech processing system can include the step of establishing a default focus receiver as a first entity to request speech focus of a speech processing system having multiple applications that share speech resources based upon speech focus. An event occurrence can be detected. An event handler of the default speech receiver can previously define behavior for the event occurrence and where default system behavior can be implemented within the speech processing system for the event occurrence. The default system behavior can be utilized when speech focus is not assigned during the event occurrence. Responsive to the event occurrence, at least one programmatic action can be performed in accordance with machine readable instructions of the event handler. The default system behavior is not implemented responsive to the event occurrence.

Type: Application

Filed: August 10, 2005

Publication date: February 15, 2007

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lisa Abbott, Daniel Badt, John Eckhart, Harvey Ruback, Steven Woodward
Supporting multiple speech enabled user interface consoles within a motor vehicle

Publication number: 20070038461

Abstract: An in-vehicle system that shares speech processing resources among multiple applications located within a vehicle. The system can include one or more software applications, each associated with different functionally independent in-vehicle consoles. Each application can have a console specific user interface. The system can also include a single in-vehicle speech processing system implemented separately from the in-vehicle consoles. The speech processing system can execute speech processing tasks responsive to requests received from the applications. That is, the in-vehicle speech processing system can provide speech processing capabilities for the applications. The provided speech processing capabilities can include text-to-speech capabilities and speech recognition capabilities.

Type: Application

Filed: August 10, 2005

Publication date: February 15, 2007

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lisa Abbott, Daniel Badt, Werayuth Charoenruengkit, John Eckhart, Michael Florio, Gary Hanson, Harvey Ruback, William Whitehead, Steven Woodward
Method and system for improved speech recognition by degrading utterance pronunciations

Publication number: 20070038454

Abstract: A speech recognition system (10) or method (20) can include a speech input device and a processor (14) coupled to the speech input. The processor can be programmed to identify (22) a plurality of words that are members of confusable pairs of words where each pair includes a target word and a substituted word. The processor can degrade (24) a pronunciation of the substituted word to provide a worse pronunciation of the substituted word. The processor can further compare (28) the pronunciation of the target word with the worse pronunciation to the substituted word. The processor can be further programmed to reduce (26) confusion between the substituted word and other words in a recognition grammar of the speech recognition engine and can also narrow the scope within which the substituted word is recognized.

Type: Application

Filed: August 10, 2005

Publication date: February 15, 2007

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: John Eckhart, Harvey Ruback
Integrating conversational speech into Web browsers

Publication number: 20060235694

Abstract: A method of integrating conversational speech into a multimodal, Web-based processing model can include speech recognizing a user spoken utterance directed to a voice-enabled field of a multimodal markup language document presented within a browser. A statistical grammar can be used to determine a recognition result. The method further can include providing the recognition result to the browser, receiving, within a natural language understanding (NLU) system, the recognition result from the browser, and semantically processing the recognition result to determine a meaning. Accordingly, a next programmatic action to be performed can be selected according to the meaning.

Type: Application

Filed: April 14, 2005

Publication date: October 19, 2006

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Charles Cross, Brien Muschett, Harvey Ruback, Leslie Wilson
Influencing automatic speech recognition signal-to-noise levels

Publication number: 20060100866

Abstract: A system for influencing a signal-to-noise ratio (SNR) associated with a signal input to an automatic speech recognition device is provided. The system includes a normalized energy module that determines a normalized energy measurement based upon a spectrum of frequency-domain complex coefficients, the coefficients generated by the automatic speech recognition device. The system also includes an SNR module that generates an SNR measurement. The SNR measurement can be based upon a comparison of speech and non-speech portions of the signal input to the automatic speech recognition device. The system further includes a cue module that provides a cue to a user of the automatic speech recognition device, the cue being based upon the SNR measurement.

Type: Application

Filed: October 28, 2004

Publication date: May 11, 2006

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Neal Alewine, John Eckhart, Harvey Ruback, Josef Vopieka
Audio notification management system

Patent number: 6738742

Abstract: A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.

Type: Grant

Filed: February 11, 2003

Date of Patent: May 18, 2004

Assignee: International Business Machines Corporation

Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey Ruback, Carl A. Smith, Ronald E. Vanbuskirk, Huifang Wang, Steven G. Woodward
Preventing audio feedback

Patent number: 6674451

Abstract: A method for enabling a user to proactively reduce the likelihood of audio feedback in an application requiring audio input and output, comprising the steps of: generating a graphical user interface (GUI) display screen including a first area for displaying information about preventing audio feedback and a second area for user selections and controls; displaying a list of available audio outputs in the second area; prompting the user to select one of the audio outputs from the list; prompting the user to select one of a plurality of muting options for each selected one of the audio outputs; and, displaying in the GUI display screen an explanation for each one of the plurality of muting options, whereby muting selections for proactively reducing the likelihood of audio feedback can be made based on user experience and knowledge. Only one of the muting option explanations is displayed at a time, responsive to the user selection of one of the muting options.

Type: Grant

Filed: February 25, 1999

Date of Patent: January 6, 2004

Assignee: International Business Machines Corporation

Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Ronald Van Buskirk, Harvey Ruback
Audio notification management system

Publication number: 20030130850

Abstract: A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.

Type: Application

Filed: February 11, 2003

Publication date: July 10, 2003

Applicant: International Business Machines Corporation

Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey Ruback, Carl A. Smith, Ronald E. Vanbuskirk, Huifang Wang, Steven G. Woodward
Audio notification management system

Patent number: 6542868

Abstract: A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.

Type: Grant

Filed: September 23, 1999

Date of Patent: April 1, 2003

Assignee: International Business Machines Corporation

Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey Ruback, Carl A. Smith, Ronald E. Vanbuskirk, Huifang Wang, Steven G. Woodward
Trouble shooting a wrong audio source

Patent number: 6504553

Abstract: A method for guiding a user through trouble shooting a wrong audio source among a plurality of audio sources, comprises the steps of: (a) generating a GUI display screen including a first area for displaying information about testing an audio input device, a second area for displaying instructions and status information and for providing dynamic feedback and a third area for user selections and controls; (b) displaying a list of available audio sources in said third area; (c) prompting said user to select an audio source from said list; (d) prompting said user to test said selected audio source; (e) in the event said test is unsuccessful, prompting said user to select any other audio source identified in said third area; (f) prompting said user to test said any other audio source in said list; and, (g) repeating steps (e) and (f) until one of said audio sources is tested successfully or each of said audio sources is tested unsuccessfully.

Type: Grant

Filed: February 25, 1999

Date of Patent: January 7, 2003

Assignee: International Business Machines Corporation

Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Harvey Ruback, Ronald Van Buskirk
Connecting and optimizing audio input devices

Patent number: 6492999

Abstract: A method for connecting and optimizing audio input devices, comprises the steps of: determining an audio input type; generating a first GUI display screen for prompting and enabling user selection of an audio input device; generating a second GUI display screen for prompting and enabling user connection of the audio input device; testing the connected audio input device; configuring audio settings of the connected audio input device; and, storing for later retrieval an association of the connected audio input device and the configured audio settings. The audio settings are configured and the association is stored only if the testing step is successful. The second GUI display screen can include a device specific image and device specific instructions.

Type: Grant

Filed: February 25, 1999

Date of Patent: December 10, 2002

Assignee: International Business Machines Corporation

Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Ronald Van Buskirk, Harvey Ruback
Task automation user interface with text-to-speech output

Patent number: 6456973

Abstract: In a computer system adapted for text-to-speech playback, a method for instructing a user in performing a task having a plurality of steps can include retrieving a textual instruction from a location in an electronic storage device of the computer system. The textual instruction can correspond to one or more of the steps in the task. The textual instruction can be displayed in a task automation user interface, and a text-to-speech (TTS) conversion of the textual instruction can be executed. The steps can be repeated until all textual instructions corresponding to each step in the task have been retrieved and TTS converted.

Type: Grant

Filed: October 12, 1999

Date of Patent: September 24, 2002

Assignee: International Business Machines Corp.

Inventors: Frank Fado, Peter J. Guasti, Amado Nassiff, Harvey Ruback, Ronald E. VanBuskirk
User selectable input devices for speech applications

Patent number: 6342903

Abstract: A method for enabling user selectable input devices for dictation or transcription in a speech application, comprising the steps of: establishing a registry of dictation and transcription device descriptions, each of the descriptions including a device specific image, a device specific set of device-connecting instructions and a device specific list of audio configuration parameters; building dynamic tables containing information retrieved from the registry; establishing and storing a plurality of enrollments, each of the enrollments representing a speech file of user specific training data corresponding to at least one of a specific audio input device and a specific audio environment; and, generating GUI display screen using the information in at least one of the dynamic tables to enable user selection any input device in the registry for which one of the enrollments is available, for use as a dictation or transcription input to the speech application.

Type: Grant

Filed: February 25, 1999

Date of Patent: January 29, 2002

Assignee: International Business Machines Corp.

Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Ronald Van Buskirk, Harvey Ruback
Maintaining input device identity

Patent number: 6275805

Abstract: A method for maintaining input device identity in a speech application, comprising the steps of: storing a plurality of enrollments, each of the enrollments representing a speech file of training data associated with at least one of a specific audio input device and a specific audio environment for a specific user; generating a graphical user interface (GUI) display screen for prompting and enabling user selection of at least one of an audio input device and an audio environment; and, retrieving one of the enrollments responsive to the user selection, for use in a dictation or transcription session.

Type: Grant

Filed: February 25, 1999

Date of Patent: August 14, 2001

Assignee: International Business Machines Corp.

Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Ronald Van Buskirk, Harvey Ruback