Patents by Inventor Harvey M. Ruback
Harvey M. Ruback has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9196252Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.Type: GrantFiled: October 26, 2009Date of Patent: November 24, 2015Assignee: Nuance Communications, Inc.Inventors: Harvey M. Ruback, Steven G. Woodward
-
Patent number: 8131548Abstract: A speech processing method can automatically and dynamically adjust speech grammar weights at runtime based upon usage data. Each of the speech grammar weights can be associated with an available speech command contained within a speech grammar to which the speech grammar weights apply. The usage data can indicate a relative frequency with which each of the available speech commands is utilized.Type: GrantFiled: March 6, 2006Date of Patent: March 6, 2012Assignee: Nuance Communications, Inc.Inventors: Oscar J. Blass, Harvey M. Ruback, Roberto Vila
-
Patent number: 7983914Abstract: A speech recognition system or method can include a speech input device and a processor coupled to the speech input device. The processor can be programmed to identify a plurality of words that are members of confusable pairs of words where each pair includes a target word and a substituted word. The processor can degrade a pronunciation of the substituted word to provide a worse pronunciation of the substituted word. The processor can further compare the pronunciation of the target word with the worse pronunciation to the substituted word. The processor can be further programmed to reduce confusion between the substituted word and other words in a recognition grammar of the speech recognition engine and can also narrow the scope within which the substituted word is recognized.Type: GrantFiled: August 10, 2005Date of Patent: July 19, 2011Assignee: Nuance Communications, Inc.Inventors: John W. Eckhart, Harvey M. Ruback
-
Patent number: 7904300Abstract: An in-vehicle system that shares speech processing resources among multiple applications located within a vehicle. The system can include one or more software applications, each associated with different functionally independent in-vehicle consoles. Each application can have a console specific user interface. The system can also include a single in-vehicle speech processing system implemented separately from the in-vehicle consoles. The speech processing system can execute speech processing tasks responsive to requests received from the applications. That is, the in-vehicle speech processing system can provide speech processing capabilities for the applications. The provided speech processing capabilities can include text-to-speech capabilities and speech recognition capabilities.Type: GrantFiled: August 10, 2005Date of Patent: March 8, 2011Assignee: Nuance Communications, Inc.Inventors: Lisa Abbott, Daniel E. Badt, Werayuth T. Charoenruengkit, John W. Eckhart, Michael Florio, Gary R. Hanson, Harvey M. Ruback, William Russell Whitehead, Steven G. Woodward
-
Patent number: 7848928Abstract: A method for implementing speech focus in a speech processing system can include the step of establishing a default focus receiver as a first entity to request speech focus of a speech processing system having multiple applications that share speech resources based upon speech focus. An event occurrence can be detected. An event handler of the default speech receiver can previously define behavior for the event occurrence and where default system behavior can be implemented within the speech processing system for the event occurrence. The default system behavior can be utilized when speech focus is not assigned during the event occurrence. Responsive to the event occurrence, at least one programmatic action can be performed in accordance with machine readable instructions of the event handler. The default system behavior is not implemented responsive to the event occurrence.Type: GrantFiled: August 10, 2005Date of Patent: December 7, 2010Assignee: Nuance Communications, Inc.Inventors: Lisa Abbott, Daniel E. Badt, John W. Eckhart, Harvey M. Ruback, Steven G. Woodward
-
Publication number: 20090171663Abstract: The present invention discloses creating and using speech recognition grammars of reduced size. The reduced speech recognition grammars can include a set of entries, each entry having a unique identifier and a phonetic representation that is used when matching speech input against the entries. Each entry can lack a textual spelling corresponding to the phonetic representation. The reduced speech recognition grammar can be digitally encoded and stored in a computer readable media, such as a hard drive or flash memory of a portable speech enabled device.Type: ApplicationFiled: January 2, 2008Publication date: July 2, 2009Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: DANIEL E. BADT, VLADIMIR BERGL, JOHN W. ECKHART, RADEK HAMPL, JONATHAN PALGON, HARVEY M. RUBACK
-
Publication number: 20080312929Abstract: The present invention discloses a text-to-speech system that provides output variability. The system can include a finite state grammar, a variability engine and a text-to-speech engine. The finite state grammar can contain a phrase role consisting of one or more phrase elements. The phrase rule can deterministically generate a variable text phrase based upon at least one random number. The phrase rule can include a definition for each of the phrase elements. Each definition can be associated with at least one defined text string. The variability engine can construct a random text phrase responsive to receiving an action command, wherein said finite state grammar is used to create the text phrase. The variability engine can also rely on user-specified weights to adjust the output probabilities. The speech-to-text engine can convert the text phrase generated by the variability engine into speech output.Type: ApplicationFiled: June 12, 2007Publication date: December 18, 2008Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: OSCAR J. BLASS, PARITOSH D. PATEL, HARVEY M. RUBACK, ROBERTO VILA
-
Patent number: 7366673Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. Selecting can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Selecting can further include registering the speech grammar in the recognition system.Type: GrantFiled: June 15, 2001Date of Patent: April 29, 2008Assignee: International Business Machines CorporationInventors: Harvey M. Ruback, Steven G. Woodward
-
Patent number: 6999591Abstract: An automatic gain control method in accordance with the inventive arrangements can include the following steps. Initially, an audio signal can be provided to an audio device which has a range of permissible signal level settings and a signal level controller for establishing a particular signal level setting. In addition, an actual signal level can be measured for the audio signal at an established signal level setting. The measured actual signal level further can be stored in a volume map along with the corresponding established signal level setting. Following the storage of the measured actual signal level in the volume map, a different signal level setting can be established using the signal level controller. Subsequently, the actual signal level can be re-measured and the re-measured actual signal level and corresponding established different signal level setting can be stored in the volume map.Type: GrantFiled: February 27, 2001Date of Patent: February 14, 2006Assignee: International Business Machines CorporationInventors: Francis Fado, Peter J. Guasti, Gary R. Hanson, Keith P. Loring, Harvey M. Ruback, Steven G. Woodward
-
Patent number: 6741963Abstract: A method and a system for improving recall of speech data in a computer speech system can include a plurality of speech cache management steps including providing a speech cache, receiving a speech system input and identifying a speech event in the received speech system input, the speech event comprising speech data. Subsequently, the speech data can be compared to pre-determined speech cache entry criteria; and, if the speech data meets one of the pre-determined entry criteria, at least one entry can be added to the speech cache, the at least one entry corresponding to the speech data. Additionally, the speech data can be compared to pre-determined speech cache exit criteria; and, if the speech data meets one of the pre-determined exit criteria, at least one entry can be purged from the speech cache, the at least one entry corresponding to the speech data. The entry criteria can include frequently used speech data, recently used speech data, and important speech data.Type: GrantFiled: June 21, 2000Date of Patent: May 25, 2004Assignee: International Business Machines CorporationInventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey M. Ruback, Carl A. Smith, Ronald E. VanBuskirk, Huifang Wang, Steven G. Woodward
-
Publication number: 20030046074Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.Type: ApplicationFiled: June 15, 2001Publication date: March 6, 2003Applicant: International Business Machines CorporationInventors: Harvey M. Ruback, Steven G. Woodward
-
Publication number: 20020161588Abstract: A system configured to automatically modify audio data and audio device behavior can include an embedded audio layer (EAL), the EAL providing an abstracted interface to an audio device; a filter/controller linked list accessible by the EAL; and, a registration processor for registering filter/controllers (FCs) with the EAL, the registration processor inserting an FC reference into the filter/controller linked list upon registration of a corresponding FC. The system can further include an unregistration processor for unregistering FCs with the EAL, the unregistration processor removing an FC reference from the filter/controller linked list upon unregistration of a corresponding FC. Finally, the system can include an enablement processor for enabling use of a corresponding FC when a specified audio device is active; and, a disablement processor for disabling use of an enabled FC when a specified audio device is active.Type: ApplicationFiled: February 23, 2001Publication date: October 31, 2002Applicant: International Business Machines CorporationInventors: Joseph Camosy, Gary R. Hanson, Brien H. Muschett, Harvey M. Ruback, Steven G. Woodward
-
Publication number: 20020159608Abstract: An automatic gain control method in accordance with the inventive arrangements can include the following steps. Initially, an audio signal can be provided to an audio device which has a range of permissible signal level settings and a signal level controller for establishing a particular signal level setting. In addition, an actual signal level can be measured for the audio signal at an established signal level setting. The measured actual signal level further can be stored in a volume map along with the corresponding established signal level setting. Following the storage of the measured actual signal level in the volume map, a different signal level setting can be established using the signal level controller. Subsequently, the actual signal level can be re-measured and the re-measured actual signal level and corresponding established different signal level setting can be stored in the volume map.Type: ApplicationFiled: February 27, 2001Publication date: October 31, 2002Applicant: International Business Machines CorporationInventors: Francis Fado, Peter J. Guasti, Gary R. Hanson, Keith P. Loring, Harvey M. Ruback, Steven G. Woodward
-
Patent number: 5873064Abstract: Method for implementing a multi-action voice macro (140) for a voice recognition navigator program (102) on a computer system. The method involves analyzing a target application program (22) to determine a plurality of target application states (24). Each of the target application states (24) is comprised of a plurality of window objects. The target application states are arranged in the form of one or more sub-context trees, with each of the sub-context trees comprised of a plurality of sub-context objects (50, 52, 54, 56, 58, 60, 62, 64, 66, 68). A set of user inputs is determined to which each of the window objects will be responsive. Each user input is assigned a corresponding voice macro (140) which simulates the user inputs in response to a spoken utterance. The voice macro (140) includes a link field (148), which identifies at least one linked macro to be executed by the navigator program (102) when a specific vocabulary phrase for the voice macro (140) is spoken by a user.Type: GrantFiled: November 8, 1996Date of Patent: February 16, 1999Assignee: International Business Machines CorporationInventors: Mario E. De Armas, Harvey M. Ruback
-
Patent number: 5864819Abstract: Method for representing a target software application program to a voice recognition navigator program on a computer system. The method requires analyzing an application program to determine a plurality of application states. Each of the application states is defined as a set of window objects within the application for performing a specific user task. According to the invention, each of the application states is preferably represented by a sub-context tree, comprised of a plurality of sub-context objects. The tree allows the navigator to associate decoded spoken commands to specific window objects.Type: GrantFiled: November 8, 1996Date of Patent: January 26, 1999Assignee: International Business Machines CorporationInventors: Mario E. De Armas, Harvey M. Ruback