Patents by Inventor Harvey M. Ruback

Harvey M. Ruback has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9196252
    Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.
    Type: Grant
    Filed: October 26, 2009
    Date of Patent: November 24, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Harvey M. Ruback, Steven G. Woodward
  • Patent number: 8131548
    Abstract: A speech processing method can automatically and dynamically adjust speech grammar weights at runtime based upon usage data. Each of the speech grammar weights can be associated with an available speech command contained within a speech grammar to which the speech grammar weights apply. The usage data can indicate a relative frequency with which each of the available speech commands is utilized.
    Type: Grant
    Filed: March 6, 2006
    Date of Patent: March 6, 2012
    Assignee: Nuance Communications, Inc.
    Inventors: Oscar J. Blass, Harvey M. Ruback, Roberto Vila
  • Patent number: 7983914
    Abstract: A speech recognition system or method can include a speech input device and a processor coupled to the speech input device. The processor can be programmed to identify a plurality of words that are members of confusable pairs of words where each pair includes a target word and a substituted word. The processor can degrade a pronunciation of the substituted word to provide a worse pronunciation of the substituted word. The processor can further compare the pronunciation of the target word with the worse pronunciation to the substituted word. The processor can be further programmed to reduce confusion between the substituted word and other words in a recognition grammar of the speech recognition engine and can also narrow the scope within which the substituted word is recognized.
    Type: Grant
    Filed: August 10, 2005
    Date of Patent: July 19, 2011
    Assignee: Nuance Communications, Inc.
    Inventors: John W. Eckhart, Harvey M. Ruback
  • Patent number: 7904300
    Abstract: An in-vehicle system that shares speech processing resources among multiple applications located within a vehicle. The system can include one or more software applications, each associated with different functionally independent in-vehicle consoles. Each application can have a console specific user interface. The system can also include a single in-vehicle speech processing system implemented separately from the in-vehicle consoles. The speech processing system can execute speech processing tasks responsive to requests received from the applications. That is, the in-vehicle speech processing system can provide speech processing capabilities for the applications. The provided speech processing capabilities can include text-to-speech capabilities and speech recognition capabilities.
    Type: Grant
    Filed: August 10, 2005
    Date of Patent: March 8, 2011
    Assignee: Nuance Communications, Inc.
    Inventors: Lisa Abbott, Daniel E. Badt, Werayuth T. Charoenruengkit, John W. Eckhart, Michael Florio, Gary R. Hanson, Harvey M. Ruback, William Russell Whitehead, Steven G. Woodward
  • Patent number: 7848928
    Abstract: A method for implementing speech focus in a speech processing system can include the step of establishing a default focus receiver as a first entity to request speech focus of a speech processing system having multiple applications that share speech resources based upon speech focus. An event occurrence can be detected. An event handler of the default speech receiver can previously define behavior for the event occurrence and where default system behavior can be implemented within the speech processing system for the event occurrence. The default system behavior can be utilized when speech focus is not assigned during the event occurrence. Responsive to the event occurrence, at least one programmatic action can be performed in accordance with machine readable instructions of the event handler. The default system behavior is not implemented responsive to the event occurrence.
    Type: Grant
    Filed: August 10, 2005
    Date of Patent: December 7, 2010
    Assignee: Nuance Communications, Inc.
    Inventors: Lisa Abbott, Daniel E. Badt, John W. Eckhart, Harvey M. Ruback, Steven G. Woodward
  • Publication number: 20090171663
    Abstract: The present invention discloses creating and using speech recognition grammars of reduced size. The reduced speech recognition grammars can include a set of entries, each entry having a unique identifier and a phonetic representation that is used when matching speech input against the entries. Each entry can lack a textual spelling corresponding to the phonetic representation. The reduced speech recognition grammar can be digitally encoded and stored in a computer readable media, such as a hard drive or flash memory of a portable speech enabled device.
    Type: Application
    Filed: January 2, 2008
    Publication date: July 2, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: DANIEL E. BADT, VLADIMIR BERGL, JOHN W. ECKHART, RADEK HAMPL, JONATHAN PALGON, HARVEY M. RUBACK
  • Publication number: 20080312929
    Abstract: The present invention discloses a text-to-speech system that provides output variability. The system can include a finite state grammar, a variability engine and a text-to-speech engine. The finite state grammar can contain a phrase role consisting of one or more phrase elements. The phrase rule can deterministically generate a variable text phrase based upon at least one random number. The phrase rule can include a definition for each of the phrase elements. Each definition can be associated with at least one defined text string. The variability engine can construct a random text phrase responsive to receiving an action command, wherein said finite state grammar is used to create the text phrase. The variability engine can also rely on user-specified weights to adjust the output probabilities. The speech-to-text engine can convert the text phrase generated by the variability engine into speech output.
    Type: Application
    Filed: June 12, 2007
    Publication date: December 18, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: OSCAR J. BLASS, PARITOSH D. PATEL, HARVEY M. RUBACK, ROBERTO VILA
  • Patent number: 7366673
    Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. Selecting can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Selecting can further include registering the speech grammar in the recognition system.
    Type: Grant
    Filed: June 15, 2001
    Date of Patent: April 29, 2008
    Assignee: International Business Machines Corporation
    Inventors: Harvey M. Ruback, Steven G. Woodward
  • Patent number: 6999591
    Abstract: An automatic gain control method in accordance with the inventive arrangements can include the following steps. Initially, an audio signal can be provided to an audio device which has a range of permissible signal level settings and a signal level controller for establishing a particular signal level setting. In addition, an actual signal level can be measured for the audio signal at an established signal level setting. The measured actual signal level further can be stored in a volume map along with the corresponding established signal level setting. Following the storage of the measured actual signal level in the volume map, a different signal level setting can be established using the signal level controller. Subsequently, the actual signal level can be re-measured and the re-measured actual signal level and corresponding established different signal level setting can be stored in the volume map.
    Type: Grant
    Filed: February 27, 2001
    Date of Patent: February 14, 2006
    Assignee: International Business Machines Corporation
    Inventors: Francis Fado, Peter J. Guasti, Gary R. Hanson, Keith P. Loring, Harvey M. Ruback, Steven G. Woodward
  • Patent number: 6741963
    Abstract: A method and a system for improving recall of speech data in a computer speech system can include a plurality of speech cache management steps including providing a speech cache, receiving a speech system input and identifying a speech event in the received speech system input, the speech event comprising speech data. Subsequently, the speech data can be compared to pre-determined speech cache entry criteria; and, if the speech data meets one of the pre-determined entry criteria, at least one entry can be added to the speech cache, the at least one entry corresponding to the speech data. Additionally, the speech data can be compared to pre-determined speech cache exit criteria; and, if the speech data meets one of the pre-determined exit criteria, at least one entry can be purged from the speech cache, the at least one entry corresponding to the speech data. The entry criteria can include frequently used speech data, recently used speech data, and important speech data.
    Type: Grant
    Filed: June 21, 2000
    Date of Patent: May 25, 2004
    Assignee: International Business Machines Corporation
    Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey M. Ruback, Carl A. Smith, Ronald E. VanBuskirk, Huifang Wang, Steven G. Woodward
  • Publication number: 20030046074
    Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.
    Type: Application
    Filed: June 15, 2001
    Publication date: March 6, 2003
    Applicant: International Business Machines Corporation
    Inventors: Harvey M. Ruback, Steven G. Woodward
  • Publication number: 20020161588
    Abstract: A system configured to automatically modify audio data and audio device behavior can include an embedded audio layer (EAL), the EAL providing an abstracted interface to an audio device; a filter/controller linked list accessible by the EAL; and, a registration processor for registering filter/controllers (FCs) with the EAL, the registration processor inserting an FC reference into the filter/controller linked list upon registration of a corresponding FC. The system can further include an unregistration processor for unregistering FCs with the EAL, the unregistration processor removing an FC reference from the filter/controller linked list upon unregistration of a corresponding FC. Finally, the system can include an enablement processor for enabling use of a corresponding FC when a specified audio device is active; and, a disablement processor for disabling use of an enabled FC when a specified audio device is active.
    Type: Application
    Filed: February 23, 2001
    Publication date: October 31, 2002
    Applicant: International Business Machines Corporation
    Inventors: Joseph Camosy, Gary R. Hanson, Brien H. Muschett, Harvey M. Ruback, Steven G. Woodward
  • Publication number: 20020159608
    Abstract: An automatic gain control method in accordance with the inventive arrangements can include the following steps. Initially, an audio signal can be provided to an audio device which has a range of permissible signal level settings and a signal level controller for establishing a particular signal level setting. In addition, an actual signal level can be measured for the audio signal at an established signal level setting. The measured actual signal level further can be stored in a volume map along with the corresponding established signal level setting. Following the storage of the measured actual signal level in the volume map, a different signal level setting can be established using the signal level controller. Subsequently, the actual signal level can be re-measured and the re-measured actual signal level and corresponding established different signal level setting can be stored in the volume map.
    Type: Application
    Filed: February 27, 2001
    Publication date: October 31, 2002
    Applicant: International Business Machines Corporation
    Inventors: Francis Fado, Peter J. Guasti, Gary R. Hanson, Keith P. Loring, Harvey M. Ruback, Steven G. Woodward
  • Patent number: 5873064
    Abstract: Method for implementing a multi-action voice macro (140) for a voice recognition navigator program (102) on a computer system. The method involves analyzing a target application program (22) to determine a plurality of target application states (24). Each of the target application states (24) is comprised of a plurality of window objects. The target application states are arranged in the form of one or more sub-context trees, with each of the sub-context trees comprised of a plurality of sub-context objects (50, 52, 54, 56, 58, 60, 62, 64, 66, 68). A set of user inputs is determined to which each of the window objects will be responsive. Each user input is assigned a corresponding voice macro (140) which simulates the user inputs in response to a spoken utterance. The voice macro (140) includes a link field (148), which identifies at least one linked macro to be executed by the navigator program (102) when a specific vocabulary phrase for the voice macro (140) is spoken by a user.
    Type: Grant
    Filed: November 8, 1996
    Date of Patent: February 16, 1999
    Assignee: International Business Machines Corporation
    Inventors: Mario E. De Armas, Harvey M. Ruback
  • Patent number: 5864819
    Abstract: Method for representing a target software application program to a voice recognition navigator program on a computer system. The method requires analyzing an application program to determine a plurality of application states. Each of the application states is defined as a set of window objects within the application for performing a specific user task. According to the invention, each of the application states is preferably represented by a sub-context tree, comprised of a plurality of sub-context objects. The tree allows the navigator to associate decoded spoken commands to specific window objects.
    Type: Grant
    Filed: November 8, 1996
    Date of Patent: January 26, 1999
    Assignee: International Business Machines Corporation
    Inventors: Mario E. De Armas, Harvey M. Ruback