Patents by Inventor Harvey Ruback

Harvey Ruback has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20100049521
    Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.
    Type: Application
    Filed: October 26, 2009
    Publication date: February 25, 2010
    Applicant: Nuance Communications, Inc.
    Inventors: Harvey Ruback, Steven Woodward
  • Patent number: 7610204
    Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.
    Type: Grant
    Filed: March 5, 2008
    Date of Patent: October 27, 2009
    Assignee: Nuance Communications, Inc.
    Inventors: Harvey Ruback, Steven Woodward
  • Publication number: 20080189111
    Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.
    Type: Application
    Filed: March 5, 2008
    Publication date: August 7, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Harvey Ruback, Steven Woodward
  • Publication number: 20070208555
    Abstract: A speech processing method can automatically and dynamically adjust speech grammar weights at runtime based upon usage data. Each of the speech grammar weights can be associated with an available speech command contained within a speech grammar to which the speech grammar weights apply. The usage data can indicate a relative frequency with which each of the available speech commands is utilized.
    Type: Application
    Filed: March 6, 2006
    Publication date: September 6, 2007
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Oscar Blass, Harvey Ruback, Roberto Vila
  • Publication number: 20070038462
    Abstract: A method for implementing speech focus in a speech processing system can include the step of establishing a default focus receiver as a first entity to request speech focus of a speech processing system having multiple applications that share speech resources based upon speech focus. An event occurrence can be detected. An event handler of the default speech receiver can previously define behavior for the event occurrence and where default system behavior can be implemented within the speech processing system for the event occurrence. The default system behavior can be utilized when speech focus is not assigned during the event occurrence. Responsive to the event occurrence, at least one programmatic action can be performed in accordance with machine readable instructions of the event handler. The default system behavior is not implemented responsive to the event occurrence.
    Type: Application
    Filed: August 10, 2005
    Publication date: February 15, 2007
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lisa Abbott, Daniel Badt, John Eckhart, Harvey Ruback, Steven Woodward
  • Publication number: 20070038461
    Abstract: An in-vehicle system that shares speech processing resources among multiple applications located within a vehicle. The system can include one or more software applications, each associated with different functionally independent in-vehicle consoles. Each application can have a console specific user interface. The system can also include a single in-vehicle speech processing system implemented separately from the in-vehicle consoles. The speech processing system can execute speech processing tasks responsive to requests received from the applications. That is, the in-vehicle speech processing system can provide speech processing capabilities for the applications. The provided speech processing capabilities can include text-to-speech capabilities and speech recognition capabilities.
    Type: Application
    Filed: August 10, 2005
    Publication date: February 15, 2007
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lisa Abbott, Daniel Badt, Werayuth Charoenruengkit, John Eckhart, Michael Florio, Gary Hanson, Harvey Ruback, William Whitehead, Steven Woodward
  • Publication number: 20070038454
    Abstract: A speech recognition system (10) or method (20) can include a speech input device and a processor (14) coupled to the speech input. The processor can be programmed to identify (22) a plurality of words that are members of confusable pairs of words where each pair includes a target word and a substituted word. The processor can degrade (24) a pronunciation of the substituted word to provide a worse pronunciation of the substituted word. The processor can further compare (28) the pronunciation of the target word with the worse pronunciation to the substituted word. The processor can be further programmed to reduce (26) confusion between the substituted word and other words in a recognition grammar of the speech recognition engine and can also narrow the scope within which the substituted word is recognized.
    Type: Application
    Filed: August 10, 2005
    Publication date: February 15, 2007
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: John Eckhart, Harvey Ruback
  • Publication number: 20060235694
    Abstract: A method of integrating conversational speech into a multimodal, Web-based processing model can include speech recognizing a user spoken utterance directed to a voice-enabled field of a multimodal markup language document presented within a browser. A statistical grammar can be used to determine a recognition result. The method further can include providing the recognition result to the browser, receiving, within a natural language understanding (NLU) system, the recognition result from the browser, and semantically processing the recognition result to determine a meaning. Accordingly, a next programmatic action to be performed can be selected according to the meaning.
    Type: Application
    Filed: April 14, 2005
    Publication date: October 19, 2006
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Charles Cross, Brien Muschett, Harvey Ruback, Leslie Wilson
  • Publication number: 20060100866
    Abstract: A system for influencing a signal-to-noise ratio (SNR) associated with a signal input to an automatic speech recognition device is provided. The system includes a normalized energy module that determines a normalized energy measurement based upon a spectrum of frequency-domain complex coefficients, the coefficients generated by the automatic speech recognition device. The system also includes an SNR module that generates an SNR measurement. The SNR measurement can be based upon a comparison of speech and non-speech portions of the signal input to the automatic speech recognition device. The system further includes a cue module that provides a cue to a user of the automatic speech recognition device, the cue being based upon the SNR measurement.
    Type: Application
    Filed: October 28, 2004
    Publication date: May 11, 2006
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Neal Alewine, John Eckhart, Harvey Ruback, Josef Vopieka
  • Patent number: 6738742
    Abstract: A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.
    Type: Grant
    Filed: February 11, 2003
    Date of Patent: May 18, 2004
    Assignee: International Business Machines Corporation
    Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey Ruback, Carl A. Smith, Ronald E. Vanbuskirk, Huifang Wang, Steven G. Woodward
  • Patent number: 6674451
    Abstract: A method for enabling a user to proactively reduce the likelihood of audio feedback in an application requiring audio input and output, comprising the steps of: generating a graphical user interface (GUI) display screen including a first area for displaying information about preventing audio feedback and a second area for user selections and controls; displaying a list of available audio outputs in the second area; prompting the user to select one of the audio outputs from the list; prompting the user to select one of a plurality of muting options for each selected one of the audio outputs; and, displaying in the GUI display screen an explanation for each one of the plurality of muting options, whereby muting selections for proactively reducing the likelihood of audio feedback can be made based on user experience and knowledge. Only one of the muting option explanations is displayed at a time, responsive to the user selection of one of the muting options.
    Type: Grant
    Filed: February 25, 1999
    Date of Patent: January 6, 2004
    Assignee: International Business Machines Corporation
    Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Ronald Van Buskirk, Harvey Ruback
  • Publication number: 20030130850
    Abstract: A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.
    Type: Application
    Filed: February 11, 2003
    Publication date: July 10, 2003
    Applicant: International Business Machines Corporation
    Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey Ruback, Carl A. Smith, Ronald E. Vanbuskirk, Huifang Wang, Steven G. Woodward
  • Patent number: 6542868
    Abstract: A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.
    Type: Grant
    Filed: September 23, 1999
    Date of Patent: April 1, 2003
    Assignee: International Business Machines Corporation
    Inventors: Daniel E. Badt, Peter J. Guasti, Gary R. Hanson, Amado Nassiff, Edwin A. Rodriguez, Harvey Ruback, Carl A. Smith, Ronald E. Vanbuskirk, Huifang Wang, Steven G. Woodward
  • Patent number: 6504553
    Abstract: A method for guiding a user through trouble shooting a wrong audio source among a plurality of audio sources, comprises the steps of: (a) generating a GUI display screen including a first area for displaying information about testing an audio input device, a second area for displaying instructions and status information and for providing dynamic feedback and a third area for user selections and controls; (b) displaying a list of available audio sources in said third area; (c) prompting said user to select an audio source from said list; (d) prompting said user to test said selected audio source; (e) in the event said test is unsuccessful, prompting said user to select any other audio source identified in said third area; (f) prompting said user to test said any other audio source in said list; and, (g) repeating steps (e) and (f) until one of said audio sources is tested successfully or each of said audio sources is tested unsuccessfully.
    Type: Grant
    Filed: February 25, 1999
    Date of Patent: January 7, 2003
    Assignee: International Business Machines Corporation
    Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Harvey Ruback, Ronald Van Buskirk
  • Patent number: 6492999
    Abstract: A method for connecting and optimizing audio input devices, comprises the steps of: determining an audio input type; generating a first GUI display screen for prompting and enabling user selection of an audio input device; generating a second GUI display screen for prompting and enabling user connection of the audio input device; testing the connected audio input device; configuring audio settings of the connected audio input device; and, storing for later retrieval an association of the connected audio input device and the configured audio settings. The audio settings are configured and the association is stored only if the testing step is successful. The second GUI display screen can include a device specific image and device specific instructions.
    Type: Grant
    Filed: February 25, 1999
    Date of Patent: December 10, 2002
    Assignee: International Business Machines Corporation
    Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Ronald Van Buskirk, Harvey Ruback
  • Publication number: 20020180772
    Abstract: A method for guiding a user through trouble shooting a wrong audio source among a plurality of audio sources, comprises the steps of: (a) generating a GUI display screen including a first area for displaying information about testing an audio input device, a second area for displaying instructions and status information and for providing dynamic feedback and a third area for user selections and controls; (b) displaying a list of available audio sources in said third area; (c) prompting said user to select an audio source from said list; (d) prompting said user to test said selected audio source; (e) in the event said test is unsuccessful, prompting said user to select any other audio source identified in said third area; (f) prompting said user to test said any other audio source in said list; and, (g) repeating steps (e) and (f) until one of said audio sources is tested successfully or each of said audio sources is tested unsuccessfully.
    Type: Application
    Filed: February 25, 1999
    Publication date: December 5, 2002
    Inventors: FRANK FADO, PETER GUASTI, AMADO NASSIFF, HARVEY RUBACK, RONALD VAN BUSKIRK
  • Publication number: 20020180775
    Abstract: A method for connecting and optimizing audio input devices, comprises the steps of: determining an audio input type; generating a first GUI display screen for prompting and enabling user selection of an audio input device; generating a second GUI display screen for prompting and enabling user connection of the audio input device; testing the connected audio input device; configuring audio settings of the connected audio input device; and, storing for later retrieval an association of the connected audio input device and the configured audio settings. The audio settings are configured and the association is stored only if the testing step is successful. The second GUI display screen can include a device specific image and device specific instructions.
    Type: Application
    Filed: February 25, 1999
    Publication date: December 5, 2002
    Inventors: FRANK FADO, PETER GUASTI, AMADO NASSIFF, RONALD VAN BUSKIRK, HARVEY RUBACK
  • Patent number: 6456973
    Abstract: In a computer system adapted for text-to-speech playback, a method for instructing a user in performing a task having a plurality of steps can include retrieving a textual instruction from a location in an electronic storage device of the computer system. The textual instruction can correspond to one or more of the steps in the task. The textual instruction can be displayed in a task automation user interface, and a text-to-speech (TTS) conversion of the textual instruction can be executed. The steps can be repeated until all textual instructions corresponding to each step in the task have been retrieved and TTS converted.
    Type: Grant
    Filed: October 12, 1999
    Date of Patent: September 24, 2002
    Assignee: International Business Machines Corp.
    Inventors: Frank Fado, Peter J. Guasti, Amado Nassiff, Harvey Ruback, Ronald E. VanBuskirk
  • Patent number: 6342903
    Abstract: A method for enabling user selectable input devices for dictation or transcription in a speech application, comprising the steps of: establishing a registry of dictation and transcription device descriptions, each of the descriptions including a device specific image, a device specific set of device-connecting instructions and a device specific list of audio configuration parameters; building dynamic tables containing information retrieved from the registry; establishing and storing a plurality of enrollments, each of the enrollments representing a speech file of user specific training data corresponding to at least one of a specific audio input device and a specific audio environment; and, generating GUI display screen using the information in at least one of the dynamic tables to enable user selection any input device in the registry for which one of the enrollments is available, for use as a dictation or transcription input to the speech application.
    Type: Grant
    Filed: February 25, 1999
    Date of Patent: January 29, 2002
    Assignee: International Business Machines Corp.
    Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Ronald Van Buskirk, Harvey Ruback
  • Patent number: 6275805
    Abstract: A method for maintaining input device identity in a speech application, comprising the steps of: storing a plurality of enrollments, each of the enrollments representing a speech file of training data associated with at least one of a specific audio input device and a specific audio environment for a specific user; generating a graphical user interface (GUI) display screen for prompting and enabling user selection of at least one of an audio input device and an audio environment; and, retrieving one of the enrollments responsive to the user selection, for use in a dictation or transcription session.
    Type: Grant
    Filed: February 25, 1999
    Date of Patent: August 14, 2001
    Assignee: International Business Machines Corp.
    Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Ronald Van Buskirk, Harvey Ruback