Patents by Inventor Oscar J. Blass

Oscar J. Blass has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8234120
    Abstract: The present invention discloses a solution for assuring user-defined voice commands are unambiguous. The solution can include a step of identifying a user attempt to enter a user-defined voice command into a voice-enabled system. A safety analysis can be performed on the user-defined voice command to determine a likelihood that the user-defined voice command will be confused with preexisting voice commands recognized by the voice-enabled system. When a high likelihood of confusion is determined by the safety analysis, a notification can be presented that the user-defined voice command is subject to confusion. A user can then define a different voice command or can choose to continue to use the potentially confusing command, possibly subject to a system imposed confusion mitigating condition or action.
    Type: Grant
    Filed: July 26, 2006
    Date of Patent: July 31, 2012
    Assignee: Nuance Communications, Inc.
    Inventors: Ciprian Agapi, Oscar J. Blass, Brennan D. Monteiro, Roberto Vila
  • Patent number: 8170877
    Abstract: A method for producing speech output can include the step of selecting a TTS output device from a plurality of available output devices. The selected output device can be associated with outputting content of an application responsive to a print command. According to the method, the print command can be detected, which results in the content of the application being conveyed to the selected TTS output device. The TTS output device can be associated with at least one text-to-speech engine. Upon content conveyance to the TTS output device, at least a portion of the content can be automatically converted using the text-to-speech engine. The speech converted content can be outputted.
    Type: Grant
    Filed: June 20, 2005
    Date of Patent: May 1, 2012
    Assignee: Nuance Communications, Inc.
    Inventors: Ciprian Agapi, Oscar J. Blass, Charles T. Rutherfoord
  • Patent number: 8131548
    Abstract: A speech processing method can automatically and dynamically adjust speech grammar weights at runtime based upon usage data. Each of the speech grammar weights can be associated with an available speech command contained within a speech grammar to which the speech grammar weights apply. The usage data can indicate a relative frequency with which each of the available speech commands is utilized.
    Type: Grant
    Filed: March 6, 2006
    Date of Patent: March 6, 2012
    Assignee: Nuance Communications, Inc.
    Inventors: Oscar J. Blass, Harvey M. Ruback, Roberto Vila
  • Patent number: 8027437
    Abstract: A method for delivering a message to a recipient in an environment with ambient noise includes the steps of recording the ambient noise in the environment at a certain time interval, analyzing the recorded ambient noise to obtain an average power Pnoise or a RMS amplitude Anoise of the ambient noise, providing a predetermined desired SNRdesired, calculating an average signal power Psignal or a RMS amplitude Asignal of the message to be delivered based on the Pnoise or Anoise and the desired SNRdesired, and adjusting a volume of the message to be delivered according to the Psignal or Asignal. Alternatively, the actual SNRactual will be computed and the message will be repeated if the SNRactual falls below the SNRmin. Systems for delivering a message to a recipient in an environment with ambient noise and computer-readable media having computer-executable instructions for carrying out the methods are also provided.
    Type: Grant
    Filed: December 18, 2006
    Date of Patent: September 27, 2011
    Assignee: Nuance Communications, Inc.
    Inventors: Paritosh D. Patel, Oscar J. Blass, Roberto Vila, Jie Z. Zeng, Anatol Blass
  • Patent number: 8019605
    Abstract: The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can automatically process pre-recorded audio to derive speech assets for a concatenative TTS voice. The pre-recording audio can include sets of recorded phrases used by a speech user interface (Sill). A set of unfulfilled speech assets needed for foil phonetic coverage of the concatenative TTS voice can be determined. A reduced script can be constructed that includes a set of phrases, which when read by a voice talent result in a reduced corpus. When the reduced corpus is automatically processed, a reduced set of speech assets result. The reduced set includes each of the unfulfilled speech assets. When this reduced corpus is combined with existing speech assets the result will be a voice with a complete set of speech assets.
    Type: Grant
    Filed: May 14, 2007
    Date of Patent: September 13, 2011
    Assignee: Nuance Communications, Inc.
    Inventors: Ciprian Agapi, Oscar J. Blass, Paritosh D. Patel, Roberto Vila
  • Patent number: 7747445
    Abstract: The present invention discloses a method for a voice-enabled computing environment. The method includes a step of receiving a series of voice commands related to an abstraction, wherein the voice command specifies an abstraction type. The method further includes responding to the voice command to replay the commands, executing at least one programmatic action related to the abstraction, where the programmatic action is executed demonstrating characteristics specific to the abstraction type associated with the voice command. The present method applies, but is not limited to, the execution of collections of commands that may have diverging timing characteristics.
    Type: Grant
    Filed: July 12, 2006
    Date of Patent: June 29, 2010
    Assignee: Nuance Communications, Inc.
    Inventors: Ciprian Agapi, Musaed A. Almutawa, Oscar J. Blass, Brennan D. Monteiro, Roberto Vila
  • Patent number: 7747446
    Abstract: A system and method for voice recognition interaction is provided. The system can have a processor for receiving a voice signal and determining a command based on the voice signal. The system can also have a confirmation interface operably connected to the processor, where the confirmation interface is capable of receiving a confirmation signal from a user and providing the confirmation signal to the processor. The system can have a user identifying device for determining an identity of the user. The processor can determine a confirmation criteria based at least in part on the identity of the user or a type of the command. The satisfaction of the confirmation criteria can be applied to allow or prevent performance of the command.
    Type: Grant
    Filed: December 12, 2006
    Date of Patent: June 29, 2010
    Assignee: Nuance Communications, Inc.
    Inventors: Oscar J. Blass, Musaed A. Almutawa, Paritosh D. Patel, Robert Vila
  • Publication number: 20090043583
    Abstract: The present invention discloses a solution for customizing synthetic voice characteristics in a user specific fashion. The solution can establish a communication between a user and a voice response system. A data store can be searched for a speech profile associated with the user. When a speech profile is found, a set of speech output characteristics established for the user from the profile can be determined. Parameters and settings of a text-to-speech engine can be adjusted in accordance with the determined set of speech output characteristics. During the established communication, synthetic speech can be generated using the adjusted text-to-speech engine. Thus, each detected user can hear a synthetic speech generated by a different voice specifically selected for that user. When no user profile is detected, a default voice or a voice based upon a user's speech or communication details can be used.
    Type: Application
    Filed: August 8, 2007
    Publication date: February 12, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Ciprian AGAPI, Oscar J. BLASS, Oswaldo GAGO, Roberto VILA
  • Publication number: 20080312929
    Abstract: The present invention discloses a text-to-speech system that provides output variability. The system can include a finite state grammar, a variability engine and a text-to-speech engine. The finite state grammar can contain a phrase role consisting of one or more phrase elements. The phrase rule can deterministically generate a variable text phrase based upon at least one random number. The phrase rule can include a definition for each of the phrase elements. Each definition can be associated with at least one defined text string. The variability engine can construct a random text phrase responsive to receiving an action command, wherein said finite state grammar is used to create the text phrase. The variability engine can also rely on user-specified weights to adjust the output probabilities. The speech-to-text engine can convert the text phrase generated by the variability engine into speech output.
    Type: Application
    Filed: June 12, 2007
    Publication date: December 18, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: OSCAR J. BLASS, PARITOSH D. PATEL, HARVEY M. RUBACK, ROBERTO VILA
  • Publication number: 20080288256
    Abstract: The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can automatically process pre-recorded audio to derive speech assets for a concatenative TTS voice. The pre-recording audio can include sets of recorded phrases used by a speech user interface (Sill). A set of unfulfilled speech assets needed for foil phonetic coverage of the concatenative TTS voice can be determined. A reduced script can be constructed that includes a set of phrases, which when read by a voice talent result in a reduced corpus. When the reduced corpus is automatically processed, a reduced set of speech assets result. The reduced set includes each of the unfulfilled speech assets. When this reduced corpus is combined with existing speech assets the result will be a voice with a complete set of speech assets.
    Type: Application
    Filed: May 14, 2007
    Publication date: November 20, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: CIPRIAN AGAPI, OSCAR J. BLASS, PARITOSH D. PATEL, ROBERTO VILA
  • Publication number: 20080282185
    Abstract: The present invention discloses a method and system for notifying a user of a password/passcode change event. In the invention, a computing system can automatically detect a password/passcode change event. At least one user affected by the event can be determined. A pre-existing user defined message can be retrieved from a data store. A password/passcode change message can be automatically conveyed to the affected user. The change message can include content of the pre-existing user defined message. Further, in one embodiment, an automatically generated activity report associated with a password/passcode can be presented to either a user making a password/passcode change or as part of the conveyed change message.
    Type: Application
    Filed: May 11, 2007
    Publication date: November 13, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: OSCAR J. BLASS, PATRICK COMMARFORD, RODRIGO J. PASTRANA, ROBERTO VILA
  • Publication number: 20080282164
    Abstract: The present invention discloses a method for handling contact information in a communication device. The method can include a step of presenting content within a visual interface of the communication device. A contact detection algorithm can execute to detect contact items included in the content. A visual indication can be presented next to each detected contact item. User input can be received, which causes the communication device to perform a contact action relating to one of the contact items. The contact action can add contact information to an address book, can initiate a communication session, and/or can place the contact item in a memory space for later use by an application other than one in which the content was presented.
    Type: Application
    Filed: May 11, 2007
    Publication date: November 13, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: OSCAR J. BLASS, MEN T. LIEN, RODRIGO JOSE PASTRANA, ROBERTO VILA
  • Publication number: 20080262847
    Abstract: The present invention discloses a concept and a use of audio anchors within voice-enabled interfaces. Audio anchors can be user configurable points from which audio playback occurs. In the invention, a user can identify an interface position at which an audio anchor is to be established. The computing device can determine an anchor direction setting, with values that include forward playback and backward playback. Interface items can then be audibly enumerated from the audio anchor in a direction indicated by the anchor direction setting. For example, if a set of interface items are alphabetically ordered items and if an audio anchor is set at a first item beginning with a letter ā€œGā€ and an anchor direction is set to indicate backward playback, then the interface items beginning with letters ā€œA-Fā€ can be audibly played in reverse alphabetical order. Additionally, a rate of audio playback can be user adjustable.
    Type: Application
    Filed: April 19, 2007
    Publication date: October 23, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Ciprian Agapi, Oscar J. Blass, Paritosh D. Patel, Roberto Vila
  • Publication number: 20080147386
    Abstract: A method for delivering a message to a recipient in an environment with ambient noise includes the steps of recording the ambient noise in the environment at a certain time interval, analyzing the recorded ambient noise to obtain an average power Pnoise or a RMS amplitude Anoise of the ambient noise, providing a predetermined desired SNRdesired, calculating an average signal power Psignal or a RMS amplitude Asignal of the message to be delivered based on the Pnoise or Anoise and the desired SNRdesired, and adjusting a volume of the message to be delivered according to the Psignal or Asignal. Alternatively, the actual SNRactual will be computed and the message will be repeated if the SNRactual falls below the SNRmin. Systems for delivering a message to a recipient in an environment with ambient noise and computer-readable media having computer-executable instructions for carrying out the methods are also provided.
    Type: Application
    Filed: December 18, 2006
    Publication date: June 19, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Paritosh D. Patel, Oscar J. Blass, Roberto Vila, Jie Z. Zeng, Anatol Blass
  • Publication number: 20080140400
    Abstract: A system and method for voice recognition interaction is provided. The system can have a processor for receiving a voice signal and determining a command based on the voice signal. The system can also have a confirmation interface operably connected to the processor, where the confirmation interface is capable of receiving a confirmation signal from a user and providing the confirmation signal to the processor. The system can have a user identifying device for determining an identity of the user. The processor can determine a confirmation criteria based at least in part on the identity of the user or a type of the command. The satisfaction of the confirmation criteria can be applied to allow or prevent performance of the command.
    Type: Application
    Filed: December 12, 2006
    Publication date: June 12, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: OSCAR J. BLASS, Musaed A. Almutawa, Paritosh D. Patel, Robert Vila
  • Publication number: 20080059446
    Abstract: A method for searching Web pages that begins with the identification of query criteria entered into a search provider. A set of Web pages that satisfies the query criteria are determined. Then, a page ranking is ascertained for each Web page in the set. The Web pages are presented in order by page ranking. The page ranking is based upon at least one relevancy factor that includes a browsing-time factor. The browsing-time factor can be calculated from browsing behavior exhibited by users, who provided similar query criteria. The set of users from which the browsing-time factor is calculated can include a current user, a set of users sharing characteristics with the current user, and/or a general set of users. Browsing behavior can include time spent at a Web page, where the browsed Web page is a page that was previously presented as a search result for the similar query criteria.
    Type: Application
    Filed: July 26, 2006
    Publication date: March 6, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: OSCAR J. BLASS, OSWALDO GAGO, BRENNAN D. MONTEIRO, PARITOSH D. PATEL, ROBERTO VILA
  • Publication number: 20080046250
    Abstract: The present invention discloses a solution for assuring user-defined voice commands are unambiguous. The solution can include a step of identifying a user attempt to enter a user-defined voice command into a voice-enabled system. A safety analysis can be performed on the user-defined voice command to determine a likelihood that the user-defined voice command will be confused with preexisting voice commands recognized by the voice-enabled system. When a high likelihood of confusion is determined by the safety analysis, a notification can be presented that the user-defined voice command is subject to confusion. A user can then define a different voice command or can choose to continue to use the potentially confusing command, possibly subject to a system imposed confusion mitigating condition or action.
    Type: Application
    Filed: July 26, 2006
    Publication date: February 21, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: CIPRIAN AGAPI, OSCAR J. BLASS, BRENNAN D. MONTEIRO, ROBERTO VILA
  • Publication number: 20080046251
    Abstract: A method for state transition in voice systems including: generating one or more stackable state macros, each of the one or more stackable state macros including a plurality of commands; saving the current state before executing another macro; enabling restoring the previous state after a plurality of commands is completed, allowing a user to utter voice commands to restore the individual state of components or the voice systems as a whole to the previous state or to a known home state.
    Type: Application
    Filed: August 15, 2006
    Publication date: February 21, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Ciprian Agapi, Musaed A. Almutawa, Oscar J. Blass, Patrick M. Commarford, Roberto Vila
  • Publication number: 20080015863
    Abstract: The present invention discloses a method for a voice-enabled computing environment. The method includes a step of receiving a series of voice commands related to an abstraction, wherein the voice command specifies an abstraction type. The method further includes responding to the voice command to replay the commands, executing at least one programmatic action related to the abstraction, where the programmatic action is executed demonstrating characteristics specific to the abstraction type associated with the voice command. The present method applies, but is not limited to, the execution of collections of commands that may have diverging timing characteristics.
    Type: Application
    Filed: July 12, 2006
    Publication date: January 17, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: CIPRIAN AGAPI, MUSAED A. ALMUTAWA, OSCAR J. BLASS, BRENNAN D. MONTEIRO, ROBERTO VILA
  • Patent number: 7280968
    Abstract: A method for digitally generating speech with improved prosodic characteristics can include receiving a speech input, determining at least one prosodic characteristic contained within the speech input, and generating a speech output including the prosodic characteristic within the speech output.
    Type: Grant
    Filed: March 25, 2003
    Date of Patent: October 9, 2007
    Assignee: International Business Machines Corporation
    Inventor: Oscar J. Blass