Patents by Inventor Oscar J. Blass

Oscar J. Blass has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Performing a safety analysis for user-defined voice commands to ensure that the voice commands do not cause speech recognition ambiguities

Patent number: 8234120

Abstract: The present invention discloses a solution for assuring user-defined voice commands are unambiguous. The solution can include a step of identifying a user attempt to enter a user-defined voice command into a voice-enabled system. A safety analysis can be performed on the user-defined voice command to determine a likelihood that the user-defined voice command will be confused with preexisting voice commands recognized by the voice-enabled system. When a high likelihood of confusion is determined by the safety analysis, a notification can be presented that the user-defined voice command is subject to confusion. A user can then define a different voice command or can choose to continue to use the potentially confusing command, possibly subject to a system imposed confusion mitigating condition or action.

Type: Grant

Filed: July 26, 2006

Date of Patent: July 31, 2012

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Oscar J. Blass, Brennan D. Monteiro, Roberto Vila
Printing to a text-to-speech output device

Patent number: 8170877

Abstract: A method for producing speech output can include the step of selecting a TTS output device from a plurality of available output devices. The selected output device can be associated with outputting content of an application responsive to a print command. According to the method, the print command can be detected, which results in the content of the application being conveyed to the selected TTS output device. The TTS output device can be associated with at least one text-to-speech engine. Upon content conveyance to the TTS output device, at least a portion of the content can be automatically converted using the text-to-speech engine. The speech converted content can be outputted.

Type: Grant

Filed: June 20, 2005

Date of Patent: May 1, 2012

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Oscar J. Blass, Charles T. Rutherfoord
Dynamically adjusting speech grammar weights based on usage

Patent number: 8131548

Abstract: A speech processing method can automatically and dynamically adjust speech grammar weights at runtime based upon usage data. Each of the speech grammar weights can be associated with an available speech command contained within a speech grammar to which the speech grammar weights apply. The usage data can indicate a relative frequency with which each of the available speech commands is utilized.

Type: Grant

Filed: March 6, 2006

Date of Patent: March 6, 2012

Assignee: Nuance Communications, Inc.

Inventors: Oscar J. Blass, Harvey M. Ruback, Roberto Vila
System and method for improving message delivery in voice systems utilizing microphone and target signal-to-noise ratio

Patent number: 8027437

Abstract: A method for delivering a message to a recipient in an environment with ambient noise includes the steps of recording the ambient noise in the environment at a certain time interval, analyzing the recorded ambient noise to obtain an average power Pnoise or a RMS amplitude Anoise of the ambient noise, providing a predetermined desired SNRdesired, calculating an average signal power Psignal or a RMS amplitude Asignal of the message to be delivered based on the Pnoise or Anoise and the desired SNRdesired, and adjusting a volume of the message to be delivered according to the Psignal or Asignal. Alternatively, the actual SNRactual will be computed and the message will be repeated if the SNRactual falls below the SNRmin. Systems for delivering a message to a recipient in an environment with ambient noise and computer-readable media having computer-executable instructions for carrying out the methods are also provided.

Type: Grant

Filed: December 18, 2006

Date of Patent: September 27, 2011

Assignee: Nuance Communications, Inc.

Inventors: Paritosh D. Patel, Oscar J. Blass, Roberto Vila, Jie Z. Zeng, Anatol Blass
Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets

Patent number: 8019605

Abstract: The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can automatically process pre-recorded audio to derive speech assets for a concatenative TTS voice. The pre-recording audio can include sets of recorded phrases used by a speech user interface (Sill). A set of unfulfilled speech assets needed for foil phonetic coverage of the concatenative TTS voice can be determined. A reduced script can be constructed that includes a set of phrases, which when read by a voice talent result in a reduced corpus. When the reduced corpus is automatically processed, a reduced set of speech assets result. The reduced set includes each of the unfulfilled speech assets. When this reduced corpus is combined with existing speech assets the result will be a voice with a complete set of speech assets.

Type: Grant

Filed: May 14, 2007

Date of Patent: September 13, 2011

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Oscar J. Blass, Paritosh D. Patel, Roberto Vila
Distinguishing among different types of abstractions consisting of plurality of commands specified by particular sequencing and or timing or no timing and sequencing using voice commands

Patent number: 7747445

Abstract: The present invention discloses a method for a voice-enabled computing environment. The method includes a step of receiving a series of voice commands related to an abstraction, wherein the voice command specifies an abstraction type. The method further includes responding to the voice command to replay the commands, executing at least one programmatic action related to the abstraction, where the programmatic action is executed demonstrating characteristics specific to the abstraction type associated with the voice command. The present method applies, but is not limited to, the execution of collections of commands that may have diverging timing characteristics.

Type: Grant

Filed: July 12, 2006

Date of Patent: June 29, 2010

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Musaed A. Almutawa, Oscar J. Blass, Brennan D. Monteiro, Roberto Vila
Voice recognition interactive system with a confirmation capability

Patent number: 7747446

Abstract: A system and method for voice recognition interaction is provided. The system can have a processor for receiving a voice signal and determining a command based on the voice signal. The system can also have a confirmation interface operably connected to the processor, where the confirmation interface is capable of receiving a confirmation signal from a user and providing the confirmation signal to the processor. The system can have a user identifying device for determining an identity of the user. The processor can determine a confirmation criteria based at least in part on the identity of the user or a type of the command. The satisfaction of the confirmation criteria can be applied to allow or prevent performance of the command.

Type: Grant

Filed: December 12, 2006

Date of Patent: June 29, 2010

Assignee: Nuance Communications, Inc.

Inventors: Oscar J. Blass, Musaed A. Almutawa, Paritosh D. Patel, Robert Vila
DYNAMIC MODIFICATION OF VOICE SELECTION BASED ON USER SPECIFIC FACTORS

Publication number: 20090043583

Abstract: The present invention discloses a solution for customizing synthetic voice characteristics in a user specific fashion. The solution can establish a communication between a user and a voice response system. A data store can be searched for a speech profile associated with the user. When a speech profile is found, a set of speech output characteristics established for the user from the profile can be determined. Parameters and settings of a text-to-speech engine can be adjusted in accordance with the determined set of speech output characteristics. During the established communication, synthetic speech can be generated using the adjusted text-to-speech engine. Thus, each detected user can hear a synthetic speech generated by a different voice specifically selected for that user. When no user profile is detected, a default voice or a voice based upon a user's speech or communication details can be used.

Type: Application

Filed: August 8, 2007

Publication date: February 12, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Ciprian AGAPI, Oscar J. BLASS, Oswaldo GAGO, Roberto VILA
USING FINITE STATE GRAMMARS TO VARY OUTPUT GENERATED BY A TEXT-TO-SPEECH SYSTEM

Publication number: 20080312929

Abstract: The present invention discloses a text-to-speech system that provides output variability. The system can include a finite state grammar, a variability engine and a text-to-speech engine. The finite state grammar can contain a phrase role consisting of one or more phrase elements. The phrase rule can deterministically generate a variable text phrase based upon at least one random number. The phrase rule can include a definition for each of the phrase elements. Each definition can be associated with at least one defined text string. The variability engine can construct a random text phrase responsive to receiving an action command, wherein said finite state grammar is used to create the text phrase. The variability engine can also rely on user-specified weights to adjust the output probabilities. The speech-to-text engine can convert the text phrase generated by the variability engine into speech output.

Type: Application

Filed: June 12, 2007

Publication date: December 18, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: OSCAR J. BLASS, PARITOSH D. PATEL, HARVEY M. RUBACK, ROBERTO VILA
REDUCING RECORDING TIME WHEN CONSTRUCTING A CONCATENATIVE TTS VOICE USING A REDUCED SCRIPT AND PRE-RECORDED SPEECH ASSETS

Publication number: 20080288256

Abstract: The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can automatically process pre-recorded audio to derive speech assets for a concatenative TTS voice. The pre-recording audio can include sets of recorded phrases used by a speech user interface (Sill). A set of unfulfilled speech assets needed for foil phonetic coverage of the concatenative TTS voice can be determined. A reduced script can be constructed that includes a set of phrases, which when read by a voice talent result in a reduced corpus. When the reduced corpus is automatically processed, a reduced set of speech assets result. The reduced set includes each of the unfulfilled speech assets. When this reduced corpus is combined with existing speech assets the result will be a voice with a complete set of speech assets.

Type: Application

Filed: May 14, 2007

Publication date: November 20, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: CIPRIAN AGAPI, OSCAR J. BLASS, PARITOSH D. PATEL, ROBERTO VILA
CUSTOMIZED MESSAGES FOR PASSWORD/PASSCODE CHANGE EVENTS

Publication number: 20080282185

Abstract: The present invention discloses a method and system for notifying a user of a password/passcode change event. In the invention, a computing system can automatically detect a password/passcode change event. At least one user affected by the event can be determined. A pre-existing user defined message can be retrieved from a data store. A password/passcode change message can be automatically conveyed to the affected user. The change message can include content of the pre-existing user defined message. Further, in one embodiment, an automatically generated activity report associated with a password/passcode can be presented to either a user making a password/passcode change or as part of the conveyed change message.

Type: Application

Filed: May 11, 2007

Publication date: November 13, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: OSCAR J. BLASS, PATRICK COMMARFORD, RODRIGO J. PASTRANA, ROBERTO VILA
INTERACTING WITH PHONE NUMBERS AND OTHER CONTACT INFORMATION CONTAINED IN BROWSER CONTENT

Publication number: 20080282164

Abstract: The present invention discloses a method for handling contact information in a communication device. The method can include a step of presenting content within a visual interface of the communication device. A contact detection algorithm can execute to detect contact items included in the content. A visual indication can be presented next to each detected contact item. User input can be received, which causes the communication device to perform a contact action relating to one of the contact items. The contact action can add contact information to an address book, can initiate a communication session, and/or can place the contact item in a memory space for later use by an application other than one in which the content was presented.

Type: Application

Filed: May 11, 2007

Publication date: November 13, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: OSCAR J. BLASS, MEN T. LIEN, RODRIGO JOSE PASTRANA, ROBERTO VILA
USER POSITIONABLE AUDIO ANCHORS FOR DIRECTIONAL AUDIO PLAYBACK FROM VOICE-ENABLED INTERFACES

Publication number: 20080262847

Abstract: The present invention discloses a concept and a use of audio anchors within voice-enabled interfaces. Audio anchors can be user configurable points from which audio playback occurs. In the invention, a user can identify an interface position at which an audio anchor is to be established. The computing device can determine an anchor direction setting, with values that include forward playback and backward playback. Interface items can then be audibly enumerated from the audio anchor in a direction indicated by the anchor direction setting. For example, if a set of interface items are alphabetically ordered items and if an audio anchor is set at a first item beginning with a letter “G” and an anchor direction is set to indicate backward playback, then the interface items beginning with letters “A-F” can be audibly played in reverse alphabetical order. Additionally, a rate of audio playback can be user adjustable.

Type: Application

Filed: April 19, 2007

Publication date: October 23, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Ciprian Agapi, Oscar J. Blass, Paritosh D. Patel, Roberto Vila
SYSTEM AND METHOD FOR IMPROVING MESSAGE DELIVERY IN VOICE SYSTEMS UTILIZING MICROPHONE AND TARGET SIGNAL-TO-NOISE RATIO

Publication number: 20080147386

Abstract: A method for delivering a message to a recipient in an environment with ambient noise includes the steps of recording the ambient noise in the environment at a certain time interval, analyzing the recorded ambient noise to obtain an average power Pnoise or a RMS amplitude Anoise of the ambient noise, providing a predetermined desired SNRdesired, calculating an average signal power Psignal or a RMS amplitude Asignal of the message to be delivered based on the Pnoise or Anoise and the desired SNRdesired, and adjusting a volume of the message to be delivered according to the Psignal or Asignal. Alternatively, the actual SNRactual will be computed and the message will be repeated if the SNRactual falls below the SNRmin. Systems for delivering a message to a recipient in an environment with ambient noise and computer-readable media having computer-executable instructions for carrying out the methods are also provided.

Type: Application

Filed: December 18, 2006

Publication date: June 19, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Paritosh D. Patel, Oscar J. Blass, Roberto Vila, Jie Z. Zeng, Anatol Blass
VOICE RECOGNITION INTERACTIVE SYSTEM

Publication number: 20080140400

Abstract: A system and method for voice recognition interaction is provided. The system can have a processor for receiving a voice signal and determining a command based on the voice signal. The system can also have a confirmation interface operably connected to the processor, where the confirmation interface is capable of receiving a confirmation signal from a user and providing the confirmation signal to the processor. The system can have a user identifying device for determining an identity of the user. The processor can determine a confirmation criteria based at least in part on the identity of the user or a type of the command. The satisfaction of the confirmation criteria can be applied to allow or prevent performance of the command.

Type: Application

Filed: December 12, 2006

Publication date: June 12, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: OSCAR J. BLASS, Musaed A. Almutawa, Paritosh D. Patel, Robert Vila
IMPROVING RESULTS FROM SEARCH PROVIDERS USING A BROWSING-TIME RELEVANCY FACTOR

Publication number: 20080059446

Abstract: A method for searching Web pages that begins with the identification of query criteria entered into a search provider. A set of Web pages that satisfies the query criteria are determined. Then, a page ranking is ascertained for each Web page in the set. The Web pages are presented in order by page ranking. The page ranking is based upon at least one relevancy factor that includes a browsing-time factor. The browsing-time factor can be calculated from browsing behavior exhibited by users, who provided similar query criteria. The set of users from which the browsing-time factor is calculated can include a current user, a set of users sharing characteristics with the current user, and/or a general set of users. Browsing behavior can include time spent at a Web page, where the browsed Web page is a page that was previously presented as a search result for the similar query criteria.

Type: Application

Filed: July 26, 2006

Publication date: March 6, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: OSCAR J. BLASS, OSWALDO GAGO, BRENNAN D. MONTEIRO, PARITOSH D. PATEL, ROBERTO VILA
PERFORMING A SAFETY ANALYSIS FOR USER-DEFINED VOICE COMMANDS TO ENSURE THAT THE VOICE COMMANDS DO NOT CAUSE SPEECH RECOGNITION AMBIGUITIES

Publication number: 20080046250

Abstract: The present invention discloses a solution for assuring user-defined voice commands are unambiguous. The solution can include a step of identifying a user attempt to enter a user-defined voice command into a voice-enabled system. A safety analysis can be performed on the user-defined voice command to determine a likelihood that the user-defined voice command will be confused with preexisting voice commands recognized by the voice-enabled system. When a high likelihood of confusion is determined by the safety analysis, a notification can be presented that the user-defined voice command is subject to confusion. A user can then define a different voice command or can choose to continue to use the potentially confusing command, possibly subject to a system imposed confusion mitigating condition or action.

Type: Application

Filed: July 26, 2006

Publication date: February 21, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: CIPRIAN AGAPI, OSCAR J. BLASS, BRENNAN D. MONTEIRO, ROBERTO VILA
ENHANCING ENVIRONMENT VOICE MACROS VIA SAVE/RESTORE STATE OF OBJECT WITHIN AN ENVIRONMENT CONTROLLED BY VOICE COMANDS

Publication number: 20080046251

Abstract: A method for state transition in voice systems including: generating one or more stackable state macros, each of the one or more stackable state macros including a plurality of commands; saving the current state before executing another macro; enabling restoring the previous state after a plurality of commands is completed, allowing a user to utter voice commands to restore the individual state of components or the voice systems as a whole to the previous state or to a known home state.

Type: Application

Filed: August 15, 2006

Publication date: February 21, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Ciprian Agapi, Musaed A. Almutawa, Oscar J. Blass, Patrick M. Commarford, Roberto Vila
DISTINGUISHING AMONG DIFFERENT TYPES OF ABSTRACTIONS USING VOICE COMMANDS

Publication number: 20080015863

Abstract: The present invention discloses a method for a voice-enabled computing environment. The method includes a step of receiving a series of voice commands related to an abstraction, wherein the voice command specifies an abstraction type. The method further includes responding to the voice command to replay the commands, executing at least one programmatic action related to the abstraction, where the programmatic action is executed demonstrating characteristics specific to the abstraction type associated with the voice command. The present method applies, but is not limited to, the execution of collections of commands that may have diverging timing characteristics.

Type: Application

Filed: July 12, 2006

Publication date: January 17, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: CIPRIAN AGAPI, MUSAED A. ALMUTAWA, OSCAR J. BLASS, BRENNAN D. MONTEIRO, ROBERTO VILA
Synthetically generated speech responses including prosodic characteristics of speech inputs

Patent number: 7280968

Abstract: A method for digitally generating speech with improved prosodic characteristics can include receiving a speech input, determining at least one prosodic characteristic contained within the speech input, and generating a speech output including the prosodic characteristic within the speech output.

Type: Grant

Filed: March 25, 2003

Date of Patent: October 9, 2007

Assignee: International Business Machines Corporation

Inventor: Oscar J. Blass

prev 1 2 3 next