Patents by Inventor Ciprian Agapi

Ciprian Agapi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for prompting multi-token input speech

Patent number: 10521186

Abstract: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

Type: Grant

Filed: March 20, 2013

Date of Patent: December 31, 2019

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Soonthorn Ativanichayaphong, Leslie R. Wilson
Enhancing environment voice macros via a stackable save/restore state of an object within an environment controlled by voice commands for control of vehicle components

Patent number: 9583096

Abstract: A method for state transition in voice systems including: generating one or more stackable state macros, each of the one or more stackable state macros including a plurality of commands; saving the current state before executing another macro; enabling restoring the previous state after a plurality of commands is completed, allowing a user to utter voice commands to restore the individual state of components or the voice systems as a whole to the previous state or to a known home state. The method further utilizes voice commands not specific to the current state and is used specifically for automatically controlling a plurality of components of a vehicle.

Type: Grant

Filed: August 15, 2006

Date of Patent: February 28, 2017

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Musaed A. Almutawa, Oscar J. Blass, Patrick M. Commarford, Roberto Vila
Dynamically extending the speech prompts of a multimodal application

Patent number: 9530411

Abstract: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.

Type: Grant

Filed: August 26, 2013

Date of Patent: December 27, 2016

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise

Patent number: 9396721

Abstract: Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.

Type: Grant

Filed: November 4, 2011

Date of Patent: July 19, 2016

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Michael H. Mirt
Records disambiguation in a multimodal application operating on a multimodal device

Patent number: 9349367

Abstract: Methods, apparatus, and products are disclosed for record disambiguation in a multimodal application operating on a multimodal device, the multimodal device supporting multiple modes of interaction including at least a voice mode and a visual mode, that include: prompting, by the multimodal application, a user to identify a particular record among a plurality of records; receiving, by the multimodal application in response to the prompt, a voice utterance from the user; determining, by the multimodal application, that the voice utterance ambiguously identifies more than one of the plurality of records; generating, by the multimodal application, a user interaction to disambiguate the records ambiguously identified by the voice utterance in dependence upon record attributes of the records ambiguously identified by the voice utterance; and selecting, by the multimodal application for further processing, one of the records ambiguously identified by the voice utterance in dependence upon the user interaction.

Type: Grant

Filed: April 24, 2008

Date of Patent: May 24, 2016

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Pradeep P. Mansey
Adjusting a speech engine for a mobile computing device based on background noise

Patent number: 9076454

Abstract: Methods, apparatus, and products are disclosed for adjusting a speech engine for a mobile computing device based on background noise, the mobile computing device operatively coupled to a microphone, that include: sampling, through the microphone, background noise for a plurality of operating environments in which the mobile computing device operates; generating, for each operating environment, a noise model in dependence upon the sampled background noise for that operating environment; and configuring the speech engine for the mobile computing device with the noise model for the operating environment in which the mobile computing device currently operates.

Type: Grant

Filed: January 25, 2012

Date of Patent: July 7, 2015

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Paritosh D. Patel
Method and system for defining standard catch styles for speech application code generation

Patent number: 8799001

Abstract: A method and system for defining standard catch styles used in generating speech application code for managing catch events, in which a style-selection menu that allows for selection of one or more catch styles is presented. Each catch style represents a system response to a catch event. A catch style can be selected from the style-selection menu. For each selected catch style, the system can prepare a response for each catch event. If the selected catch style requires playing a new audio message in response to a particular catch event, a contextual message can be entered in one or more text fields. The contextual message entered in each text field corresponds to the new audio message that will be played in response to the particular catch event. In certain catch styles, the entered contextual message is different for each catch event, while in other catch styles, the entered contextual message is the same for each catch event.

Type: Grant

Filed: November 17, 2003

Date of Patent: August 5, 2014

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Felipe Gomez, James R. Lewis, Vanessa V. Michelini, Sibyl C. Sullivan
Method and system for testing sections of large speech applications

Patent number: 8661411

Abstract: A method and system for testing code within a speech application. A test file can be automatically generated to verify the functionality of a new section of code within a graphical call flow builder application. A user can specify through a wizard two points on a path identifying the code section to be tested. The wizard can generate a test file and configure a path to a new subpath. Values are assigned to graphical call flow prompts along the path. Thus, the new code section is reached under the same path conditions for allowing repeatable testing. The system can include a test harness to test a new code section from within a context of the speech application, and a test controller for transitioning to the new code section. The test controller can run the test harness within the speech application to evaluate a functionality of the new code section.

Type: Grant

Filed: December 2, 2005

Date of Patent: February 25, 2014

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Brent D. Metz
System and method for call center agent quality assurance using biometric detection technologies

Patent number: 8654937

Abstract: A method, system and computer program for assessing the quality of a call recipient response during an interactive voice dialog. Embodiments of the present invention address deficiencies of the art in respect to biometric analysis and provide a novel and non-obvious method, system and computer program product for call center agent quality assurance using biometric technologies. A solution for automated monitoring of call center agents' skill, mood, professionalism and behavior using biometric technologies and for providing appropriate action to improve customer handling and satisfaction is provided. The solution provides an automated method for detecting potential problems and preemptively taking action to provide consistent, quality customer service.

Type: Grant

Filed: November 30, 2005

Date of Patent: February 18, 2014

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, Baiju D. Mandalia, Pradeep P. Mansey
Dynamically publishing directory information for a plurality of interactive voice response systems

Patent number: 8638909

Abstract: Some example embodiments include a method of dynamically publishing directory information for a plurality of interactive voice response (‘IVR’) systems. The method includes receiving, by the IVR directory service on behalf of one of the IVR systems, a web services update request. The method includes determining, by the IVR directory service in response to the web services update request, updated directory information for the IVR system. The method includes updating the IVR system directory with the updated directory information for the IVR system. The method includes generating an updated voice mode user interface to reflect the updated IVR system directory with the updated directory information for the IVR system. The generating includes creating one more voice dialogs in accordance with the directory information, the one or more voice dialogs specifying a call flow defining the interaction between a caller and the IVR directory service.

Type: Grant

Filed: June 19, 2012

Date of Patent: January 28, 2014

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Fang Wang
DYNAMICALLY EXTENDING THE SPEECH PROMPTS OF A MULTIMODAL APPLICATION

Publication number: 20130339033

Abstract: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.

Type: Application

Filed: August 26, 2013

Publication date: December 19, 2013

Applicant: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR.
Automatic speech recognition with a selection list

Patent number: 8612230

Abstract: Methods, apparatus, and computer program products are described for automatic speech recognition (‘ASR’) that include accepting by the multimodal application speech input and visual input for selecting or deselecting items in a selection list, the speech input enabled by a speech recognition grammar; providing, from the multimodal application to the grammar interpreter, the speech input and the speech recognition grammar; receiving, by the multimodal application from the grammar interpreter, interpretation results including matched words from the grammar that correspond to items in the selection list and a semantic interpretation token that specifies whether to select or deselect items in the selection list; and determining, by the multimodal application in dependence upon the value of the semantic interpretation token, whether to select or deselect items in the selection list that correspond to the matched words.

Type: Grant

Filed: January 3, 2007

Date of Patent: December 17, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Soonthorn Ativanichayaphong, Charles W. Cross, Jr., Gerald M. McCobb
SYSTEMS AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES

Publication number: 20130227417

Abstract: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

Type: Application

Filed: March 20, 2013

Publication date: August 29, 2013

Applicant: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Soonthorn Ativanichayaphong, Leslie R. Wilson
Dynamically extending the speech prompts of a multimodal application

Patent number: 8521534

Abstract: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.

Type: Grant

Filed: September 12, 2012

Date of Patent: August 27, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
Speech enabled media sharing in a multimodal application

Patent number: 8510117

Abstract: Speech enabled media sharing in a multimodal application including parsing, by a multimodal browser, one or more markup documents of a multimodal application; identifying, by the multimodal browser, in the one or more markup documents a web resource for display in the multimodal browser; loading, by the multimodal browser, a web resource sharing grammar that includes keywords for modes of resource sharing and keywords for targets for receipt of web resources; receiving, by the multimodal browser, an utterance matching a keyword for the web resource, a keyword for a mode of resource sharing and a keyword for a target for receipt of the web resource in the web resource sharing grammar thereby identifying the web resource, a mode of resource sharing, and a target for receipt of the web resource; and sending, by the multimodal browser, the web resource to the identified target for the web resource using the identified mode of resource sharing.

Type: Grant

Filed: July 9, 2009

Date of Patent: August 13, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
Multimodal teleconferencing

Patent number: 8416714

Abstract: Multimodal teleconferencing including receiving, by a multimodal teleconferencing module, a speech utterance from one of a plurality of participants in the multimodal teleconference; identifying the participant making the speech utterance as a current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to the current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to one or more other participants in the multimodal teleconference; providing, by the multimodal teleconferencing module to a multimodal teleconferencing client for display to the current speaker, an identification of the speaker and the content retrieved for the speaker; and providing, by the multimodal teleconferencing module to one or more of multimodal teleconferencing clients for display to the other participants, an identification of the current speaker with the content retrieved for the one or more ot

Type: Grant

Filed: August 5, 2009

Date of Patent: April 9, 2013

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
System and methods for prompting user speech in multimodal devices

Patent number: 8417529

Abstract: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

Type: Grant

Filed: December 27, 2006

Date of Patent: April 9, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Soonthorn Ativanichayaphong, Leslie R. Wilson
Improving speech capabilities of a multimodal application

Patent number: 8380513

Abstract: Improving speech capabilities of a multimodal application including receiving, by the multimodal browser, a media file having a metadata container; retrieving, by the multimodal browser, from the metadata container a speech artifact related to content stored in the media file for inclusion in the speech engine available to the multimodal browser; determining whether the speech artifact includes a grammar rule or a pronunciation rule; if the speech artifact includes a grammar rule, modifying, by the multimodal browser, the grammar of the speech engine to include the grammar rule; and if the speech artifact includes a pronunciation rule, modifying, by the multimodal browser, the lexicon of the speech engine to include the pronunciation rule.

Type: Grant

Filed: May 19, 2009

Date of Patent: February 19, 2013

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
DYNAMICALLY EXTENDING THE SPEECH PROMPTS OF A MULTIMODAL APPLICATION

Publication number: 20130018658

Abstract: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.

Type: Application

Filed: September 12, 2012

Publication date: January 17, 2013

Applicant: International Business Machiness Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR.
Method and arrangement for managing grammar options in a graphical callflow builder

Patent number: 8355918

Abstract: A method (10) in a speech recognition application callflow can include the steps of assigning (11) an individual option and a pre-built grammar to a same prompt, treating (15) the individual option as a valid output of the pre-built grammar if the individual option is a potential valid match to a recognition phrase (12) or an annotation (13) in the pre-built grammar, and treating (14) the individual option as an independent grammar from the pre-built grammar if the individual option fails to be a potential valid match to the recognition phrase or the annotation in the pre-built grammar.

Type: Grant

Filed: January 5, 2012

Date of Patent: January 15, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Felipe Gomez, James R. Lewis, Vanessa V. Michelini

1 2 3 4 5 next