Patents by Inventor Ciprian Agapi

Ciprian Agapi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Dynamically extending the speech prompts of a multimodal application

Patent number: 8290780

Abstract: Dynamically extending the speech prompts of a multimodal application including receiving, by the prompt generation engine, a media file having a metadata container; retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application; and modifying, by the prompt generation engine, the multimodal application to include the speech prompt.

Type: Grant

Filed: June 24, 2009

Date of Patent: October 16, 2012

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
DYNAMICALLY PUBLISHING DIRECTORY INFORMATION FOR A PLURALITY OF INTERACTIVE VOICE RESPONSE SYSTEMS

Publication number: 20120257730

Abstract: Some example embodiments include a method of dynamically publishing directory information for a plurality of interactive voice response (‘IVR’) systems. The method includes receiving, by the IVR directory service on behalf of one of the IVR systems, a web services update request. The method includes determining, by the IVR directory service in response to the web services update request, updated directory information for the IVR system. The method includes updating the IVR system directory with the updated directory information for the IVR system. The method includes generating an updated voice mode user interface to reflect the updated IVR system directory with the updated directory information for the IVR system. The generating includes creating one more voice dialogs in accordance with the directory information, the one or more voice dialogs specifying a call flow defining the interaction between a caller and the IVR directory service.

Type: Application

Filed: June 19, 2012

Publication date: October 11, 2012

Applicant: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR., Fang Wang
Methods and system for creating and editing an XML-based speech synthesis document

Patent number: 8265936

Abstract: A method for creating and editing an XML-based speech synthesis document for input to a text-to-speech engine is provided. The method includes recording voice utterances of a user reading a pre-selected text and parsing the recorded voice utterances into individual words and periods of silence. The method also includes recording a synthesized speech output generated by a text-to-speech engine, the synthesized speech output being an audible rendering of the pre-selected text, and parsing the synthesized speech output into individual words and periods of silence. The method further includes annotating the XML-based speech synthesis document based upon a comparison of the recorded voice utterances and the recorded synthesized speech output.

Type: Grant

Filed: June 3, 2008

Date of Patent: September 11, 2012

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, Oswaldo Gago, Maria Elena Smith, Roberto Vila
METHOD AND ARRANGEMENT FOR MANAGING GRAMMAR OPTIONS IN A GRAPHICAL CALLFLOW BUILDER

Publication number: 20120209613

Abstract: A method (10) in a speech recognition application callflow can include the steps of assigning (11) an individual option and a pre-built grammar to a same prompt, treating (15) the individual option as a valid output of the pre-built grammar if the individual option is a potential valid match to a recognition phrase (12) or an annotation (13) in the pre-built grammar, and treating (14) the individual option as an independent grammar from the pre-built grammar if the individual option fails to be a potential valid match to the recognition phrase or the annotation in the pre-built grammar.

Type: Application

Filed: January 5, 2012

Publication date: August 16, 2012

Applicant: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Felipe Gomez, James R. Lewis, Vanessa V. Michelini
Performing a safety analysis for user-defined voice commands to ensure that the voice commands do not cause speech recognition ambiguities

Patent number: 8234120

Abstract: The present invention discloses a solution for assuring user-defined voice commands are unambiguous. The solution can include a step of identifying a user attempt to enter a user-defined voice command into a voice-enabled system. A safety analysis can be performed on the user-defined voice command to determine a likelihood that the user-defined voice command will be confused with preexisting voice commands recognized by the voice-enabled system. When a high likelihood of confusion is determined by the safety analysis, a notification can be presented that the user-defined voice command is subject to confusion. A user can then define a different voice command or can choose to continue to use the potentially confusing command, possibly subject to a system imposed confusion mitigating condition or action.

Type: Grant

Filed: July 26, 2006

Date of Patent: July 31, 2012

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Oscar J. Blass, Brennan D. Monteiro, Roberto Vila
Dynamically publishing directory information for a plurality of interactive voice response systems

Patent number: 8229081

Abstract: Methods, apparatus, and products are disclosed for dynamically publishing directory information for a plurality of interactive voice response (‘IVR’) systems through an IVR directory service that include: providing a description of a web services publication interface for the IVR directory service; receiving, on behalf of one or more IVR systems, web services publication requests through the publication interface; determining, in response to the web services publication requests, directory information for each IVR system requesting publication; adding the directory information for each IVR system to an IVR system directory; generating a voice mode user interface to reflect the directory information for each IVR system added to the IVR system directory; and interacting, using the voice mode user interface, with a caller to identify a particular IVR system in dependence upon the IVR system directory and query information provided by the caller and to connect the caller with the identified IVR system.

Type: Grant

Filed: April 24, 2008

Date of Patent: July 24, 2012

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Fang Wang
Signaling correspondence between a meeting agenda and a meeting discussion

Patent number: 8214242

Abstract: Signaling correspondence between a meeting agenda and a meeting discussion includes: receiving a meeting agenda specifying one or more topics for a meeting; analyzing, for each topic, one or more documents to identify topic keywords for that topic; receiving meeting discussions among participants for the meeting; identifying a current topic for the meeting in dependence upon the meeting agenda; determining a correspondence indicator in dependence upon the meeting discussions and the topic keywords for the current topic, the correspondence indicator specifying the correspondence between the meeting agenda and the meeting discussion; and rendering the correspondence indicator to the participants of the meeting.

Type: Grant

Filed: April 24, 2008

Date of Patent: July 3, 2012

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Brian D. Goodman, Frank L. Jania, Darren M. Shaw
ADJUSTING A SPEECH ENGINE FOR A MOBILE COMPUTING DEVICE BASED ON BACKGROUND NOISE

Publication number: 20120123776

Abstract: Methods, apparatus, and products are disclosed for adjusting a speech engine for a mobile computing device based on background noise, the mobile computing device operatively coupled to a microphone, that include: sampling, through the microphone, background noise for a plurality of operating environments in which the mobile computing device operates; generating, for each operating environment, a noise model in dependence upon the sampled background noise for that operating environment; and configuring the speech engine for the mobile computing device with the noise model for the operating environment in which the mobile computing device currently operates.

Type: Application

Filed: January 25, 2012

Publication date: May 17, 2012

Applicant: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR., Paritosh D. Patel
ADJUSTING A SPEECH ENGINE FOR A MOBILE COMPUTING DEVICE BASED ON BACKGROUND NOISE

Publication number: 20120123777

Abstract: Methods, apparatus, and products are disclosed for adjusting a speech engine for a mobile computing device based on background noise, the mobile computing device operatively coupled to a microphone, that include: sampling, through the microphone, background noise for a plurality of operating environments in which the mobile computing device operates; generating, for each operating environment, a noise model in dependence upon the sampled background noise for that operating environment; and configuring the speech engine for the mobile computing device with the noise model for the operating environment in which the mobile computing device currently operates.

Type: Application

Filed: January 25, 2012

Publication date: May 17, 2012

Applicant: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR., Paritosh D. Patel
Printing to a text-to-speech output device

Patent number: 8170877

Abstract: A method for producing speech output can include the step of selecting a TTS output device from a plurality of available output devices. The selected output device can be associated with outputting content of an application responsive to a print command. According to the method, the print command can be detected, which results in the content of the application being conveyed to the selected TTS output device. The TTS output device can be associated with at least one text-to-speech engine. Upon content conveyance to the TTS output device, at least a portion of the content can be automatically converted using the text-to-speech engine. The speech converted content can be outputted.

Type: Grant

Filed: June 20, 2005

Date of Patent: May 1, 2012

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Oscar J. Blass, Charles T. Rutherfoord
TESTING A GRAMMAR USED IN SPEECH RECOGNITION FOR RELIABILITY IN A PLURALITY OF OPERATING ENVIRONMENTS HAVING DIFFERENT BACKGROUND NOISE

Publication number: 20120053934

Abstract: Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.

Type: Application

Filed: November 4, 2011

Publication date: March 1, 2012

Applicant: Nuance Communications. Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR., Michael H. Mirt
Adjusting a speech engine for a mobile computing device based on background noise

Patent number: 8121837

Abstract: Methods, apparatus, and products are disclosed for adjusting a speech engine for a mobile computing device based on background noise, the mobile computing device operatively coupled to a microphone, that include: sampling, through the microphone, background noise for a plurality of operating environments in which the mobile computing device operates; generating, for each operating environment, a noise model in dependence upon the sampled background noise for that operating environment; and configuring the speech engine for the mobile computing device with the noise model for the operating environment in which the mobile computing device currently operates.

Type: Grant

Filed: April 24, 2008

Date of Patent: February 21, 2012

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Paritosh D. Patel
Adjusting music length to expected waiting time while caller is on hold

Patent number: 8102987

Abstract: A method of adjusting music length to expected waiting time while a caller is on hold includes choosing one or more media selections based upon their play duration and matching the selection(s) to the expected waiting time.

Type: Grant

Filed: October 16, 2008

Date of Patent: January 24, 2012

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, Thomas E. Creamer, James R. Lewis, Vanessa V. Michelini, Wallace J. Sadowski, Clifford J. Strohofer
Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise

Patent number: 8082148

Abstract: Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.

Type: Grant

Filed: April 24, 2008

Date of Patent: December 20, 2011

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Michael H. Mirt
Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets

Patent number: 8019605

Abstract: The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can automatically process pre-recorded audio to derive speech assets for a concatenative TTS voice. The pre-recording audio can include sets of recorded phrases used by a speech user interface (Sill). A set of unfulfilled speech assets needed for foil phonetic coverage of the concatenative TTS voice can be determined. A reduced script can be constructed that includes a set of phrases, which when read by a voice talent result in a reduced corpus. When the reduced corpus is automatically processed, a reduced set of speech assets result. The reduced set includes each of the unfulfilled speech assets. When this reduced corpus is combined with existing speech assets the result will be a voice with a complete set of speech assets.

Type: Grant

Filed: May 14, 2007

Date of Patent: September 13, 2011

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Oscar J. Blass, Paritosh D. Patel, Roberto Vila
Disambiguation systems and methods for use in generating grammars

Patent number: 8010343

Abstract: A method and system for addressing disambiguation issues in interactive applications by creating a disambiguation system for generating complex grammars that includes homonym detection and grouping, and provides optimization feedback that eliminates time-consuming and repetitive iterative steps during the grammar generation portion of the interactive application configuration.

Type: Grant

Filed: December 15, 2005

Date of Patent: August 30, 2011

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Brent D. Metz
Automatic generation of a callflow statistics application for speech systems

Patent number: 8005202

Abstract: A method, system and computer program for automatically generating call flow statistics in a voice application. Embodiments of the present invention address deficiencies of the art in respect to call flow statistics generation systems and provide a novel and non-obvious method, system and computer program product for automatically generating a call flow statistics-generating application and presenting updated statistics on a call flow representation. Various statistics collection points are identified on the visual representation. Upon running of the voice application, call flow statistics are gathered and presented for each statistics collection point. Call identifiers corresponding to each call path can be selected and call paths corresponding to the selected call identifier may be highlighted and their call statistics displayed.

Type: Grant

Filed: December 8, 2005

Date of Patent: August 23, 2011

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, James R. Lewis, Michael H. Mirt
System, apparatus, and methods for creating alternate-mode applications

Patent number: 7920681

Abstract: A system, apparatus, and method for creating alternate-mode interactive applications is provided. A system for creating an alternate-mode interactive application includes a selection module for selecting a voice-mode element from a set of voice-mode elements defining a voice-mode interactive application for accomplishing a predetermined user-directed task The system also includes a generation module for generating an alternate-mode element corresponding to the selected voice-mode element, the alternate-mode element having a modality different than the voice-mode element. The system further includes a construction module for constructing an alternate-mode interactive application based upon the generated alternate-mode element.

Type: Grant

Filed: November 5, 2004

Date of Patent: April 5, 2011

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, Felipe Gomez, James R. Lewis, Gary J. Pietrocarlo, Wallace J. Sadowski
Multimodal Teleconferencing

Publication number: 20110032845

Abstract: Multimodal teleconferencing including receiving, by a multimodal teleconferencing module, a speech utterance from one of a plurality of participants in the multimodal teleconference; identifying the participant making the speech utterance as a current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to the current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to one or more other participants in the multimodal teleconference; providing, by the multimodal teleconferencing module to a multimodal teleconferencing client for display to the current speaker, an identification of the speaker and the content retrieved for the speaker; and providing, by the multimodal teleconferencing module to one or more of multimodal teleconferencing clients for display to the other participants, an identification of the current speaker with the content retrieved for the one or more ot

Type: Application

Filed: August 5, 2009

Publication date: February 10, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR.
Dynamically Extending The Speech Prompts Of A Multimodal Application

Publication number: 20100332234

Abstract: Dynamically extending the speech prompts of a multimodal application including receiving, by the prompt generation engine, a media file having a metadata container; retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application; and modifying, by the prompt generation engine, the multimodal application to include the speech prompt.

Type: Application

Filed: June 24, 2009

Publication date: December 30, 2010

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR.

prev 1 2 3 4 5 next