Patents by Inventor Charles W. Cross, Jr.

Charles W. Cross, Jr. has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Web service support for a multimodal client processing a multimodal application

Patent number: 8788620

Abstract: Web service support for a multimodal client processing a multimodal application, the multimodal client providing an execution environment for the application and operating on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the application stored on an application server, includes: receiving, by the server, an application request from the client that specifies the application and device characteristics; determining, by a multimodal adapter of the server, modality requirements for the application; selecting, by the adapter, a modality web service in dependence upon the modality requirements and the characteristics for the device; determining, by the adapter, whether the device supports VoIP in dependence upon the characteristics; providing, by the server, the application to the client; and providing, by the adapter to the client in dependence upon whether the device supports VoIP, access to the modality web service for processing the appl

Type: Grant

Filed: April 4, 2007

Date of Patent: July 22, 2014

Assignee: International Business Machines Corporation

Inventors: Soonthorn Ativanichayaphong, Charles W. Cross, Jr., Gerald M. McCobb
Method and apparatus for voice-enabling an application

Patent number: 8768711

Abstract: A method of voice-enabling an application for command and control and content navigation can include the application dynamically generating a markup language fragment specifying a command and control and content navigation grammar for the application, instantiating an interpreter from a voice library, and providing the markup language fragment to the interpreter. The method also can include the interpreter processing a speech input using the command and control and content navigation grammar specified by the markup language fragment and providing an event to the application indicating an instruction representative of the speech input.

Type: Grant

Filed: June 17, 2004

Date of Patent: July 1, 2014

Assignee: Nuance Communications, Inc.

Inventors: Soonthorn Ativanichayaphong, Charles W. Cross, Jr., Brien H. Muschett
Invoking tapered prompts in a multimodal application

Patent number: 8744861

Abstract: Methods, apparatus, and computer program products are described for invoking tapered prompts in a multimodal application implemented with a multimodal browser and a multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes. Embodiments include identifying, by a multimodal browser, a prompt element in a multimodal application; identifying, by the multimodal browser, one or more attributes associated with the prompt element; and playing a speech prompt according to the one or more attributes associated with the prompt element.

Type: Grant

Filed: March 1, 2012

Date of Patent: June 3, 2014

Assignee: Nuance Communications, Inc.

Inventors: Soonthorn Ativanichayaphong, Charles W. Cross, Jr., Gerald M. McCobb
Providing expressive user interaction with a multimodal application

Patent number: 8725513

Abstract: Methods, apparatus, and products are disclosed for providing expressive user interaction with a multimodal application, the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to a speech engine through a VoiceXML interpreter, including: receiving, by the multimodal browser, user input from a user through a particular mode of user interaction; determining, by the multimodal browser, user output for the user in dependence upon the user input; determining, by the multimodal browser, a style for the user output in dependence upon the user input, the style specifying expressive output characteristics for at least one other mode of user interaction; and rendering, by the multimodal browser, the user output in dependence upon the style.

Type: Grant

Filed: April 12, 2007

Date of Patent: May 13, 2014

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Ellen M. Eide, Igor R. Jablokov
Displaying speech command input state information in a multimodal browser

Patent number: 8719034

Abstract: Methods, systems, and products are disclosed for displaying speech command input state information in a multimodal browser including displaying an icon representing a speech command type and displaying an icon representing the input state of the speech command. In typical embodiments, the icon representing a speech command type and the icon representing the input state of the speech command also includes attributes of a single icon. Typical embodiments include accepting from a user a speech command of the speech command type, changing the input state of the speech command, and displaying another icon representing the changed input state of the speech command. Typical embodiments also include displaying the text of the speech command in association with the icon representing the speech command type.

Type: Grant

Filed: September 13, 2005

Date of Patent: May 6, 2014

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Michael Charles Hollinger, Igor R. Jablokov, Benjamin D. Lewis, Hilary A. Pike, Daniel M. Smith, David W. Wintermute, Michael A. Zaitzeff
ESTABLISHING A MULTIMODAL PERSONALITY FOR A MULTIMODAL APPLICATION IN DEPENDENCE UPON ATTRIBUTES OF USER INTERACTION

Publication number: 20140122091

Abstract: Establishing a multimodal personality for a multimodal application, including evaluating, by the multimodal application, attributes of a user's interaction with the multimodal application; selecting, by the multimodal application, a vocal demeanor in dependence upon the values of the attributes of the user's interaction with the multimodal application; and incorporating, by the multimodal application, the vocal demeanor into the multimodal application.

Type: Application

Filed: November 26, 2013

Publication date: May 1, 2014

Applicant: Nuance Communications, Inc.

Inventors: Charles W. Cross, JR., Hilary A. Pike
Pausing a VoiceXML dialog of a multimodal application

Patent number: 8713542

Abstract: Pausing a VoiceXML dialog of a multimodal application, including generating by the multimodal application a pause event; responsive to the pause event, temporarily pausing the dialogue by the VoiceXML interpreter; generating by the multimodal application a resume event; and responsive to the resume event, resuming the dialog. Embodiments are implemented with the multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application is operatively coupled to a VoiceXML interpreter, and the VoiceXML interpreter is interpreting the VoiceXML dialog to be paused.

Type: Grant

Filed: February 27, 2007

Date of Patent: April 29, 2014

Assignee: Nuance Communications, Inc.

Inventors: Soonthorn Ativanichayaphong, Charles W. Cross, Jr., David Jaramillo, Gerald M. McCobb
Establishing a multimodal personality for a multimodal application

Patent number: 8706500

Abstract: Methods, apparatus, and computer program products are described for establishing a multimodal personality for a multimodal application that include selecting, by the multimodal application, matching vocal and visual demeanors and incorporating, by the multimodal application, the matching vocal and visual demeanors as a multimodal personality into the multimodal application.

Type: Grant

Filed: November 1, 2011

Date of Patent: April 22, 2014

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Hilary A. Pike
ESTABLISHING A MULTIMODAL ADVERTISING PERSONALITY FOR A SPONSOR OF A ULTIMODAL APPLICATION

Publication number: 20140052449

Abstract: Establishing a multimodal advertising personality for a sponsor of a multimodal application, including associating one or more vocal demeanors with a sponsor of a multimodal application and presenting a speech portion of the multimodal application for the sponsor using at least one of the vocal demeanors associated with the sponsor.

Type: Application

Filed: July 29, 2013

Publication date: February 20, 2014

Applicant: Nuance Communications, Inc.

Inventors: Charles W. Cross, JR., Hilary A. Pike
INDEXING DIGITIZED SPEECH WITH WORDS REPRESENTED IN THE DIGITIZED SPEECH

Publication number: 20140039899

Abstract: Indexing digitized speech with words represented in the digitized speech, with a multimodal digital audio editor operating on a multimodal device supporting modes of user interaction, the modes of user interaction including a voice mode and one or more non-voice modes, the multimodal digital audio editor operatively coupled to an ASR engine, including providing by the multimodal digital audio editor to the ASR engine digitized speech for recognition; receiving in the multimodal digital audio editor from the ASR engine recognized user speech including a recognized word, also including information indicating where, in the digitized speech, representation of the recognized word begins; and inserting by the multimodal digital audio editor the recognized word, in association with the information indicating where, in the digitized speech, representation of the recognized word begins, into a speech recognition grammar, the speech recognition grammar voice enabling user interface commands of the multimodal digital au

Type: Application

Filed: August 7, 2013

Publication date: February 6, 2014

Applicant: Nuance Communications, Inc.

Inventors: Charles W. Cross, JR., Frank L. Jania
Dynamically publishing directory information for a plurality of interactive voice response systems

Patent number: 8638909

Abstract: Some example embodiments include a method of dynamically publishing directory information for a plurality of interactive voice response (‘IVR’) systems. The method includes receiving, by the IVR directory service on behalf of one of the IVR systems, a web services update request. The method includes determining, by the IVR directory service in response to the web services update request, updated directory information for the IVR system. The method includes updating the IVR system directory with the updated directory information for the IVR system. The method includes generating an updated voice mode user interface to reflect the updated IVR system directory with the updated directory information for the IVR system. The generating includes creating one more voice dialogs in accordance with the directory information, the one or more voice dialogs specifying a call flow defining the interaction between a caller and the IVR directory service.

Type: Grant

Filed: June 19, 2012

Date of Patent: January 28, 2014

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Fang Wang
DYNAMICALLY EXTENDING THE SPEECH PROMPTS OF A MULTIMODAL APPLICATION

Publication number: 20130339033

Abstract: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.

Type: Application

Filed: August 26, 2013

Publication date: December 19, 2013

Applicant: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR.
Automatic speech recognition with a selection list

Patent number: 8612230

Abstract: Methods, apparatus, and computer program products are described for automatic speech recognition (‘ASR’) that include accepting by the multimodal application speech input and visual input for selecting or deselecting items in a selection list, the speech input enabled by a speech recognition grammar; providing, from the multimodal application to the grammar interpreter, the speech input and the speech recognition grammar; receiving, by the multimodal application from the grammar interpreter, interpretation results including matched words from the grammar that correspond to items in the selection list and a semantic interpretation token that specifies whether to select or deselect items in the selection list; and determining, by the multimodal application in dependence upon the value of the semantic interpretation token, whether to select or deselect items in the selection list that correspond to the matched words.

Type: Grant

Filed: January 3, 2007

Date of Patent: December 17, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Soonthorn Ativanichayaphong, Charles W. Cross, Jr., Gerald M. McCobb
Establishing a multimodal personality for a multimodal application in dependence upon attributes of user interaction

Patent number: 8600755

Abstract: Establishing a multimodal personality for a multimodal application, including evaluating, by the multimodal application, attributes of a user's interaction with the multimodal application; selecting, by the multimodal application, a vocal demeanor in dependence upon the values of the attributes of the user's interaction with the multimodal application; and incorporating, by the multimodal application, the vocal demeanor into the multimodal application.

Type: Grant

Filed: January 23, 2013

Date of Patent: December 3, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Hilary A. Pike
Synchronizing visual and speech events in a multimodal application

Patent number: 8571872

Abstract: Exemplary methods, systems, and products are disclosed for synchronizing visual and speech events in a multimodal application, including receiving from a user speech; determining a semantic interpretation of the speech; calling a global application update handler; identifying, by the global application update handler, an additional processing function in dependence upon the semantic interpretation; and executing the additional function. Typical embodiments may include updating a visual element after executing the additional function. Typical embodiments may include updating a voice form after executing the additional function. Typical embodiments also may include updating a state table after updating the voice form. Typical embodiments also may include restarting the voice form after executing the additional function.

Type: Grant

Filed: September 30, 2011

Date of Patent: October 29, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Michael C. Hollinger, Igor R. Jablokov, Benjamin D. Lewis, Hilary A. Pike, Daniel M. Smith, David W. Wintermute, Michael A. Zaitzeff
ESTABLISHING A PREFERRED MODE OF INTERACTION BETWEEN A USER AND A MULTIMODAL APPLICATION

Publication number: 20130283172

Abstract: Establishing a preferred mode of interaction between a user and a multimodal application, including evaluating, by a multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, user modal preference, and dynamically configuring multimodal content of the multimodal application in dependence upon the evaluation of user modal preference.

Type: Application

Filed: June 20, 2013

Publication date: October 24, 2013

Inventors: Charles W. Cross, JR., Hilary A. Pike
Context-based grammars for automated speech recognition

Patent number: 8566087

Abstract: Methods, apparatus, and computer program products for providing a context-based grammar for automatic speech recognition, including creating by a multimodal application a context, the context comprising words associated with user activity in the multimodal application, and supplementing by the multimodal application a grammar for automatic speech recognition in dependence upon the context.

Type: Grant

Filed: September 13, 2012

Date of Patent: October 22, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Frank L. Jania
Dynamically extending the speech prompts of a multimodal application

Patent number: 8521534

Abstract: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.

Type: Grant

Filed: September 12, 2012

Date of Patent: August 27, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
Indexing digitized speech with words represented in the digitized speech

Patent number: 8515757

Abstract: Indexing digitized speech with words represented in the digitized speech, with a multimodal digital audio editor operating on a multimodal device supporting modes of user interaction, the modes of user interaction including a voice mode and one or more non-voice modes, the multimodal digital audio editor operatively coupled to an ASR engine, including providing by the multimodal digital audio editor to the ASR engine digitized speech for recognition; receiving in the multimodal digital audio editor from the ASR engine recognized user speech including a recognized word, also including information indicating where, in the digitized speech, representation of the recognized word begins; and inserting by the multimodal digital audio editor the recognized word, in association with the information indicating where, in the digitized speech, representation of the recognized word begins, into a speech recognition grammar, the speech recognition grammar voice enabling user interface commands of the multimodal digital au

Type: Grant

Filed: March 20, 2007

Date of Patent: August 20, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Frank L. Jania
Speech enabled media sharing in a multimodal application

Patent number: 8510117

Abstract: Speech enabled media sharing in a multimodal application including parsing, by a multimodal browser, one or more markup documents of a multimodal application; identifying, by the multimodal browser, in the one or more markup documents a web resource for display in the multimodal browser; loading, by the multimodal browser, a web resource sharing grammar that includes keywords for modes of resource sharing and keywords for targets for receipt of web resources; receiving, by the multimodal browser, an utterance matching a keyword for the web resource, a keyword for a mode of resource sharing and a keyword for a target for receipt of the web resource in the web resource sharing grammar thereby identifying the web resource, a mode of resource sharing, and a target for receipt of the web resource; and sending, by the multimodal browser, the web resource to the identified target for the web resource using the identified mode of resource sharing.

Type: Grant

Filed: July 9, 2009

Date of Patent: August 13, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.

prev 1 2 3 4 5 6 next