Patents by Inventor Charles W. Cross, Jr.
Charles W. Cross, Jr. has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 8788620Abstract: Web service support for a multimodal client processing a multimodal application, the multimodal client providing an execution environment for the application and operating on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the application stored on an application server, includes: receiving, by the server, an application request from the client that specifies the application and device characteristics; determining, by a multimodal adapter of the server, modality requirements for the application; selecting, by the adapter, a modality web service in dependence upon the modality requirements and the characteristics for the device; determining, by the adapter, whether the device supports VoIP in dependence upon the characteristics; providing, by the server, the application to the client; and providing, by the adapter to the client in dependence upon whether the device supports VoIP, access to the modality web service for processing the applType: GrantFiled: April 4, 2007Date of Patent: July 22, 2014Assignee: International Business Machines CorporationInventors: Soonthorn Ativanichayaphong, Charles W. Cross, Jr., Gerald M. McCobb
-
Patent number: 8768711Abstract: A method of voice-enabling an application for command and control and content navigation can include the application dynamically generating a markup language fragment specifying a command and control and content navigation grammar for the application, instantiating an interpreter from a voice library, and providing the markup language fragment to the interpreter. The method also can include the interpreter processing a speech input using the command and control and content navigation grammar specified by the markup language fragment and providing an event to the application indicating an instruction representative of the speech input.Type: GrantFiled: June 17, 2004Date of Patent: July 1, 2014Assignee: Nuance Communications, Inc.Inventors: Soonthorn Ativanichayaphong, Charles W. Cross, Jr., Brien H. Muschett
-
Patent number: 8744861Abstract: Methods, apparatus, and computer program products are described for invoking tapered prompts in a multimodal application implemented with a multimodal browser and a multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes. Embodiments include identifying, by a multimodal browser, a prompt element in a multimodal application; identifying, by the multimodal browser, one or more attributes associated with the prompt element; and playing a speech prompt according to the one or more attributes associated with the prompt element.Type: GrantFiled: March 1, 2012Date of Patent: June 3, 2014Assignee: Nuance Communications, Inc.Inventors: Soonthorn Ativanichayaphong, Charles W. Cross, Jr., Gerald M. McCobb
-
Patent number: 8725513Abstract: Methods, apparatus, and products are disclosed for providing expressive user interaction with a multimodal application, the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to a speech engine through a VoiceXML interpreter, including: receiving, by the multimodal browser, user input from a user through a particular mode of user interaction; determining, by the multimodal browser, user output for the user in dependence upon the user input; determining, by the multimodal browser, a style for the user output in dependence upon the user input, the style specifying expressive output characteristics for at least one other mode of user interaction; and rendering, by the multimodal browser, the user output in dependence upon the style.Type: GrantFiled: April 12, 2007Date of Patent: May 13, 2014Assignee: Nuance Communications, Inc.Inventors: Charles W. Cross, Jr., Ellen M. Eide, Igor R. Jablokov
-
Patent number: 8719034Abstract: Methods, systems, and products are disclosed for displaying speech command input state information in a multimodal browser including displaying an icon representing a speech command type and displaying an icon representing the input state of the speech command. In typical embodiments, the icon representing a speech command type and the icon representing the input state of the speech command also includes attributes of a single icon. Typical embodiments include accepting from a user a speech command of the speech command type, changing the input state of the speech command, and displaying another icon representing the changed input state of the speech command. Typical embodiments also include displaying the text of the speech command in association with the icon representing the speech command type.Type: GrantFiled: September 13, 2005Date of Patent: May 6, 2014Assignee: Nuance Communications, Inc.Inventors: Charles W. Cross, Jr., Michael Charles Hollinger, Igor R. Jablokov, Benjamin D. Lewis, Hilary A. Pike, Daniel M. Smith, David W. Wintermute, Michael A. Zaitzeff
-
Publication number: 20140122091Abstract: Establishing a multimodal personality for a multimodal application, including evaluating, by the multimodal application, attributes of a user's interaction with the multimodal application; selecting, by the multimodal application, a vocal demeanor in dependence upon the values of the attributes of the user's interaction with the multimodal application; and incorporating, by the multimodal application, the vocal demeanor into the multimodal application.Type: ApplicationFiled: November 26, 2013Publication date: May 1, 2014Applicant: Nuance Communications, Inc.Inventors: Charles W. Cross, JR., Hilary A. Pike
-
Patent number: 8713542Abstract: Pausing a VoiceXML dialog of a multimodal application, including generating by the multimodal application a pause event; responsive to the pause event, temporarily pausing the dialogue by the VoiceXML interpreter; generating by the multimodal application a resume event; and responsive to the resume event, resuming the dialog. Embodiments are implemented with the multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application is operatively coupled to a VoiceXML interpreter, and the VoiceXML interpreter is interpreting the VoiceXML dialog to be paused.Type: GrantFiled: February 27, 2007Date of Patent: April 29, 2014Assignee: Nuance Communications, Inc.Inventors: Soonthorn Ativanichayaphong, Charles W. Cross, Jr., David Jaramillo, Gerald M. McCobb
-
Patent number: 8706500Abstract: Methods, apparatus, and computer program products are described for establishing a multimodal personality for a multimodal application that include selecting, by the multimodal application, matching vocal and visual demeanors and incorporating, by the multimodal application, the matching vocal and visual demeanors as a multimodal personality into the multimodal application.Type: GrantFiled: November 1, 2011Date of Patent: April 22, 2014Assignee: Nuance Communications, Inc.Inventors: Charles W. Cross, Jr., Hilary A. Pike
-
Publication number: 20140052449Abstract: Establishing a multimodal advertising personality for a sponsor of a multimodal application, including associating one or more vocal demeanors with a sponsor of a multimodal application and presenting a speech portion of the multimodal application for the sponsor using at least one of the vocal demeanors associated with the sponsor.Type: ApplicationFiled: July 29, 2013Publication date: February 20, 2014Applicant: Nuance Communications, Inc.Inventors: Charles W. Cross, JR., Hilary A. Pike
-
Publication number: 20140039899Abstract: Indexing digitized speech with words represented in the digitized speech, with a multimodal digital audio editor operating on a multimodal device supporting modes of user interaction, the modes of user interaction including a voice mode and one or more non-voice modes, the multimodal digital audio editor operatively coupled to an ASR engine, including providing by the multimodal digital audio editor to the ASR engine digitized speech for recognition; receiving in the multimodal digital audio editor from the ASR engine recognized user speech including a recognized word, also including information indicating where, in the digitized speech, representation of the recognized word begins; and inserting by the multimodal digital audio editor the recognized word, in association with the information indicating where, in the digitized speech, representation of the recognized word begins, into a speech recognition grammar, the speech recognition grammar voice enabling user interface commands of the multimodal digital auType: ApplicationFiled: August 7, 2013Publication date: February 6, 2014Applicant: Nuance Communications, Inc.Inventors: Charles W. Cross, JR., Frank L. Jania
-
Patent number: 8638909Abstract: Some example embodiments include a method of dynamically publishing directory information for a plurality of interactive voice response (‘IVR’) systems. The method includes receiving, by the IVR directory service on behalf of one of the IVR systems, a web services update request. The method includes determining, by the IVR directory service in response to the web services update request, updated directory information for the IVR system. The method includes updating the IVR system directory with the updated directory information for the IVR system. The method includes generating an updated voice mode user interface to reflect the updated IVR system directory with the updated directory information for the IVR system. The generating includes creating one more voice dialogs in accordance with the directory information, the one or more voice dialogs specifying a call flow defining the interaction between a caller and the IVR directory service.Type: GrantFiled: June 19, 2012Date of Patent: January 28, 2014Assignee: International Business Machines CorporationInventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr., Fang Wang
-
Publication number: 20130339033Abstract: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.Type: ApplicationFiled: August 26, 2013Publication date: December 19, 2013Applicant: Nuance Communications, Inc.Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR.
-
Patent number: 8612230Abstract: Methods, apparatus, and computer program products are described for automatic speech recognition (‘ASR’) that include accepting by the multimodal application speech input and visual input for selecting or deselecting items in a selection list, the speech input enabled by a speech recognition grammar; providing, from the multimodal application to the grammar interpreter, the speech input and the speech recognition grammar; receiving, by the multimodal application from the grammar interpreter, interpretation results including matched words from the grammar that correspond to items in the selection list and a semantic interpretation token that specifies whether to select or deselect items in the selection list; and determining, by the multimodal application in dependence upon the value of the semantic interpretation token, whether to select or deselect items in the selection list that correspond to the matched words.Type: GrantFiled: January 3, 2007Date of Patent: December 17, 2013Assignee: Nuance Communications, Inc.Inventors: Ciprian Agapi, Soonthorn Ativanichayaphong, Charles W. Cross, Jr., Gerald M. McCobb
-
Patent number: 8600755Abstract: Establishing a multimodal personality for a multimodal application, including evaluating, by the multimodal application, attributes of a user's interaction with the multimodal application; selecting, by the multimodal application, a vocal demeanor in dependence upon the values of the attributes of the user's interaction with the multimodal application; and incorporating, by the multimodal application, the vocal demeanor into the multimodal application.Type: GrantFiled: January 23, 2013Date of Patent: December 3, 2013Assignee: Nuance Communications, Inc.Inventors: Charles W. Cross, Jr., Hilary A. Pike
-
Patent number: 8571872Abstract: Exemplary methods, systems, and products are disclosed for synchronizing visual and speech events in a multimodal application, including receiving from a user speech; determining a semantic interpretation of the speech; calling a global application update handler; identifying, by the global application update handler, an additional processing function in dependence upon the semantic interpretation; and executing the additional function. Typical embodiments may include updating a visual element after executing the additional function. Typical embodiments may include updating a voice form after executing the additional function. Typical embodiments also may include updating a state table after updating the voice form. Typical embodiments also may include restarting the voice form after executing the additional function.Type: GrantFiled: September 30, 2011Date of Patent: October 29, 2013Assignee: Nuance Communications, Inc.Inventors: Charles W. Cross, Jr., Michael C. Hollinger, Igor R. Jablokov, Benjamin D. Lewis, Hilary A. Pike, Daniel M. Smith, David W. Wintermute, Michael A. Zaitzeff
-
Publication number: 20130283172Abstract: Establishing a preferred mode of interaction between a user and a multimodal application, including evaluating, by a multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, user modal preference, and dynamically configuring multimodal content of the multimodal application in dependence upon the evaluation of user modal preference.Type: ApplicationFiled: June 20, 2013Publication date: October 24, 2013Inventors: Charles W. Cross, JR., Hilary A. Pike
-
Patent number: 8566087Abstract: Methods, apparatus, and computer program products for providing a context-based grammar for automatic speech recognition, including creating by a multimodal application a context, the context comprising words associated with user activity in the multimodal application, and supplementing by the multimodal application a grammar for automatic speech recognition in dependence upon the context.Type: GrantFiled: September 13, 2012Date of Patent: October 22, 2013Assignee: Nuance Communications, Inc.Inventors: Charles W. Cross, Jr., Frank L. Jania
-
Patent number: 8521534Abstract: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.Type: GrantFiled: September 12, 2012Date of Patent: August 27, 2013Assignee: Nuance Communications, Inc.Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
-
Patent number: 8515757Abstract: Indexing digitized speech with words represented in the digitized speech, with a multimodal digital audio editor operating on a multimodal device supporting modes of user interaction, the modes of user interaction including a voice mode and one or more non-voice modes, the multimodal digital audio editor operatively coupled to an ASR engine, including providing by the multimodal digital audio editor to the ASR engine digitized speech for recognition; receiving in the multimodal digital audio editor from the ASR engine recognized user speech including a recognized word, also including information indicating where, in the digitized speech, representation of the recognized word begins; and inserting by the multimodal digital audio editor the recognized word, in association with the information indicating where, in the digitized speech, representation of the recognized word begins, into a speech recognition grammar, the speech recognition grammar voice enabling user interface commands of the multimodal digital auType: GrantFiled: March 20, 2007Date of Patent: August 20, 2013Assignee: Nuance Communications, Inc.Inventors: Charles W. Cross, Jr., Frank L. Jania
-
Patent number: 8510117Abstract: Speech enabled media sharing in a multimodal application including parsing, by a multimodal browser, one or more markup documents of a multimodal application; identifying, by the multimodal browser, in the one or more markup documents a web resource for display in the multimodal browser; loading, by the multimodal browser, a web resource sharing grammar that includes keywords for modes of resource sharing and keywords for targets for receipt of web resources; receiving, by the multimodal browser, an utterance matching a keyword for the web resource, a keyword for a mode of resource sharing and a keyword for a target for receipt of the web resource in the web resource sharing grammar thereby identifying the web resource, a mode of resource sharing, and a target for receipt of the web resource; and sending, by the multimodal browser, the web resource to the identified target for the web resource using the identified mode of resource sharing.Type: GrantFiled: July 9, 2009Date of Patent: August 13, 2013Assignee: Nuance Communications, Inc.Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.