Patents by Inventor Charles W. Cross

Charles W. Cross has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Automatic speech recognition with a selection list

Patent number: 8612230

Abstract: Methods, apparatus, and computer program products are described for automatic speech recognition (‘ASR’) that include accepting by the multimodal application speech input and visual input for selecting or deselecting items in a selection list, the speech input enabled by a speech recognition grammar; providing, from the multimodal application to the grammar interpreter, the speech input and the speech recognition grammar; receiving, by the multimodal application from the grammar interpreter, interpretation results including matched words from the grammar that correspond to items in the selection list and a semantic interpretation token that specifies whether to select or deselect items in the selection list; and determining, by the multimodal application in dependence upon the value of the semantic interpretation token, whether to select or deselect items in the selection list that correspond to the matched words.

Type: Grant

Filed: January 3, 2007

Date of Patent: December 17, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, Soonthorn Ativanichayaphong, Charles W. Cross, Jr., Gerald M. McCobb
Establishing a multimodal personality for a multimodal application in dependence upon attributes of user interaction

Patent number: 8600755

Abstract: Establishing a multimodal personality for a multimodal application, including evaluating, by the multimodal application, attributes of a user's interaction with the multimodal application; selecting, by the multimodal application, a vocal demeanor in dependence upon the values of the attributes of the user's interaction with the multimodal application; and incorporating, by the multimodal application, the vocal demeanor into the multimodal application.

Type: Grant

Filed: January 23, 2013

Date of Patent: December 3, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Hilary A. Pike
Synchronizing visual and speech events in a multimodal application

Patent number: 8571872

Abstract: Exemplary methods, systems, and products are disclosed for synchronizing visual and speech events in a multimodal application, including receiving from a user speech; determining a semantic interpretation of the speech; calling a global application update handler; identifying, by the global application update handler, an additional processing function in dependence upon the semantic interpretation; and executing the additional function. Typical embodiments may include updating a visual element after executing the additional function. Typical embodiments may include updating a voice form after executing the additional function. Typical embodiments also may include updating a state table after updating the voice form. Typical embodiments also may include restarting the voice form after executing the additional function.

Type: Grant

Filed: September 30, 2011

Date of Patent: October 29, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Michael C. Hollinger, Igor R. Jablokov, Benjamin D. Lewis, Hilary A. Pike, Daniel M. Smith, David W. Wintermute, Michael A. Zaitzeff
ESTABLISHING A PREFERRED MODE OF INTERACTION BETWEEN A USER AND A MULTIMODAL APPLICATION

Publication number: 20130283172

Abstract: Establishing a preferred mode of interaction between a user and a multimodal application, including evaluating, by a multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, user modal preference, and dynamically configuring multimodal content of the multimodal application in dependence upon the evaluation of user modal preference.

Type: Application

Filed: June 20, 2013

Publication date: October 24, 2013

Inventors: Charles W. Cross, JR., Hilary A. Pike
Context-based grammars for automated speech recognition

Patent number: 8566087

Abstract: Methods, apparatus, and computer program products for providing a context-based grammar for automatic speech recognition, including creating by a multimodal application a context, the context comprising words associated with user activity in the multimodal application, and supplementing by the multimodal application a grammar for automatic speech recognition in dependence upon the context.

Type: Grant

Filed: September 13, 2012

Date of Patent: October 22, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Frank L. Jania
Dynamically extending the speech prompts of a multimodal application

Patent number: 8521534

Abstract: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.

Type: Grant

Filed: September 12, 2012

Date of Patent: August 27, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
Indexing digitized speech with words represented in the digitized speech

Patent number: 8515757

Abstract: Indexing digitized speech with words represented in the digitized speech, with a multimodal digital audio editor operating on a multimodal device supporting modes of user interaction, the modes of user interaction including a voice mode and one or more non-voice modes, the multimodal digital audio editor operatively coupled to an ASR engine, including providing by the multimodal digital audio editor to the ASR engine digitized speech for recognition; receiving in the multimodal digital audio editor from the ASR engine recognized user speech including a recognized word, also including information indicating where, in the digitized speech, representation of the recognized word begins; and inserting by the multimodal digital audio editor the recognized word, in association with the information indicating where, in the digitized speech, representation of the recognized word begins, into a speech recognition grammar, the speech recognition grammar voice enabling user interface commands of the multimodal digital au

Type: Grant

Filed: March 20, 2007

Date of Patent: August 20, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Frank L. Jania
Speech enabled media sharing in a multimodal application

Patent number: 8510117

Abstract: Speech enabled media sharing in a multimodal application including parsing, by a multimodal browser, one or more markup documents of a multimodal application; identifying, by the multimodal browser, in the one or more markup documents a web resource for display in the multimodal browser; loading, by the multimodal browser, a web resource sharing grammar that includes keywords for modes of resource sharing and keywords for targets for receipt of web resources; receiving, by the multimodal browser, an utterance matching a keyword for the web resource, a keyword for a mode of resource sharing and a keyword for a target for receipt of the web resource in the web resource sharing grammar thereby identifying the web resource, a mode of resource sharing, and a target for receipt of the web resource; and sending, by the multimodal browser, the web resource to the identified target for the web resource using the identified mode of resource sharing.

Type: Grant

Filed: July 9, 2009

Date of Patent: August 13, 2013

Assignee: Nuance Communications, Inc.

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
Establishing a multimodal advertising personality for a sponsor of multimodal application

Patent number: 8498873

Abstract: Establishing a multimodal advertising personality for a sponsor of a multimodal application, including associating one or more vocal demeanors with a sponsor of a multimodal application and presenting a speech portion of the multimodal application for the sponsor using at least one of the vocal demeanors associated with the sponsor.

Type: Grant

Filed: June 28, 2012

Date of Patent: July 30, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Hilary A. Pike
Establishing a preferred mode of interaction between a user and a multimodal application

Patent number: 8494858

Abstract: Establishing a preferred mode of interaction between a user and a multimodal application, including evaluating, by a multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, user modal preference, and dynamically configuring multimodal content of the multimodal application in dependence upon the evaluation of user modal preference.

Type: Grant

Filed: February 14, 2012

Date of Patent: July 23, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Hilary A. Pike
Multimodal teleconferencing

Patent number: 8416714

Abstract: Multimodal teleconferencing including receiving, by a multimodal teleconferencing module, a speech utterance from one of a plurality of participants in the multimodal teleconference; identifying the participant making the speech utterance as a current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to the current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to one or more other participants in the multimodal teleconference; providing, by the multimodal teleconferencing module to a multimodal teleconferencing client for display to the current speaker, an identification of the speaker and the content retrieved for the speaker; and providing, by the multimodal teleconferencing module to one or more of multimodal teleconferencing clients for display to the other participants, an identification of the current speaker with the content retrieved for the one or more ot

Type: Grant

Filed: August 5, 2009

Date of Patent: April 9, 2013

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
Improving speech capabilities of a multimodal application

Patent number: 8380513

Abstract: Improving speech capabilities of a multimodal application including receiving, by the multimodal browser, a media file having a metadata container; retrieving, by the multimodal browser, from the metadata container a speech artifact related to content stored in the media file for inclusion in the speech engine available to the multimodal browser; determining whether the speech artifact includes a grammar rule or a pronunciation rule; if the speech artifact includes a grammar rule, modifying, by the multimodal browser, the grammar of the speech engine to include the grammar rule; and if the speech artifact includes a pronunciation rule, modifying, by the multimodal browser, the lexicon of the speech engine to include the pronunciation rule.

Type: Grant

Filed: May 19, 2009

Date of Patent: February 19, 2013

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.
Establishing a multimodal personality for a multimodal application in dependence upon attributes of user interaction

Patent number: 8374874

Abstract: Establishing a multimodal personality for a multimodal application, including evaluating, by the multimodal application, attributes of a user's interaction with the multimodal application; selecting, by the multimodal application, a vocal demeanor in dependence upon the values of the attributes of the user's interaction with the multimodal application; and incorporating, by the multimodal application, the vocal demeanor into the multimodal application.

Type: Grant

Filed: September 11, 2006

Date of Patent: February 12, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Hilary A. Pike
Enabling speech within a multimodal program using markup

Patent number: 8359203

Abstract: A method for speech enabling an application can include the step of specifying a speech input within a speech-enabled markup. The speech-enabled markup can also specify an application operation that is to be executed responsive to the detection of the speech input. After the speech input has been defined within the speech-enabled markup, the application can be instantiated. The specified speech input can then be detected and the application operation can be responsively executed in accordance with the specified speech-enabled markup.

Type: Grant

Filed: September 20, 2011

Date of Patent: January 22, 2013

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Leslie R. Wilson, Steven G. Woodward
DYNAMICALLY EXTENDING THE SPEECH PROMPTS OF A MULTIMODAL APPLICATION

Publication number: 20130018658

Abstract: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.

Type: Application

Filed: September 12, 2012

Publication date: January 17, 2013

Applicant: International Business Machiness Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, JR.
CONTEXT-BASED GRAMMARS FOR AUTOMATED SPEECH RECOGNITION

Publication number: 20130006621

Abstract: Methods, apparatus, and computer program products for providing a context-based grammar for automatic speech recognition, including creating by a multimodal application a context, the context comprising words associated with user activity in the multimodal application, and supplementing by the multimodal application a grammar for automatic speech recognition in dependence upon the context.

Type: Application

Filed: September 13, 2012

Publication date: January 3, 2013

Applicant: Nuance Communications, Inc.

Inventors: Charles W. Cross, JR., Frank L. Jania
Context-based grammars for automated speech recognition

Patent number: 8332218

Abstract: Methods, apparatus, and computer program products for providing a context-based grammar for automatic speech recognition, including creating by a multimodal application a context, the context comprising words associated with user activity in the multimodal application, and supplementing by the multimodal application a grammar for automatic speech recognition in dependence upon the context.

Type: Grant

Filed: June 13, 2006

Date of Patent: December 11, 2012

Assignee: Nuance Communications, Inc.

Inventors: Charles W. Cross, Jr., Frank L. Jania
ESTABLISHING A MULTIMODAL ADVERTISING PERSONALITY FOR A SPONSOR OF A MULTIMODAL APPLICATION

Publication number: 20120271642

Abstract: Establishing a multimodal advertising personality for a sponsor of a multimodal application, including associating one or more vocal demeanors with a sponsor of a multimodal application and presenting a speech portion of the multimodal application for the sponsor using at least one of the vocal demeanors associated with the sponsor.

Type: Application

Filed: June 28, 2012

Publication date: October 25, 2012

Applicant: Nuance Communications, Inc.

Inventors: Charles W. Cross, JR., Hilary A. Pike
Systems and methods for inputting graphical data into a graphical input field

Patent number: 8296149

Abstract: A system (20) for inputting graphical data into a graphical input field includes a graphical input device (22) for inputting the graphical data into the graphical input field, and a processor-executable voice-form module (28) responsive to an initial presentation of graphical data to the graphical input device. The voice-form module (28) causes a determination of whether the inputting of the graphical data into the graphical input field is complete. A method for inputting graphical data into a graphical input field includes initiating an input of graphical data via a graphical input device into the graphical input field, and actuating a voice-form module in response to initiating the input of graphical data into the graphical input field.

Type: Grant

Filed: January 30, 2009

Date of Patent: October 23, 2012

Assignee: International Business Machines Corporation

Inventors: Charles W. Cross, Jr., David Jaramillo, Marc White
Dynamically extending the speech prompts of a multimodal application

Patent number: 8290780

Abstract: Dynamically extending the speech prompts of a multimodal application including receiving, by the prompt generation engine, a media file having a metadata container; retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application; and modifying, by the prompt generation engine, the multimodal application to include the speech prompt.

Type: Grant

Filed: June 24, 2009

Date of Patent: October 16, 2012

Assignee: International Business Machines Corporation

Inventors: Ciprian Agapi, William K. Bodin, Charles W. Cross, Jr.

prev 1 2 3 4 5 6 7 … next