Patents by Inventor Jordan R. Cohen

Jordan R. Cohen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

INTERACTION ASSISTANT

Publication number: 20180174585

Abstract: An interaction assistant conducts multiple turn interaction dialogs with a user in which context is maintained between turns, and the system manages the dialog to achieve an inferred goal for the user. The system includes a linguistic interface to a user and a parser for processing linguistic events from the user. A dialog manager of the system is configured to receive alternative outputs from the parser, and selecting an action and causing the action to be performed based on the received alternative outputs. The system further includes a dialog state for an interaction with the user, and the alternative outputs represent alternative transitions from a current dialog state to a next dialog state. The system further includes a storage for a plurality of templates, and wherein each dialog state is defined in terms of an interrelationship of one or more instances of the templates.

Type: Application

Filed: February 14, 2018

Publication date: June 21, 2018

Inventors: Jacob Andreas, Taylor D. Berg-Kirkpatrick, Pengyu Chen, Jordan R. Cohen, Laurence S. Gillick, David Leo Wright Hall, Daniel Klein, Michael Newman, Adam D. Pauls, Daniel L. Roth, Jesse Daniel Eskes Rusak, Andrew R. Volpe, Steven A. Wegmann
AUTOMATIC SPOKEN DIALOGUE SCRIPT DISCOVERY

Publication number: 20170147554

Abstract: A method for configuring an automated dialogue system uses traces of interactions via a graphical user interface (GUI) for an application. Each trace includes interactions in the context of a plurality of presentations of the GUI. Elements of one or more presentations of the GUI are identified, and templates are associated with portions of the trace. Each template has one or more defined inputs and a defined output. For each template of the plurality of templates, the portions of the traces are processed to automatically configure the template by specifying a procedure for providing values of inputs to the template via the GUI and obtaining a value of an output. The automated dialogue system is configured with the configured templates, thereby avoiding manual configuration of the dialogue system.

Type: Application

Filed: November 22, 2016

Publication date: May 25, 2017

Inventors: Pengyu Chen, Jordan R. Cohen, Laurence S. Gillick, David Leo Wright Hall, Daniel Klein, Adam D. Pauls, Daniel L. Roth, Jesse Daniel Eskes Rusak
INTERACTION ASSISTANT

Publication number: 20170140755

Abstract: An interaction assistant conducts multiple turn interaction dialogs with a user in which context is maintained between turns, and the system manages the dialog to achieve an inferred goal for the user. The system includes a linguistic interface to a user and a parser for processing linguistic events from the user. A dialog manager of the system is configured to receive alternative outputs from the parser, and selecting an action and causing the action to be performed based on the received alternative outputs. The system further includes a dialog state for an interaction with the user, and the alternative outputs represent alternative transitions from a current dialog state to a next dialog state. The system further includes a storage for a plurality of templates, and wherein each dialog state is defined in terms of an interrelationship of one or more instances of the templates.

Type: Application

Filed: November 10, 2016

Publication date: May 18, 2017

Inventors: Jacob Andreas, Taylor D. Being-Kirkpatrick, Pengyu Chen, Jordan R. Cohen, Laurence S Gillsick, David Leo Wright Hall, Daniel Klein, Michael Newman, Adam D. Pauls, Daniel L. Roth, Jesse Daniel Eskes Rusak, Andrew R. Volpe, Steven A. Wegmann
ATTENTIVE ASSISTANT

Publication number: 20170118344

Abstract: An approach to providing communication assistance to an operator of a vehicle makes use software having a first component executing on a personal device of the operator as well as a second component executing on a server in communication with the personal device. In some implementations, handling a call involves establishing a first two-way audio link between the server and the calling device is established, and a second two-way audio link between a server and the user device. The server passes some of the audio from the calling device to the user device, and monitors a user's voice input, of lack thereof, to determine how to handle the call.

Type: Application

Filed: October 20, 2016

Publication date: April 27, 2017

Inventors: Jordan R. Cohen, Daniel L. Roth, David Leo Wright Hall, Jesse Daniel Eskes Rusak, Andrew Robert Volpe, Sean Daniel True, Damon R. Pender, Laurence S. Gillick, Yan Virin
Methods and apparatus for formant-based voice synthesis

Patent number: 8706488

Abstract: In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.

Type: Grant

Filed: February 27, 2013

Date of Patent: April 22, 2014

Assignee: Nuance Communications, Inc.

Inventors: Michael D. Edgington, Laurence Gillick, Jordan R. Cohen
Methods and apparatus for formant-based voice systems

Patent number: 8447592

Abstract: In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.

Type: Grant

Filed: September 13, 2005

Date of Patent: May 21, 2013

Assignee: Nuance Communications, Inc.

Inventors: Michael D. Edgington, Laurence Gillick, Jordan R. Cohen
Word recognition using choice lists

Patent number: 7809574

Abstract: One aspect of the invention involves word recognition that uses scrollable choice lists in which choices are listed in character-order. Another aspect relates to a scrollable, visually-displayed word recognition choice list, where the recognition candidates on the choice list are each associated with a choice-selecting symbol the user can use to select a desired recognition candidate by pressing an associated button, and where the same choice-selecting symbol is used for different choices displayed on the display at different times as a result of scrolling. Another aspect of the invention relates to providing a choice list of best scoring characters for a particular character position in the spelling of a filter that is used to filter word recognition. Another aspect of the invention relates to a choice list used in word recognition in which the choice list can be scrolled horizontally.

Type: Grant

Filed: September 24, 2004

Date of Patent: October 5, 2010

Assignee: Voice Signal Technologies Inc.

Inventors: Daniel L. Roth, Jordan R. Cohen, David F. Johnston, Edward W. Porter
Speech recognition using automatic recognition turn off

Patent number: 7716058

Abstract: Large vocabulary speech recognition can automatically turn recognition off in one or more ways. A user command can turn on recognition that is automatically turned off after the next end of utterance. A plurality of buttons can each be associated with a different speech mode and the touch of a given button can turn on, and then automatically turn off, the given button's associated speech recognition mode. These selectable modes can include large vocabulary and alphabetic entry modes, or continuous and discrete modes. A first user input can start recognition that allows a sequence of vocabulary words to be recognized and a second user input can start recognition that turns off after one word has been recognized. A first user input can start recognition that allows a sequence of utterances to be recognized and a second user input can start recognition that allows only a single utterance to be recognized.

Type: Grant

Filed: September 24, 2004

Date of Patent: May 11, 2010

Assignee: Voice Signal Technologies, Inc.

Inventors: Daniel L. Roth, Jordan R. Cohen, David F. Johnston
Word recognition using word transformation commands

Patent number: 7634403

Abstract: Word recognition enables a user to have a selected transformation performed on a given word produced by word recognition. In one aspect of the invention, a selectable transformation changes the given word to a differently spelled word having the same word root. In another, a selectable transformation changes a given word to one or more of its homonyms. In yet another, a selectable transformation changes the given word between a representation that spells the word with letters and one that does not. In one aspect of the invention a user can select to display a choice list of transformed words corresponding to a given recognized word and then select to have one of the listed transformed words replace the given word. In another aspect of the invention word recognition favors recognition of words corresponding to a user selected part of speech.

Type: Grant

Filed: September 24, 2004

Date of Patent: December 15, 2009

Assignee: Voice Signal Technologies, Inc.

Inventors: Daniel L. Roth, Jordan R. Cohen, David F. Johnston
Combined speech recognition and text-to-speech generation

Patent number: 7577569

Abstract: Text-to-speech (TTS) generation is used in conjunction with large vocabulary speech recognition to say words selected by the speech recognition. The software for performing the large vocabulary speech recognition can share speech modeling data with the TTS software. TTS or recorded audio can be used to automatically say both recognized text and the names of recognized commands after their recognition. The TTS can automatically repeats text recognized by the speech recognition after each of a succession of end of utterance detections. A user can move a cursor back or forward in recognized text, and the TTS can speak one or more words at the cursor location after each such move. The speech recognition can be used to produces a choice list of possible recognition candidates and the TTS can be used to provide spoken output of one or more of the candidates on the choice list.

Type: Grant

Filed: September 24, 2004

Date of Patent: August 18, 2009

Assignee: Voice Signal Technologies, Inc.

Inventors: Daniel L. Roth, Jordan R. Cohen, David F. Johnston, Manfred G. Grabherr, Edward W. Porter
Speech recognition using ambiguous or phone key spelling and/or filtering

Patent number: 7526431

Abstract: Alphabetic filtering of the speech recognition of words uses a key press to indicate a desired character in an alphabetic filter string, where each key press represents two or more letters. The key presses can be disambiguated by recognizing a key-disambiguation utterance in association with a given key press. A user can select a desired recognition candidate from a choice list produced by such filtered word recognition. Ambiguous alphabetic filtering can be performed iteratively in response to the addition of successive ambiguous key presses. A user can select to re-recognize the utterance using filtering based on ambiguous key input after seeing the results of recognition without such filtering. Unambiguous alphabetic filtering can be performed by using multiple presses of an ambiguous key to disambiguate which letter is intended. A user can select between entering text by either large vocabulary speech recognition or by spelling text by pressing phone keys.

Type: Grant

Filed: September 24, 2004

Date of Patent: April 28, 2009

Assignee: Voice Signal Technologies, Inc.

Inventors: Daniel L. Roth, Jordan R. Cohen, David F. Johnston
Combined speech recognition and sound recording

Patent number: 7505911

Abstract: A handheld device with both large-vocabulary speech recognition and audio recoding allows users to switch between at least two of the following three modes: (1) recording audio without corresponding speech recognition; (2) recording with speech recognition; and (3) speech recognition without audio recording. A handheld device with both large-vocabulary speech recognition and audio recoding enables a user to select a portion of previously recorded sound and have speech recognition performed upon it. A system enables a user to search for a text label associated with portions of unrecognized recorded sound by uttering the label's words. A large-vocabulary system allows users to switch between playing back recorded audio and speech recognition with a single input, with successive audio playbacks automatically starting slightly before the end of prior playback. And a cell phone that allows both large-vocabulary speech recognition and audio recording and playback.

Type: Grant

Filed: December 5, 2004

Date of Patent: March 17, 2009

Inventors: Daniel L. Roth, Jordan R. Cohen, David F. Johnston, Edward W. Porter
Speech recognition using re-utterance recognition

Patent number: 7444286

Abstract: The present invention relates to speech recognition that enables a user to perform re-utterance recognition, in which speech recognition is performed upon both a second saying of a sequence of one or more words and upon an earlier saying of the same sequence to help the speech recognition better select one or more best scoring text sequences for the utterances.

Type: Grant

Filed: December 5, 2004

Date of Patent: October 28, 2008

Inventors: Daniel L. Roth, Jordan R. Cohen
Speech recognition using selectable recognition modes

Patent number: 7313526

Abstract: The present invention relates to speech recognition using selectable recognition modes. This includes innovations such as: large vocabulary speech recognition programming that supplies recognized words to external program as they are recognized, and allows a user to select between large vocabulary recognition of an utterance with and without language context from the prior utterance independently of state of the external program; allowing a user to select between continuous and discrete speech recognition that use substantially the same vocabulary; allowing a user to select between continuous and discrete large-vocabulary speech recognition modes; allowing a user to select between at least two different alphabetic entry speech recognition modes; and allowing a user to select from among four or more of the following recognitions modes when creating text: a large-vocabulary mode, an alphabetic entry mode, a number entry mode, and a punctuation entry mode.

Type: Grant

Filed: September 24, 2004

Date of Patent: December 25, 2007

Assignee: Voice Signal Technologies, Inc.

Inventors: Daniel L. Roth, Jordan R. Cohen, David F. Johnston, Manfred G. Grabherr
Methods, systems, and programming for performing speech recognition

Patent number: 7225130

Abstract: The present invention relates to: speech recognition using selectable recognition modes; using choice lists in large-vocabulary speech recognition; enabling users to select word transformations; speech recognition that automatically turns recognition off in one or more specified ways; phone key control of large-vocabulary speech recognition; speech recognition using phone key alphabetic filtering and spelling: speech recognition that enables a user to perform re-utterance recognition; the combination of speech recognition and text-to-speech (TTS) generation; the combination of speech recognition with handwriting and/or character recognition; and the combination of large-vocabulary speech recognition with audio recording and playback.

Type: Grant

Filed: September 6, 2002

Date of Patent: May 29, 2007

Assignee: Voice Signal Technologies, Inc.

Inventors: Daniel L. Roth, Jordan R. Cohen, David F. Johnston, Manfred G. Grabherr
Methods, systems, and programming for performing speech recognition

Publication number: 20040267528

Abstract: The present invention relates to: speech recognition using selectable recognition modes; using choice lists in large-vocabulary speech recognition; enabling users to select word transformations; speech recognition that automatically turns recognition off in one or more specified ways; phone key control of large-vocabulary speech recognition; speech recognition using phone key alphabetic filtering and spelling: speech recognition that enables a user to perform re-utterance recognition; the combination of speech recognition and text-to-speech (TTS) generation; the combination of speech recognition with handwriting and/or character recognition; and the combination of large-vocabulary speech recognition with audio recording and playback.

Type: Application

Filed: September 6, 2002

Publication date: December 30, 2004

Inventors: Daniel L. Roth, Jordan R. Cohen, David F. Johnston, Manfred G. Grabherr
Apparatus, methods, and programming for speech synthesis via bit manipulations of compressed database

Publication number: 20040073428

Abstract: Text-to-speech synthesis modifies the pitch of the sounds it concatenates to generate speech, when such sounds are in compressed, coded form, so as to make them sound better together. The pitch, duration, and energy of such concatenated sounds can be altered to better match, respectively, pitch, duration, and/or energy contours generated from phonetic spelling of the speech to be synthesized, which can, in turn, be derived from the text to be synthesized. The synthesized speech can be generated from the encoded sound of sub-word snippets as well as of one or more whole words. The duration of concatenated sounds can be changed by inserting or deleting sound frames associated with individual snippets. Such text-to-speech can be used to say words recognized by speech recognition, such as to provide feedback on the recognition. Such text-to-speech synthesis can be used in portable devices such as cellphones, PDAs, and/or wrist phones.

Type: Application

Filed: October 10, 2002

Publication date: April 15, 2004

Inventors: Igor Zlokarnik, Laurence S. Gillick, Jordan R. Cohen
Methods, systems, and programming for performing speech recognition

Publication number: 20040049388

Abstract: The present invention relates to: speech recognition using selectable recognition modes; using choice lists in large-vocabulary speech recognition; enabling users to select word transformations; speech recognition that automatically turns recognition off in one or more specified ways; phone key control of large-vocabulary speech recognition; speech recognition using phone key alphabetic filtering and spelling: speech recognition that enables a user to perform re-utterance recognition; the combination of speech recognition and text-to-speech (TTS) generation; the combination of speech recognition with handwriting and/or character recognition; and the combination of large-vocabulary speech recognition with audio recording and playback.

Type: Application

Filed: September 6, 2002

Publication date: March 11, 2004

Inventors: Daniel L. Roth, Jordan R. Cohen, David F. Johnston, Manfred G. Grabherr