Patents Assigned to SpeechWorks International, Inc.

Speech recognition and speaker verification using distributed speech processing

Patent number: 7571100

Abstract: Processing a speech utterance by communicating between a local computer and a remote computer using a hyper text communication session. The local computer sends a recording of a speech utterance to the remote computer in the session, and receives a result from the remote computer, the result based on a processing of the recording at the remote computer.

Type: Grant

Filed: December 3, 2002

Date of Patent: August 4, 2009

Assignee: Speechworks International, Inc.

Inventors: Philip Lenir, Andrew Hunt, Francois Corriveau
Multi-lingual speech recognition with cross-language context modeling

Patent number: 7149688

Abstract: An approach to multi-lingual speech recognition that permits different words in an utterance to be from different languages. Words from different languages are represented using different sets of sub-word units that are each associate with the corresponding language. Despite the use of different sets of sub-word units, the approach enables use of cross-word context at boundaries between words from different languages (cross-language context) to select appropriate variants of the sub-word units to match the context.

Type: Grant

Filed: November 4, 2002

Date of Patent: December 12, 2006

Assignee: SpeechWorks International, Inc.

Inventor: Johan Schalkwyk
Reduced unit database generation based on cost information

Patent number: 6988069

Abstract: An arrangement is provided for generating a reduced unit database of a desired size to be used in text to speech operations. A reduced unit database with a desired size is generated based on a full unit database. The reduction is carried out with respect to a text database with a plurality of sentences. Units from the full database are pruned to minimize an overall cost associated with using alternative units other than the units in the reduced unit database.

Type: Grant

Filed: January 31, 2003

Date of Patent: January 17, 2006

Assignee: Speechworks International, Inc.

Inventor: Michael Stuart Phillips
Linguistic prosodic model-based text to speech

Patent number: 6961704

Abstract: An arrangement is provided for text to speech processing based on linguistic prosodic models. Linguistic prosodic models are established to characterize different linguistic prosodic characteristics. When an input text is received, a target unit sequence is generated with a linguistic target that annotates target units in the target unit sequence with a plurality of linguistic prosodic characteristics so that speech synthesized in accordance with the target unit sequence and the linguistic target has certain desired prosodic properties. A unit sequence is selected in accordance with the target unit sequence and the linguistic target based on joint cost information evaluated using established linguistic prosodic models. The selected unit sequence is used to produce synthesized speech corresponding to the input text.

Type: Grant

Filed: January 31, 2003

Date of Patent: November 1, 2005

Assignee: Speechworks International, Inc.

Inventors: Michael S. Phillips, Daniel S. Faulkner, Marek A. Przezdzieci
Automatically retraining a speech recognition system

Patent number: 6789062

Abstract: A telephone-based interactive speech recognition system is retrained using variable weighting and incremental retraining. Variable weighting involves changing the relative influence of particular measurement data to be reflected in a statistical model. Statistical model data is determined based upon an initial set of measurement data determined from an initial set of speech utterances. When new statistical model data is to be generated to reflect new measurement data determined from new speech utterances, a weighting factor is applied to the new measurement data to generate weighted new measurement data. The new statistical model data is then determined based upon the initial set of measurement data and the weighted new measurement data. Incremental retraining involves generating new statistical model data using prior statistical model data to reduce the amount of prior measurement data that must be maintained and processed.

Type: Grant

Filed: February 25, 2000

Date of Patent: September 7, 2004

Assignee: SpeechWorks International, Inc.

Inventors: Michael S. Phillips, Krishna K. Govindarajan, Mark Fanty, Etienne Barnard
Method and apparatus for facilitating speech barge-in in connection with voice recognition systems

Patent number: 6785365

Abstract: A barge-in detector for use in connection with a speech recognition system forms a prompt replica for use in detecting the presence or absence of user input to the system. The replica is indicative of the prompt energy applied to an input of the system. The detector detects the application of user input to the system, even if concurrent with a prompt, and enables the system to quickly respond to the user input.

Type: Grant

Filed: July 24, 2001

Date of Patent: August 31, 2004

Assignee: Speechworks International, Inc.

Inventor: John N. Nguyen
Dynamic semantic control of a speech recognition system

Publication number: 20040006465

Abstract: A method and apparatus are provided for automatically recognizing words of spoken speech using a computer-based speech recognition system according to a dynamic semantic model. In an embodiment, the speech recognition system recognizes speech and generates one or more word strings, each of which is a hypothesis of the speech, and creates and stores a probability value or score for each of the word strings. The word strings are ordered by probability value. The speech recognition system also creates and stores, for each of the word strings, one or more keyword-value pairs that represent semantic elements and semantic values of the semantic elements for the speech that was spoken. One or more dynamic semantic rules are defined that specify how a probability value of a word string should be modified based on information about external conditions, facts, or the environment of the application in relation to the semantic values of that word string.

Type: Application

Filed: February 10, 2003

Publication date: January 8, 2004

Applicant: Speechworks International, Inc., a Delaware Corporation

Inventors: Michael S. Phillips, Etienne Barnard, Jean-Guy Dahan, Michael J. Metzger
Load-adjusted speech recogintion

Patent number: 6629075

Abstract: A speech recognition system includes a user interface configured to provide signals indicative of a user's speech. A speech recognizer of the system includes a processor configured to use the signals from the user interface to perform speech recognition operations to attempt to recognize speech indicated by the signals. A control mechanism is coupled to the voice recognizer and is configured to affect processor usage for speech recognition operations in accordance with a loading of the processor.

Type: Grant

Filed: June 9, 2000

Date of Patent: September 30, 2003

Assignee: SpeechWorks International, Inc.

Inventor: Johan Schalkwyk
Statistical computing and reporting for interactive speech applications

Patent number: 6606598

Abstract: A method and apparatus are disclosed for computing and reporting statistical information that describes the performance of an interactive speech application. The interactive speech application is developed and deployed for use by one or more callers. During execution, the interactive speech application stores, in a log, event information that describes each task carried out by the interactive speech application in response to interaction with the one or more callers. After the log is established, an analytical report is displayed. The report describes selective actions taken by the interactive speech application while executing, and selective actions taken by one or more callers while interacting with the interactive speech application. Information in the analytical report is selected so as to identify one or more potential performance problems in the interactive speech application. The analytical reports are generated based on the information stored in the event logs.

Type: Grant

Filed: September 21, 1999

Date of Patent: August 12, 2003

Assignee: SpeechWorks International, Inc.

Inventors: Mark A. Holthouse, Matthew T. Marx, John N. Nguyen
Segmentation approach for speech recognition systems

Patent number: 6535851

Abstract: Phonetic units are identified in a body of utterance data according to a novel segmentation approach. A body of received utterance data is processed and a set of candidate phonetic unit boundaries is determined that defines a set of candidate phonetic units. The set of candidate phonetic unit boundaries is determined based upon changes in Cepstral coefficient values, changes in utterance energy, changes in phonetic classification, broad category analysis (retroflex, back vowels, front vowels) and sonorant onset detection. The set of candidate phonetic unit boundaries is filtered by priority and proximity to other candidate phonetic units and by silence regions. The set of candidate phonetic units is filtered using no-cross region analysis to generate a set of filtered candidate phonetic units. No-cross region analysis generally involves discarding candidate phonetic units that completely span an energy up, energy down, dip or broad category type no-cross region.

Type: Grant

Filed: March 24, 2000

Date of Patent: March 18, 2003

Assignee: SpeechWorks, International, Inc.

Inventors: Mark Fanty, Michael S. Phillips
Dynamic semantic control of a speech recognition system

Patent number: 6519562

Abstract: A method and apparatus are provided for automatically recognizing words of spoken speech using a computer-based speech recognition system according to a dynamic semantic model. In an embodiment, the speech recognition system recognizes speech and generates one or more word strings, each of which is a hypothesis of the speech, and creates and stores a probability value or score for each of the word strings. The word strings are ordered by probability value. The speech recognition system also creates and stores, for each of the word strings, one or more keyword-value pairs that represent semantic elements and semantic values of the semantic elements for the speech that was spoken. One or more dynamic semantic rules are defined that specify how a probability value of a word string should be modified based on information about external conditions, facts, or the environment of the application in relation to the semantic values of that word string.

Type: Grant

Filed: February 25, 1999

Date of Patent: February 11, 2003

Assignee: Speechworks International, Inc.

Inventors: Michael S. Phillips, Etienne Barnard, Jean-Guy Dahan, Michael J. Metzger
Method and apparatus for dynamic adaptation of a large vocabulary speech recognition system and for use of constraints from a database in a large vocabulary speech recognition system

Patent number: 6501833

Abstract: The lexical network of a large-vocabulary speech recognition system is structured to effectuate the rapid and efficient addition of words to the system's active vocabulary. The lexical network is structured to include Phonetic Constraint Nodes, which organize the inter-word phonetic information in the network, and Word Class Nodes which organize the syntactic semantic information in the network. Network fragments, corresponding to phoneme pronunciations and labeled to specify permitted interconnections to each other and to phonetic constraint nodes, are precompiled to facilitate the rapid generation of pronunciations for new words and thereby enhance the rapid addition of words to the vocabulary even during speech recognition. Functions defined in accordance with linguistic constraints may be utilized during recognition. Different language models and different vocabularies for different portions of a discourse may also be invoked depending, in part, on the discourse history.

Type: Grant

Filed: October 3, 1997

Date of Patent: December 31, 2002

Assignee: SpeechWorks International, Inc.

Inventors: Michael S. Phillips, John N. Nguyen
Automatically determining words for updating in a pronunciation dictionary in a speech recognition system

Patent number: 6434521

Abstract: An approach for automatically determining the accuracy of a pronunciation dictionary in a speech recognition system involves comparing an expected pronunciation representation for a particular word from a pronunciation dictionary to one or more actual pronunciations of the particular word. An accuracy score for each of the phonemes that constitute the pronunciation of the particular word is determined from the comparison of the expected and actual pronunciations for the particular word. The accuracy score is evaluated against specified accuracy criteria to determine whether the expected pronunciation for the particular word satisfies the specified accuracy criteria. If the expected pronunciation does not satisfy the specified accuracy criteria for the particular word, then the expected pronunciation for the particular word in the pronunciation dictionary is identified as requiring updating.

Type: Grant

Filed: June 24, 1999

Date of Patent: August 13, 2002

Assignee: SpeechWorks International, Inc.

Inventor: Etienne Barnard
Method and system of reviewing the behavior of an interactive speech recognition application

Patent number: 6405170

Abstract: A method and apparatus are provided for improving the performance of an interactive speech application. The interactive speech application is developed and deployed for use by one or more callers. During execution, the interactive speech application stores, in a log, event information that describes each task carried out by the interactive speech application in response to interaction with the one or more callers. The application also stores one or more sets of audio information, in which each of the sets of audio information is associated with one or more utterances by one of the callers. Each of the sets of audio information is associated with one of the tasks represented in the log. After the log is established, an analytical report is displayed. The report describes selective actions taken by the interactive speech application while executing, and selective actions taken by one or more callers while interacting with the interactive speech application.

Type: Grant

Filed: September 22, 1998

Date of Patent: June 11, 2002

Assignee: SpeechWorks International, Inc.

Inventors: Michael S. Phillips, Mark A. Fanty, Krishna K. Govindarajan
Method and apparatus for improved speech recognition by modifying a pronunciation dictionary based on pattern definitions of alternate word pronunciations

Patent number: 6389394

Abstract: An approach for automatically modifying a pronunciation dictionary in a speech recognition system based on patterns of alternate pronunciations is described. A representation of the pronunciation dictionary, such as a plurality of dynamically linked phoneme values, is obtained. One or more pattern definitions are obtained. The pattern definitions specify zero or more phonemes to be substituted for zero or more phonemes of all words in the pronunciation dictionary. The linked phoneme values are modified by adding, for each path of each word, alternate paths that use each of the substitute phonemes according to the pattern definitions, thereby creating an expanded set of dynamically linked phoneme values. One or more example pronunciations of a particular word are then obtained. One or more best paths through the expanded set of phoneme values are determined for each of the example pronunciations and used to find the overall best path(s).

Type: Grant

Filed: February 9, 2000

Date of Patent: May 14, 2002

Assignee: SpeechWorks International, Inc.

Inventor: Mark Fanty
Method and apparatus for facilitating speech barge-in in connection with voice recognition systems

Patent number: 6266398

Abstract: A barge-in detector for use in connection with a speech recognition system forms a prompt replica for use in detecting the presence or absence of user input to the system. The replica is indicative of the prompt energy applied to an input of the system. The detector detects the application of user input to the system, even if concurrent with a prompt, and enables the system to quickly respond to the user input.

Type: Grant

Filed: March 12, 1998

Date of Patent: July 24, 2001

Assignee: Speechworks International, Inc.

Inventor: John N. Nguyen
System and method for developing interactive speech applications

Patent number: 6173266

Abstract: Dialogue modules are provided, with each dialogue module includes computer readable instructions for accomplishing a predefined interactive dialogue task in an interactive speech application. In response to user input, a subset of the plurality of dialogue modules are selected to accomplish their respective interactive dialogue tasks in the interactive speech application and are interconnected in an order defining the call flow of the application, and the application is generated. A graphical user interface represents the stored plurality of dialogue modules as icons in a graphical display in which icons for the subset of dialogue modules are selected in the graphical display in response to user input, the icons for the subset of dialogue modules are graphically interconnected into a graphical representation of the call flow of the interactive speech application, and the interactive speech application is generated based upon the graphical representation.

Type: Grant

Filed: May 6, 1998

Date of Patent: January 9, 2001

Assignee: SpeechWorks International, Inc.

Inventors: Matthew T. Marx, Jerry K. Carter, Michael S. Phillips, Mark A. Holthouse, Stephen D. Seabury, Jose L. Elizondo-Cecenas, Brett D. Phaneuf
Apparatus that detects voice energy during prompting by a voice recognition system

Patent number: 6061651

Abstract: A barge-in detector for use in connection with a speech recognition system forms a prompt replica for use in detecting the presence or absence of user input to the system. The replica is indicative of the prompt energy applied to an input of the system. The detector detects the application of user input to the system, even if concurrent with a prompt, and enables the system to quickly respond to the user input.

Type: Grant

Filed: March 12, 1998

Date of Patent: May 9, 2000

Assignee: Speechworks International, Inc.

Inventor: John N. Nguyen
Method and apparatus for continuous spelling speech recognition with early identification

Patent number: 5995928

Abstract: A speech recognition system capable of recognizing a word or a plurality of words based on a continuous spelling of the word(s) by a user. The system includes a speech recognition engine with a decoder running in forward mode such that the recognition engine continuously outputs an updated string of hypothesized letters based on the letters uttered by the user. The system further includes a spelling engine for comparing each string of hypothesized letters to a vocabulary list of words. The spelling engine returns a best match for the string of hypothesized letters. The system may also include an early identification unit for presenting the user with the best matching word(s) possibly before the user has completed spelling the desired word(s).

Type: Grant

Filed: October 2, 1996

Date of Patent: November 30, 1999

Assignee: Speechworks International, Inc.

Inventors: John N. Nguyen, Matthew T. Marx