Patents Assigned to SpeechWorks International, Inc.
  • Patent number: 7571100
    Abstract: Processing a speech utterance by communicating between a local computer and a remote computer using a hyper text communication session. The local computer sends a recording of a speech utterance to the remote computer in the session, and receives a result from the remote computer, the result based on a processing of the recording at the remote computer.
    Type: Grant
    Filed: December 3, 2002
    Date of Patent: August 4, 2009
    Assignee: Speechworks International, Inc.
    Inventors: Philip Lenir, Andrew Hunt, Francois Corriveau
  • Patent number: 7149688
    Abstract: An approach to multi-lingual speech recognition that permits different words in an utterance to be from different languages. Words from different languages are represented using different sets of sub-word units that are each associate with the corresponding language. Despite the use of different sets of sub-word units, the approach enables use of cross-word context at boundaries between words from different languages (cross-language context) to select appropriate variants of the sub-word units to match the context.
    Type: Grant
    Filed: November 4, 2002
    Date of Patent: December 12, 2006
    Assignee: SpeechWorks International, Inc.
    Inventor: Johan Schalkwyk
  • Patent number: 6988069
    Abstract: An arrangement is provided for generating a reduced unit database of a desired size to be used in text to speech operations. A reduced unit database with a desired size is generated based on a full unit database. The reduction is carried out with respect to a text database with a plurality of sentences. Units from the full database are pruned to minimize an overall cost associated with using alternative units other than the units in the reduced unit database.
    Type: Grant
    Filed: January 31, 2003
    Date of Patent: January 17, 2006
    Assignee: Speechworks International, Inc.
    Inventor: Michael Stuart Phillips
  • Patent number: 6961704
    Abstract: An arrangement is provided for text to speech processing based on linguistic prosodic models. Linguistic prosodic models are established to characterize different linguistic prosodic characteristics. When an input text is received, a target unit sequence is generated with a linguistic target that annotates target units in the target unit sequence with a plurality of linguistic prosodic characteristics so that speech synthesized in accordance with the target unit sequence and the linguistic target has certain desired prosodic properties. A unit sequence is selected in accordance with the target unit sequence and the linguistic target based on joint cost information evaluated using established linguistic prosodic models. The selected unit sequence is used to produce synthesized speech corresponding to the input text.
    Type: Grant
    Filed: January 31, 2003
    Date of Patent: November 1, 2005
    Assignee: Speechworks International, Inc.
    Inventors: Michael S. Phillips, Daniel S. Faulkner, Marek A. Przezdzieci
  • Patent number: 6789062
    Abstract: A telephone-based interactive speech recognition system is retrained using variable weighting and incremental retraining. Variable weighting involves changing the relative influence of particular measurement data to be reflected in a statistical model. Statistical model data is determined based upon an initial set of measurement data determined from an initial set of speech utterances. When new statistical model data is to be generated to reflect new measurement data determined from new speech utterances, a weighting factor is applied to the new measurement data to generate weighted new measurement data. The new statistical model data is then determined based upon the initial set of measurement data and the weighted new measurement data. Incremental retraining involves generating new statistical model data using prior statistical model data to reduce the amount of prior measurement data that must be maintained and processed.
    Type: Grant
    Filed: February 25, 2000
    Date of Patent: September 7, 2004
    Assignee: SpeechWorks International, Inc.
    Inventors: Michael S. Phillips, Krishna K. Govindarajan, Mark Fanty, Etienne Barnard
  • Patent number: 6785365
    Abstract: A barge-in detector for use in connection with a speech recognition system forms a prompt replica for use in detecting the presence or absence of user input to the system. The replica is indicative of the prompt energy applied to an input of the system. The detector detects the application of user input to the system, even if concurrent with a prompt, and enables the system to quickly respond to the user input.
    Type: Grant
    Filed: July 24, 2001
    Date of Patent: August 31, 2004
    Assignee: Speechworks International, Inc.
    Inventor: John N. Nguyen
  • Publication number: 20040006465
    Abstract: A method and apparatus are provided for automatically recognizing words of spoken speech using a computer-based speech recognition system according to a dynamic semantic model. In an embodiment, the speech recognition system recognizes speech and generates one or more word strings, each of which is a hypothesis of the speech, and creates and stores a probability value or score for each of the word strings. The word strings are ordered by probability value. The speech recognition system also creates and stores, for each of the word strings, one or more keyword-value pairs that represent semantic elements and semantic values of the semantic elements for the speech that was spoken. One or more dynamic semantic rules are defined that specify how a probability value of a word string should be modified based on information about external conditions, facts, or the environment of the application in relation to the semantic values of that word string.
    Type: Application
    Filed: February 10, 2003
    Publication date: January 8, 2004
    Applicant: Speechworks International, Inc., a Delaware Corporation
    Inventors: Michael S. Phillips, Etienne Barnard, Jean-Guy Dahan, Michael J. Metzger
  • Patent number: 6629075
    Abstract: A speech recognition system includes a user interface configured to provide signals indicative of a user's speech. A speech recognizer of the system includes a processor configured to use the signals from the user interface to perform speech recognition operations to attempt to recognize speech indicated by the signals. A control mechanism is coupled to the voice recognizer and is configured to affect processor usage for speech recognition operations in accordance with a loading of the processor.
    Type: Grant
    Filed: June 9, 2000
    Date of Patent: September 30, 2003
    Assignee: SpeechWorks International, Inc.
    Inventor: Johan Schalkwyk
  • Patent number: 6606598
    Abstract: A method and apparatus are disclosed for computing and reporting statistical information that describes the performance of an interactive speech application. The interactive speech application is developed and deployed for use by one or more callers. During execution, the interactive speech application stores, in a log, event information that describes each task carried out by the interactive speech application in response to interaction with the one or more callers. After the log is established, an analytical report is displayed. The report describes selective actions taken by the interactive speech application while executing, and selective actions taken by one or more callers while interacting with the interactive speech application. Information in the analytical report is selected so as to identify one or more potential performance problems in the interactive speech application. The analytical reports are generated based on the information stored in the event logs.
    Type: Grant
    Filed: September 21, 1999
    Date of Patent: August 12, 2003
    Assignee: SpeechWorks International, Inc.
    Inventors: Mark A. Holthouse, Matthew T. Marx, John N. Nguyen
  • Patent number: 6535851
    Abstract: Phonetic units are identified in a body of utterance data according to a novel segmentation approach. A body of received utterance data is processed and a set of candidate phonetic unit boundaries is determined that defines a set of candidate phonetic units. The set of candidate phonetic unit boundaries is determined based upon changes in Cepstral coefficient values, changes in utterance energy, changes in phonetic classification, broad category analysis (retroflex, back vowels, front vowels) and sonorant onset detection. The set of candidate phonetic unit boundaries is filtered by priority and proximity to other candidate phonetic units and by silence regions. The set of candidate phonetic units is filtered using no-cross region analysis to generate a set of filtered candidate phonetic units. No-cross region analysis generally involves discarding candidate phonetic units that completely span an energy up, energy down, dip or broad category type no-cross region.
    Type: Grant
    Filed: March 24, 2000
    Date of Patent: March 18, 2003
    Assignee: SpeechWorks, International, Inc.
    Inventors: Mark Fanty, Michael S. Phillips
  • Patent number: 6519562
    Abstract: A method and apparatus are provided for automatically recognizing words of spoken speech using a computer-based speech recognition system according to a dynamic semantic model. In an embodiment, the speech recognition system recognizes speech and generates one or more word strings, each of which is a hypothesis of the speech, and creates and stores a probability value or score for each of the word strings. The word strings are ordered by probability value. The speech recognition system also creates and stores, for each of the word strings, one or more keyword-value pairs that represent semantic elements and semantic values of the semantic elements for the speech that was spoken. One or more dynamic semantic rules are defined that specify how a probability value of a word string should be modified based on information about external conditions, facts, or the environment of the application in relation to the semantic values of that word string.
    Type: Grant
    Filed: February 25, 1999
    Date of Patent: February 11, 2003
    Assignee: Speechworks International, Inc.
    Inventors: Michael S. Phillips, Etienne Barnard, Jean-Guy Dahan, Michael J. Metzger
  • Patent number: 6501833
    Abstract: The lexical network of a large-vocabulary speech recognition system is structured to effectuate the rapid and efficient addition of words to the system's active vocabulary. The lexical network is structured to include Phonetic Constraint Nodes, which organize the inter-word phonetic information in the network, and Word Class Nodes which organize the syntactic semantic information in the network. Network fragments, corresponding to phoneme pronunciations and labeled to specify permitted interconnections to each other and to phonetic constraint nodes, are precompiled to facilitate the rapid generation of pronunciations for new words and thereby enhance the rapid addition of words to the vocabulary even during speech recognition. Functions defined in accordance with linguistic constraints may be utilized during recognition. Different language models and different vocabularies for different portions of a discourse may also be invoked depending, in part, on the discourse history.
    Type: Grant
    Filed: October 3, 1997
    Date of Patent: December 31, 2002
    Assignee: SpeechWorks International, Inc.
    Inventors: Michael S. Phillips, John N. Nguyen
  • Patent number: 6434521
    Abstract: An approach for automatically determining the accuracy of a pronunciation dictionary in a speech recognition system involves comparing an expected pronunciation representation for a particular word from a pronunciation dictionary to one or more actual pronunciations of the particular word. An accuracy score for each of the phonemes that constitute the pronunciation of the particular word is determined from the comparison of the expected and actual pronunciations for the particular word. The accuracy score is evaluated against specified accuracy criteria to determine whether the expected pronunciation for the particular word satisfies the specified accuracy criteria. If the expected pronunciation does not satisfy the specified accuracy criteria for the particular word, then the expected pronunciation for the particular word in the pronunciation dictionary is identified as requiring updating.
    Type: Grant
    Filed: June 24, 1999
    Date of Patent: August 13, 2002
    Assignee: SpeechWorks International, Inc.
    Inventor: Etienne Barnard
  • Patent number: 6405170
    Abstract: A method and apparatus are provided for improving the performance of an interactive speech application. The interactive speech application is developed and deployed for use by one or more callers. During execution, the interactive speech application stores, in a log, event information that describes each task carried out by the interactive speech application in response to interaction with the one or more callers. The application also stores one or more sets of audio information, in which each of the sets of audio information is associated with one or more utterances by one of the callers. Each of the sets of audio information is associated with one of the tasks represented in the log. After the log is established, an analytical report is displayed. The report describes selective actions taken by the interactive speech application while executing, and selective actions taken by one or more callers while interacting with the interactive speech application.
    Type: Grant
    Filed: September 22, 1998
    Date of Patent: June 11, 2002
    Assignee: SpeechWorks International, Inc.
    Inventors: Michael S. Phillips, Mark A. Fanty, Krishna K. Govindarajan
  • Patent number: 6389394
    Abstract: An approach for automatically modifying a pronunciation dictionary in a speech recognition system based on patterns of alternate pronunciations is described. A representation of the pronunciation dictionary, such as a plurality of dynamically linked phoneme values, is obtained. One or more pattern definitions are obtained. The pattern definitions specify zero or more phonemes to be substituted for zero or more phonemes of all words in the pronunciation dictionary. The linked phoneme values are modified by adding, for each path of each word, alternate paths that use each of the substitute phonemes according to the pattern definitions, thereby creating an expanded set of dynamically linked phoneme values. One or more example pronunciations of a particular word are then obtained. One or more best paths through the expanded set of phoneme values are determined for each of the example pronunciations and used to find the overall best path(s).
    Type: Grant
    Filed: February 9, 2000
    Date of Patent: May 14, 2002
    Assignee: SpeechWorks International, Inc.
    Inventor: Mark Fanty
  • Patent number: 6266398
    Abstract: A barge-in detector for use in connection with a speech recognition system forms a prompt replica for use in detecting the presence or absence of user input to the system. The replica is indicative of the prompt energy applied to an input of the system. The detector detects the application of user input to the system, even if concurrent with a prompt, and enables the system to quickly respond to the user input.
    Type: Grant
    Filed: March 12, 1998
    Date of Patent: July 24, 2001
    Assignee: Speechworks International, Inc.
    Inventor: John N. Nguyen
  • Patent number: 6173266
    Abstract: Dialogue modules are provided, with each dialogue module includes computer readable instructions for accomplishing a predefined interactive dialogue task in an interactive speech application. In response to user input, a subset of the plurality of dialogue modules are selected to accomplish their respective interactive dialogue tasks in the interactive speech application and are interconnected in an order defining the call flow of the application, and the application is generated. A graphical user interface represents the stored plurality of dialogue modules as icons in a graphical display in which icons for the subset of dialogue modules are selected in the graphical display in response to user input, the icons for the subset of dialogue modules are graphically interconnected into a graphical representation of the call flow of the interactive speech application, and the interactive speech application is generated based upon the graphical representation.
    Type: Grant
    Filed: May 6, 1998
    Date of Patent: January 9, 2001
    Assignee: SpeechWorks International, Inc.
    Inventors: Matthew T. Marx, Jerry K. Carter, Michael S. Phillips, Mark A. Holthouse, Stephen D. Seabury, Jose L. Elizondo-Cecenas, Brett D. Phaneuf
  • Patent number: 6061651
    Abstract: A barge-in detector for use in connection with a speech recognition system forms a prompt replica for use in detecting the presence or absence of user input to the system. The replica is indicative of the prompt energy applied to an input of the system. The detector detects the application of user input to the system, even if concurrent with a prompt, and enables the system to quickly respond to the user input.
    Type: Grant
    Filed: March 12, 1998
    Date of Patent: May 9, 2000
    Assignee: Speechworks International, Inc.
    Inventor: John N. Nguyen
  • Patent number: 5995928
    Abstract: A speech recognition system capable of recognizing a word or a plurality of words based on a continuous spelling of the word(s) by a user. The system includes a speech recognition engine with a decoder running in forward mode such that the recognition engine continuously outputs an updated string of hypothesized letters based on the letters uttered by the user. The system further includes a spelling engine for comparing each string of hypothesized letters to a vocabulary list of words. The spelling engine returns a best match for the string of hypothesized letters. The system may also include an early identification unit for presenting the user with the best matching word(s) possibly before the user has completed spelling the desired word(s).
    Type: Grant
    Filed: October 2, 1996
    Date of Patent: November 30, 1999
    Assignee: Speechworks International, Inc.
    Inventors: John N. Nguyen, Matthew T. Marx