Patents by Inventor James K. Baker

James K. Baker has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Apparatuses and methods for training and operating speech recognition systems

Patent number: 5850627

Abstract: A word recognition system can: respond to the input of a character string from a user by limiting the words it will recognize to words having a related, but not necessarily the same, string; score signals generated after a user has been prompted to generate a given word against words other than the prompted word to determine if the signal should be used to train the prompted word; vary the number of signals a user is prompted to generate to train a given word as a function of how well the training signals score against each other or prior models for the prompted word; create a new acoustic model of a phrase by concatenating prior acoustic models of the words in the phrase; obtain information from another program running on the same computer, such as its commands or the context of text being entered into it, and use that information to vary which words it can recognize; determine which program unit, such as an application program or dialog box, currently has input focus on its computer and create a vocabulary

Type: Grant

Filed: June 26, 1997

Date of Patent: December 15, 1998

Assignee: Dragon Systems, Inc.

Inventors: Joel M. Gould, Elizabeth E. Steele, Frank J. McGrath, Steven D. Squires, Peter S. Heitman, Joel W. Parke, Dean G. Sturtevant, Jed M. Roberts, James K. Baker
Lexical tree pre-filtering in speech recognition

Patent number: 5822730

Abstract: A speech recognition technique uses lexical tree pre-filtering to obtain lists of words for use in performing speech recognition. The lexical tree pre-filtering includes representing a vocabulary of words using a lexical tree and identifying a first subset of the vocabulary that may correspond to speech spoken beginning at a first time by propagating through the lexical tree information about the speech spoken beginning at the first time. A second subset of the vocabulary that may correspond to speech spoken beginning at a second time is identified by propagating through the lexical tree information about the speech spoken beginning at the second time. Words included in the speech are recognized by comparing speech spoken beginning at the first time with words from the first subset of the vocabulary and speech spoken beginning at the second time with words from the second subset of the vocabulary. The state of the lexical tree is not reset between identifying the first and second subsets.

Type: Grant

Filed: August 22, 1996

Date of Patent: October 13, 1998

Assignee: Dragon Systems, Inc.

Inventors: Robert Roth, James K. Baker, Laurence S. Gillick, Alan Walsh
Speech recognition system for languages with compound words

Patent number: 5754972

Abstract: A system and associated methods for recognizing compound words from an utterance containing a succession of one or more words from a predetermined vocabulary. At least one of the words in the utterance is a compound word including at least two formatives in succession, wherein those formatives are words in the vocabulary.

Type: Grant

Filed: May 10, 1995

Date of Patent: May 19, 1998

Assignee: Dragon Systems, Inc.

Inventors: James K. Baker, Paul G. Bamberg, Jed M. Roberts, Caroline B. Huang, Stijn VanEven, Claudia L. E. Ellermann
Systems and methods for word recognition

Patent number: 5680511

Abstract: In one aspect, the invention provides word recognition systems that operate to recognize an unrecognized or ambiguous word that occurs within a passage of words. The system can offer several words as choice words for inserting into the passage to replace the unrecognized word. The system can select the best choice word by using the choice word to extract from a reference source, sample passages of text that relate to the choice word. For example, the system can select the dictionary passage that defines the choice word. The system then compares the selected passage to the current passage, and generates a score that indicates the likelihood that the choice word would occur within that passage of text. The system can select the choice word with the best score to substitute into the passage. The passage of words being analyzed can be any word sequence including an utterance, a portion of handwritten text, a portion of typewritten text or other such sequence of words, numbers and characters.

Type: Grant

Filed: June 7, 1995

Date of Patent: October 21, 1997

Assignee: Dragon Systems, Inc.

Inventors: Janet M. Baker, Laurence S. Gillick, James K. Baker, Jonathan P. Yamron
Stackable container with recessed hinged lid and hinge means therefor

Patent number: 5452847

Abstract: A rectangular stackable container, such as a tote box, having a lid or cover which is hingedly-attached at the upper rim area along one or both sides of the bin portion of the container. The rim area of the bin portion contains a rim-reinforcing wire, portions of which are exposed and releasably engaged by novel hinge means which are fastened to the lid and support the lid recessed slightly below the level of the rim when the lid is in closed position. The hinge means preferably are extruded from polyvinyl chloride.

Type: Grant

Filed: December 22, 1994

Date of Patent: September 26, 1995

Assignee: United States Corrulite Corporation

Inventors: Stephen Harris, James K. Baker, Eugene Dow
Apparatus and methods for training speech recognition systems and their users and otherwise improving speech recognition performance

Patent number: 5428707

Abstract: A tutorial instructs how to use a word recognition system, such as one for speech recognition. It specifies a set of allowed response words for each of a plurality of states. It sends messages on how to use the recognizer in certain states, and, in others, presents exercises in which the user is to enter signals representing expected words. It scores each such signal against word models to select which response word corresponds to it, and then advances to a state associated with that selected response. This scoring is performed against a large vocabulary even though only a small number of responses are allowed, and the signal is rejected if too many non-allowed words score better than any allowed word. The system comes with multiple sets of standard signal models; it scores each against a given user's signals, selects the set which scores best, and then performs adaptive and batch training upon that set.

Type: Grant

Filed: November 13, 1992

Date of Patent: June 27, 1995

Assignee: Dragon Systems, Inc.

Inventors: Joel M. Gould, Elizabeth E. Steele, James K. Baker
Method for interactive speech recognition and training

Patent number: 5027406

Abstract: A method for creating word models for a large vocabulary, natural language dictation system. A user with limited typing skills can create documents with little or no advance training of word models. As the user is dictating, the user speaks a word which may or may not already be in the active vocabulary. The system displays a list of the words in the active vocabulary which best match the spoken word. By keyboard or voice command, the user may choose the correct word from the list or may choose to edit a similar word if the correct word is not on the list. Alternately, the user may type or speak the initial letters of the word. Then the recognition algorithm is called again satisfying the initial letters, and the choices displayed again. A word list is then also displayed from a large backup vocabulary. The best words to display from the backup vocabulary are chosen using a statistical language model and optionally word models derived from a phonemic dictionary.

Type: Grant

Filed: December 6, 1988

Date of Patent: June 25, 1991

Assignee: Dragon Systems, Inc.

Inventors: Jed Roberts, James K. Baker, Edward W. Porter
Method for representing word models for use in speech recognition

Patent number: 4903305

Abstract: A method is provided for deriving acoustic word representations for use in speech recognition. Initial word models are created, each formed of a sequence of acoustic sub-models. The acoustic sub-models from a plurality of word models are clustered, so as to group acoustically similar sub-models from different words, using, for example, the Kullback-Leibler information as a metric of similarity. Then each word is represented by cluster spelling representing the clusters into which its acoustic sub-models were placed by the clustering. Speech recognition is performed by comparing sequences of frames from speech to be recognized against sequences of acoustic models associated with the clusters of the cluster spelling of individual word models.

Type: Grant

Filed: March 23, 1989

Date of Patent: February 20, 1990

Assignee: Dragon Systems, Inc.

Inventors: Laurence Gillick, Dean Sturtevant, Robert S. Roth, James K. Baker, Janet M. Baker
Interactive speech recognition apparatus

Patent number: 4866778

Abstract: A speech recognition system which can perform multiple recognition passes on each word. If the recognizer is correct in its first pass, the operator may abort later passes by either pressing a key or speaking the next word. Otherwise, the operator may either wait for a second recognition pass to be performed against a larger vocabulary, or may specify one or more initial letters causing the second recognition pass to be performed against a vocabulary substantially restricted to words starting with those initial letters. Each time the user adds an additional letter to the initial string, any previous recognition is aborted and the re-recognition process is started anew with the new string. If the user types a control character after the initial string, then the string itself is used as the output of the recognizer. In one embodiment, a language model limits a relatively small vocabulary used in the first pass to the words most likely to occur given the language context of the dictated word.

Type: Grant

Filed: August 11, 1986

Date of Patent: September 12, 1989

Assignee: Dragon Systems, Inc.

Inventor: James K. Baker
Method for creating and using multiple-word sound models in speech recognition

Patent number: 4837831

Abstract: A first speech recognition method receives an acoustic description of an utterance to be recognized and scores a portion of that description against each of a plurality of cluster models representing similar sounds from different words. The resulting score for each cluster is used to calculate a word score for each word represented by that cluster. Preferably these word scores are used to prefilter vocabulary words, and the description of the utterance includes a succession of acoustic decriptions which are compared by linear time alignment against a succession of acoustic models. A second speech recognition method is also provided which matches an acoustic model with each of a succession of acoustic descriptions of an utterance to be recognized. Each of these models has a probability score for each vocabulary word. The probability scores for each word associated with the matching acoustic models are combined to form a total score for that word.

Type: Grant

Filed: October 15, 1986

Date of Patent: June 6, 1989

Assignee: Dragon Systems, Inc.

Inventors: Laurence Gillick, Paul G. Bamberg, James K. Baker, Robert S. Roth
Method for speech recognition

Patent number: 4805219

Abstract: A method determines if a portion of speech corresponds to a speech pattern by time aligning both the speech and a plurality of speech pattern models against a common time-aligning model. This compensates for speech variation between the speech and the pattern models. The method then compares the resulting time-aligned speech model against the resulting time-aligned pattern models to determine which of the patterns most probably corresponds to the speech. Preferably there are a plurality of time-aligning models, each representing a group of somewhat similar sound sequences which occur in different words. Each of these time-aligning models is scored for similarity against a portion of speech, and the time-aligned speech model and time-aligned pattern models produced by time alignment with the best scoring time-aligning model are compared to determine the likelihood that each speech pattern corresponds to the portion of speech. This is performed for each successive portion of speech.

Type: Grant

Filed: April 3, 1987

Date of Patent: February 14, 1989

Assignee: Dragon Systems, Inc.

Inventors: James K. Baker, Laurence Gillick
Method for speech analysis and speech recognition

Patent number: 4805218

Abstract: A method of speech analysis calculates one or more difference parameters for each of a sequence of acoustic frames, where each difference parameter is a function of the difference between an acoustic parameter in one frame and an acoustic parameter in a nearby frame. The method is used in speech recognition which compares the difference parameters of each frame against acoustic models representing speech units, where each speech-unit model has a model of the difference parameters associated with the frames of its speech unit. The difference parameters can be slope parameters or energy difference parameters. Slope parameters are derived by finding the difference between the energy of a given spectral parameter of a given frame and the energy, in a nearby frame, of a spectral parameter associated with a different frequency band. The resulting parameter indicates the extent to which the frequency of energy in the part of the spectrum represented by the given parameter is going up or going down.

Type: Grant

Filed: April 3, 1987

Date of Patent: February 14, 1989

Assignee: Dragon Systems, Inc.

Inventors: Paul G. Bamberg, James K. Baker, Laurence Gillick, Robert S. Roth
Speech recognition method

Patent number: 4803729

Abstract: Smoothed frame labeling associates phonetic frame labels with a given speech frame as a function of (a) the closeness with which the given frame compares to each of a plurality of acoustic models, (b) which frame labels correspond with a neighboring frame, and (c) transition probabilities which indicate, for the frame labels associated with the neighboring frame, which frame labels are probably associated with the given frame. The smoothed frame labeling is used to divide the speech into segments of frames having the same class of labels. The invention represents words as a collection of known diphone models, each of which models the sound before and after a boundary between segments derived by the smoothed frame labeling.

Type: Grant

Filed: April 3, 1987

Date of Patent: February 7, 1989

Assignee: Dragon Systems, Inc.

Inventor: James K. Baker
Speech recognition apparatus and method

Patent number: 4783803

Abstract: A system is disclosed for recognizing a pattern in a collection of data given a context of one or more other patterns previously identified. Preferably the system is a speech recognition system, the patterns are words and the collection of data is a sequence of acoustic frames. During the processing of each of a plurality of frames, for each word in an active vocabulary, the system updates a likelihood score representing a probability of a match between the word and the frame, combines a language model score based on one or more previously recognized words with that likelihood score, and prunes the word from the active vocabulary if the combined score is below a threshold. A rapid match is made between the frames and each word of an initial vocabulary to determine which words should originally be placed in the active vocabulary.

Type: Grant

Filed: November 12, 1985

Date of Patent: November 8, 1988

Assignee: Dragon Systems, Inc.

Inventors: James K. Baker, Paul G. Bamberg, Mark F. Sidell, Robert S. Roth
Speech recognition training method

Patent number: 4718088

Abstract: A speech recognition method and apparatus employ a speech processing circuitry for repetitively deriving from a speech input, at a frame repetition rate, a plurality of acoustic parameters. The acoustic parameters represent the speech input signal for a frame time. A plurality of template matching and cost processing circuitries are connected to a system bus, along with the speech processing circuitry, for determining, or identifying, the speech units in the input speech, by comparing the acoustic parameters with stored template patterns. The apparatus can be expanded by adding more template matching and cost processing circuitry to the bus thereby increasing the speech recognition capacity of the apparatus. Template pattern generation is advantageously aided by using a "joker" word to specify the time boundaries of utterances spoken in isolation, by finding the beginning and ending of an utterance surrounded by silence.

Type: Grant

Filed: March 27, 1984

Date of Patent: January 5, 1988

Assignee: Exxon Research and Engineering Company

Inventors: James K. Baker, John W. Klovstad, Chin-Hui Lee, Kalyan Ganesan
Speech recognition method

Patent number: 4713778

Abstract: A speech recognition method and apparatus employ a speech processing circuitry for repetitively deriving from a speech input, at a frame repetition rate, a plurality of acoustic parameters. The acoustic parameters represent the speech input signal for a frame time. A plurality of template matching and cost processing circuitries are connected to a system bus, along with the speech processing circuitry, for determining, or identifying, the speech units in the input speech, by comparing the acoustic parameters with stored template patterns. The apparatus can be expanded by adding more template matching and cost processing circuitry to the bus thereby increasing the speech recognition capacity of the apparatus. The template matching and cost processing circuitries provide distributed processing, on demand, of the acoustic parameters for generating through a dynamic programming technique the recognition decision.

Type: Grant

Filed: March 27, 1984

Date of Patent: December 15, 1987

Assignee: Exxon Research and Engineering Company

Inventor: James K. Baker
Parallel pattern verifier with dynamic time warping

Patent number: 4348553

Abstract: A speech recognition system is disclosed which employs a network of elementary local decision modules for matching an observed time-varying speech pattern against all possible time warpings of the stored prototype patterns. For each elementary speech segment, an elementary recognizer provides a score indicating the degree of correlation of the input speech segment with stored spectral patterns. Each local decision module receives the results of the elementary recognizer and, at the same time, receives an input from selected ones of the other local decision modules. Each local decision module specializes in a particular node in the network wherein each node matches the probability of how well the input segment of speech matches the particular sound segments in the sounds of the words spoken. Each local decision module takes the prior decisions of all preceding sound segments which are input from the other local decision modules and makes a selection of the locally optimum time warping to be permitted.

Type: Grant

Filed: July 2, 1980

Date of Patent: September 7, 1982

Assignee: International Business Machines Corporation

Inventors: James K. Baker, Janet M. Baker

prev … 4 5 6 7 8