Dynamic Time Warping Patents (Class 704/241)
  • Patent number: 6195636
    Abstract: In a system in which user equipment is connected to a packet network and a speech recognition application server is also connected to the packet network for performing speech recognition on speech data from the user equipment, a speech recognition system selectively performs feature extraction at a user end before transmitting speech data to be recognized. The feature extraction is performed only for speech which is to be recognized.
    Type: Grant
    Filed: February 19, 1999
    Date of Patent: February 27, 2001
    Assignee: Texas Instruments Incorporated
    Inventors: Joseph A. Crupi, Zoran Mladenovic, Edward B. Morgan, Bogdan R. Kosanovic, Negendra Kumar
  • Patent number: 6157911
    Abstract: A method and a system substantially eliminates an erroneous voice recognition of repetitive elements in word spotting. One preferred embodiment according to the current invention eliminates erroneous voice recognition of repetitive elements by selectively prolonging a response time of words containing repetitive elements. In order to substantially eliminate the errors, in another preferred embodiment according to the current invention, words containing repetitive elements are marked by a silent key word.
    Type: Grant
    Filed: March 27, 1998
    Date of Patent: December 5, 2000
    Assignee: Ricoh Company, Ltd.
    Inventor: Masaru Kuroda
  • Patent number: 6134527
    Abstract: A method of testing a new vocabulary word is performed using any set of enrollment utterances provided by the user or from an available database. The present method preferably does not use separate training and similarity test utterances. This allows any or all available repetitions of a vocabulary word being enrolled to be used for training (204), therefore improving the robustness of the trained models. Likewise, any or all training repetitions can also be utilized for similarity analysis (212), providing additional test samples which should further improve the detection of acoustically similar words. Additionally, the similarity analysis progresses incrementally and does not need to continue if a confusable word is found. Finally, first and second thresholds could be employed (212, 302) to provide greater flexibility for a user training a speech recognition system.
    Type: Grant
    Filed: January 30, 1998
    Date of Patent: October 17, 2000
    Assignee: Motorola, Inc.
    Inventors: Jeffrey Arthur Meunier, Edward Srenger, Steven Albrecht
  • Patent number: 6023676
    Abstract: A keyword recognition system for speaker dependent, dynamic time warping (DTW) recognition systems uses all of the trained word templates in the system, (keyword and vocabulary), to determine if an utterance is a keyword utterance or not. The utterance is selected as the keyword if a keyword score indicates a significant match to the keyword template and if the keyword score indicates a better match than do the entirety of scores to the vocabulary word templates.
    Type: Grant
    Filed: December 12, 1996
    Date of Patent: February 8, 2000
    Assignee: DSPC Israel, Ltd.
    Inventor: Adoram Erell
  • Patent number: 5987411
    Abstract: Methods and systems consistent with the present invention enroll a candidate phrase uttered by a user in a dictionary having at least one previously enrolled phrase. The system receives utterances of the candidate phrase and determines whether the first utterance is confusingly similar to a previously enrolled phrase and whether they are consistent with each other. The system then enrolls the candidate phrase in the dictionary according to these determinations.
    Type: Grant
    Filed: December 17, 1997
    Date of Patent: November 16, 1999
    Assignee: Northern Telecom Limited
    Inventors: Marco Petroni, Hung S. Ma
  • Patent number: 5960395
    Abstract: A method for matching an input pattern with a number of stored reference patterns using a dynamic programming matching technique is described. The reference patterns of a reference signal which are at the end of a dynamic programming path for a current input pattern are listed in an active list. The dynamic programming paths are propagated by processing the reference patterns on the active list, and a new active list is generated for the succeeding input pattern. The amount of processing required for each pattern on the active list is reduced by using a pointer which identifies the reference pattern which is the earliest in the sequence of patterns of the current reference signal listed on the new active list during the processing of a preceding dynamic programming path. In a second aspect, a speech recognition interface is used as a control system for a telephony system.
    Type: Grant
    Filed: February 6, 1997
    Date of Patent: September 28, 1999
    Assignee: Canon Kabushiki Kaisha
    Inventor: Eli Tzirkel-Hancock
  • Patent number: 5956678
    Abstract: In the recognition of coherently spoken words, a plurality of hypotheses is usually built up which end in various words during the recognition process and are then to be continued with further words. To keep the number of words yet to be continued as small as possible, especially in the case of a large vocabulary, it is known to carry out a look-ahead in a limited time space. It is suggested according to the invention to use the same phonemes for the look-ahead as for the actual recognition and to add together the differential sums obtained in the look-ahead for the evaluation of the partial hypothesis which has just ended and which is to be continued, and to compare this sum with a threshold value which depends on the extrapolated minimum total evaluation at the end of the time space of the look-ahead. The searching space for hypotheses to be continued can be limited by this in a particularly favorable manner.
    Type: Grant
    Filed: April 17, 1995
    Date of Patent: September 21, 1999
    Assignee: U.S. Philips Corporation
    Inventors: Reinhold Hab-Umbach, Hermann Ney
  • Patent number: 5933808
    Abstract: A system that synchronously segments a speech waveform using pitch period and a center of the pitch waveform. The pitch waveform center is determined by finding a local minimum of a centroid histogram waveform of the low-pass filtered speech waveform for one pitch period. The speech waveform can then be represented by one or more of such pitch waveforms or segments during speech compression, reconstruction or synthesis. The pitch waveform can be modified by frequency enhancement/filtering, waveform stretching/shrinking in speech synthesis or speech disguise. The utterance rate can also be controlled to speed up or slow down the speech.
    Type: Grant
    Filed: November 7, 1995
    Date of Patent: August 3, 1999
    Assignee: The United States of America as represented by the Secretary of the Navy
    Inventors: George S. Kang, Lawrence J. Fransen
  • Patent number: 5909665
    Abstract: To construct an inexpensive speech recognition system, a speech recogntion system includes an analyzing unit for extracting a sound, sequentially dividing the sound into a plurality of frames, converting each of the frames sequentially to first data, and sequentially storing the first data to an input pattern memory, a distance calculating unit for reading a predetermined number of the first data from the input pattern memory, reading one of second data from a standard pattern memory, calculating first distances between each of the predetermined number of the first data and the one of the second data, and a judging unit for judging a word representing the sound based on the first distances.
    Type: Grant
    Filed: May 29, 1997
    Date of Patent: June 1, 1999
    Assignee: NEC Corporation
    Inventor: Yasuko Kato
  • Patent number: 5907825
    Abstract: A method for determining the location of a pattern, when input in isolation, within a representative input signal is provided. The method aligns the input signal with a signal representative of a plurality of connected patterns, one of which is the same as the pattern within the input signal. The method then determines the location from the results of the aligning step. The location determined using this apparatus can be used to determine an isolated reference model by extracting features of the input signal from the location found. This isolated reference model can then be used to generate a continuous reference model for the pattern, by aligning the isolated reference model with the signals representative of a plurality of connected patterns, one of which is the pattern to be modelled.
    Type: Grant
    Filed: February 6, 1997
    Date of Patent: May 25, 1999
    Assignee: Canon Kabushiki Kaisha
    Inventor: Eli Tzirkel-Hancock
  • Patent number: 5854999
    Abstract: Compensatory values for compensating a reference pattern to match with an utterance environment of an input speech are employed for determining an environmental variation index to be input to a secondary matching controller, which is responsible for magnitudes of the index smaller than a threshold to hold a second matching section inoperative so that a recognition result of a primary matching of a previous compensated reference pattern is output, and for magnitudes of the index larger than the threshold to operate the second matching section to output a recognition result of a second matching based on a current compensated reference pattern to be stored as a subsequent reference pattern.
    Type: Grant
    Filed: June 24, 1996
    Date of Patent: December 29, 1998
    Assignee: NEC Corporation
    Inventor: Hiroshi Hirayama
  • Patent number: 5809465
    Abstract: A pattern recognition method of dynamic time warping of two sequences of feature sets onto each other is provided. The method includes the steps of creating a rectangular graph having the two sequences on its two axes, defining a swath of width r, where r is an odd number, centered about a diagonal line connecting the beginning point at the bottom left of the rectangle to the endpoint at the top right of the rectangle and also defining r-1 lines within the swath. The lines defining the swath are parallel to the diagonal line. Each array element k of an r-sized array is associated with a separate array of the r lines within the swath and for each row of the rectangle, the dynamic time warping method recursively generates new path values for each array element k as a function of the previous value of the array element k and of at least one of the current values of the two neighboring array elements k-1 and k+1 of the array element k.
    Type: Grant
    Filed: March 29, 1996
    Date of Patent: September 15, 1998
    Assignee: Advanced Recognition Technology
    Inventors: Gabriel Ilan, Jacob Goldberger
  • Patent number: 5799275
    Abstract: A speech recognition system automatically designates a scope of a partial reference pattern. Plural reference patterns, each of which ends in each of composing frames and starts from a preceding frame, are supposed and cumulative distances at every frame are calculated. A partial reference pattern that has a minimal distance value as compared with all other partial reference patterns is taken as a partial input speech recognizing result.
    Type: Grant
    Filed: June 18, 1996
    Date of Patent: August 25, 1998
    Assignees: The Japan Iron and Steel Federation, Sharp Kabushiki Kaisha, Real World Computing Partnership
    Inventors: Yoshiaki Itoh, Jiro Kiyama, Hiroshi Kojima, Susumu Seki, Ryuichi Oka
  • Patent number: 5778342
    Abstract: A pattern recognition system and method is disclosed. The method includes the steps of a) providing a noisy test feature set of the input signal, a plurality of reference feature sets of reference templates produced in a quiet environment, and a background noise feature set of background noise present in the input signal, b) producing adapted reference templates from the test feature set, the background noise feature set and the reference feature sets and c) determining match scores defining the match between each of the adapted reference templates and the test feature set. The method can also include adapting the scores before accepting a score as the result. The system and method are described for both Hidden Markov Model (HMM) and Dynamic Time Warping (DTW) scoring units. The system performs the steps of the method.
    Type: Grant
    Filed: February 1, 1996
    Date of Patent: July 7, 1998
    Assignee: DSPC Israel Ltd.
    Inventors: Adoram Erell, David Burshtein
  • Patent number: 5749073
    Abstract: In the first step of a sound morphing process, each sound which forms the basis for the morph is converted into one or more quantitative representations, such as spectrograms. After the representations have been obtained, the temporal axes of the two sounds are matched, so that similar components of the two sounds, such as onsets, harmonic regions and inharmonic regions, are aligned with one another. Other characteristics of the sounds, such as pitch, formant frequencies, or the like, are then matched. Once the energy in each of the sounds has been accounted for and matched to that of the other sound, the two sounds are cross-faded, to produce a representation of a new sound. This representation is then inverted, to generate the morphed sound.
    Type: Grant
    Filed: March 15, 1996
    Date of Patent: May 5, 1998
    Assignee: Interval Research Corporation
    Inventor: Malcolm Slaney
  • Patent number: 5737722
    Abstract: To determine the degree of correspondence between a first and a second pattern, the first and the second pattern are mapped to n first (V11 to V1n) and m second (V21 to V2m) feature vectors respectively. For points ({1, 1} to {n, m}) in a subarea of a matrix formed by the n first (V11 to V1n) and m second (V21 to V2m) feature vectors, the distance of the respective first (V11 to V1n) and the second (V21 to V2m) feature vectors is computed, and from the mean distance along an optimum path by means of a DP algorithm. Data regarding the gradient of the respective optimum path are determined during the computation for boundary points of the subarea, and the subarea is dynamically reduced for further computations on the basis of these data. The mean distance is used as the degree of correspondence.
    Type: Grant
    Filed: September 20, 1995
    Date of Patent: April 7, 1998
    Assignee: Alcatel N.V.
    Inventors: Dieter Kopp, Gebhard Thierer, Gregor Rozinaj