Patents Assigned to ATR Interpreting Telecommunications Research
-
Patent number: 6343266Abstract: An anaphora analyzing apparatus is disclosed for automatically estimating an anaphora referential relation or an antecedent of a noun for use in a natural language sentence. A storage unit stores analyzed results outputted from an analyzer, and an antecedent candidate generator detects a target component required for anaphora analysis in accordance with the current analyzed results and the past analyzed results stored in the storage unit, and generates antecedent candidates corresponding to the target component. Then, a candidate rejecting section rejects unnecessary candidates having no potential for anaphora referential relation among the antecedent candidates by using a predetermined rejecting criterion, and outputs the remaining antecedent candidates.Type: GrantFiled: February 25, 2000Date of Patent: January 29, 2002Assignee: ATR Interpreting Telecommunications Research LaboratoriesInventors: Michael Paul, Kazuhide Yamamoto, Eiichiro Sumita
-
Patent number: 6314399Abstract: An apparatus generates a statistical class sequence model called A class bi-multigram model from input training strings of discrete-valued units, where bigram dependencies are assumed between adjacent variable length sequences of maximum length N units, and where class labels are assigned to the sequences. The number of times all sequences of units occur are counted, as well as the number of times all pairs of sequences of units co-occur in the input training strings. An initial bigram probability distribution of all the pairs of sequences is computed as the number of times the two sequences co-occur, divided by the number of times the first sequence occurs in the input training string. Then, the input sequences are classified into a pre-specified desired number of classes. Further, an estimate of the bigram probability distribution of the sequences is calculated by using an EM algorithm to maximize the likelihood of the input training string computed with the input probability distributions.Type: GrantFiled: April 13, 1999Date of Patent: November 6, 2001Assignee: ATR Interpreting Telecommunications ResearchInventors: Sabine Deligne, Yoshinori Sagisaka, Hideharu Nakajima
-
Patent number: 6236963Abstract: In a speaker normalization processor apparatus, a vocal-tract configuration estimator estimates feature quantities of a vocal-tract configuration showing an anatomical configuration of a vocal tract of each normalization-target speaker, by looking up to a correspondence between vocal-tract configuration parameters and Formant frequencies previously determined based on a vocal tract model of the standard speaker, based on speech waveform data of each normalization-target speaker.Type: GrantFiled: March 16, 1999Date of Patent: May 22, 2001Assignee: ATR Interpreting Telecommunications Research LaboratoriesInventors: Masaki Naito, Li Deng, Yoshinori Sagisaka
-
Apparatus and method for producing analogically similar word based on pseudo-distances between words
Patent number: 6219633Abstract: In an analogically similar word production apparatus, based on three inputted unit strings, an analogically similar word which is a word analogically similar to inputted unit strings is produced at high speed without using attributes and without any finite state automaton. A pseudo-distance matrix memory stores therein only matrix elements sufficient for computation of limited pseudo distances between two letter strings, out of the elements of two pseudo-distance matrices, and more specifically, matrix elements including diagonal elements are computed by a preprocessing section and then stored in the pseudo-distance matrix memory.Type: GrantFiled: August 6, 1999Date of Patent: April 17, 2001Assignee: ATR Interpreting Telecommunications Research LaboratoriesInventor: Yves Lepage -
Patent number: 6041299Abstract: There are disclosed an apparatus for calculating a posteriori probabilities of phoneme symbols and a speech recognition apparatus using the apparatus for calculating a posteriori probabilities of phoneme symbols. A feature extracting section extracts speech feature parameters from a speech signal of an uttered speech sentence composed of an inputted character series, and a calculating section calculates a a posteriori probability of a phoneme symbol of the speech signal, by using a bidirectional recurrent neural network. The bidirectional recurrent neural network includes (a) an input layer for receiving the speech feature parameters extracted by the feature extracting means and a plurality of hypothetical phoneme symbol series signals, (b) an intermediate layer of at least one layer having a plurality of units, and (c) an output layer for outputting a a posteriori probability of each phoneme symbol.Type: GrantFiled: March 11, 1998Date of Patent: March 21, 2000Assignee: ATR Interpreting Telecommunications Research LaboratoriesInventors: Mike Schuster, Toshiaki Fukada
-
Patent number: 6009424Abstract: Provided is a similarity search apparatus for searching data at a higher speed than that of the prior art without limiting the types of letter of a search key. A unit position correspondence memory stores therein a table that expresses the ordinal number among units at which each unit in a search key inputted by means of a keyboard has appeared within the search key. A search section refers to the table stored in the unit position correspondence memory and operates every time units are read out one by one from a database memory including a plurality of units to generate a plurality of status parameters each of which includes a similarity, a position of coincidence and a skip number, which express with what number of units from the top of the search key the units read out from the database have coincided at what degree of similarity, and express how many units in the database have been skipped over subsequently.Type: GrantFiled: September 3, 1997Date of Patent: December 28, 1999Assignee: ATR Interpreting Telecommunications Research LaboratoriesInventors: Yves Lepage, Shinichi Ando
-
Patent number: 5983178Abstract: A speaker clustering apparatus generates HMMs for clusters based on feature quantities of a vocal-tract configuration of speech waveform data, and a speech recognition apparatus provided with the speaker clustering apparatus. In response to the speech waveform data of N speakers, an estimator estimates feature quantities of vocal-tract configurations, with reference to correspondence between vocal-tract configuration parameters and Formant frequencies predetermined based on a predetermined vocal tract model of a standard speaker. Further, a clustering processor calculates speaker-to-speaker distances between the N speakers based on the feature quantities of the vocal-tract configurations of the N speakers as estimated, and clusters the vocal-tract configurations of the N speakers using a clustering algorithm based on calculated speaker-to-speaker distances, thereby generating K clusters.Type: GrantFiled: December 10, 1998Date of Patent: November 9, 1999Assignee: ATR Interpreting Telecommunications Research LaboratoriesInventors: Masaki Naito, Li Deng, Yoshinori Sagisaka
-
Patent number: 5878390Abstract: A speech recognition apparatus which includes a speech recognition section for performing a speech recognition process on an uttered speech with reference to a predetermined statistical language model, based on a series of speech signal of the uttered speech sentence composed of a series of input words. The speech recognition section calculates a functional value of a predetermined erroneous sentence judging function with respect to speech recognition candidates, where the erroneous sentence judging representing a degree of unsuitability for the speech recognition candidates. When the calculated functional value exceeds a predetermined threshold value, the speech recognition section performs the speech recognition process by eliminating a speech recognition candidate corresponding to a calculated functional value.Type: GrantFiled: June 23, 1997Date of Patent: March 2, 1999Assignee: ATR Interpreting Telecommunications Research LaboratoriesInventors: Jun Kawai, Yumi Wakita
-
Patent number: 5839105Abstract: There is provided a speaker-independent model generation apparatus and a speech recognition apparatus which require a processing unit to have less memory capacity and which allow its computation time to be reduced, as compared with a conventional counterpart. A single Gaussian HMM is generated with a Baum-Welch training algorithm based on spoken speech data from a plurality of specific speakers. A state having a maximum increase in likelihood as a result of splitting one state in contextual or temporal domains is searched. Then, the state having a maximum increase in likelihood is split in a contextual or temporal domain corresponding to the maximum increase in likelihood. Thereafter, a single Gaussian HMM is generated with the Baum-Welch training algorithm, and these steps are iterated until the states within the single Gaussian HMM can no longer be split or until a predetermined number of splits is reached. Thus, a speaker-independent HMM is generated.Type: GrantFiled: November 29, 1996Date of Patent: November 17, 1998Assignee: ATR Interpreting Telecommunications Research LaboratoriesInventors: Mari Ostendorf, Harald Singer
-
Patent number: 5835893Abstract: In a word clustering apparatus for clustering words, a plurality of words is clustered to obtain a total tree diagram of a word dictionary representing a word clustering result, where the total tree diagram includes tree diagrams of an upper layer, a middle layer and a lower layer. In a speech recognition apparatus, a microphone converts an input utterance speech composed of a plurality of words into a speech signal, and a feature extractor extracts predetermined acoustic feature parameters from the converted speech signal. Then, a speech recognition controller executes a speech recognition process on the extracted acoustic feature parameters with reference to a predetermined Hidden Markov Model and the obtained total tree diagram of the word dictionary, and outputs a result of the speech recognition.Type: GrantFiled: April 18, 1996Date of Patent: November 10, 1998Assignee: ATR Interpreting Telecommunications Research LabsInventor: Akira Ushioda
-
Patent number: 5754681Abstract: In a signal pattern recognition apparatus, a plurality of feature transformation sections respectively transform an inputted signal pattern into vectors in a plurality of feature spaces corresponding respectively to predetermined classes using a predetermined transformation parameter corresponding to each of the classes so as to emphasize a feature of each of the classes, and a plurality of discriminant function sections respectively calculates a value of a discriminant function using a predetermined discriminant function representing a similarity measure of each of the classes for the transformed vectors in the plurality of feature spaces.Type: GrantFiled: June 22, 1995Date of Patent: May 19, 1998Assignee: ATR Interpreting Telecommunications Research LaboratoriesInventors: Hideyuki Watanabe, Tsuyoshi Yamaguchi, Shigeru Katagiri