Patents Examined by Richard J. Kim
-
Patent number: 5375188Abstract: A music/voice discriminating apparatus is composed of a signal processing portion for effecting the signal processing upon input acoustic signals, a music/voice deciding portion for discriminating whether the input acoustic signals are music or voice, a first signal processing portion for optimally setting acoustic parameters for the signal processing respectively for music or voice, and a second signal processing portion for controlling the acoustic parameters of the first signal processing portion in accordance with the decision results of the music/voice deciding portion so that it may become a desirable value set in the second parameter setting portion.Type: GrantFiled: June 8, 1992Date of Patent: December 20, 1994Assignee: Matsushita Electric Industrial Co., Ltd.Inventors: Mitsuhiko Serikawa, Akihisa Kawamura, Masaharu Matsumoto, Hiroko Numazu
-
Patent number: 5371853Abstract: Apparatus and method for encoding speech using a codebook excited linear predictive (CELP) speech processor and an algebraic codebook for use therewith. The CELP speech processor receives a digital speech input representative of human speech and performs linear predictive code analysis and perceptual weighting filtering to produce a short term speech information and a long term speech information. The CELP speech processor utilizes an organized, non-overlapping, algebraic codebook containing a predetermined number of vectors, uniformly distributed over a multi-dimensional sphere to generate a remaining speech residual. The short term speech information, long term speech information and remaining speech residual are combinable to form a quality reproduction of the digital speech input.Type: GrantFiled: October 28, 1991Date of Patent: December 6, 1994Assignee: University of Maryland at College ParkInventors: Yuhung Kao, John Baras
-
Patent number: 5369726Abstract: A phoneme estimator in a speech-recognition system includes energy detect circuitry for detecting the segments of a speech signal that should be analyzed for phoneme content. Speech-element processors then process the speech signal segments, calculating nonlinear representations of the segments. The nonlinear representation data is applied to speech-element modeling circuitry which reduces the data through speech element specific modeling. The reduced data are then subjected to further nonlinear processing. The results of the further nonlinear processing are again applied to speech-element modeling circuitry, producing phoneme isotype estimates. The phoneme isotype estimates are rearranged and consolidated, that is, the estimates are uniformly labeled and duplicate estimates are consolidated, forming estimates of words or phrases containing minimal numbers of phonemes. The estimates may then be compared with stored words or phrases to determine what was spoken.Type: GrantFiled: February 9, 1993Date of Patent: November 29, 1994Assignee: Eliza CorporationInventors: John P. Kroeker, Robert L. Powers
-
Patent number: 5369730Abstract: In an overlap addition unit, speech waveform data is subjected to overlap addition every period read out from a period storage unit, and in a simple addition unit, the waveform data obtained by the overlap addition and the aperiodic waveform data read out from an aperiodic waveform storage unit are added to each other. Thus, the aperiodic waveform is given to the speech waveform to improve the quality of synthesized speech.Type: GrantFiled: May 26, 1992Date of Patent: November 29, 1994Assignee: Hitachi, Ltd.Inventor: Shunichi Yajima
-
Patent number: 5361323Abstract: In a signal encoding device, a voice signal is divided into subsignals each comprising a predetermined number of successive samples, and data trains are generated by giving initial values to a recurrence equation. The generated data trains are divided into patterns each comprising the same number of samples as the predetermined number. The distance between each pattern produced by the pattern dividing unit and each of the subsignals are calculated. The pattern that provides the smallest distance with respect to each of the subsignals is identified, and the initial value set in the recurrence equation for generation of the data train that constitutes the pattern is output as coded data representing the respective subsignal.Type: GrantFiled: November 29, 1991Date of Patent: November 1, 1994Assignee: Sharp Kabushiki KaishaInventors: Yasumoto Murata, Shuichi Yoshikawa, Yuji Nishiwaki, Shuichi Kawama, Tomokazu Morio, Atsunori Kitoh
-
Patent number: 5355432Abstract: A speech recognition system includes an acoustic analyzer which produces a time sequence of acoustic parameters from an input speech signal in an utterance boundary thereof, and estimates a trajectory in a parameter space from the time sequence of acoustic parameters. The trajectory is re-sampled in the parameter space at predetermined constant intervals sequentially each time the acoustic parameters are produced by the acoustic analyzing means, thereby producing an input utterance pattern. The input utterance pattern is matched with reference speech patterns to recognize the input speech signal. The speech recognition system also has an utterance boundary detector for detecting the utterance boundary of the input speech signal. The trajectory is re-sampled while the utterance boundary is being detected by the utterance boundary detector.Type: GrantFiled: August 12, 1992Date of Patent: October 11, 1994Assignee: Sony CorporationInventors: Miyuki Tanaka, Masao Watari, Yasuhiko Kato
-
Patent number: 5353408Abstract: A code conversion table, in which a code of a voice with noise added thereto and a code of a voice without noise are associated with each other in terms of probability, is referred to in a code converter. Using the code converter, a code is obtained in a vector quantizer by vector-quantizing cepstrum coefficients extracted from the voice with noise added thereto, and is converted into a code of a voice obtained by suppressing the noise in the voice with noise added thereto. Linear predictive coefficients are obtained from the code, and the voice signal is reproduced in a synthesis filter according to the linear predictive coefficients.Type: GrantFiled: December 30, 1992Date of Patent: October 4, 1994Assignee: Sony CorporationInventors: Yasuhiko Kato, Masao Watari, Makoto Akabane
-
Patent number: 5353373Abstract: The set of possible excitation signals is subdivided into a plurality of subsets, the first of which provides the contribution to the coded signal necessary to set up a transmission at a minimum rate guaranteed by the network, while the others supply a contribution which, when added to that of the first subset, causes a rate increase by successive steps. At the receiving side, a decoded signal is generated by using the excitation contribution of the first subset alone if the coded signals are received at the minimum rate, while for rates higher than the minimum rate the contributions of the subsets which have allowed such rate increase are also used.Type: GrantFiled: December 4, 1991Date of Patent: October 4, 1994Assignee: SIP - Societa Italiana per l'Esercizio delle Telecomunicazioni P.A.Inventors: Rosario Drogo de Iacovo, Roberto Montagna, Daniele Sereno
-
Patent number: 5353376Abstract: Systems and methods for improved speech acquisition are disclosed including a plurality of linearly arrayed sensors to detect spoken input and to output signals in response thereto, a beamformer connected to the sensors to cancel a preselected noise portion of the signals to thereby produce a processed signal, and a speech recognition system to recognize the processed signal and to respond thereto. The beamformer may also include an adaptive filter with enable/disable circuitry for selectively training the adaptive filter a predetermined period of time. A highpass filter may also be used to filter a preselected noise portion of the sensed signals before the signals are forwarded to the beamformer. The speech recognition system may include a speaker independent base which is able to be adapted by a predetermined amount of training by a speaker, and which system includes a voice dialer or a speech coder for telecommunication.Type: GrantFiled: March 20, 1992Date of Patent: October 4, 1994Assignee: Texas Instruments IncorporatedInventors: Sang G. Oh, Vishu R. Viswanathan
-
Patent number: 5345536Abstract: A set of "m" feature parameters is generated every frame from reference speech which is spoken by at least one speaker and which represents recognition-object words, where "m" denotes a preset integer. A set of "n" types of standard patterns is previously generated on the basis of speech data of a plurality of speakers, where "n" denotes a preset integer. Matching between the feature parameters of the reference speech and each of the standard patterns is executed to generate a vector of "n" reference similarities between the feature parameters of the reference speech and each of the standard patterns every frame. The reference similarity vectors of respective frames are arranged into temporal sequences corresponding to the recognition-object words respectively. The reference similarity vector sequences are previously registered as dictionary similarity vector sequences. Input speech to be recognized is analyzed to generate "m" feature parameters from the input speech.Type: GrantFiled: December 17, 1991Date of Patent: September 6, 1994Assignee: Matsushita Electric Industrial Co., Ltd.Inventors: Masakatsu Hoshimi, Maki Miyata, Shoji Hiraoka, Katsuyuki Niyada
-
Patent number: 5337394Abstract: In the speech recognizer disclosed herein, alignment of an unknown speech sediment, represented by a finely gradiated sequence of frames, with a model sediment represented by a sequence of states is performed by first preparing respective coarse sequences representing the unknown and model segments thereby to define a coarse matrix representing possible alignments. The fine sequences correspondingly define a fine matrix. A best alignment of the coarse sequences is determined thereby to define a coarse path through the coarse matrix. The coarse path is overlaid on the fine matrix and a corridor is defined which includes fine matrix locations which lie within a preselected metric of the coarse path. Only transitions within the corridor are calculated in determining the fine alignment of the unknown speech segment with the model segment, thereby significantly reducing the number of computations required.Type: GrantFiled: June 9, 1992Date of Patent: August 9, 1994Assignee: Kurzweil Applied Intelligence, Inc.Inventor: Vladimir Sejnoha
-
Patent number: 5325463Abstract: An IC card with a built-in voice synthesizing function is provided which includes a solid-state memory containing vector-quantized coded data, a pattern generating means for generating patterns each composed of a prescribed number of digital data by repeatedly performing calculations using a recurrence equation with initial values given by coded data read out of the solid-state memory, a converter circuit for limiting the band of each pattern generated by the pattern generating means and converting into an analog voice signal, and a gain control circuit for adjusting the gain of the analog voice signal output from the converter circuit on the basis of gain data read out of the solid-state memory.Type: GrantFiled: January 30, 1992Date of Patent: June 28, 1994Assignee: Sharp Kabushiki KaishaInventors: Yasumoto Murata, Shuichi Yoshikawa, Yuji Nishiwaki
-
Patent number: 5325462Abstract: A method, system and process to improve the formant composition in a speech synthesis system so that the formants are more intelligible. The system employs a process in the memory of a processor to change the starting and ending frequency of phonemes from the frequency of the independent phonemes. The process examines preceding and succeeding ending phoneme frequency values to detect similar phoneme frequency values. If a dissimilar value is detected, then the invention provides for exchange of the formants to render the resulting speech more intelligible.Type: GrantFiled: August 3, 1992Date of Patent: June 28, 1994Assignee: International Business Machines CorporationInventor: Peter W. Farrett
-
Patent number: 5321793Abstract: A low-delay audio signal coding system, using analysis-by-synthesis techniques, has circuitry for adapting the spectral parameters and the prediction order of synthesis filters, and of perceptual weighting filters in the order at each frame, starting from the reconstructed signal relevant to the previous frame. In the case of a CELP coder, gain controls are also provided to adapt, starting from the reconstructed sinal, a factor, bound to the average power of the input signal, of the gain by which the innovation vectors are weighted.Type: GrantFiled: May 21, 1993Date of Patent: June 14, 1994Assignee: SIP--Societa Italiana per l'Esercizio delle Telecommunicazioni P.A.Inventors: Rosario Drogo De Iacovo, Roberto Montagna, Daniele Sereno
-
Patent number: 5317673Abstract: In a hidden Markov model-based speech recognition system, multilayer perceptrons (MLPs) are used in context-dependent estimation of a plurality of state-dependent observation probability distributions of phonetic classes. Estimation is obtained by the Bayesian factorization of the observation likelihood in terms of posterior probabilities of phone classes assuming the context and the input speech vector. The context-dependent estimation is employed as the state-dependent observation probabilities needed as parameter input to a hidden Markov model speech processor to identify the word sequence representing the unknown speech input of input speech vectors. Within the speech processor, models are provided which employ the observation probabilities in the recognition process.Type: GrantFiled: June 22, 1992Date of Patent: May 31, 1994Assignee: SRI InternationalInventors: Michael H. Cohen, Horacio E. Franco
-
Patent number: 5313557Abstract: A pen recorder comprises a hollow pen body having a proximal end and a distal end a solid state audio recorder in its interior. A retractable pen mechanism is mounted on the distal end of the pen body, and a microphone transducer and speaker transducer are mounted at the proximal end. The speaker transducer is transversely mounted across an open proximal end of the pen body so that the interior of the pen body acts as a resonator improving the sound quality of the speaker.Type: GrantFiled: December 17, 1991Date of Patent: May 17, 1994Assignee: MachinaInventor: Ralph Osterhout
-
Patent number: 5303374Abstract: A digital audio signal processing apparatus is provided having a predictive error generator for generating predictive error data by processing input digital data to acquire a plurality of different frequency characteristics. A selector selects one of the plural predictive error data. A requantizer requantizes the selected predictive error data. A corrector processes with a predetermined frequency characteristic, the requantization error induced during the operation of the requantizer, thereby correcting the requantization error caused in the requantizer. A frequency characteristic control selects at least two of the predictive error data obtained with the plural frequency characteristics, then calculates the selected predictive error data and controls the frequency characteristic in the corrector in accordance with the result of such calculation.Type: GrantFiled: October 15, 1991Date of Patent: April 12, 1994Assignee: Sony CorporationInventors: Satoshi Mitsuhashi, Masayuki Nishiguchi