Neural Network Patents (Class 704/232)

System and method of word graph matrix decomposition

Patent number: 7277850

Abstract: Disclosed is a system and method of decomposing a lattice transition matrix into a block diagonal matrix. The method is applicable to automatic speech recognition but can be used in other contexts as well, such as parsing, named entity extraction and any other methods. The method normalizes the topology of any input graph according to a canonical form.

Type: Grant

Filed: April 2, 2003

Date of Patent: October 2, 2007

Assignee: AT&T Corp.

Inventors: Dilek Z. Hakkani-Tur, Giuseppe Riccardi
Nonlinear mapping for feature extraction in automatic speech recognition

Patent number: 7254538

Abstract: The present invention successfully combines neural-net discriminative feature processing with Gaussian-mixture distribution modeling (GMM). By training one or more neural networks to generate subword probability posteriors, then using transformations of these estimates as the base features for a conventionally-trained Gaussian-mixture based system, substantial error rate reductions may be achieved. The present invention effectively has two acoustic models in tandem—first a neural net and then a GMM. By using a variety of combination schemes available for connectionist models, various systems based upon multiple features streams can be constructed with even greater error rate reductions.

Type: Grant

Filed: November 16, 2000

Date of Patent: August 7, 2007

Assignee: International Computer Science Institute

Inventors: Hynek Hermansky, Sangita Sharma, Daniel Ellis
Method for detecting the time sequences of a fundamental frequency of an audio response unit to be synthesized

Patent number: 7219061

Abstract: Predetermined macrosegments of the fundamental frequency are determined by a neural network, and these predefined macrosegments are reproduced by fundamental-frequency sequences stored in a database. The fundamental frequency is generated on the basis of a relatively large text section which is analyzed by the neural network. Microstructures from the database are received in the fundamental frequency. The fundamental frequency thus formed is thus optimized both with regard to its macrostructure and to its microstructure. As a result, an extremely natural sound is achieved.

Type: Grant

Filed: October 24, 2000

Date of Patent: May 15, 2007

Assignee: Siemens Aktiengesellschaft

Inventors: Caglayan Erdem, Martin Holzapfel
Method and device for selecting a sound algorithm

Patent number: 7206414

Abstract: The invention relates to a method for selecting a sound algorithm for processing an audio signal. The audio signal is analyzed and the type of audio signal is ascertained based on the analysis. The audio signal is classified as a music signal or another signal, and different sound algorithms are used for the further processing and subsequent output of the audio signal.

Type: Grant

Filed: September 30, 2002

Date of Patent: April 17, 2007

Assignee: Grundig Multimedia B.V.

Inventor: Donald Schulz
Method and apparatus for detecting prosodic phrase break in a text to speech (TTS) system

Patent number: 7136802

Abstract: Methods for processing speech data are described herein. In one aspect of the invention, an exemplary method includes receiving a text sentence comprising a plurality of words, each of the plurality of words having a part of speech (POS) tag, generating a POS sequence based on the POS tag of each of the plurality of words, detecting a prosodic phrase break through a recurrent neural network (RNN), based on the POS sequence, and generating a prosodic phrases boundary based on the prosodic phrase break. Other methods and apparatuses are also described.

Type: Grant

Filed: January 16, 2002

Date of Patent: November 14, 2006

Assignee: Intel Corporation

Inventors: Zhiwei Ying, Xiaohua Shi
Method and apparatus for efficient implementation and evaluation of state machines and programmable finite state automata

Patent number: 7119577

Abstract: A method and apparatus for efficient implementation and evaluation of state machines and programmable finite state automata is described. In one embodiment, a state machine architecture comprises a plurality of node elements, wherein each of the plurality of node elements represents a node of a control flow graph. The state machine architecture also comprises a plurality of interconnections to connect node elements, a plurality of state transition connectivity control logic to enable and disable connections within the plurality of interconnections to form the control flow graph with the plurality of node elements, and a plurality of state transition evaluation logic coupled to the interconnections and operable to evaluate input data against criteria, the plurality of state transition evaluation logic to control one or more state transitions between node elements in the control flow graph.

Type: Grant

Filed: August 27, 2003

Date of Patent: October 10, 2006

Assignee: Cisco Systems, Inc.

Inventor: Harshvardhan Sharangpani
Multistream network feature processing for a distributed speech recognition system

Patent number: 7089178

Abstract: A distributed voice recognition system and method for obtaining acoustic features and speech activity at multiple frequencies by extracting high frequency components thereof on a device, such as a subscriber station and transmitting them to a network server having multiple stream processing capability, including cepstral feature processing, MLP nonlinear transformation processing, and multiband temporal pattern architecture processing. The features received at the network server are processed using all three streams, wherein each of the three streams provide benefits not available in the other two, thereby enhancing feature interpretation. Feature extraction and feature interpretation may operate at multiple frequencies, including but not limited to 8 kHz, 11 kHz, and 16 kHz.

Type: Grant

Filed: April 30, 2002

Date of Patent: August 8, 2006

Assignee: Qualcomm Inc.

Inventors: Harinath Garudadri, Sunil Sivadas, Hynek Hermansky, Nelson H. Morgan, Charles C. Wooters, Andre Gustavo Adami, Maria Carmen Benitez Ortuzar, Lukas Burget, Stephane N. Dupont, Frantisek Grezl, Pratibha Jain, Sachin Kajarekar, Petr Motlicek
Methods and apparatuses for evaluation of regular expressions of arbitrary size

Patent number: 7085918

Abstract: Embodiments of the invention provide a programmable FSA building block, having a number of programmable registers and associated logic implemented therein, that provide the capability of contextually evaluating complex REs of arbitrary size against multiple data streams. Embodiments of the invention provide fully programmable hardware in which all of the states of an RE are instantiated and all of the states are fully connected. For one embodiment, the building blocks have a fixed number of states to facilitate implementation on a chip. For such an embodiment, an RE having an excessive number of states is implemented on two or more FSA building blocks and the FSA building blocks are then stitched together to effect evaluation of the RE. For one embodiment, two or more REs having a number of states less than the fixed number of states of a building block may be implemented with a single building block.

Type: Grant

Filed: January 8, 2004

Date of Patent: August 1, 2006

Assignee: Cisco Systems, Inc.

Inventors: Harshvardan Sharangpani, Manoj Khare, Kent Fielden, Rajesh Patil, Judge Kennedy Arora
Automatic monitoring and statistical analysis of dynamic process metrics to expose meaningful changes

Patent number: 7072899

Abstract: A selection module allows a user to specify at least one measure to be monitored in at least one dimension of a dimensional hierarchy. A control limit calculator extracts, for each specified measure and for each specified dimension, a time series from a multidimensional database for the specified measure in the specified dimension and automatically calculates one or more control limits for the specified measure in the specified dimension based on the extracted time series using a Statistical Process Control (SPC) technique. Thereafter, a monitoring module monitors newly acquired data including each specified measure in each specified dimension for an out-of-limits condition based on one or more automatically-calculated control limits. An alert module triggers an alert in response to an out-of-limits condition being detected.

Type: Grant

Filed: December 17, 2004

Date of Patent: July 4, 2006

Assignee: Proclarity, Inc.

Inventor: Robert C. Lokken
Method and apparatus for transcribing speech when a plurality of speakers are participating

Patent number: 6996526

Abstract: A method and apparatus are disclosed for transcribing speech when a number of speakers are participating. A number of different speech recognition systems, each with a different speaker model, are executed in parallel. When the identity of all of the participating speakers is known and a speaker model is available for each participant, each speech recognition system employs a different speaker model suitable for a corresponding participant. Each speech recognition system decodes the speech and generates a corresponding confidence score. The decoded output having the highest confidence score is selected for presentation to a user. When all participating speakers are not known, or when there are too many participants to implement a unique speaker model for each participant, a speaker independent speech recognition system is employed together with a speaker specific speech recognition system.

Type: Grant

Filed: January 2, 2002

Date of Patent: February 7, 2006

Assignee: International Business Machines Corporation

Inventors: Sara H. Basson, Peter Gustav Fairweather, Alexander Faisman, Dimitri Kanevsky, Jeffery Scott Sorensen
Class quantization for distributed speech recognition

Patent number: 6961696

Abstract: A system, method and computer readable medium for quantizing class information and pitch information of audio is disclosed. The method on an information processing system includes receiving audio and capturing a frame of the audio. The method further includes determining a pitch of the frame and calculating a codeword representing the pitch of the frame, wherein a first codeword value indicates an indefinite pitch. The method further includes determining a class of the frame, wherein the class is any one of at least two classes indicating an indefinite pitch and at least one class indicating a definite pitch. The method further includes calculating a codeword representing the class of the frame, wherein the codeword length is the maximum of the minimum number of bits required to represent the at least two classes and the minimum number of bits required to represent the at least one class.

Type: Grant

Filed: February 7, 2003

Date of Patent: November 1, 2005

Assignees: Motorola, Inc., International Business Machines Corporation

Inventors: Tenkasi V. Ramabadran, Alexander Sorin
Acoustic speech recognition method and system using stereo vision neural networks with competition and cooperation

Patent number: 6947890

Abstract: A method and system are provided for speech recognition. The speech recognition method includes the steps of preparing training data representing acoustic parameters of each of phonemes at each time frame; receiving an input signal representing a sound to be recognized and converting the input signal to input data; comparing the input data at each frame with the training data of each of the phonemes to derive a similarity measure of the input data with respect to each of the phonemes; and processing the similarity measures obtained in the comparing step using a neural net model governing development of activities of plural cells to conduct speech recognition of the input signal.

Type: Grant

Filed: May 30, 2000

Date of Patent: September 20, 2005

Inventors: Tetsuro Kitazoe, Sung-Ill Kim, Tomoyuki Ichiki
Efficient speech recognition system bases on an auditory model

Patent number: 6947891

Abstract: A speech recognition system that is insensitive to external noise and applicable to actual life includes an A/D converter that converts analog voice signals to digital signals. An FIR filtering section employs powers-of-two conversion to filter the digital signals converted at the A/D converter into numbers of channels. A characteristic extraction section immediately extracts speech characteristics having strong noise-resistance from the output signals of the FIR filtering section without using additional memories. A word boundary detection section discriminates the information of the start-point and the end-point of a voice signal on the basis of the characteristics extracted by the characteristic extraction section.

Type: Grant

Filed: January 22, 2001

Date of Patent: September 20, 2005

Assignee: Korea Advanced Institute of Science & Technology

Inventors: Soo Young Lee, Chang Min Kim
Telephony-data application interface apparatus and method for multi-modal access to data applications

Patent number: 6941273

Abstract: A voice-enabled system for online shopping provides a voice and telephony interface, as well a text and graphic interface, for shopping over the Internet using a browser or a telephone. The system allows customers to access an online shop, search for desired database items, select items, and finally pay for selected items using a credit card, over a phone line or the Internet. A telephony-Internet interface converts spoken queries into electronic commands for transmission to an online shop or database. Markup language-type pages transmitted to callers from the online-shop or database are parsed to extract selected information. The selected information is then reported to the callers via audio messaging.

Type: Grant

Filed: October 7, 1998

Date of Patent: September 6, 2005

Inventors: Masoud Loghmani, Fred F. Korangy
Method of speech recognition using variational inference with switching state space models

Patent number: 6931374

Abstract: A method is developed which includes 1) defining a switching state space model for a continuous valued hidden production-related parameter and the observed speech acoustics, and 2) approximating a posterior probability that provides the likelihood of a sequence of the hidden production-related parameters and a sequence of speech units based on a sequence of observed input values. In approximating the posterior probability, the boundaries of the speech units are not fixed but are optimally determined. Under one embodiment, a mixture of Gaussian approximation is used. In another embodiment, an HMM posterior approximation is used.

Type: Grant

Filed: April 1, 2003

Date of Patent: August 16, 2005

Assignee: Microsoft Corporation

Inventors: Hagai Attias, Leo Jingyu Lee, Li Deng
Methods for speech processing

Patent number: 6920423

Abstract: The invention relates to a method for speech processing in which input variables containing speech features are mapped onto output variables. In the mapping process, the input variables are weighted and/or identical maps are produced for different sets of input variables and at least one output variable.

Type: Grant

Filed: September 24, 2001

Date of Patent: July 19, 2005

Assignee: Siemens Aktiengesellschaft

Inventors: Achim Mueller, Hans-Georg Zimmermann
Compressing HMM prototypes

Patent number: 6907398

Abstract: A method is described for compressing the storage space required by HMM prototypes in an electronic memory. For this purpose prescribed HMM prototypes are mapped onto compressed HMM prototypes with the aid of a neural network (encoder). These can be stored with a smaller storage space than the uncompressed HMM prototypes. A second neural network (decoder) serves to reconstruct the HMM prototypes.

Type: Grant

Filed: September 6, 2001

Date of Patent: June 14, 2005

Assignee: Siemens Aktiengesellschaft

Inventor: Harald Hoege
Apparatus and method for selecting length of variable length coding bit stream using neural network

Patent number: 6885320

Abstract: An apparatus and method for selecting the length of a variable length code bitstream by using a neural network are provided. The apparatus for selecting the length of a variable length code bitstream includes a bitstream estimation length receiving unit which inputs a predetermined quantization DCT coefficient block to a neural network whose training is finished, and receives the estimation length of a bitstream corresponding to the quantization DCT coefficient block from the neural network; and a bitstream estimation length selection unit which receives user selection about an estimation length received by the bitstream estimation length receiving unit. According to the method and apparatus the length of a variable length code bit stream can be estimated such that a user can select a desired length of a bitstream in advance without performing variable length coding.

Type: Grant

Filed: January 21, 2004

Date of Patent: April 26, 2005

Assignee: Samsung Elecetronics Co., Ltd.

Inventor: So-young Kim
Method and apparatus for suppressing audible noise in speech transmission

Patent number: 6820053

Abstract: Method of suppressing audible noise in speech transmission by means of a multi-layer self-organizing fed-back neural network comprising a minima detection layer, a reaction layer, a diffusion layer and an integration layer, said layers defining a filter function F(f,T) for noise filtering.

Type: Grant

Filed: October 6, 2000

Date of Patent: November 16, 2004

Inventor: Dietmar Ruwisch
Method and device for recognising a phonetic sound sequence or character sequence

Publication number: 20040199389

Abstract: The invention relates to a method for recognizing a phonetic sound sequence or a character sequence, e.g.

Type: Application

Filed: February 12, 2004

Publication date: October 7, 2004

Inventor: Hans Geiger
Spatial image processor

Patent number: 6801655

Abstract: A spatial image processor neural network for processing image data to discriminate between first and second spatial configurations of component objects includes a photo transducer input array for converting an input image to pixel data and sending the data to a localized gain network (LGN) module, a parallel memory processor and neuron array for receiving the pixel data and processing the pixel data into component recognition vectors and chaotic oscillators for receiving the recognition vectors and sending feedback data to the LGN module as attention activations. The network further includes a temporal spatial retina for receiving both the pixel data and temporal feedback activations and generating temporal spatial vectors, which are processed by a temporal parallel processor into temporal component recognition vectors. A spatial recognition vector array receives the temporal component recognition vectors and forms an object representation of the first configuration of component objects.

Type: Grant

Filed: May 10, 2001

Date of Patent: October 5, 2004

Assignee: The United States of America as represented by the Secretary of the Navy

Inventor: Roger L. Woodall
Method of determining the topology of a network by transmitting and monitoring sequences of orthogonal signals

Patent number: 6728670

Abstract: A method of determining the topology of a network comprising: transmitting a signal comprised of a sequence of bursts of packets formed of orthogonal signals, monitoring devices in the network including the destination device for reception of the signal, and defining a sequence of devices within the network by sensing a sequence of reception of the signal in the devices from the source device toward the destination device.

Type: Grant

Filed: February 7, 2002

Date of Patent: April 27, 2004

Assignee: Peregrine Systems, Inc.

Inventors: David Schenkel, Michael Slavitch, Nicholas Dawes
Method and system for multilingual voice recognition

Publication number: 20040039570

Abstract: The present invention provides for a method and system of voice recognition, in particular for navigation in a hypertext navigation system. For each new word, a language identification stage, in particular embodied as a neural network, is used to determine the inclusion of the word in a language or a dialect with a given probability factor and the grapheme/phoneme relationship corresponding to the word with the greatest probability coefficient in the phonetic lexicon, or in at least one of the several phonetic lexica, is updated.

Type: Application

Filed: May 28, 2003

Publication date: February 26, 2004

Inventors: Steffen Harengel, Meinrad Niemoeller
Speech recognition in consumer electronic products

Patent number: 6665639

Abstract: A method and apparatus are described that allow inexpensive speech recognition in applications where this capability is not otherwise feasible because of cost or technical reasons, or because of inconvenience to the user. A relatively simple speaker independent recognition algorithm, capable of recognizing a limited number of utterances at any one time, is associated with the base unit of an electronics product. To function, the product requires information from an external medium and this medium also provides the data required to recognize several sets of utterances pertinent to other information provided by the external medium.

Type: Grant

Filed: January 16, 2002

Date of Patent: December 16, 2003

Assignee: Sensory, Inc.

Inventors: Todd F. Mozer, Forrest S. Mozer, Thomas North
Speech recognition system, speech recognition method, speech synthesis system, speech synthesis method, and program product

Publication number: 20030171921

Abstract: The object of the present invention is to keep a high success rate in recognition with a low-volume of sound signal, without being affected by noise.

Type: Application

Filed: March 4, 2003

Publication date: September 11, 2003

Applicant: NTT DoCoMo, Inc.

Inventors: Hiroyuki Manabe, Akira Hiraiwa, Toshiaki Sugimura
Model adaptation of neural tree networks and other fused models for speaker verification

Patent number: 6519561

Abstract: The model adaptation system of the present invention is a speaker verification system that embodies the capability to adapt models learned during the enrollment component to track aging of a user's voice. The system has the advantage of only requiring a single enrollment for the user. The model adaptation system and methods can be applied to several types of speaker recognition models including neural tree networks (NTN), Gaussian Mixture Models (GMMs), and dynamic time warping (DTW) or to multiple models (i.e., combinations of NTNs, GMMs and DTW). Moreover, the present invention can be applied to text-dependent or text-independent systems.

Type: Grant

Filed: November 3, 1998

Date of Patent: February 11, 2003

Assignee: T-Netix, Inc.

Inventors: Kevin Farrell, William Mistretta
Method and apparatus for training an ultra-large vocabulary, continuous speech, speaker independent, automatic speech recognition system and consequential database

Patent number: 6490557

Abstract: The present invention is embodied in a system and method for recognizing speech and transcribing speech in real time. The system includes a computer, which could be in a LAN or WAN linked to other computer systems through the Internet. The computer has a controller, or similar device, to filter background noise and convert incoming signals to digital format. The digital signals are transcribed to a word list, which is processed by an automatic speech recognition system. This system synchronizes and compares the lists and forwards the list to a speech recognition learning system, which stores the data on-site. The stored data is forwarded to an off-site storage system, and an off-site large scale learning system that processes the data from all sites on the wide area network system.

Type: Grant

Filed: March 3, 1999

Date of Patent: December 3, 2002

Inventor: John C. Jeppesen
Method and system for objectively evaluating speech

Patent number: 6446038

Abstract: A method and system for objectively evaluating the quality of speech in a voice communication system. A plurality of speech reference vectors is first obtained based on a plurality of clean speech samples. A corrupted speech signal is received and processed to determine a plurality of distortions derived from a plurality of distortion measures based on the plurality of speech reference vectors. The plurality of distortions are processed by a non-linear neural network model to generate a subjective score representing user acceptance of the corrupted speech signal. The non-linear neural network model is first trained on clean speech samples as well as corrupted speech samples through the use of backpropagation to obtain the weights and bias terms necessary to predict subjective scores from several objective measures.

Type: Grant

Filed: April 1, 1996

Date of Patent: September 3, 2002

Assignee: Qwest Communications International, Inc.

Inventors: Aruna Bayya, Marvin Vis
Method of controlling devices via speech signals, more particularly, in motorcars

Publication number: 20020065584

Abstract: The invention relates to a method of controlling function units of a motorcar or of devices (1a, 1b) installed in a motorcar, via speech signals, in which

Type: Application

Filed: August 22, 2001

Publication date: May 30, 2002

Inventors: Andreas Kellner, Alexander Fischer
Handwriting and speech recognizer using neural network with separate start and continuation output scores

Patent number: 6393395

Abstract: A method and system for recognizing user input information including cursive handwriting and spoken words. A time-delayed neural network having an improved architecture is trained at the word level with an improved method, which, along with preprocessing improvements, results in a recognizer with greater recognition accuracy. Preprocessing is performed on the input data and, for example, may include resampling the data with sample points based on the second derivative to focus the recognizer on areas of the input data where the slope change per time is greatest. The input data is segmented, featurized and fed to the time-delayed neural network which outputs a matrix of character scores per segment. The neural network architecture outputs a separate score for the start and the continuation of a character.

Type: Grant

Filed: January 7, 1999

Date of Patent: May 21, 2002

Assignee: Microsoft Corporation

Inventors: Angshuman Guha, Patrick M. Haluptzok, James A. Pittman
Matrix quantization with vector quantization error compensation and neural network postprocessing for robust speech recognition

Patent number: 6347297

Abstract: A speech recognition system utilizes both matrix and vector quantizers as front ends to a second stage speech classifier such as hidden Markov models (HMMs) and utilizes neural network postprocessing to, for example, improve speech recognition performance. Matrix quantization exploits the “evolution” of the speech short-term spectral envelopes as well as frequency domain information, and vector quantization (VQ) primarily operates on frequency domain information. Time domain information may be substantially limited which may introduce error into the matrix quantization, and the VQ may provide error compensation. The matrix and vector quantizers may split spectral subbands to target selected frequencies for enhanced processing and may use fuzzy associations to develop fuzzy observation sequence data. A mixer provides a variety of input data to the neural network for classification determination. The neural network's ability to analyze the input data generally enhances recognition accuracy.

Type: Grant

Filed: October 5, 1998

Date of Patent: February 12, 2002

Assignee: Legerity, Inc.

Inventors: Safdar M. Asghar, Lin Cong
Method and apparatus of hierarchically organizing an acoustic model for speech recognition and adaptation of the model to unseen domains

Patent number: 6324510

Abstract: A method of organizing an acoustic model for speech recognition is comprised of the steps of calculating a measure of acoustic dissimilarity of subphonetic units. A clustering technique is recursively applied to the subphonetic units based on the calculated measure of acoustic dissimilarity to automatically generate a hierarchically arranged model. Each application of the clustering technique produces another level of the hierarchy with the levels progressing from the least specific to the most specific. A technique for adapting the structure and size of a trained acoustic model to an unseen domain using only a small amount of adaptation data is also disclosed.

Type: Grant

Filed: November 6, 1998

Date of Patent: November 27, 2001

Assignee: Lernout & Hauspie Speech Products N.V.

Inventors: Alex Waibel, Juergen Fritsch
Voice detection in audio signals

Patent number: 6321194

Abstract: The presence of a voice in an audio signal is detected by sampling frequency components of the audio signal during a window that starts when a power of the audio signal reaches a predetermined threshold and stops when the audio signal's power drops below the predetermined threshold. An array of elements is generated based on the sampled frequency components. Each element in the array corresponds to a time-based sum of frequency components. Whether the audio signal corresponds to a voice is determined using one or values calculated from the generated array. The value may correspond either to a frequency-based sum of array elements or to the window. The calculated values are analyzed using fuzzy logic which generates a measure of a likelihood that the audio signal is a voice.

Type: Grant

Filed: April 27, 1999

Date of Patent: November 20, 2001

Assignee: Brooktrout Technology, Inc.

Inventor: Alexander Berestesky
Audio diagnostic system and method using frequency spectrum and neural network

Patent number: 6304865

Abstract: A method for testing an audio device with a trained neural network includes a loopback connector connecting the output port of the audio device to the input port of the audio device. A test signal is transmitted through the audio port and received at the input port. The test signal is converted into a frequency spectrum for analysis. The frequency spectrum is provided as input to a trained neural network, the neural network being previously trained to recognize the frequency spectrum pattern created by a properly working, or ideal, audio device. The neural network is trained by connecting the input port to the output port of an audio device from which the training is to occur. Prior to converting signals to a frequency spectrum, the waveform characteristics of the signal may be iteratively evaluated and recording levels adjusted so that the signal received has characteristics that can be tested by the neural network.

Type: Grant

Filed: October 27, 1998

Date of Patent: October 16, 2001

Assignee: Dell U.S.A., L.P.

Inventors: Alan K. Christensen, Christopher F. Broadbent
System and method for recognizing user-specified pen-based gestures using hidden markov models

Patent number: 6304674

Abstract: A method for recognizing user specified pen-based gestures uses Hidden Markov Models. A gesture recognizer is implemented which includes a fast pruning procedure. In addition, an incremental training method is utilized.

Type: Grant

Filed: August 3, 1998

Date of Patent: October 16, 2001

Assignee: Xerox Corporation

Inventors: Todd A. Cass, Lynn D. Wilcox, Tichomir G. Tenev
Computer voice recognition method verifying speaker identity using speaker and non-speaker data

Patent number: 6298323

Abstract: A method for recognizing a speaker in which a voice signal is spoken into a computer by a speaker and a feature vector is formed for the voice signal. The feature vector is compared to at least one stored reference feature vector and to at least one anti-feature vector. The reference feature vector is formed from a speech sample of a speaker to be verified. The anti-feature vector was formed from a speech sample that was spoken in by another speaker who is not the speaker to be verified. A 2-class classification is resolved by forming a similarity value and evaluating the similarity value on the basis of a predetermined range within which the similarity value must deviate from a predetermined value so that the voice signal can be classified as deriving from the speaker to be verified.

Type: Grant

Filed: July 25, 1997

Date of Patent: October 2, 2001

Assignee: Siemens Aktiengesellschaft

Inventor: Bernhard Kaemmerer
Method and apparatus for generating deterministic approximate weighted finite-state automata

Patent number: 6266634

Abstract: An approximate weighted finite-state automaton can be constructed in place of a weighted finite-state automaton so long as the approximate weighted finite-state automaton maintains a sufficient portion of the original best strings in the weighted finite-state automaton and sufficiently few spurious strings are introduced into the approximate weighted finite-state automaton compared to the weighted finite-state automaton. An approximate weighted finite-state automaton can be created from a non-deterministic weighted finite-state automaton during determinization by discarding the requirement that old states be used in place of new states only when an old state is identical to a new state. Instead, in an approximate weighted finite-state automaton, old states will be used in place of new states when each of the remainders of the new state is sufficiently close to the corresponding remainder of the old state.

Type: Grant

Filed: March 23, 2000

Date of Patent: July 24, 2001

Assignee: AT&T Corporation

Inventors: Adam Louis Buchsbaum, Raffaele Giancarlo, Jeffery Rex Westbrook
Device and method for analysis and filtration of sound

Patent number: 6243671

Abstract: A device for analysis and filtration of sound which comprises at least one frequency-linear filter, at least one frequency-logarithmic filter and a weighting means for the combining and non-linear weighting of the output signals from the frequency-linear filter and the frequency-logarithmic filter. The sound is fed parallel as an input signal to the two sets of filters. In turn, the output signals from the filters are fed to the weighting means where they are combined and weighted non-linearly on the basis of magnitudes relevant to the sound. A decision with respect to the identity of the sound is made in a decision means. The invention also relates to a method for analyzing and filtering sound with the aid of the above-mentioned device.

Type: Grant

Filed: January 4, 1999

Date of Patent: June 5, 2001

Inventors: Thomas Lagö, Sven Olsson
System and method capable of automatically switching information output format

Patent number: 6243675

Abstract: In an information processing system such as a navigation system or a portable information terminal unit, information output mode of a display unit to a user is switchable to a plurality of output modes. The navigation system switches the language to be used for information output to any one of Japanese language, English language, and German language. Not the input speech itself is used as a command, but whether the input speech is a Japanese language input, an English language input, or a German language input is determined. The language to be used for information output is switched depending on the determination result.

Type: Grant

Filed: August 8, 2000

Date of Patent: June 5, 2001

Assignee: Denso Corporation

Inventor: Takenori Ito
Pattern matching method and apparatus

Patent number: 6240389

Abstract: A method and apparatus is provided for matching a first sequence of patterns representative of a first signal with a second sequence of patterns representative of a second signal. The system uses a plurality of different pruning thresholds (th) to control the propagation of paths which represent possible matchings between a sequence of second signal patterns and a sequence of first signal patterns ending at the current first signal pattern. In particular, the pruning threshold used for a given path during the processing of a current first signal pattern depends upon the position, within the sequence of patterns representing the second signal, of the second signal pattern which is at the end of the given path.

Type: Grant

Filed: February 8, 1999

Date of Patent: May 29, 2001

Assignee: Canon Kabushiki Kaisha

Inventors: Robert Alexander Keiller, Eli Tzirkel-Hancock, Julian Richard Seward
Method for automatically generating pronunciation dictionary in speech recognition system

Patent number: 6236965

Abstract: A method for automatically generating a pronunciation dictionary in a speech recognition system is disclosed. Pronunciation patterns of a large scale pronunciation dictionary are learned through a neural network without resorting to a phonetic knowledge, and the pronunciation sequences for input words are accurately formed by utilizing an exception grapheme pronunciation dictionary, an exception word pronunciation dictionary for graphemes and words prohibiting the formation of an accurate pronunciation dictionary through the learning neural network, thereby reducing the size of the memory and the amount of calculations. A multi-layer perceptron for directly mapping phonemes relevant to respective graphemes is taught by utilizing a neural network, so as to form an exception word pronunciation dictionary data base, an exception grapheme pronunciation dictionary data base, and a phoneme output multi-layer perceptron parameter data base for respective graphemes.

Type: Grant

Filed: October 7, 1999

Date of Patent: May 22, 2001

Assignee: Electronic Telecommunications Research Institute

Inventors: Hoi-Rin Kim, Young-Jik Lee
Method and apparatus for signal classification using a multilayer network

Patent number: 6208963

Abstract: A method and apparatus for signal classification using a multilayer temporal relaxation network involves receiving an input signal feature vector, classifying a first signal feature, and classifying a second signal feature using contextual information. The multilayer temporal relaxation network applies a relaxation process that updates an activation value of a node in a first layer and updates an activation value of a node in a second layer. The multilayer network then generates a signal classification according to an activation value of a node in the multilayer network.

Type: Grant

Filed: June 24, 1998

Date of Patent: March 27, 2001

Inventors: Tony R. Martinez, R. Brian Moncur, D. Lynn Shepherd, Randall J. Parr, D. Randall Wilson, Carl Hal Hansen
Multiresolutional classifier with training system and method

Patent number: 6192353

Abstract: An improved method and system for training and classifying using a low complexity and high accuracy multiresolutional polynomial classifier (412) is presented. A method of training an multiresolutional polynomial classifier which reduces the complexity of existing classifiers allows models representing subgroups of classes to easily be created. The models which represent subgroups of classes are applied to an unidentified input to produce a coarse classification of the unidentified input using a low order classifier. Once a coarse classification of the unidentified input is performed, a more detailed classification is performed using another low complexity classifier.

Type: Grant

Filed: February 9, 1998

Date of Patent: February 20, 2001

Assignee: Motorola, Inc.

Inventors: Khaled Assaleh, William Michael Campbell, John Eric Kleider
Method of and a device for speech recognition employing neural network and markov model recognition techniques

Patent number: 6185528

Abstract: A method and a device for recognition of isolated words in large vocabularies are described, wherein recognition is performed through two sequential steps using neural networks and Markov models techniques, respectively, and the results of both techniques are adequately combined so as to improve recognition accuracy. The devices performing the combination also provide an evaluation of recognition reliability.

Type: Grant

Filed: April 29, 1999

Date of Patent: February 6, 2001

Assignee: CSELT - Centro Studi e Laboratori Telecomunicazioni S.p.A.

Inventors: Luciano Fissore, Roberto Gemello, Franco Ravera
Method, device and system for noise-tolerant language understanding

Patent number: 6178398

Abstract: A method (900), device (200) and system (100) provide, in response to text/linguistic input, one of a set of pre-determined meanings which is the most likely intended meaning of that input. A trained meaning discriminator is generated from an annotated training corpus and a meaning discriminator trainer. The trained meaning discriminator generates a meaning vector from an input utterance. The intended meaning encoder analyzes the meaning vector to determine the most likely intended meaning and confidence measures.

Type: Grant

Filed: November 18, 1997

Date of Patent: January 23, 2001

Assignee: Motorola, Inc.

Inventors: Richard John Peterson, Dale William Russell, Orhan Karaali, Harry Martin Bliss
Signal verification using signal processing arrangement for time varying band limited input signal

Patent number: 6175818

Abstract: A signal processing arrangement for a band-limited input signal, comprising a plurality N of signal comparators. Each signal comparator is adapted to compare the input signal with a plurality of different exemplar signals and to generate an output indicative of which of the exemplar signals corresponds most closely to the input signal. Each of the exemplar signals is arbitrarily derived independent of any expected input signal. The arrangement provides an N-part output signal which is indicative of the input signal, such that each part of the N-part output signal is derived from the output signal of a respective one of said N signal comparators.

Type: Grant

Filed: December 11, 1998

Date of Patent: January 16, 2001

Assignee: Domain Dynamics Limited

Inventor: Reginald Alfred King
Recognition apparatus using neural network, and learning method therefor

Patent number: 6151592

Abstract: A recognition apparatus and method using a neural network is provided. A neuron-like element stores a value of its inner condition. The neuron-like element also updates a values of its internal status on the basis of an output from the neuron-like element itself, outputs from other neuron-like elements and an external input, and an output value generator a value of its internal status into an external output. Accordingly, the neuron-like element itself can retain the history of input data. This enables the time series data, such as speech, to be processed without providing any special devices in the neural network.

Type: Grant

Filed: January 20, 1998

Date of Patent: November 21, 2000

Assignee: Seiko Epson Corporation

Inventor: Mitsuhiro Inazumi
Pattern classifier with training system and methods of operation therefor

Patent number: 6131089

Abstract: Classifiers (110) and a comparator (112) perform an identification method (400) to identify a class as one of a predetermined set of classes. The identification method is based on determining the observation costs associated with the unidentified class. The identification method includes combining models representing the predetermined set of classes and the unidentified vectors representing the class. The predetermined class associated with the largest observation cost is identified as the class. Additionally, a unique, low-complexity training method (300) includes creating the models which represent the predetermined set of classes.

Type: Grant

Filed: May 4, 1998

Date of Patent: October 10, 2000

Assignee: Motorola, Inc.

Inventors: William Michael Campbell, Bruce Alan Fette
Method and apparatus for discriminative utterance verification using multiple confidence measures

Patent number: 6125345

Abstract: A multiple confidence measures subsystem of an automated speech recognition system allows otherwise independent confidence measures to be integrated and used for both training and testing on a consistent basis. Speech to be recognized is input to a speech recognizer and a recognition verifier of the multiple confidence measures subsystem. The speech recognizer generates one or more confidence measures. The speech recognizer preferably generates a misclassification error (MCE) distance as one of the confidence measures. The recognized speech output by the speech recognizer is input to the recognition verifier, which outputs one or more confidence measures. The recognition verifier preferably outputs a misverification error (MVE) distance as one of the confidence measures. The confidence measures output by the speech recognizer and the recognition verifier are normalized and then input to an integrator.

Type: Grant

Filed: September 19, 1997

Date of Patent: September 26, 2000

Assignee: AT&T Corporation

Inventors: Piyush C. Modi, Mazin G. Rahim
Training process for the classification of a perceptual signal

Patent number: 6119083

Abstract: Training apparatus and method for establishing the network definition function of a trainable processing apparatus for analyzing a signal, includes providing a training sequence having a first signal and a distorted version of the first signal, receiving the training sequence and generating a distortion perception measure for indicating the extent to which the distortion would be perceptible to a human observer, and applying the distortion perception measure to the trainable processing apparatus to determine the network definition function.

Type: Grant

Filed: March 19, 1998

Date of Patent: September 12, 2000

Assignee: British Telecommunications public limited company

Inventors: Michael P Hollier, Philip Gray

prev … 5 6 7 8 9 10 next