Patents Examined by Robert Louis Sax
  • Patent number: 5956463
    Abstract: The invention relates to an automated system for monitoring wildlife auditory data and recording same for subsequent analysis and identification. The system comprises one or more microphones coupled to a recording apparatus for recording wildlife vocalizations in digital format. The resultant recorded data is preprocessed, segmented, and analyzed by means of a neural network to identify the respective species. The system minimizes the need for human intervention and subjective interpretation of the recorded sounds.
    Type: Grant
    Filed: October 7, 1996
    Date of Patent: September 21, 1999
    Assignee: Ontario Hydro
    Inventors: Paul H. Patrick, Narayan Ramani, William G. Hanson, Ronald W. Sheehan, Robert L. Jennette
  • Patent number: 5950162
    Abstract: The present invention teaches a method (400), device and system (300) utilizing at least one of: mapping a sequence of phones to a sequence of articulatory features and utilizing prominence and boundary information, in addition to a predetermined set of rules for type, phonetic context, syntactic and prosodic context for phones to provide provide a system that generates segment durations efficiently with a small training set.
    Type: Grant
    Filed: October 30, 1996
    Date of Patent: September 7, 1999
    Assignee: Motorola, Inc.
    Inventors: Gerald Corrigan, Orhan Karaali, Noel Massey
  • Patent number: 5950154
    Abstract: A noise filter technique estimates noise in speech that has been processed by Call Multiplication Equipment. The received signal has speech frames and interspersed fill-noise frames inserted at a satellite signal receiving station. The filtering technique removes the fill-noise from the signal. The remaining speech frames are analyzed such that the speech frames having the lowest power values are used to create a histogram of power/frequency. This histogram contains information from which the noise-in-speech power spectrum is derived.
    Type: Grant
    Filed: July 15, 1996
    Date of Patent: September 7, 1999
    Assignee: AT&T Corp.
    Inventors: Raymond Stephen Medaugh, Ronald Shaya
  • Patent number: 5946654
    Abstract: A speech model is produced for use in determining whether a speaker associated with the speech model produced an unidentified speech sample. First a sample of speech of a particular speaker is obtained. Next, the contents of the sample of speech are identified using speech recognition. Finally, a speech model associated with the particular speaker is produced using the sample of speech and the identified contents thereof. The speech model is produced without using an external mechanism to monitor the accuracy with which the contents were identified.
    Type: Grant
    Filed: February 21, 1997
    Date of Patent: August 31, 1999
    Assignee: Dragon Systems, Inc.
    Inventors: Michael Jack Newman, Laurence S. Gillick, Yoshiko Ito
  • Patent number: 5943649
    Abstract: A method for configuring an audio interface for a speech recognition application in a computer system comprises the steps of: (a) identifying at least one sound card present in the computer system capable of supporting speech; (b) displaying a at least one graphical user interface (GUI) prompting a user selection and connection of a microphone as an audio input device to the at least one sound card; (c) in the event the selected microphone is of an electret type and the selected sound card is of a type having a high gain range, displaying a second GUI prompting the user to connect a signal strength attenuator in series with the microphone, the microphone and the attenuator together having an operating characteristic corresponding to a dynamic microphone; (d) in the event the selected microphone is of a dynamic type, omitting step (c); and, (e) testing the connected microphone without regard to whether the connected microphone is of the electret or dynamic type.
    Type: Grant
    Filed: October 29, 1997
    Date of Patent: August 24, 1999
    Assignee: International Business Machines Corporation
    Inventors: Frank Fado, Peter Guasti, Amado Nassiff, Ronald VanBuskirk
  • Patent number: 5940793
    Abstract: A method and apparatus accesses a database where entries are linked to at least two sets of patterns. One or more patterns of a first set of patterns are recognized within a received signal. The recognized patterns are used to identify entries and compile a list of patterns in a second set of patterns to which those entries are also linked. The list is then used to recognize a second received signal. The received signals may, for example, be voice signals or signals indicating the origin or destination of the received signals.
    Type: Grant
    Filed: April 21, 1997
    Date of Patent: August 17, 1999
    Assignee: British Telecommunications public limited company
    Inventors: David J. Attwater, Steven J. Whittaker, Francis J. Scahill, Alison D. Simons
  • Patent number: 5937385
    Abstract: An automated system generates and revises grammars for speech recognizers in a speech recognition system. Given an initial grammar, expressed in terms of non-terminals in Backus-Naur Form (BNF) notation, a sentence generator generates a list of all sentences accepted by the grammar. From this list, a corpus of inappropriate or irrelevant sentences which are accepted by the grammar (counter-examples) is identified. A grammar revisor program uses the original grammar and the list of counter examples, to generate a pruned list from which a revised grammar is generated. The revision process is iterated several times either concatenating or merging pairs of non-terminals until the revised grammar is deemed satisfactory in that it accepts as legal only relevant sentences. The revised grammar is used by the speech recognizer, thus reducing errors in the overall system.
    Type: Grant
    Filed: October 20, 1997
    Date of Patent: August 10, 1999
    Assignee: International Business Machines Corporation
    Inventors: Wlodek Wlodzimierz Zadrozny, Nandakishore Kambhatla
  • Patent number: 5924061
    Abstract: A low-complexity method and apparatus for performing signal decomposition in a low bit-rate WI speech encoder. A time-ordered sequence of sets of time-domain parameters is generated based on samples of a speech signal to be coded, each set of time-domain parameters corresponding to a waveform characterizing the speech signal. A cross correlation is then performed between two or more of said sets of time-domain parameters to produce a set of signals which represents relatively high rates of evolution of characterizing waveform shape across the time-ordered sequence of sets. Finally, the speech signal is coded based on the produced set of signals. A set of signals which represents relatively low rates of evolution of characterizing waveform shape across the time-ordered sequence of sets may also be produced.
    Type: Grant
    Filed: March 10, 1997
    Date of Patent: July 13, 1999
    Assignee: Lucent Technologies Inc.
    Inventor: Yair Shoham
  • Patent number: 5924063
    Abstract: A speech signal encoder includes a speech analyzer for determining short-term prediction codes at a predetermined time interval. The prediction codes indicate frequency characteristics of a speech signal. A reverse filter is provided for calculating residual signals of first synthesis filter. The residual signals are defined by the short-term prediction codes. A residual code book stores past residual signals. Further, a plurality of delay codes, each of which represents pitch correlation of the speech signal, are predetermined. A vector generator issues, using the residual code book, delay residual vectors each of which corresponds to the delay code. A filter is provided for generating a synthesis signal using second synthesis filter which receives the delay residual vectors and which is defined by the short-term prediction codes. A distance between the speech signal and the synthesis signal is calculated. Subsequently, a pitch path estimator estimates a pitch path which varies smoothly.
    Type: Grant
    Filed: December 27, 1995
    Date of Patent: July 13, 1999
    Assignee: NEC Corporation
    Inventors: Keiichi Funaki, Kazunori Ozawa
  • Patent number: 5913192
    Abstract: A speaker identification system includes a speaker-independent phrase recognizer. The speaker-independent phrase recognizer scores a password utterance against all the sets of phonetic transcriptions in a lexicon database to determine the N best speaker-independent scores, determines the N best sets of phonetic transcriptions based on the N best speaker-independent scores, and determines the N best possible identities. A speaker-dependent phrase recognizer retrieves the hidden Markov model corresponding to each of the N best possible identities, and scores the password utterance against each of the N hidden Markov models to generate a speaker-dependent score for each of the N best possible identities. A score processor coupled to the outputs of the speaker-independent phrase recognizer and the speaker-dependent phrase recognizer determines a putative identity. A verifier coupled to the score processor authenticates the determined putative identity.
    Type: Grant
    Filed: August 22, 1997
    Date of Patent: June 15, 1999
    Assignee: AT&T Corp
    Inventors: Sarangarajan Parthasarathy, Aaron Edward Rosenberg
  • Patent number: 5909667
    Abstract: A computer voice operated dictations system having a text editing mode provided for editing dictated text. The text editing mode navigates a user through the dictated text by highlighting a correction window comprised of a plurality of words comprising a portion of the text. The correction window is comprised of "m" words and the value of m is preferably determined, based upon the voice recognition capabilities of a particular voice recognition engine associated with the dictation system. The greater the ability of the speech recognition engine to recognize and distinguish spoken words, the larger the value of m. A correction vocabulary is defined which consists of the plurality of words which are contained within the correction window and a plurality of correction commands. The voice recognition computer dictation system is configured for processing a spoken utterance which corresponds to an error word contained within the plurality of words highlighted in the correction window is processed.
    Type: Grant
    Filed: March 5, 1997
    Date of Patent: June 1, 1999
    Assignee: International Business Machines Corporation
    Inventors: Kyriakos Leontiades, Kris Coe
  • Patent number: 5907825
    Abstract: A method for determining the location of a pattern, when input in isolation, within a representative input signal is provided. The method aligns the input signal with a signal representative of a plurality of connected patterns, one of which is the same as the pattern within the input signal. The method then determines the location from the results of the aligning step. The location determined using this apparatus can be used to determine an isolated reference model by extracting features of the input signal from the location found. This isolated reference model can then be used to generate a continuous reference model for the pattern, by aligning the isolated reference model with the signals representative of a plurality of connected patterns, one of which is the pattern to be modelled.
    Type: Grant
    Filed: February 6, 1997
    Date of Patent: May 25, 1999
    Assignee: Canon Kabushiki Kaisha
    Inventor: Eli Tzirkel-Hancock
  • Patent number: 5903866
    Abstract: A low-complexity method and apparatus for performing waveform interpolation in a low bit-rate WI speech decoder, wherein interpolation between received waveforms is performed with use of spline coefficients generated based thereupon. Specifically, two signals are received from a WI encoder, each comprising a set of frequency domain parameters representing a speech signal segment of a corresponding pitch period. Then, spline coefficients are generated from each of the received signals, wherein each set of spline coefficients comprises a spline representation of a time domain transformation of the corresponding set of frequency domain parameters. Finally, the decoder interpolates between the spline representations to generate interpolated time domain data which is used to synthesize a reconstructed speech signal. In certain embodiments of the present invention, the time scale of at least one of the spline representations is modified to enable the interpolation therebetween.
    Type: Grant
    Filed: March 10, 1997
    Date of Patent: May 11, 1999
    Assignee: Lucent Technologies Inc.
    Inventor: Yair Shoham
  • Patent number: 5903871
    Abstract: A voice recording and/or reproducing apparatus has a memory for storing information unique to a user and/or information unique to the apparatus. When recording a voice file, the unique information is recorded in index information. Preferably, a unique file number is also recorded. In order that a voice file can be recognized by a personal computer or other information apparatus, a predetermined identifier consisting of one to three characters is added to the name of the voice file. When reproducing, a voice file having a name including the predetermined identifier consisting of one to three characters is selected and reproduced.
    Type: Grant
    Filed: April 15, 1997
    Date of Patent: May 11, 1999
    Assignee: Olympus Optical Co., Ltd.
    Inventors: Nobuo Terui, Masatoshi Tobinai
  • Patent number: 5899971
    Abstract: Speech recognition with a speaker-independent speech recognition method is possible for voice signals newly added by a user using the computer unit. Imaging of the voice signal onto stored phoneme feature vectors is implemented for each voice signal to be newly incorporated. A number of possibilities of imaging are compared to one another and the best phoneme representation of the imaging is employed as phoneme presentation for the new word. The voice signal is then stored in a user memory with the phoneme presentation.
    Type: Grant
    Filed: March 19, 1997
    Date of Patent: May 4, 1999
    Assignee: Siemens Aktiengesellschaft
    Inventor: Luc De Vos
  • Patent number: 5897618
    Abstract: A voice command interface is implemented in a data processing system whereby a user may select any application which is currently running on the data processing system without having to manually modify a list of programs which may be accessed by a voice command and without having to request the program by its full name. To allow a user to access a program which is currently running on the data processing system, control within the data processing system ensures that a name of the program or application which is opened by an external user is added to a window list automatically. Similarly, a program which is no longer enabled by a user and running on the data processing system will be automatically removed from the window list. Additionally, the data processing system and methodology implemented therein allow an external user to speak merely a portion of a title of an application to access the application using a voice command.
    Type: Grant
    Filed: March 10, 1997
    Date of Patent: April 27, 1999
    Assignee: International Business Machines Corporation
    Inventors: Philip Albert Loats, Xiaotong Wang
  • Patent number: 5897617
    Abstract: Diphones are prepared for text-to-speech converting by selectively pronouncing a set of selected diphones and processing each such diphone for persistent storage. Finally, each processed diphone is stored in an individually addressable manner. In particular, amongst such set as spoken by a single person, on a basis of homophony each diphone is assigned to one or more diverse languages. Sharing of selective diphones amongst more than one language diminishes required storage. The storage may entail language-specific processing qualifiers.
    Type: Grant
    Filed: August 14, 1996
    Date of Patent: April 27, 1999
    Assignee: U.S. Philips Corporation
    Inventor: Rene P. G. Collier
  • Patent number: 5893063
    Abstract: A voice command interface is implemented in a data processing system whereby a user may select any application which is currently running on the data processing system without having to manually modify a list of programs which may be accessed by a voice command and without having to request the program by its full name. To allow a user to access a program which is currently running on the data processing system, control within the data processing system ensures that a name of the program or application which is opened by an external user is added to a window list automatically. Similarly, a program which is no longer enabled by a user and running on the data processing system will be automatically removed from the window list. Additionally, the data processing system and methodology implemented therein allow an external user to speak merely a portion of a title of an application to access the application using a voice command.
    Type: Grant
    Filed: March 10, 1997
    Date of Patent: April 6, 1999
    Assignee: International Business Machines Corporation
    Inventors: Philip Albert Loats, William Joseph Tracey, II, Xiaotong Wang
  • Patent number: 5890113
    Abstract: An analyzing unit 1 converts an input speech into a feature vector time series. A reference pattern storing unit 3 stores the feature vector time series obtained by the same manner as in the analyzing unit. A matching unit 2 correlates for time axis the input speech feature vector time series and the reference patterns to one another. An environmental adapting unit 4 performs the environmental adaptation between the input speech feature vector time series and the reference patterns according to the result of matching in the matching unit 2. A speaker adapting unit 6 performs the adaptation concerning the speaker between the environmentally adapted reference patterns from the environmental adapting unit 4 and the input speech feature vector time series.
    Type: Grant
    Filed: December 13, 1996
    Date of Patent: March 30, 1999
    Assignee: NEC Corporation
    Inventor: Keizaburo Takagi
  • Patent number: 5890115
    Abstract: A wavetable speech synthesis apparatus includes a wavetable memory for defining a plurality of primitive speech sounds. The primitive speech elements are individually assigned to a memory cell designated by an instrument identification in the wavetable memory. Various primitive speech elements are defined and selected from among sound bites, entire words and phrases, frequently-occurring syllables, phonemes or smaller atomic speech elements. The primitive speech elements generate primitive sounds that are played back at a selected pitch, duration, attack velocity and envelope, sustain, and decay velocity and envelope. Various types of speaker qualities or identities are assigned to different frequency ranges of the speech elements. The wavetable memory includes a speech sample database and a speech reference database. The speech sample database supplies speech signals that are processed by the wavetable synthesizer according to information contained in the speech reference database.
    Type: Grant
    Filed: March 7, 1997
    Date of Patent: March 30, 1999
    Assignee: Advanced Micro Devices, Inc.
    Inventor: Terry Lynn Cole