Patents Examined by Donald L. Storm
  • Patent number: 6587816
    Abstract: A method for estimating a pitch frequency of an audio signal includes computing a first transform of the signal to a frequency domain over a first time interval, and computing a second transform of the signal to the frequency domain over a second time interval, which contains the first time interval. A line spectrum of the signal is found, based on the first and second transforms, the spectrum including spectral lines having respective line amplitudes and line frequencies. A utility function that is periodic in the frequencies of the lines in the spectrum is then computed. This function is indicative, for each candidate pitch frequency in a given pitch frequency range, of a compatibility of the spectrum with the candidate pitch frequency. The pitch frequency of the speech signal is estimated responsive to the utility function.
    Type: Grant
    Filed: July 14, 2000
    Date of Patent: July 1, 2003
    Assignee: International Business Machines Corporation
    Inventors: Dan Chazan, Meir Zibulski, Ron Hoory
  • Patent number: 6584438
    Abstract: A frame erasure compensation method in a variable-rate speech coder includes quantizing, with a first encoder, a pitch lag value for a current frame and a first delta pitch lag value equal to the difference between the pitch lag value for the current frame and the pitch lag value for the previous frame. A second, predictive encoder quantizes only a second delta pitch lag value for the previous frame (equal to the difference between the pitch lag value for the previous frame and the pitch lag value for the frame prior to that frame). If the frame prior to the previous frame is processed as a frame erasure, the pitch lag value for the previous frame is obtained by subtracting the first delta pitch lag value from the pitch lag value for the current frame. The pitch lag value for the erasure frame is then obtained by subtracting the second delta pitch lag value from the pitch lag value for the previous frame.
    Type: Grant
    Filed: April 24, 2000
    Date of Patent: June 24, 2003
    Assignee: Qualcomm Incorporated
    Inventors: Sharath Manjunath, Pengjun Huang, Eddie-Lun Tik Choy
  • Patent number: 6571211
    Abstract: A portable audio recorder stores voice information files in the form of digital data. Header data is associated with each voice information file. The header data includes the name of the person who dictated the voice information file and the serial number of the recorder unit itself. This information may be used in connection with management of the voice information files in the recorder or after the files are transferred to another device such as a personal computer. The header data may also include the serial number of the recorder and a work facility location to which the file pertains. The user of the recorder may be permitted to select a format for the header data from among a plurality of pre-stored formats. Specific transaction data may be downloaded to the recorder, and used to build a header for a voice file, which is then uploaded with the header to another device.
    Type: Grant
    Filed: November 12, 1998
    Date of Patent: May 27, 2003
    Assignee: Dictaphone Corporation
    Inventors: John J. Dwyer, David K. Godin, Stephen Rothschild, John J. Pawlowski
  • Patent number: 6564185
    Abstract: The invention relates to a method and apparatus for recognition processing of continuous words of a group which is structured by a plurality of words such that a recognition result of all of the words which structures the continuous words is effectively and accurately confirmed. All of the continuous words which have been input are recognition processed, the recognition result of all of the continuous words is output, a response from a speaker showing an affirmative/negative recognition result is input and recognition processed. If affirmative is determined, the recognition result at that time is confirmed for all of the continuous words. If negative is determined, for each word from a first to an nth (third in this case) which structures continuous words, the content showing affirmative/negative from the speaker is recognized, affirmative or negative is determined, and the recognition result at that time is confirmed as a recognition processing target word.
    Type: Grant
    Filed: August 10, 1999
    Date of Patent: May 13, 2003
    Assignee: Seiko Epson Corporation
    Inventors: Yasunaga Miyazawa, Mitsuhiro Inazumi, Hiroshi Hasegawa, Masahisa Ikejiri
  • Patent number: 6542864
    Abstract: An apparatus and method for data processing that improves estimation of spectral parameters of speech data and reduces algorithmic delay in a data coding operation. Estimation of spectral parameters is improved by adaptively adjusting a gain function used to enhance data based on whether the data contains information speech and noise or noise only. Delay is reduced by extracting coding parameters using incompletely processed data. This data is formed by multiplying a less current portion of an input data frame with a synthesis window and a more current portion of the data frame with an inverse analysis window, and performing an overlap-add process on the data frame and a similarly processed previous data frame.
    Type: Grant
    Filed: October 2, 2001
    Date of Patent: April 1, 2003
    Assignee: AT&T Corp.
    Inventors: Richard Vandervoort Cox, Rainer Martin
  • Patent number: 6539355
    Abstract: A bandwidth expanding method and apparatus in which frequency characteristics of high-frequency components of broad band signals can be adjusted to the liking of the user, overflow due to addition is prevented from occurring without power variations being perceived by a user, the number of broad band formants is reduced, and emphasis is attached to the rough structure of the spectrum, so that the produced broad band speech signals can be improved in quality. To this end, in a speech bandwidth expansion device, frequency characteristics of the frequency components not less than 3400 Hz are adjusted by preset alterable parameter values and summed to the original narrow band speech components. If overflow has occurred in a sample, the high-range gain of the sample is lowered to a level below the overflow level before proceeding to addition.
    Type: Grant
    Filed: October 14, 1999
    Date of Patent: March 25, 2003
    Assignee: Sony Corporation
    Inventors: Shiro Omori, Masayuki Nishiguchi
  • Patent number: 6535603
    Abstract: An improved thermal design for passively cooled telecommunication repeater housings for use with wire transmission in the local loop outside plant is achieved by replacing the known convection based heat transfer designs with a design based on solid thermal conduction. A thermal chassis includes thermal collection, transfer and distribution members that collect the repeater modules' waste heat through respective thermal interfaces, transfer the waste heat along respective thermal conduction paths to the environmental enclosure, and then distribute the waste heat over a substantial portion of the enclosure's available surface area to form an enlarged thermal interface for convectively transferring the waste heat to the ambient air. Heat transfer is further improved by expanding the enclosure's external surface area and fabricating the distribution members so that they are in permanent and intimate thermal contact with the enclosure's expanded surface area.
    Type: Grant
    Filed: February 8, 2001
    Date of Patent: March 18, 2003
    Assignee: Anacapa Technology, Inc.
    Inventor: Erich K. Laetsch
  • Patent number: 6535854
    Abstract: Home networks low-cost digital interfaces are introduced that integrate entertainment, communication and computing electronics into consumer multimedia. Normally, these are low-cost, easy to use systems, since they allow the user to remove or add any kind of network devices with the bus being active. To improve the user interface a speech unit (2) is proposed that enables all devices (11) connected to the bus system (31) to be controlled by a single speech recognition device. The properties of this device, e.g. the vocabulary can be dynamically and actively extended by the consumer devices (11) connected to the bus system (31). The proposed technology is independent from a specific bus standard, e.g. the IEEE 1394 standard, and is well-suited for all kinds of wired wireless home networks. The speech unit (2) receives data and messages from the device. The speech unit (2) recognizes speaker-dependent commands. A Speech synthesizer synthesizes messages.
    Type: Grant
    Filed: October 19, 1998
    Date of Patent: March 18, 2003
    Assignee: Sony International (Europe) GmbH
    Inventors: Peter Buchner, Silke Goronzy, Ralf Kompe, Stefan Rapp
  • Patent number: 6519559
    Abstract: A signal processing unit is disclosed for selectively routing an unfiltered input signal and a noise reduced version of the unfiltered input signal to an output port in response to a noise power estimate. Routing the unfiltered input signal to the output port when the noise power estimate is less than a noise floor threshold avoids degrading the information content of an input signal having a power level close to the noise floor. A first attenuation factor and a second attenuation factor can be applied to the unfiltered input signal. A method is disclosed for parsing a signal into a plurality of frames, selecting a maximum value for each frame, and averaging the maximum values to form a noise floor threshold.
    Type: Grant
    Filed: July 29, 1999
    Date of Patent: February 11, 2003
    Assignee: Intel Corporation
    Inventor: Sudheer Sirivara
  • Patent number: 6510223
    Abstract: An improved service access and heat transfer design for passively cooled telecommunication repeater housings for use with wire transmission in the local loop outside plant is achieved by a cover, sealable to the housing's sidewall, removable to provide field replaceable, plug-in access to the repeater modules and voltage protector assemblies wherein the voltage protector assemblies can be installed and removed without first removing the repeater modules protected by those voltage protector assemblies and by replacing the known convection based heat transfer designs with a design based on solid thermal conduction. Thermal sleeves, which mount the repeater modules, collect the repeater modules' waste heat through thermal interfaces, transfer the waste heat along thermal conduction paths to the housing's sidewalls, and then distribute the waste heat over a substantial portion of the housing's sidewalls.
    Type: Grant
    Filed: September 14, 2001
    Date of Patent: January 21, 2003
    Assignee: Anacapa Technology, Inc.
    Inventor: Erich K. Laetsch
  • Patent number: 6502066
    Abstract: Formants, corresponding to input speech units based either on a known text or the results of a speech recognition procedure, are generated from a formant synthesizer. A frequency response is generated based on the synthesized formants. A second frequency response is generated based on a speech signal which is received and which corresponds to utterances of speech units. The synthesized formants are modified based on a comparison of the frequency response corresponding to the synthesized formants and specific proportional characteristics of a frequency response of the input speech signal. In one illustrative embodiment, the comparison is then recalculated and further modifications are made accordingly to improve accuracy. In one illustrative embodiment, time aligning and frequency warping are utilized as modification functions.
    Type: Grant
    Filed: April 2, 2001
    Date of Patent: December 31, 2002
    Assignee: Microsoft Corporation
    Inventor: Michael D. Plumpe
  • Patent number: 6496795
    Abstract: The present invention is embodied in a system and method for performing spectral analysis of a digital signal having a discrete duration by spectrally decomposing the digital signal at predefined frequencies uniformly distributed over a sampling frequency interval into complex frequency coefficients so that magnitude and phase information at each frequency is immediately available to produce a modulated complex lapped transform (MCLT). The present invention includes a MCLT processor, an acoustic echo cancellation device and a noise reducer integrated with an encoder/decoder device.
    Type: Grant
    Filed: May 5, 1999
    Date of Patent: December 17, 2002
    Assignee: Microsoft Corporation
    Inventor: Henrique S. Malvar
  • Patent number: 6496797
    Abstract: An apparatus and method for speech compression includes dividing the speech spectrum into a plurality of frames, assigning frame classifications to the plurality of frames, and determining the speech modeling parameters based on the assigned frame classification. The voiced part of the speech spectrum and the unvoiced part of the speech spectrum are synthesized separately using an Analysis by Synthesis allowing a correct correspondence between voiced and unvoiced parts of the reconstructed signal. Particularly, a frequency response of a special simulated signal based on the previous and current frames is used as an approximating function. The simulated signal is synthesized at the encoder side in the way it will be generated at the decoder side. Also, a better of two encoding methods is selected to encode the spectral magnitudes. A wavelet encoder and an inter-frame predictive encoder illustrate the invention's efficient, yet accurate reconstruction of synthesized digital speech.
    Type: Grant
    Filed: April 1, 1999
    Date of Patent: December 17, 2002
    Assignee: LG Electronics Inc.
    Inventors: Victor V. Redkov, Anatoli I. Tikhotski, Alexandr L. Maiboroda, Eugene V. Djourinski
  • Patent number: 6496798
    Abstract: A system controller (106) includes a speech encoder (107) that encodes a low bit rate digital voice message. The speech encoder sets values of words of a header of the encoded message. The values of the words define a quantity of frames in the voice message, N, and define a vocoder rate used for the encoded message. The speech encoder sets a state of each indicator in each frame status field of N frame status fields that are transmitted after the header of the encoded message. The speech encoder assembles N frame data fields, wherein each of the frame data fields comprises a set of data words. The N frame data fields follow the N frame status fields. Each set of data words conforms to at least one of the vocoder rate and the states of the indicators. A decoder (3310) decodes the encoded low bit rate digital message.
    Type: Grant
    Filed: September 30, 1999
    Date of Patent: December 17, 2002
    Assignee: Motorola, Inc.
    Inventors: Jian-Cheng Huang, Floyd Simpson, Sunil Satyamurti, Oleg Andric, Kenneth Finlon
  • Patent number: 6493667
    Abstract: In order to achieve low error rates in a speech recognition system, for example, in a system employing rank-based decoding, we discriminate the most confusable incorrect leaves from the correct leaf by lowering their ranks. That is, we increase the likelihood of the correct leaf of a frame, while decreasing the likelihoods of the confusable leaves. In order to do this, we use the auxiliary information from the prediction of the neighboring frames to augment the likelihood computation of the current frame. We then use the residual errors in the predictions of neighboring frames to discriminate between the correct (best) and incorrect leaves of a given frame. We present a new methodology that incorporates prediction error likelihoods into the overall likelihood computation to improve the rank position of the correct leaf.
    Type: Grant
    Filed: August 5, 1999
    Date of Patent: December 10, 2002
    Assignee: International Business Machines Corporation
    Inventors: Peter V. de Souza, Yuqing Gao, Michael Picheny, Bhuvana Ramabhadran
  • Patent number: 6490556
    Abstract: A half duplex switching device includes an input connection for receiving an input audio signal, and classification module coupled to the input connection. The classification module provides an output which indicates a classification of the input signal based upon a density of the input audio signal, an energy level of the input audio signal, and classification data provided with the input audio signal. A switching device is coupled to the classification module and determines if the received input audio signal contains speech signals based upon the output of the classification module. The communication receiving device can be used in both communication systems which provide continuous speech signals, and communication systems which remove silence and only provide speech signals.
    Type: Grant
    Filed: May 28, 1999
    Date of Patent: December 3, 2002
    Assignee: Intel Corporation
    Inventors: David L. Graumann, Claudia M. Henry
  • Patent number: 6490550
    Abstract: A system and method for IP-based telephone communication utilizing speech-generated text is disclosed. An embodiment of the present invention includes an interface to the Internet for sending and receiving voice and video signals. In addition, the embodiment generates text signals corresponding to voice signals generated by the user. A transmission signal is generated from the text signals and the voice signals for transmission over the Internet. Further, the embodiment of the present invention includes an application which is capable of receiving video and/or speech-generated data transmitted by another device, and concurrently displaying the speech-generated data and video to a user. The speech-generated data may be converted to an audio signal and applied to a speaker concurrently with the video and speech-generated data. In this way, a user is capable of easily communicating speech-generated information to another user during periods of voice signal loss.
    Type: Grant
    Filed: November 30, 1998
    Date of Patent: December 3, 2002
    Assignee: Ericsson Inc.
    Inventor: Farzad Hiri
  • Patent number: 6487530
    Abstract: A system and method for speech recognition includes a speaker-independent set of stored word representations derived from speech of many users deemed to be typical speakers and for use by all users, and may further include speaker-dependent sets of stored word representations specific to each user. Utterances from a user which match stored words in either set according to the ordering rules are reported as words.
    Type: Grant
    Filed: March 30, 1999
    Date of Patent: November 26, 2002
    Assignee: Nortel Networks Limited
    Inventors: Lin Lin, Ping Lin
  • Patent number: 6484137
    Abstract: An audio reproducing apparatus comprises: audio decoding means for decoding an input audio signal frame by frame; data expanding/compressing means for subjecting data in a decoded frame to time-scale modification process; a frame sequence table which contains a sequence determined according to a given speed rate in which respective frames are expanded/compressed; frame counting means for counting the number of frames of the input audio signal; and data expansion/compression control means for instructing the dalta expanding/compressing means to subject the frame to one of time-scale compression process, time-scale expansion process, and process without time-scale modification process, with reference to the frame sequence table based on a count value output from the frame counting means, the data expanding/compressing means subjecting the audio signal to time-scale modification process in accordance with an instruction signal from the data expansion/compression control means.
    Type: Grant
    Filed: October 29, 1998
    Date of Patent: November 19, 2002
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Hirotsugu Taniguchi, Masayuki Misaki, Junichi Tagawa, Michio Matsumoto
  • Patent number: 6480604
    Abstract: A balanced spectrum limiter for limiting the frequency spectrum of signals transmitted through a telephone or communication system includes one or more filters and an energy surge protection circuit. Each filter includes a pair of inductors wound around a single core and a capacitor connected between the two inductors. The energy surge protection circuit can be any type of energy surge protection circuit known in the art. The range of frequencies which can be transmitted is determined by the filter. The filter is removably connected to the energy surge protection circuit so that filters of varying parameters may be easily interchanged.
    Type: Grant
    Filed: October 1, 1999
    Date of Patent: November 12, 2002
    Assignee: Porta Systems Corporation
    Inventor: Prem G. Chandran