Patents Examined by W. R. Young
  • Patent number: 7031916
    Abstract: A method of initializing an ITU Recommendation G.729 Annex B voice activity detection (VAD) device is disclosed, having the steps of (1) extracting a set of parameters from a signal that characterize the signal; (2) calculating an energy measure of the signal from the set of parameters; (3) comparing the energy measure with a reference value; (4) determining an initial value for an average of a noise characteristic of the signal; and (5) counting the number of times the energy measure equals or exceeds the reference level. Also disclosed is a method of converging an ITU Recommendation G.
    Type: Grant
    Filed: June 1, 2001
    Date of Patent: April 18, 2006
    Assignee: Texas Instruments Incorporated
    Inventors: Dunling Li, Daniel C. Thomas, Gokhan Sisli
  • Patent number: 7031911
    Abstract: A method and computer-readable medium are provided that construct a collocation mistake pattern database for use in writing in a first language by a person whose native language is a second language. The method includes obtaining a bilingual corpus having sentences in first and second languages and extracting second language word pairs from the second language sentences in the corpus. For each second language word pair extracted from the corpus, a corresponding first language word pair is extracted from the corresponding first language sentence in the corpus to determine a correct first language translation for the second language word pair. Also, for each second language word pair extracted from the corpus, a set of combinations of first language translation words corresponding to the second language word pair is created.
    Type: Grant
    Filed: June 28, 2002
    Date of Patent: April 18, 2006
    Assignee: Microsoft Corporation
    Inventors: Ming Zhou, Ting Liu
  • Patent number: 7027978
    Abstract: A system control portion of a voice data recording and reproducing apparatus converts inputted voice signals into digitized voice data, adds header information stored in a table composed of a rewritable nonvolatile storage medium to the converted voice data and records them in a semiconductor memory as a recording medium. A PC to which such a voice data recording and reproducing apparatus can be connected acquires header information stored in the data table, and, when the changing of the header information is designated, sends the changed header information to the voice data recording and reproducing apparatus. Based upon the sent header information, the system control portion of the voice recording and reproducing apparatus rewrites the header information in the data table.
    Type: Grant
    Filed: January 29, 2001
    Date of Patent: April 11, 2006
    Assignee: Olympus Optical Co., Ltd.
    Inventor: Hideo Okano
  • Patent number: 7027976
    Abstract: Methods and apparatus for document based ambiguous character resolution. An application searches a document for words that do not contain ambiguous characters and adds them to a dictionary, then searches the document for words that do contain ambiguous characters. For each ambiguous word, a set of candidate solutions is created by resolving the ambiguous characters in all possible ways. The dictionary is searched for words matching members of the candidate solution set. When a single member is matched, the ambiguous characters are resolved accordingly. When no member or more than one member is matched, a user is prompted to resolve the ambiguous characters. Alternatively, when more than one member is matched, the ambiguous characters are resolved to obtain the largest word, the smallest word, the most words, or the fewest words.
    Type: Grant
    Filed: January 29, 2001
    Date of Patent: April 11, 2006
    Assignee: Adobe Systems Incorporated
    Inventor: Richard L. Sites
  • Patent number: 7024357
    Abstract: An apparatus for detecting at least one tone having a known frequency and duration in an input signal. The input signal is input over a period of time which is divided into frame portions including at least an initial frame portion and a last frame portion. An energy signal indicative of the energy of the input signal during each frame portion is generated. A signal filter receives the energy signal and generates a noise indicator for each frame portion based on whether noise is detected in the energy signal. A dynamic threshold determiner generates an energy threshold for each frame portion. The energy threshold for the initial frame portion is generated based on a minimum expected value of the energy signal for a subsequent frame portion. The energy thresholds for frame portions subsequent to the initial frame portion are generated based on values of the energy signals during previous frame portions and the noise indicator.
    Type: Grant
    Filed: March 22, 2004
    Date of Patent: April 4, 2006
    Assignee: Legerity, Inc.
    Inventor: John G. Bartkowiak
  • Patent number: 7024354
    Abstract: In response to a coded speech signal output from a speech coder, a speech decoder decodes the coded speech signal into a reproduction speech signal. If the reproduction speech signal meets predetermined conditions, for example, “silence”, “unvoiced sound”, and the like, the speech decoder further operates as the following. The speech decoder calculates spectral parameters based on the reproduction speech signal, and calculates an excitation signal on the basis of the reproduction speech signal and the spectral parameters. In the calculation, a level of the excitation signal is also obtained. The speech decoder smoothes in time at least one of the spectral parameters and the level of the excitation signal. The speech decoder synthesizes the excitation signal by using the synthesis filter constructed with the spectrum parameters, so as to reproduce the speech signal. The speech signal has an excellent quality even if a bit rate is low.
    Type: Grant
    Filed: November 6, 2001
    Date of Patent: April 4, 2006
    Assignee: NEC Corporation
    Inventor: Kazunori Ozawa
  • Patent number: 7024362
    Abstract: A method for estimating mean opinion score or naturalness of synthesized speech is provided. The method includes using an objective measure that has components derived directly from textual information used to form synthesized utterances. The objective measure has a high correlation with mean opinion score such that a relationship can be formed between the objective measure and corresponding mean opinion score. An estimated mean opinion score can be obtained easily from the relationship when the objective measure is applied to utterances of a modified speech synthesizer.
    Type: Grant
    Filed: February 11, 2002
    Date of Patent: April 4, 2006
    Assignee: Microsoft Corporation
    Inventors: Min Chu, Hu Peng
  • Patent number: 7020605
    Abstract: A speech coding system is provided with time-domain noise attenuation. The speech coding system has an encoder operatively connected to a decoder via a communication medium. A preprocessor processes a digitized speech signal from an analog-to-digital converter. Speech coding systems are used to encode and decode a bitstream. Gains from the speech coding are adjusted by a gain factor Gf that provides time-domain background noise attenuation.
    Type: Grant
    Filed: February 13, 2001
    Date of Patent: March 28, 2006
    Assignee: Mindspeed Technologies, Inc.
    Inventor: Yang Gao
  • Patent number: 7016833
    Abstract: A method and system for speech characterization. One embodiment includes a method for speaker verification which includes collecting data from a speaker, wherein the data comprises acoustic data and non-acoustic data. The data is used to generate a template that includes a first set of “template” parameters. The method further includes receiving a real-time identity claim from a claimant, and using acoustic data and non-acoustic data from the identity claim to generate a second set of parameters. The method further includes comparing the first set of parameters to the set of parameters to determine whether the claimant is the speaker. The first set of parameters and the second set of parameters include at least one purely non-acoustic parameter, including a non-acoustic glottal shape parameter derived from averaging multiple glottal cycle waveforms.
    Type: Grant
    Filed: June 12, 2001
    Date of Patent: March 21, 2006
    Assignee: The Regents of the University of California
    Inventors: Todd J. Gable, Lawrence C. Ng, John F. Holzrichter, Greg C. Burnett
  • Patent number: 7016829
    Abstract: A method of training a natural language processing unit applies a candidate learning set to at least one component of the natural language unit. The natural language unit is then used to generate a meaning set from a first corpus. A second meaning set is generated from a second corpus using a second natural language unit and the two meaning sets are compared to each other to form a score for the candidate learning set. This score is used to determine whether to modify the natural language unit based on the candidate learning set.
    Type: Grant
    Filed: May 4, 2001
    Date of Patent: March 21, 2006
    Assignee: Microsoft Corporation
    Inventors: Eric D. Brill, Arul A. Menezes
  • Patent number: 7016837
    Abstract: An initial combination HMM 16 is generated from a voice HMM 10 having multiplicative distortions and an initial noise HMM of additive noise, and at the same time, a Jacobian matrix J is calculated by a Jacobian matrix calculating section 19. Noise variation Namh (cep), in which an estimated value Ha^(cep) of the multiplicative distortions that are obtained from voice that is actually uttered, additive noise Na(cep) that is obtained in a non-utterance period, and additive noise Nm(cep) of the initial noise HMM 17 are combined, is multiplied by a Jacobian matrix, wherein the result of the multiplication and initial combination HMM 16 are combined, and an adaptive HMM 26 is generated. Thereby, an adaptive HMM 26 that is matched to the observation value series RNah(cep) generated from actual utterance voice can be generated in advance.
    Type: Grant
    Filed: September 18, 2001
    Date of Patent: March 21, 2006
    Assignee: Pioneer Corporation
    Inventors: Hiroshi Seo, Mitsuya Komamura, Soichi Toyama
  • Patent number: 7016692
    Abstract: A channel for location estimation based on a wireless data communication from a mobile station is selected based on one or more of signal duration, variability and power level/signal-to-noise ratio of at least a portion of the wireless signals transmitted on the selected channel by the mobile station under the applicable configuration. Acceptable channels reducing location estimation error over alternatives include the access channel for Short Message Service (SMS) systems, the reverse pilot channel or the enhanced access channel for IS2000 systems, and the reverse link traffic channel for 1×EV-DO or 1×EV-DV systems. Location estimation is performed on wireless data communications on the selected channel.
    Type: Grant
    Filed: March 20, 2002
    Date of Patent: March 21, 2006
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Purva R. Rajkotia
  • Patent number: 7016840
    Abstract: A speech synthesis apparatus (10) comprises speech segment disassembling means (101) for disassembling the speech segments each including at least one phoneme into a plurality of pitch waveforms, phase characteristic transforming means (103) for transforming the phase characteristics of the pitch waveforms into a uniformed phase characteristic, pitch waveform classifying means (104) for classifying the pitch waveforms into a plurality of groups, pitch waveform registering means (106) for registering the pitch waveforms in the database (111) by extracting one pitch waveform from among the pitch waveforms in each of the groups, and synthesizing means (107) for synthesizing the speech with the pitch waveforms registered in the database (111). The speech synthesis apparatus (10) thus constructed can synthesize a natural speech using a relatively small database capacity.
    Type: Grant
    Filed: September 12, 2001
    Date of Patent: March 21, 2006
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Ryo Mochizuki, Toshiyuki Isono, Hirofumi Nishimura
  • Patent number: 7013261
    Abstract: A system provides accelerated morphological analysis and in particular a speed-up of morphological look-up via a caching mechanism. The system determines whether each incoming token in a token stream is unique or recurring. Unique tokens, which occur for the first time in the token stream, are marked with a unique numerical identification (ID). A pointer is added to recurring tokens, which already occurred in the token stream, and directed towards the unique numerical ID which was defined for the respective token when occurring for the first time. A morphological look-up is performed on the unique tokens. Subsequently, the tokens carrying the pointer are detected and replaced with the results of morphological look-up stored under the unique numerical ID of the respective unique token.
    Type: Grant
    Filed: October 16, 2001
    Date of Patent: March 14, 2006
    Assignee: Xerox Corporation
    Inventor: Andreas Eisele
  • Patent number: 7013273
    Abstract: A system and associated method of converting audio data from a television signal into textual data for display as a closed caption on an display device is provided. The audio data is decoded and audio speech signals are filtered from the audio data. The audio speech signals are parsed into phonemes in accordance by a speech recognition module. The parsed phonemes are grouped into words and sentences responsive to a database of words corresponding to the grouped phonemes. The words are converted into text data which is formatted for presentation on the display device as closed captioned textual data.
    Type: Grant
    Filed: March 29, 2001
    Date of Patent: March 14, 2006
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventor: Michael Kahn
  • Patent number: 7013280
    Abstract: A method for correcting ambiguations in directory assistance systems includes the steps of receiving and processing a directory assistance request from a caller. If the processing results in an ambiguation of at least two names, audio information is provided to the caller. The audio information includes, at least in part, playback of an audio recording of at least one person included in the ambiguation. The audio playback helps the caller resolve the ambiguation. A voice activated directory assistance system having structure for resolving ambiguations is also disclosed.
    Type: Grant
    Filed: February 27, 2001
    Date of Patent: March 14, 2006
    Assignee: International Business Machines Corporation
    Inventors: Brent L. Davis, Reza Ghasemi, Susan M. Hill, Tracy Kong, John r Lauria, Vanessa V. Michelini
  • Patent number: 7010476
    Abstract: A system constructs finite-state networks. The system initially compiles an intermediate finite-state network from a source file of regular expressions. The intermediate finite-state network includes a delimited subpath that defines a substring having the form of a regular expression. The system subsequently produces an output finite-state network in which the delimited subpath is replaced with an FSN compiled from the substring encoded by the delimited subpath.
    Type: Grant
    Filed: December 18, 2000
    Date of Patent: March 7, 2006
    Assignee: Xerox Corporation
    Inventors: Lauri J Karttunen, Kenneth R Beesley
  • Patent number: 7009917
    Abstract: A layer jump control apparatus of an optical drive. The layer jump control apparatus has a pick up head, a preamplifier, a controller producing a focusing control signal FC, a low pass filter for receiving FC and producing a layer distance balancing signal LC, and a driving device to send a driving force to the pick up head. When the optical drive does not perform the layer jump process, the driving device receives FC. When the optical drive performs the layer jump process, the driving device receives a kicking signal and LB to determine the driving force in the kicking process; the driving device receives a braking signal and LB to determine the driving force in when the optical drive performs the braking process; and the driving device receives LB to determine the driving force in the holding process and the waiting process.
    Type: Grant
    Filed: November 28, 2001
    Date of Patent: March 7, 2006
    Assignee: Mediatek Incorporation
    Inventors: Shih-Chun Chiang, Chen-Hsing Lo
  • Patent number: 7010486
    Abstract: The invention relates to a speech recognition system and a method of calculating iteration values for free parameters ??ortho(n) of a maximum-entropy speech model MESM with the aid of the generalized-iterative scaling training algorithm in a computer-supported speech recognition system in accordance with the formula ??ortho(n+1)=G(??ortho(n), m?ortho, . . . ), where n is an iteration parameter, G a mathematical function, ? an attribute in the MESM and m?ortho a desired orthogonalized boundary value in the MESM for the attribute ?. It is an object of the invention to further develop the system and method so that they make a fast computation of the free parameters ? possible without a change of the original training object. According to the invention this object is achieved in that the desired orthogonalized boundary value m?ortho is calculated by a linear combination of the desired boundary value m? with desired boundary values m? from attributes ? that have a larger range than the attribute ?.
    Type: Grant
    Filed: February 13, 2002
    Date of Patent: March 7, 2006
    Assignee: Koninklijke Philips Electronics, N.V.
    Inventor: Jochen Peters
  • Patent number: RE39013
    Abstract: Disclosed is an optical recording and reproducing apparatus comprising a light source directing a light spot toward a recording medium, a detection system detecting light reflected from the recording medium to derive an electrical signal from the reflected light, an information processing circuit modulating the intensity of the light spot according to writing pulses to record information on the recording medium and using the electrical signal to reproduce information from the recording medium, and a tracking servo circuit carrying out tracking servo operation on the basis of the electrical signal and including an extracting circuit connected to a source of extracting pulses having a pulse width at least equal to the writing pulse width so that writing pulse parts contained in the electrical signal are extracted during recording information, whereby a track offset occurring during information recording can be minimized, and the stability of the tracking servo system can be improved.
    Type: Grant
    Filed: August 5, 2003
    Date of Patent: March 14, 2006
    Assignee: Hitachi, Ltd.
    Inventors: Toshimitsu Kaku, Kazuo Shigematsu, Hisataka Sugiyama, Takeshi Maeda, Masahiro Takasago