Patents Examined by W. R. Young

Controlling a weighting filter based on the spectral content of a speech signal

Patent number: 7010480

Abstract: A method for preparing a speech signal for encoding comprises determining whether the spectral content of an input speech signal is representative of a defined spectral characteristic (e.g., a defined characteristic slope). A frequency specific filter component of a weighting filter is controlled based on the determination of the spectral content of the speech signal or/and its location in the encoder. A core weighting filter component of the weighting filter may be maintained regardless of the spectral content of the speech signal.

Type: Grant

Filed: September 13, 2001

Date of Patent: March 7, 2006

Assignee: Mindspeed Technologies, Inc.

Inventors: Yang Gao, Huan-Yu Su
Method and device for operating voice-controlled systems in motor vehicles

Patent number: 7010129

Abstract: A device for operating voice-controlled systems, such as communication and/or intercommunication systems in motor vehicles, includes a plurality of microphones and at least one loudspeaker. Voice signals received by the microphones are transmitted to the at least one loudspeaker. The voice signals are subjected to a low-value frequency shift before being transmitted to the loudspeaker(s) or to the input of a voice-controlled device to thereby suppress feedback.

Type: Grant

Filed: May 4, 1999

Date of Patent: March 7, 2006

Assignee: Volkswagen AG

Inventors: Klaus Schaaf, Juergen Schultz, Volker Thoermann
Method and apparatus for performing speech segmentation

Patent number: 7010481

Abstract: In a method for performing a segmentation operation upon a synthesizing speech signal and an input speech signal, a synthesized speech signal and a speech element duration signal are generated from the synthesizing speech signal A first feature parameter is extracted from the synthesized speech signal, and a second feature parameter is extracted from the input speech signal. A dynamic programming matching operation is performed upon the second feature parameter with reference to the first feature parameter and the speech element duration signal to obtain segmentation points of the input speech signal.

Type: Grant

Filed: March 27, 2002

Date of Patent: March 7, 2006

Assignee: NEC Corporation

Inventor: Takuya Takizawa
Compressing messages on a per semantic component basis while maintaining a degree of human readability

Patent number: 7010478

Abstract: A text message is first parsed into its constituent semantic components such as header fields and body components. Then, different compression methods may be performed on each semantic component depending on the importance of the semantic component, the context of the semantic component, the characteristics of the semantic component, and whether or not the semantic component uses natural language expressions. For example, it is determined what compression method, if any, is to be performed on the semantic component. Each semantic component may be compressed individually. Since text compression takes the unique features of each semantic component into consideration rather than considering the text message as a monolithic text unit, a more intuitive text compression results.

Type: Grant

Filed: February 12, 2001

Date of Patent: March 7, 2006

Assignee: Microsoft Corporation

Inventors: Sharad Mathur, Gregory P. Baribault
Speech encoding apparatus, speech encoding method, speech decoding apparatus, and speech decoding method

Patent number: 7006966

Abstract: The present invention comprises: first periodicity providing means for emphasizing periodicity of a fixed code vector output from at least one fixed excitation code book by use of a first periodicity emphasis coefficient adaptively determined based on a predetermined rule; and second periodicity providing means for emphasizing periodicity of a fixed code vector output from at least one fixed excitation code book by use of a predetermined second periodicity emphasis coefficient.

Type: Grant

Filed: February 27, 2002

Date of Patent: February 28, 2006

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventors: Tadashi Yamaura, Hirohisa Tasaki
System and method of pattern recognition in very high-dimensional space

Patent number: 7006969

Abstract: A system and method of recognizing speech comprises an audio receiving element and a computer server. The audio receiving element and the computer server perform the process steps of the method. The method involves training a stored set of phonemes by converting them into n-dimensional space, where n is a relatively large number. Once the stored phonemes are converted, they are transformed using single value decomposition to conform the data generally into a hypersphere. The received phonemes from the audio-receiving element are also converted into n-dimensional space and transformed using single value decomposition to conform the data into a hypersphere. The method compares the transformed received phoneme to each transformed stored phoneme by comparing a first distance from a center of the hypersphere to a point associated with the transformed received phoneme and a second distance from the center of the hypersphere to a point associated with the respective transformed stored phoneme.

Type: Grant

Filed: November 1, 2001

Date of Patent: February 28, 2006

Assignee: AT&T Corp.

Inventor: Bishnu Saroop Atal
Audio signal processing apparatus and method thereof

Patent number: 7003469

Abstract: A digital audio signal to be replayed is processed in a waveform thereof. A frequency bandwidth of the audio signal is expanded through conversion of a sampling frequency, and then the audio signal is low-pass-filtered with a low-pass cut-off frequency corresponding to the converted sampling frequency. An interval of time between two waveform peaks of the audio signal is detected, and then difference data between current data of the audio signal and past data thereof is calculated. The difference data are subject to weighting depending on the interval, and then output data are produced based on both the low-pass-filtered audio signal and the weighted difference data. This processing, which can be realized by activation of software, improves audio quality when compressed audio data is replayed.

Type: Grant

Filed: September 4, 2001

Date of Patent: February 21, 2006

Assignee: Victor Company of Japan, Ltd.

Inventors: Kazuhito Okayama, Toshiharu Kuwaoka
Destination device initiated caller identification

Patent number: 7003466

Abstract: A method, system, and program for origin device initiated caller identification are provided. In response to detecting a call extended to a destination device, extending a request from said destination device to an origin device requesting a voice utterance of the caller at said origin device. A caller identity associated with the voice utterance is identified at the destination device, such that a callee receiving the call at the destination device is informed of the caller identity before choosing whether to speak with the caller.

Type: Grant

Filed: December 12, 2001

Date of Patent: February 21, 2006

Assignee: International Business Machines Corporation

Inventors: Michael Wayne Brown, Joseph Herbert McIntyre, Michael A. Paolini, James Mark Weaver, Scott Lee Winters
Method, apparatus, and program for envelope generation, audio compression, and audio expansion

Patent number: 7003468

Abstract: An envelope generator (20), comprises: an input terminal (20a) for having a signal inputted therein; a first integrator (21) for generating intermediate state of envelopes with a first attack time and a first release time in response to changes in level of said signal inputted through said input terminal (20a) to impart said intermediate state of envelopes to said signal; a second integrator (22) for respectively modifying said intermediate state of envelopes into final state of envelopes with a second attack time and a second release time in response to changes in level of said signal imparted said intermediate state of envelopes; and an output terminal (20d) for outputting said signal with said final state of envelopes therethrough. The envelope generator (20) thus constructed can make gain signal follow rapid fluctuations in level of an audio signal, and can impart a relatively high quality for compressing and expanding level of the audio signal not to break in shape.

Type: Grant

Filed: June 27, 2001

Date of Patent: February 21, 2006

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventor: Kiyoomi Utsumi
System and method for providing network coordinated conversational services

Patent number: 7003463

Abstract: A system and method for providing automatic and coordinated sharing of conversational resources, e.g., functions and arguments, between network-connected servers and devices and their corresponding applications. In one aspect, a system for providing automatic and coordinated sharing of conversational resources inlcudes a network having a first and second network device, the first and second network device each comprising a set of conversational resources, a dialog manager for managing a conversation and executing calls requesting a conversational service, and a communication stack for communicating messages over the network using conversational protocols, wherein the conversational protocols establish coordinated network communication between the dialog managers of the first and second network device to automatically share the set of conversational resources of the first and second network device, when necessary, to perform their respective requested conversational service.

Type: Grant

Filed: October 1, 1999

Date of Patent: February 21, 2006

Assignee: International Business Machines Corporation

Inventors: Stephane H. Maes, Ponani Gopalakrishnan
Method and system for processing, storing, retrieving and presenting information with an extendable interface for natural and artificial languages

Patent number: 6999934

Abstract: A method and system for processing, storing, retrieving and presenting information with an extendable interface for natural and artificial languages. The system includes an interpreter, a knowledge base, and an input/output module. Making use of an internal representation based on sorted-type theory, the system stores information in the knowledge base, answers queries from clients, and processes erroneous or contradictory information according to a dynamically adjustable set of rules. The system also stores language definitions in the knowledge base, enabling the system to communicate with clients in a variety of natural and artificial languages. New languages may be added to the system by presenting definitions expressed in a language already incorporated within the system.

Type: Grant

Filed: May 24, 2004

Date of Patent: February 14, 2006

Assignee: Holtran Technology Ltd.

Inventors: Victor Gluzberg, Alexander Brenner
Fast convergence method for bit allocation stage of MPEG audio layer 3 encoders

Patent number: 6999919

Abstract: A method for an improved QSS (bit allocator) algorithm is disclosed. The disclosed method is capable of greatly improving determination time; thereby, improving the efficiency of converting a signal from an audio format to an MP3 format. The starting point of the QSS determination for a present frame (N) is the QSS of a previous frame (N?1). This starting point provides for improved efficiency for determining actual QSS of frame N as QSS[N?1] will be closer to QSS[N] than an arbitrary starting point. Thus, fewer iterations are required to determine QSS[N] as compared to conventional encoders. The algorithm of the present is more efficient than conventional methods in that it makes use of the fact that audio signal statistics usually do not change abruptly during the period of one audio frame to another.

Type: Grant

Filed: February 20, 2001

Date of Patent: February 14, 2006

Assignee: Intervideo, Inc.

Inventors: Shahab Layeghi, Fahri Surucu
Disc driving device and wobble information detection method

Patent number: 6999391

Abstract: In an optical disc, a wobble signal is to be detected by a simple configuration. In the optical disc, the address information, modulated onto a sinusoidal cannier signal by adding even harmonics signals to the sinusoidal carrier signal and by changing the polarity of the harmonics signals, is formed into the wobble signal. In detecting the wobble signal from the optical disc to demodulate the address information, in a method for detecting the wobble signal, an even harmonics signal and data clocks are generated, and the even harmonics signal so generated are multiplied with the reproduced wobble signal. The resulting product signal is integrated every data clock. The sign of the digital information is verified based on the integrated value at an end edge of the data clock.

Type: Grant

Filed: October 10, 2002

Date of Patent: February 14, 2006

Assignees: Koninklijke Philips Electronics N.V., Matsushita Electric Industrial Co., Ltd., Sony Corporation

Inventors: Jacobus Petrus Josephus Heemskerk, Cornelis Marinus Schep, Aalbert Stek, Shinichi Tanaka, Shigeru Furumiya, Shoei Kobayashi, Nobuyoshi Kobayashi
Method and apparatus for phonetic context adaptation for improved speech recognition

Patent number: 6999925

Abstract: The present invention provides a computerized method and apparatus for automatically generating from a first speech recognizer a second speech recognizer which can be adapted to a specific domain. The first speech recognizer can include a first acoustic model with a first decision network and corresponding first phonetic contexts. The first acoustic model can be used as a starting point for the adaptation process. A second acoustic model with a second decision network and corresponding second phonetic contexts for the second speech recognizer can be generated by re-estimating the first decision network and the corresponding first phonetic contexts based on domain-specific training data.

Type: Grant

Filed: November 13, 2001

Date of Patent: February 14, 2006

Assignee: International Business Machines Corporation

Inventors: Volker Fischer, Siegfried Kunzmann, Eric-W. Janke, A. Jon Tyrrell
Optical disc playback apparatus

Patent number: 6999394

Abstract: An optical disc playback apparatus that performs appropriate waveform equalization according to a difference in characteristics of reproduced signals between land parts and groove parts in an optical disc to which high-density recording is performed using a land/groove recording method. The optical disc playback apparatus includes a reproduction device, an A/D conversion circuit for sampling a reproduced signal to be converted into a multi-bit digital signal, a filter for subjecting the multi-bit digital signal to digital equalization, a filter coefficient learning device for adaptively controlling filter coefficients for the filter corresponding to the land part and the groove part to minimize an equalization error, and a switch signal generation device for generating a signal for switching between land and groove, thereby changing the filter coefficient adaptively to the land or the groove.

Type: Grant

Filed: July 17, 2002

Date of Patent: February 14, 2006

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Kouichi Urita, Youichi Ogura, Shinichiro Sato
Linear discriminant based sound class similarities with unit value normalization

Patent number: 6996527

Abstract: A common requirement in automatic speech recognition is to recognize a set of words for any speaker without training the system for each new speaker. A speech recognition system is provided utilizing linear discriminant based phonetic similarities with inter-phonetic unit value normalization. Linear discriminant analysis is utilized using training data with both in-class and out-class sample training utterances for generating linear discriminant vectors for each of the phonetic units. The dot product of each linear discriminant vector and the time spectral pattern vectors generated from the input speech are computed. The resultant raw similarity vectors are then normalized utilizing normalization look-up tables for providing similarity vectors which are utilized by a word matcher for word recognition.

Type: Grant

Filed: July 26, 2001

Date of Patent: February 7, 2006

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Robert C. Boman, Philippe R. Morin, Ted H. Applebaum
Method and apparatus for transcribing speech when a plurality of speakers are participating

Patent number: 6996526

Abstract: A method and apparatus are disclosed for transcribing speech when a number of speakers are participating. A number of different speech recognition systems, each with a different speaker model, are executed in parallel. When the identity of all of the participating speakers is known and a speaker model is available for each participant, each speech recognition system employs a different speaker model suitable for a corresponding participant. Each speech recognition system decodes the speech and generates a corresponding confidence score. The decoded output having the highest confidence score is selected for presentation to a user. When all participating speakers are not known, or when there are too many participants to implement a unique speaker model for each participant, a speaker independent speech recognition system is employed together with a speaker specific speech recognition system.

Type: Grant

Filed: January 2, 2002

Date of Patent: February 7, 2006

Assignee: International Business Machines Corporation

Inventors: Sara H. Basson, Peter Gustav Fairweather, Alexander Faisman, Dimitri Kanevsky, Jeffery Scott Sorensen
Method and apparatus for accessing a content site with a sound sequence

Patent number: 6996532

Abstract: A browser with a sound input receives a sound sequence that can be used to access a content site. The sound sequence encodes characters according to a predetermined scheme. These characters are extracted by the browser and may either directly constitute the URL of the content site, or include a site code that can be translated into the content site URL by a service system. In this latter case, the browser contacts the service system to have the site code translated into the content site URL. Once in possession of the content site URL, the browser contacts that site. Where the encoded characters represent a site code requiring translation, the URL of the translation service system is preferably included in the set of encoded characters with the encoding being such that the URL encodes to a musical tune.

Type: Grant

Filed: December 4, 2001

Date of Patent: February 7, 2006

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: Andrew Thomas
Auxiliary channel masking in an audio signal

Patent number: 6996521

Abstract: A method is provided for embedding data into an audio signal and determining data embedded into an audio signal. In the method for embedding data into an audio signal, the audio signal is based on a first set of data and includes a phase component. The method modifies at least a portion of the phase component of the audio signal to embed a second set of data into the audio signal. The modified audio signal can be made to differ with respect to the audio signal in a manner at least one of (i) substantially imperceptible and (ii) imperceptible to a listener of the first set of data depending on the extent that the phase component of the audio signal is modified. In the method for determining data embedded into an audio signal, the audio signal is based on a first set of data of an original audio signal and includes a phase component. The method determines a second set of data embedded into the audio signal based on the phase component of the audio signal.

Type: Grant

Filed: October 4, 2001

Date of Patent: February 7, 2006

Assignee: The University of Miami

Inventors: Alexander I. Iliev, Michael S. Scordilis
Pickup head actuator

Patent number: 6996037

Abstract: A pickup head actuator for carrying a plurality of near-field optical I/O elements, far-field optical I/O elements, or magnetic I/O elements. Besides performing focusing and tracking servo, the attraction due to magnetic inductor and tracking magnet enables a base to float at different heights in its normal state, suitable for different types of I/O elements.

Type: Grant

Filed: May 22, 2002

Date of Patent: February 7, 2006

Assignee: Industrial Technology Research Institute

Inventors: Tai-Ting Huang, Chi-Lone Chang, Chau-Yuan Ke

prev 1 2 3 4 5 6 7 8 … next