Patents by Inventor Huan-Yu Su

Huan-Yu Su has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Double talk detector for echo cancellation in a speech communication system

Patent number: 6804203

Abstract: A speech communication system is provided that uses pitch information, pitch lags, pitch gains, energy and/or other speech characteristics about the outgoing speech and the unknown signal on a frame basis to determine if the unknown signal is an echo signal of the outgoing speech or if the unknown signal also contains speech from a second talker (double talk). Additionally, a plurality of frames of these characteristics of the outgoing speech signal and the unknown incoming signal may be buffered so that the analysis and comparison can be made more efficiently and quickly in the frame domain as opposed to a time domain. Multiple characteristics may be optionally weighted and then analyzed. The system and method may further determine a level of confidence, based on any criterion, in the determination that may then be used to adjust the gain of a filter.

Type: Grant

Filed: September 15, 2000

Date of Patent: October 12, 2004

Assignee: Mindspeed Technologies, Inc.

Inventors: Adil Benyassine, Patrick D. Ryan, Huan-Yu Su
Complexity resource manager for multi-channel speech processing

Patent number: 6789058

Abstract: A multi-channel speech processor for encoding speech in a packet network environment is disclosed. In one illustrative aspect, a complexity resource manager (CRM) is executed by a controller or processor. The CRM manages the level of complexity of encoding which is used by a signal processing unit (SPU) to convert the speech signal into packet data. In general, the CRM determines the level of complexity of encoding based on a calculated complexity budget, where the complexity budget is determined based on the time required to process prior speech signal channels and the time available to process the remaining channels. In this way, the CRM is able to control the overall complexity of the speech processor through its ability to signal the SPU to encode speech signal in a complexity reduced mode based on the calculated complexity budget under certain conditions.

Type: Grant

Filed: October 15, 2002

Date of Patent: September 7, 2004

Assignee: Mindspeed Technologies, Inc.

Inventors: Eyal Shlomot, Huan-Yu Su
Codebook tables for multi-rate encoding and decoding with pre-gain and delayed-gain quantization tables

Patent number: 6757649

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Grant

Filed: April 8, 2003

Date of Patent: June 29, 2004

Assignee: Mindspeed Technologies Inc.

Inventors: Yang Gao, Adil Benyassine, Jes Thyssen, Eyal Shlomot, Huan-yu Su
Encoding and decoding speech signals variably based on signal classification

Patent number: 6735567

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Grant

Filed: April 8, 2003

Date of Patent: May 11, 2004

Assignee: Mindspeed Technologies, Inc.

Inventors: Yang Gao, Adil Benyassine, Jes Thyssen, Eyal Shlomot, Huan-yu Su
Complexity resource manager for multi-channel speech processing

Publication number: 20040073433

Abstract: A multi-channel speech processor for encoding speech in a packet network environment is disclosed. In one illustrative aspect, a complexity resource manager (CRM) is executed by a controller or processor. The CRM manages the level of complexity of encoding which is used by a signal processing unit (SPU) to convert the speech signal into packet data. In general, the CRM determines the level of complexity of encoding based on a calculated complexity budget, where the complexity budget is determined based on the time required to process prior speech signal channels and the time available to process the remaining channels. In this way, the CRM is able to control the overall complexity of the speech processor through its ability to signal the SPU to encode speech signal in a complexity reduced mode based on the calculated complexity budget under certain conditions.

Type: Application

Filed: October 15, 2002

Publication date: April 15, 2004

Applicant: Conexant Systems, Inc.

Inventors: Eyal Shlomot, Huan-Yu Su
Conversion scheme for use between DTX and non-DTX speech coding systems

Patent number: 6721712

Abstract: In an exemplary conversion scheme, a frame of a first speech signal comprising a plurality of frames encoded at a plurality of first rates, including a first non-speech rate, is received. The rate of the received frame is determined, and if the received frame is encoded at the first non-speech rate, then the received frame is re-encoded at either a second or third non-speech rate to generate a frame of a second speech signal. Moreover, a system for converting a speech signal comprises a receiver for receiving a frame of a first speech signal and a processor capable of determining the encoding rate of the received frame and re-encoding the received frame at either a second or third non-speech rate if the received frame was originally encoded at a first non-speech rate.

Type: Grant

Filed: January 24, 2002

Date of Patent: April 13, 2004

Assignee: Mindspeed Technologies, Inc.

Inventors: Adil Benyassine, Eyal Shlomot, Huan-Yu Su
Dynamic signal detector system and method

Patent number: 6697776

Abstract: A digitized signal detection system where the bit rate encoding is changed dynamically to provide encoding for different type signals and formats at bit rates optimized to properly reconstruct the input signal whether speech or non-speech and therefore can transfer signals of different character on a frame by frame basis. A change of encoding format can make the system a speech or music recognizer dependent what is to be listened for. Three basic components a recognizer which categorizes the type of input signal, an evaluator which evaluates the category of quality of the reconstructed signal and a recommender which make as recommendation based on the quality to change standards to encode the signals received pursuant to a standard which provides for improved quality. The dynamic signal detector receives the input signal directly and extracts the parameters for evaluation. These parameters are tested and a determination made if a switch of standards are required. To improve the reconstructed signal.

Type: Grant

Filed: July 31, 2000

Date of Patent: February 24, 2004

Assignee: Mindspeed Technologies, Inc.

Inventors: Gilles G. Fayad, Huan-Yu Su
Speech coding system with a music classifier

Patent number: 6694293

Abstract: The invention provides a speech coding system with a music classifier. An encoder is disposed to receive an input signal and provides a bitstream based upon a speech coding of a portion of the input signal. The encoder provides a classification of the input signal as one of noise, speech, and music. The music classifier analyzes or determines signal properties of the input signal. The music classifier compares the signal properties to thresholds to determine the classification of the input signal.

Type: Grant

Filed: February 13, 2001

Date of Patent: February 17, 2004

Assignee: Mindspeed Technologies, Inc.

Inventors: Adil Benyassine, Huan-Yu Su
User configurable levels of security for a speaker verification system

Patent number: 6691089

Abstract: A text-prompted speaker verification system that can be configured by users based on a desired level of security. A user is prompted for a multiple-digit (or multiple-word) password. The number of digits or words used for each password is defined by the system in accordance with a user set preferred level of security. The level of training required by the system is defined by the user in accordance with a preferred level of security. The set of words used to generate passwords can also be user configurable based upon the desired level of security. The level of security associated with the frequency of false accept errors verses false reject errors is user configurable for each particular application.

Type: Grant

Filed: September 30, 1999

Date of Patent: February 10, 2004

Assignee: Mindspeed Technologies Inc.

Inventors: Huan-yu Su, Khaled Assaleh
System of encoding and decoding speech signals

Publication number: 20030200092

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Application

Filed: April 8, 2003

Publication date: October 23, 2003

Inventors: Yang Gao, Adil Benyassine, Jes Thyssen, Eyal Shlomot, Huan-Yu Su
Speech communication system and method for handling lost frames

Patent number: 6636829

Abstract: An exemplary decoder comprises a receiver that receives parameters of a speech signal on a frame-by-frame basis, a control logic for decoding parameters and for resynthesizing the speech signal, the control logic including a minimum spacing indicative of a minimum difference required between LSFs of consecutive frames, a frame recovery logic that, when a lost frame detector detects a lost frame, sets the minimum spacing for the lost frame to a first value which is greater than the minimum spacing for the previously received frame, and/or uses pitch lag parameters of a plurality of previously received frames to extrapolate a pitch lag parameter for the lost frame, and/or sets gain parameter of a subframe of the lost frame in a first manner if the lost gain parameter is an adaptive codebook gain parameter and in a second manner if the lost gain parameter is a fixed codebook gain parameter.

Type: Grant

Filed: July 14, 2000

Date of Patent: October 21, 2003

Assignee: Mindspeed Technologies, Inc.

Inventors: Adil Benyassine, Eyal Shlomot, Huan-Yu Su
Speech coder output transformation method for reducing audible noise

Patent number: 6618700

Abstract: In a cellular telephone system where a digital cellular telephone is connected to a regular telephone through the public switched telephone network (PSTN), a speech encoder/decoder is used with an A/&mgr;-Law encoder/decoder causing annoying audible noise at very low levels because of the quantization characteristics of the A/&mgr;-Law encoder/decoder. This noise is eliminated by adding a digital constant to the output of the speech coder, shifting the low level signal away from zero. The resulting DC level added to the speech signal is inaudible to the PSTN telephone user and does not degrade speech quality. Alternatively, the constant added to the output of the speech coder is confined to a small value added to the speech coder output to move the entire speech coder output during the silence period, between speech periods, above zero or below zero.

Type: Grant

Filed: September 7, 2000

Date of Patent: September 9, 2003

Assignee: Mindspeed Technologies, Inc.

Inventors: Jes Thyssen, Huan-Yu Su
Speech coding employing hybrid linear prediction coding

Patent number: 6606591

Abstract: A speech coding system that employs hybrid linear prediction coding during extraction of linear prediction coefficients within ITU-Recommendation speech coding standards. The present invention is operable within linear prediction speech coding systems including code-excited linear prediction speech coding systems, and it provides for a substantially improved perceptual quality of reproduced speech signals when compared to conventional speech coding methods that employ the commonly known auto-correlation method that is based on minimizing the linear prediction coding (LPC) prediction error energy. The invention is operable to provide for high perceptual quality of reproduced speech signals having substantial differences of energy in various frequency bands.

Type: Grant

Filed: April 13, 2000

Date of Patent: August 12, 2003

Assignee: Conexant Systems, Inc.

Inventor: Huan-yu Su
System of encoding and decoding speech signals

Patent number: 6604070

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Grant

Filed: September 15, 2000

Date of Patent: August 5, 2003

Assignee: Conexant Systems, Inc.

Inventors: Yang Gao, Adil Benyassine, Jes Thyssen, Eyal Shlomot, Huan-yu Su
Bitstream protocol for transmission of encoded voice signals

Patent number: 6581032

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Grant

Filed: September 15, 2000

Date of Patent: June 17, 2003

Assignee: Conexant Systems, Inc.

Inventors: Yang Gao, Adil Benyassine, Jes Thyssen, Eyal Shlomot, Huan-yu Su
Target signal reference shifting employed in code-excited linear prediction speech coding

Patent number: 6581030

Abstract: A speech coding system that employs target signal reference shifting in code-excited linear prediction speech coding. The speech coding system performs modification of a target signal that is used to perform speech coding of a speech signal. The modified target signal that is generated from a preliminary target signal is then used to calculate an adaptive codebook gain that is used to perform speech coding of the speech signal. The speech coding performed in accordance with the present invention provides for a substantially reduced bit-rate of operation when compared to conventional speech coding methods that inherently require a significant amount of bandwidth to encode a fractional pitch lag delay during pitch prediction that is performed within conventional code-excited linear prediction speech coding systems.

Type: Grant

Filed: April 13, 2000

Date of Patent: June 17, 2003

Assignee: Conexant Systems, Inc.

Inventor: Huan-yu Su
Codebook tables for encoding and decoding

Patent number: 6574593

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Grant

Filed: September 15, 2000

Date of Patent: June 3, 2003

Assignee: Conexant Systems, Inc.

Inventors: Yang Gao, Adil Benyassine, Huan-yu Su, Eyal Shlomot, Jes Thyssen
Speech coding having continuous long term preprocessing without any delay

Patent number: 6523002

Abstract: A zero delay continuous long term (LT) pre-processing method operable in a speech codec that introduces no delay. The present invention provides an elegant solution to perform long term (LT) pre-processing of the pitch lag of a speech signal to save a large number of bits required in various speech coding methods, including the code-excited linear prediction method. The present invention is ideal for speech coding standards and methods that any undesirable delay at the end of a speech frame of the speech signal. The present invention overcomes a significant limitation in the art of speech coding, in that, a speech coding system that performs the invention is operable while providing real time operation and introducing no delay whatsoever.

Type: Grant

Filed: September 30, 1999

Date of Patent: February 18, 2003

Assignee: Conexant Systems, Inc.

Inventors: Yang Gao, Huan-yu Su
Intelligent discontinuous transmission and comfort noise generation scheme for pulse code modulation speech coders

Patent number: 6510409

Abstract: A fully backward compatible intelligent discontinued transmission (DTX) and comfort noise generation (CNG) scheme that is operable in pulse code modulation (PCM) speech coding systems. The scheme, for example, provides a speech encoder comprising a speech signal analysis circuitry configured to calculates a predetermined plurality of parameters from the speech signal, a voice activity detector configured to determine voice activity in the speech signal, where the speech encoder enters a discontinued transmission mode of the voice activity detector does not detect voice activity, and a transmitter configured to transmit one or more speech samples of the speech signal after the speech encoder enters the discontinued transmission mode, where the one or more speech samples are capable of use by a remote speech decoder to extract a parameter from the one or more speech samples in order generate a background noise base on the parameter.

Type: Grant

Filed: January 18, 2000

Date of Patent: January 21, 2003

Assignee: Conexant Systems, Inc.

Inventor: Huan-Yu Su
Speech classification and parameter weighting used in codebook search

Patent number: 6493665

Abstract: A multi-rate speech coded supports a plurality of encoding bit rate modes by adaptively selecting encoding bit rate modes to match communication channel restrictions. In higher bit rate encoding modes, an accurate representation of speech through CELP (code excited linear prediction) and other associated modeling parameters are generated for higher quality decoding and reproduction. To support lower bit rate encoding modes, a variety of techniques are applied many of which involve the classification of the input signal. For each bit rate mode selected, pluralities of fixed or innovation subcodebooks are selected for use in generating innovation vectors. The fixed codebook contains pulse subcodebooks and noise-like subcodebooks. To assist in selection of one of the subcodebooks, an adaptive weighting approach is applied in a searching procedure wherein residual classification and various parameters are used to generate a weighting function that is used to favor one subcodebook over another.

Type: Grant

Filed: September 18, 1998

Date of Patent: December 10, 2002

Assignee: Conexant Systems, Inc.

Inventors: Huan-Yu Su, Yang Gao

prev 1 2 3 4 5 6 next