Specialized Information Patents (Class 704/206)

Pitch (Class 704/207)

Voiced or unvoiced (Class 704/208)

Formant (Class 704/209)

Silence decision (Class 704/210)

Maxima search method for sensed signals

Patent number: 8204741

Abstract: An apparatus and method for selecting a set of channels from a plurality channels in a signal processor, the method comprising sampling each one of a plurality of channels and obtaining a binary representation of each one of the samples, arranging each one of the binary representations of samples into a series of bit planes from a most significant bit plane containing the most significant bit of each binary representation, to a least significant bit plane containing the least significant bit of each binary representation, determining those bit planes having binary representations that conform to a predetermined value criteria, and selecting a set of channels by summing bits from each one of those determined bit planes that conform to the predetermined value criteria.

Type: Grant

Filed: March 29, 2004

Date of Patent: June 19, 2012

Assignee: Cochlear Limited

Inventors: Konstadinos Hatzianestis, Tony Nygard
Speaker intent analysis system

Patent number: 8200494

Abstract: A speaker intent analysis system and method for validating the truthfulness and intent of a plurality of participants' responses to questions. A computer stores, retrieves, and transmits a series of questions to be answered audibly by participants. The participants' answers are received by a data processor. The data processor analyzes and records the participants' speech parameters for determining the likelihood of dishonesty. In addition to analyzing participants' speech parameters for distinguishing stress or other abnormality, the processor may be equipped with voice recognition software to screen responses that while not dishonest, are indicative of possible malfeasance on the part of the participants. Once the responses are analyzed, the processor produces an output that is indicative of the participant's credibility. The output may be sent to proper parties and/or devices such as a web page, computer, e-mail, PDA, pager, database, report, etc. for appropriate action.

Type: Grant

Filed: October 25, 2010

Date of Patent: June 12, 2012

Inventor: David Bezar
Cue-based audio coding/decoding

Patent number: 8200500

Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.

Type: Grant

Filed: March 14, 2011

Date of Patent: June 12, 2012

Assignee: Agere Systems Inc.

Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
Method and device for performing frame erasure concealment to higher-band signal

Patent number: 8200481

Abstract: The present invention discloses a method for performing a frame erasure concealment to a higher-band signal, including: calculating a periodic intensity of a higher-band signal with respect to a lower-band signal; judging whether the periodic intensity of the higher-band signal is higher than or equal to a preconfigured threshold; if the periodic intensity of the higher-band signal is higher than or equal to the preconfigured threshold, using a pitch period repetition method to perform the frame erasure concealment to the higher-band signal of a current lost frame; and if the periodic intensity of the higher-band signal is lower than the preconfigured threshold, using a previous frame data repetition method to perform the frame erasure concealment to the higher-band signal of the current lost frame. The present invention further discloses a device for performing a frame erasure concealment to a higher-band signal and a speech decoder. The problem that the quality of the voice signal is lowered is avoided.

Type: Grant

Filed: May 29, 2008

Date of Patent: June 12, 2012

Assignee: Huawei Technologies Co., Ltd.

Inventors: Jianfeng Xu, Lei Miao, Chen Hu, Qing Zhang, Lijing Xu, Wei Li, Zhengzhong Du, Yi Yang, Fengyan Qi, Wuzhou Zhan, Dongqi Wang
Unified treatment of resolved and unresolved harmonics

Patent number: 8185382

Abstract: One embodiment of the present invention provides a post-processing method of a modulation envelope resulting from an interference of two harmonics in a filter band. According to one embodiment, the method comprising filtering the modulation envelope with a band-pass filter bank, wherein a combination of demodulation and application of the band-pass filter on the modulation envelope enables use of identical techniques for resolved and unresolved harmonics. One embodiment of the present invention provides a method of determining whether a frequency band of an input signal includes unresolved harmonics. According to a further embodiment, in response to a determination that the frequency band includes unresolved harmonics, the method comprises obtaining a modulation envelope of the frequency band by demodulating the frequency band, obtaining one or more frequency bands from the modulation envelope, and determining an evidence value that one of the frequency bands originates from one of fundamental frequencies.

Type: Grant

Filed: May 31, 2005

Date of Patent: May 22, 2012

Assignee: Honda Research Institute Europe GmbH

Inventors: Frank Joublin, Martin Heckmann
Coding Generic Audio Signals at Low Bitrates and Low Delay

Publication number: 20120101813

Abstract: A mixed time-domain/frequency-domain coding device and method for coding an input sound signal, wherein a time-domain excitation contribution is calculated in response to the input sound signal. A cut-off frequency for the time-domain excitation contribution is also calculated in response to the input sound signal, and a frequency extent of the time-domain excitation contribution is adjusted in relation to this cut-off frequency. Following calculation of a frequency-domain excitation contribution in response to the input sound signal, the adjusted time-domain excitation contribution and the frequency-domain excitation contribution are added to form a mixed time-domain/frequency-domain excitation constituting a coded version of the input sound signal. In the calculation of the time-domain excitation contribution, the input sound signal may be processed in successive frames of the input sound signal and a number of sub-frames to be used in a current frame may be calculated.

Type: Application

Filed: October 25, 2011

Publication date: April 26, 2012

Applicant: VOICEAGE CORPORATION

Inventors: Tommy Vaillancourt, Milan Jelinek
Scalable decoder and scalable decoding method

Patent number: 8160868

Abstract: A scalable decoder capable of avoiding deterioration in subjective quality of a listener. The scalable decoder for decoding core layer encoding data and extension layer encoding data including an extension layer gain coefficient, wherein a voice analysis section detects variation in power of a core layer decoding voice signal being obtained from the core layer encoding data, a gain attenuation rate calculating section (140) sets the attenuation intensity variable depending on variation in power, and a gain attenuation section (143) attenuates the extension layer gain coefficient in a second period preceding a first period according to a set attenuation intensity when extension layer encoding data in the first period is missing, thus interpolating the extension layer gain coefficient in the first period.

Type: Grant

Filed: March 13, 2006

Date of Patent: April 17, 2012

Assignee: Panasonic Corporation

Inventors: Takuya Kawashima, Hiroyuki Ehara
Received voice playback apparatus

Patent number: 8145476

Abstract: A disclosed received voice playback apparatus includes a characteristic acquiring unit configured to acquire first frequency characteristic values obtained by resolving digital vocal signals that are based on received vocal signals into predetermined frequency bands, wherein each first frequency characteristic value corresponds to one of the predetermined frequency bands; a setting unit configured to obtain second frequency characteristic values, wherein each second frequency characteristic value is set for one of the predetermined frequency bands; a computing unit configured to compute a gain for each of the predetermined frequency bands based on a difference between the first frequency characteristic value and the second frequency characteristic value; and a characteristic changing unit configured to change the first frequency characteristic values of the digital vocal signals by multiplying the digital vocal signals by each of the gains corresponding to one of the predetermined frequency bands of the digit

Type: Grant

Filed: January 10, 2008

Date of Patent: March 27, 2012

Assignee: Ricoh Company, Ltd.

Inventor: Yukihiro Imai
Feature extraction for identification and classification of audio signals

Patent number: 8140331

Abstract: Characteristic features are extracted from an audio sample based on its acoustic content. The features can be coded as fingerprints, which can be used to identify the audio from a fingerprints database. The features can also be used as parameters to separate the audio into different categories.

Type: Grant

Filed: July 4, 2008

Date of Patent: March 20, 2012

Inventor: Xia Lou
Systems, methods, and apparatus for gain coding

Patent number: 8140324

Abstract: A wideband speech encoder according to one embodiment includes a lowband encoder and a highband encoder. The lowband encoder is configured to encode a lowband portion of a wideband speech signal as a set of filter parameters and an encoded excitation signal. The highband encoder is configured to calculate values for coding parameters that specify a spectral envelope and a temporal envelope of a highband portion of the wideband speech signal. The temporal envelope is based on a highband excitation signal that is derived from the encoded excitation signal. In one such example, the temporal envelope is based on a difference in levels between the highband portion and a synthesized highband signal, wherein the synthesized highband signal is generated according to the highband excitation signal and a set of highband filter parameters.

Type: Grant

Filed: April 3, 2006

Date of Patent: March 20, 2012

Assignee: QUALCOMM Incorporated

Inventors: Koen Bernard Vos, Ananthapadmanabhan Aasanipalai Kandhadai
Methods, apparatuses and system for encoding and decoding signal

Patent number: 8135593

Abstract: Methods and apparatuses for encoding a signal and decoding a signal and a system for encoding and decoding are provided. The method for encoding a signal includes performing a classification decision process on high frequency signals of input signals, adaptively encoding the high frequency signals according to the result of the classification decision process, and outputting a bitstream including codes of low frequency signals of the input signals, adaptive codes of the high frequency signals, and the result of the classification decision process. The classification decision process is performed on the high frequency signals, and adaptive encoding or adaptive decoding is performed according to the result of the classification decision process, so the quality of voice and audio output signals is improved.

Type: Grant

Filed: May 3, 2011

Date of Patent: March 13, 2012

Assignee: Huawei Technologies Co., Ltd.

Inventors: Lei Miao, Zexin Liu, Longyin Chen, Chen Hu, Wei Xiao, Herve Marcel Taddei, Qing Zhang
System for distinguishing desired audio signals from noise

Patent number: 8131544

Abstract: A system distinguishes a primary audio source and background noise to improve the quality of an audio signal. A speech signal from a microphone may be improved by identifying and dampening background noise to enhance speech. Stochastic models may be used to model speech and to model background noise. The models may determine which portions of the signal are speech and which portions are noise. The distinction may be used to improve the signal's quality, and for speaker identification or verification.

Type: Grant

Filed: November 12, 2008

Date of Patent: March 6, 2012

Assignee: Nuance Communications, Inc.

Inventors: Tobias Herbig, Oliver Gaupp, Franz Gerl
System and method for automatically adjusting floor controls for a conversation

Patent number: 8126705

Abstract: A system and method for automatically adjusting floor controls for a conversation is provided. Audio streams are received, which each originate from an audio source. Floor controls for a current configuration including at least a portion of the audio streams are maintained. Conversational characteristics shared by two or more of the audio sources are determined. Possible configurations for the audio streams are identified based on the conversational characteristics. An analysis of the current configuration and the possible configurations is performed. A change threshold is applied to the analysis. When the analysis satisfies the change threshold, the floor controls are automatically adjusted. The audio streams are mixed into one or more outputs based on the adjusted floor controls.

Type: Grant

Filed: November 9, 2009

Date of Patent: February 28, 2012

Assignee: Palo Alto Research Center Incorporated

Inventors: Paul Masami Aoki, Margaret H. Szymanski, James D. Thornton, Daniel H. Wilson, Allison Gyle Woodruff
Encoding apparatus and encoding method

Patent number: 8121850

Abstract: An encoding device and an encoding method are provided for encoding by reducing the number of samples to be processed when encoding higher-band spectrum data according to lower-band spectrum data in a wide-band signal. The device and the method can obtain a high-quality decoded signal even if a large quantization distortion is caused in the lower-band spectrum data. When encoding higher-band spectrum data in a signal to be encoded, according to lower-band spectrum data in the signal, only for a part (a head portion) of the higher-band spectrum data, the lower-band spectrum data after being quantized is subjected to approximate partial search and higher-band spectrum data is generated according to the search result.

Type: Grant

Filed: May 9, 2007

Date of Patent: February 21, 2012

Assignee: Panasonic Corporation

Inventors: Tomofumi Yamanashi, Kaoru Sato, Toshiyuki Morii
Voice processing apparatus and program

Patent number: 8117031

Abstract: A voice processing apparatus has a storage device that stores registration information containing a characteristic parameter of a given voice. The voice processing apparatus is further provided with a judgment unit, a management unit and a notification unit. The judgment unit judges whether an input voice is appropriate or not for creating or updating the registration information based on a degree of a difference between an inter-band correlation matrix of an input voice acquired this time and an inter-band correlation matrix of another input voice that is judged as being appropriate last time. The management unit creates or updates the registration information based on a characteristic parameter of the input voice when the judgment unit judges that the input voice is appropriate. The notification unit notifies a speaker of the input voice when the judgment unit judges that the input voice is inappropriate.

Type: Grant

Filed: December 20, 2007

Date of Patent: February 14, 2012

Assignee: Yamaha Corporation

Inventors: Takehiko Kawahara, Yasuo Yoshioka
Sound encoder and sound encoding method for generating a second layer decoded signal based on a degree of variation in a first layer decoded signal

Patent number: 8099275

Abstract: A sound encoder having an improved quantization performance while suppressing an increase of the bit rate to a lowest level. In a second layer encoder, a standard deviation calculator calculates a standard deviation ?c of a first layer decoding spectrum after decoding a scale factor ratio multiplication and outputs the standard deviation ?c to a selector. The selector selects a linear transform function as a function for a nonlinear transform of a residual spectrum according to the standard deviation ?c A nonlinear transform function selects one of prepared nonlinear transform functions #1 to #N according to a result of the selection by the selector, and outputs the selected one to an inverse transformer. The inverse transformer subjects an inverse transform (expansion) to a residual spectrum candidate that is stored in a residual spectrum code book using the nonlinear transform function outputted from the nonlinear transform function and outputs the result to an adder.

Type: Grant

Filed: October 25, 2005

Date of Patent: January 17, 2012

Assignee: Panasonic Corporation

Inventor: Masahiro Oshikiri
Dynamic modification of a high-order perceptual attribute of an audio signal

Patent number: 8086448

Abstract: Processing an audio signal associated with a sound recording made available to be rendered to an end user is disclosed. The audio signal is received. A high-order perceptual attribute of the audio signal as rendered is changed by modifying the audio signal. The modification may be based on real-time analysis of the audio signal.

Type: Grant

Filed: March 29, 2004

Date of Patent: December 27, 2011

Assignee: Creative Technology Ltd

Inventors: Michael Goodwin, Carlos Avendano, Ramkumar Sridharan, Martin Wolters
Apparatus and method for processing signal, recording medium, and program

Patent number: 8065141

Abstract: A signal processing apparatus includes a decoding unit, an analyzing unit, a synthesizing unit, and a selecting unit. The decoding unit decodes an input encoded audio signal and outputs a playback audio signal. When loss of the encoded audio signal occurs, the analyzing unit analyzes the playback audio signal output before the loss occurs and generates a linear predictive residual signal. The synthesizing unit synthesizes a synthesized audio signal on the basis of the linear predictive residual signal. The selecting unit selects one of the synthesized audio signal and the playback audio signal and outputs the selected audio signal as a continuous output audio signal.

Type: Grant

Filed: August 24, 2007

Date of Patent: November 22, 2011

Assignee: Sony Corporation

Inventor: Yuuji Maeda
System enhancement of speech signals

Patent number: 8050914

Abstract: A system enhances speech by detecting a speaker's utterance through a first microphone positioned a first distance from a source of interference. A second microphone may detect the speaker's utterance at a different position. A monitoring device may estimate the power level of a first microphone signal. A synthesizer may synthesize part of the first microphone signal by processing the second microphone signal. The synthesis may occur when power level is below a predetermined level.

Type: Grant

Filed: November 12, 2008

Date of Patent: November 1, 2011

Assignee: Nuance Communications, Inc.

Inventors: Gerhard Uwe Schmidt, Mohamed Krini
Signal classifying method and apparatus

Patent number: 8050916

Abstract: A signal classifying method and apparatus are disclosed. The signal classifying method includes: obtaining a spectrum fluctuation parameter of a current signal frame determined as a foreground frame, and buffering the spectrum fluctuation parameter; obtaining a spectrum fluctuation variance of the current signal frame according to spectrum fluctuation parameters of all buffered signal frames, and buffering the spectrum fluctuation variance; and calculating a ratio of signal frames whose spectrum fluctuation variance is above or equal to a first threshold to all the buffered signal frames, and determining the current signal frame as a speech frame if the ratio is above or equal to a second threshold or determining the current signal frame as a music frame if the ratio is below the second threshold.

Type: Grant

Filed: April 12, 2011

Date of Patent: November 1, 2011

Assignee: Huawei Technologies Co., Ltd.

Inventors: Yuanyuan Liu, Zhe Wang, Eyal Shlomot
Methods of identification using voice sound analysis

Patent number: 8036891

Abstract: Methods of using individually distinctive patterns of voice characteristics to identify a speaker include computing the reassigned spectrogram of each of at least two voice samples, pruning each reassigned spectrogram to remove noise and other computational artifacts, and comparing (either visually or with the aid of a processor) the strongest points to determine whether the voice samples belong to the same speaker.

Type: Grant

Filed: June 26, 2008

Date of Patent: October 11, 2011

Assignee: California State University, Fresno

Inventor: Sean Fulop
Identification of the presence of speech in digital audio data

Patent number: 8036884

Abstract: The present invention provides a method, a computer-software-product and an apparatus for enabling a determination of speech related audio data within a record of digital audio data. The method comprises steps for extracting audio features from the record of digital audio data, for classifying one or more subsections of the record of digital audio data, and for marking at least a part of the record of digital audio data classified as speech. The classification of the digital audio data record is performed on the basis of the extracted audio features and with respect to at least one predetermined audio class.

Type: Grant

Filed: February 24, 2005

Date of Patent: October 11, 2011

Assignee: Sony Deutschland GmbH

Inventors: Yin Hay Lam, Josep Maria Sola I Caros
Method and apparatus for low bit rate speech coding detection

Patent number: 8032366

Abstract: To increase channel capacity, mobile phone carriers have deployed speech coders, such as Advanced MultiBand Excitation coding (AMBE), in networks to reduce the bit rate of each call. One undesired consequence of employing such speech coders is that the voice quality can be much worse as compared to higher bit-rate speech coders. A method or corresponding apparatus in an example embodiment of the present invention performs voice quality enhancement transparently within a network by detecting use of a coder applying rate reduction to a speech signal and known to have an adverse effect on a coded speech signal. Upon detection of the use of such coder, the coded speech signal is corrected based on components introduced into the coded speech signal due to the rate reduction. As a result of applying the voice quality enhancement, adverse effects of speech coders can be reduced, while maintaining high quality voice signals.

Type: Grant

Filed: May 16, 2008

Date of Patent: October 4, 2011

Assignee: Tellabs Operations, Inc.

Inventors: Daniel Mapes-Riordan, Steve R. Page
Speech codecs

Patent number: 8019599

Abstract: A method and apparatus include a voice activity detection module configured to detect silent frames, and a codec mode selection module configured to determine a codec mode. The voice activity detection module includes a receiver configured to receive a frame, a first determiner configured to determine a first set of parameters from the frame, and a providing unit configured to provide the first set of parameters to the codec mode selection module. The codec mode selection module includes a second determiner configured to determine a second set of parameters in dependence on the first set of parameters, and a selector configured to select a codec mode in dependence on the second set of parameters.

Type: Grant

Filed: September 23, 2009

Date of Patent: September 13, 2011

Assignee: Nokia Corporation

Inventor: Jari Makinen
Band based audio coding and decoding apparatuses, methods, and recording media for scalability

Patent number: 8015017

Abstract: Audio coding and decoding apparatuses and methods which support fine granularity scalability (FGS) using harmonic information of a high-band audio signal or wideband error audio signal when performing wideband audio coding and decoding, and recording mediums on which the methods are stored. The audio coding method includes detecting harmonics of a high-band audio signal or wideband error audio signal of an input audio signal; determining an order of the detected harmonics; and coding the detected harmonics based on the determined order.

Type: Grant

Filed: January 24, 2006

Date of Patent: September 6, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hosang Sung, Rakesh Taori, Kangeun Lee
Scalable encoder, scalable decoder, and scalable encoding method

Patent number: 8010349

Abstract: A scalable encoder enabling improvement of the encoding efficiency in the second layer and improvement of the quality of the original signal decoded using the encoding signal in the second layer. A predictive coefficient encoder of the scalable encoder has a predictive coefficient codebook where candidates of the predictive coefficient are recorded. After searching the predictive coefficient codebook, the scale factor of the first layer decoded signal inputted from a scale factor calculator is multiplied, and a predictive coefficient which most approximates the multiplication result to the scale factor of the original signal inputted from the scale factor calculator is determined and encoded, and the coded code is inputted to a multiplexer.

Type: Grant

Filed: October 11, 2005

Date of Patent: August 30, 2011

Assignee: Panasonic Corporation

Inventor: Masahiro Oshikiri
Decimated bisectional pitch refinement

Patent number: 8010350

Abstract: A method and system for refining an estimated pitch period estimate based on a coarse pitch useful for performing frame loss concealment in an audio decoder as well as for other applications. A normalized correlation at the coarse pitch lag is computed and used as the current best candidate. The normalized correlation is then evaluated at the midpoint of the refinement pitch range on either side of the current best candidate. If the normalized correlation at either midpoint is greater than the current best lag, the midpoint with the maximum correlation is selected as the current best lag. After each iteration, the refinement range is decreased by a factor of two and centered on the current best lag. This bisectional search continues until the pitch has been refined to an acceptable tolerance or until the refinement range has been exhausted. During each step of the bisectional pitch refinement, the signal is decimated to reduce the complexity of computing the normalized correlation.

Type: Grant

Filed: April 13, 2007

Date of Patent: August 30, 2011

Assignee: Broadcom Corporation

Inventor: Robert W. Zopf
Methods and systems for sample rate conversion

Patent number: 8005667

Abstract: Methods and systems for sample rate conversion convert a sampled signal to a higher data rate signal. Conversion pulses are received, having a conversion rate that is higher than the sample rate of the sampled signal. Sample points are then reconstructed from the sampled signal, in real time, on either side of a conversion pulse. An interpolation is performed between the reconstructed sample points, at the time of the conversion pulse. The interpolation results are outputted in real time. The process is repeated for additional conversion pulses. The outputted interpolated amplitudes form the higher data rate signal having a data rate equal to the conversion rate. Sample rate conversion is thus performed in real time according to the higher data rate clock, rather than with fixed ratios. As a result, when the higher data rate clock is affected by, for example, jitter or other frequency variations, the higher data rate samples immediately track the lower data rate samples.

Type: Grant

Filed: August 4, 2008

Date of Patent: August 23, 2011

Assignee: Broadcom Corporation

Inventor: Hoang Nhu
Formants extracting method combining spectral peak picking and roots extraction

Patent number: 8000959

Abstract: In a formants extracting method capable of precisely obtaining formants as resonance frequencies of voice with less computational complexity, the method includes searching a maximum value by a spectral peak-picking method, judging whether the number of formants corresponding to a zero at the obtained maximum point are two, and analyzing a pertinent root by roots polishing when the number of the formants are judged as two. The number of the formants are judged by applying Cauchy's integral formula, wherein Cauchy's integral formula is not applied repeatedly but only once at a surrounding portion of the maximum value in a z-domain.

Type: Grant

Filed: October 6, 2004

Date of Patent: August 16, 2011

Assignee: LG Electronics Inc.

Inventor: Chan-Woo Kim
Method and apparatus for estimating degree of similarity between voices

Patent number: 7996213

Abstract: A similarity degree estimation method is performed by two processes. In a first process, an inter-band correlation matrix is created from spectral data of an input voice such that the spectral data are divided into a plurality of discrete bands which are separated from each other with spaces therebetween along a frequency axis, a plurality of envelope components of the spectral data are obtained from the plurality of the discrete bands, and elements of the inter-band correlation matrix are correlation values between the respective envelope components of the input voice. In a second process, a degree of similarity is calculated between a pair of input voices to be compared with each other by using respective inter-band correlation matrices obtained for the pair of the input voices through the inter-band correlation matrix creation process.

Type: Grant

Filed: March 20, 2007

Date of Patent: August 9, 2011

Assignee: Yamaha Corporation

Inventors: Mikio Tohyama, Michiko Kazama, Satoru Goto, Takehiko Kawahara, Yasuo Yoshioka
Adaptive grouping of parameters for enhanced coding efficiency

Patent number: 7991610

Abstract: The present invention is based on the finding that parameters including a first set of parameters of a representation of a first portion of an original signal and including a second set of parameters of a representation of a second portion of the original signal can be efficiently encoded, when the parameters are arranged in a first sequence of tuples and in a second sequence of tuples, wherein the first sequence of tuples comprises tuples of parameters having two parameters from a single portion of the original signal and wherein the second sequence of tuples comprises tuples of parameters having one parameter from the first portion and one parameter from the second portion of the original signal. An efficient encoding can be achieved using a bit estimator to estimate the number of necessary bits to encode the first and the second sequence of tuples, wherein only the sequence of tuples is encoded, that results in the lower number of bits.

Type: Grant

Filed: October 5, 2005

Date of Patent: August 2, 2011

Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung e.V.

Inventors: Ralph Sperschneider, Jürgen Herre, Karsten Linzmeier, Johannes Hilpert
METHODS AND SYSTEMS FOR IDENTIFYING SPEECH SOUNDS USING MULTI-DIMENSIONAL ANALYSIS

Publication number: 20110178799

Abstract: Methods and systems of identifying speech sound features within a speech sound are provided. The sound features may be identified using a multi-dimensional analysis that analyzes the time, frequency, and intensity at which a feature occurs within a speech sound, and the contribution of the feature to the sound. Information about sound features may be used to enhance spoken speech sounds to improve recognizability of the speech sounds by a listener.

Type: Application

Filed: July 24, 2009

Publication date: July 21, 2011

Applicant: The Board of Trustees of the University of Illinois

Inventors: Jont B. Allen, Feipeng Li
System and method for voice user interface navigation

Patent number: 7974836

Abstract: A Voice User Interface (VUI) or Interactive Voice Response (IVR) system utilizes three levels of navigation (e.g. Main Menu, Services, and Helper Commands) in presenting information units arranged in sets. The units are “spoken” by a system in a group to a human user and the group of information at each level is preceded by a tone that is unique to the level. When navigating the levels, the tones of the levels are in a musical progression, e.g. the three-note blues progression I, IV, V, for preceding the groups of information, respectively. The musical progression returns to the tonic of the musical key when the navigation returns to the level one of the first group of information.

Type: Grant

Filed: December 8, 2009

Date of Patent: July 5, 2011

Assignee: Verizon Business Global LLC

Inventor: Paul T. Schultz
System and method for switching between a first filter and a second filter for a received audio signal

Patent number: 7970604

Abstract: System, method and computer-readable medium are disclosed for using filters signal processing. The system includes a module that receives information regarding a first filter, a module that receives information regarding a second filter, and a module that receives date to indicate switching between the first filter and the second filter across the spectrum of the received audio signal, and a module that processes the received audio signal according to the received data and switching between the first filter and the second filter, wherein at least one of the first filter and the second filter represent a merger of two initial filters.

Type: Grant

Filed: March 3, 2009

Date of Patent: June 28, 2011

Assignee: AT&T Intellectual Property II, L.P.

Inventors: James David Johnston, Shyh-Shiaw Kuo
Cue-based audio coding/decoding

Patent number: 7941320

Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.

Type: Grant

Filed: August 27, 2009

Date of Patent: May 10, 2011

Assignee: Agere Systems, Inc.

Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
APPARATUS AND METHOD FOR IMPROVING VOICE QUALITY IN PORTABLE TERMINAL

Publication number: 20110106530

Abstract: An apparatus and method for improving a sound quality of a portable terminal are provided. Particularly, an apparatus and method for tuning to a voice quality that a user desires using the portable terminal, in case where the user determines an abnormal sound quality, are provided. The apparatus includes a sound quality improving unit for selecting a Digital Signal Processing (DSP) filter value corresponding to a voice quality that a user desires among a plurality of DSP filters for controlling the voice quality according to a user's voice quality.

Type: Application

Filed: October 20, 2010

Publication date: May 5, 2011

Applicant: SAMSUNG ELECTRONICS CO. LTD.

Inventor: Tae-Seon KIM
DETERMINING AN UPPERBAND SIGNAL FROM A NARROWBAND SIGNAL

Publication number: 20110099004

Abstract: A method for determining an upperband speech signal from a narrowband speech signal is disclosed. A list of narrowband line spectral frequencies (LSFs) is determined from the narrowband speech signal. A first pair of adjacent narrowband LSFs that have a lower difference between them than every other pair of adjacent narrowband LSFs in the list is determined. A first feature that is a mean of the first pair of adjacent narrowband LSFs is determined. Upperband LSFs are determined based on at least the first feature using codebook mapping.

Type: Application

Filed: October 22, 2010

Publication date: April 28, 2011

Applicant: QUALCOMM Incorporated

Inventors: Venkatesh Krishnan, Daniel J. Sinder, Ananthapadmanabhan Arasanipalai Kandhadai
Vocoder system and method for vocal sound synthesis

Patent number: 7933768

Abstract: A vocoder system for improving the performance expression of an output sound while lightening the computational load. The system includes formant detection means and division means in which the center frequencies have been fixed. The modulation level with which the levels of each of the frequency bands that have been divided in the division means are set by a setting means based on the levels of each of the frequency bands that correspond to those that have been detected in the formant detection means and formant information with which the formants are changed. Therefore, it is possible to improve the performance expression of the output sound with a light computational load and without the need to calculate and change the filter figure of each filter for each sample in order to change the center frequency and bandwidth of each of the filters comprising the division means.

Type: Grant

Filed: March 23, 2004

Date of Patent: April 26, 2011

Assignee: Roland Corporation

Inventor: Tadao Kikumoto
Method and apparatus for estimating harmonic information, spectral envelope information, and degree of voicing of speech signal

Patent number: 7912709

Abstract: A degree of voicing is extracted using the characteristic of harmonic peaks existing in a constant period by converting an input speech or audio signal to a speech signal of the frequency domain, selecting the greatest peak in a first pitch period of the converted speech signal as a harmonic peak, thereafter selecting a peak having the greatest spectral value among peaks existing in each peak search range of the speech signal as a harmonic peak, extracting harmonic spectral envelope information by performing interpolation of the selected harmonic peaks, extracting non-harmonic spectral envelope information by performing interpolation of the non-harmonic peaks, and comparing the two pieces of envelope information to each other.

Type: Grant

Filed: April 4, 2007

Date of Patent: March 22, 2011

Assignee: Samsung Electronics Co., Ltd

Inventor: Hyun-Soo Kim
Tower of numeric types

Patent number: 7890320

Abstract: Various technologies and techniques are disclosed for providing a numeric tower that represents a structure supporting statically defined numeric data types. The numeric data types each are operable to implement a different but accurate representation of a particular value. Numeric operations are supported for the numeric tower that can be performed with any of the statically defined numeric data types. The numeric tower is extensible, and allows for additional statically defined numeric data types to be added, as well as operations. The numeric tower is also operable to detect overflow situations. For example, suppose a result of an operation will result in an overflow situation because the operation does not fit within a range supported by the particular numeric type. The system converts the numeric type to a different one of the numeric data types when the result does not fit within a range supported by the first one.

Type: Grant

Filed: April 17, 2007

Date of Patent: February 15, 2011

Assignee: Microsoft Corporation

Inventors: Melitta Andersen, Ryan Byington, Brian Grunkemeyer, James S. Miller, Anthony J. Moore, Ariel Weinstein
Speech Intelligibility Improvement Method and Apparatus

Publication number: 20110015922

Abstract: Prevalence detection is advantageously applied to the result of specific spectral discrimination to adaptively determine prevalent frequencies existing within an audio signal containing speech. Prevalent frequencies in this audio signal so isolated are attenuated in a highly selective manner, thus reducing the masking potential of pervasive resonances and obfuscative energy within the speech itself over low energy language-imparting speech elements.

Type: Application

Filed: July 20, 2010

Publication date: January 20, 2011

Inventor: Larry Joseph Kirn
Scalable encoding apparatus, scalable decoding apparatus, scalable encoding method, scalable decoding method, communication terminal apparatus, and base station apparatus

Patent number: 7848925

Abstract: A scalable encoding apparatus, a scalable decoding apparatus and the like are disclosed which can achieve a band scalable LSP encoding that exhibits both a high quantization efficiency and a high performance. In these apparatuses, a narrow band-to-wide band converter receives and converts a quantized narrow band LSP to a wide band, and then outputs the quantized narrow band LSP as converted (i.e., a converted wide band LSP parameter) to an LSP-to-LPC converter. The LSP-to-LPC converter converts the quantized narrow band LSP as converted to a linear prediction coefficient and then outputs it to a pre-emphasizer. The pre-emphasizer calculates and outputs the pre-emphasized linear prediction coefficient to an LPC-to-LSP converter. The LPC-to-LSP converter converts the pre-emphasized linear prediction coefficient to a pre-emphasized quantized narrow band LSP as wide band converted, and then outputs it to a prediction quantizer.

Type: Grant

Filed: September 15, 2005

Date of Patent: December 7, 2010

Assignee: Panasonic Corporation

Inventor: Hiroyuki Ehara
Apparatus and method for detecting degree of voicing of speech signal

Patent number: 7835905

Abstract: In order to detect a degree of voicing of a speech signal, an input speech signal is converted to a speech signal in the frequency domain, a pitch value is calculated from the speech signal, a plurality of harmonic peaks existing in the speech signal are detected, and a difference obtained by comparing the pitch value to an interval between adjacent harmonic peaks among the detected harmonic peaks is detected as the degree of voicing included in the speech signal.

Type: Grant

Filed: April 4, 2007

Date of Patent: November 16, 2010

Assignee: Samsung Electronics Co., Ltd

Inventor: Hyun-Soo Kim
Method and apparatus for low bit rate encoding and decoding

Patent number: 7835907

Abstract: An apparatus and method of low bit rate encoding and reproducing. The method includes transforming input audio signals in a time domain into spectral signals in a frequency domain, extracting important-spectrum components from the spectral signals in the frequency domain, and quantizing the important-spectrum components, extracting residual-spectrum components other than the important-spectrum components from the spectral signals in the frequency domain, and calculating and quantizing a noise level of the residual-spectrum components, and encoding the quantized important-spectrum components and the quantized noise level losslessly, and outputting encoded bitstreams.

Type: Grant

Filed: December 21, 2005

Date of Patent: November 16, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Junghoe Kim, Eunmi Oh, Boris Kudryashov, Konstantin Osipov
Method for Estimating a Fundamental Frequency of a Speech Signal

Publication number: 20100286981

Abstract: The invention provides a method for estimating a fundamental frequency of a speech signal comprising the steps of receiving a signal spectrum of the speech signal, filtering the signal spectrum to obtain a refined signal spectrum, determining a cross-power spectral density using the refined signal spectrum and the signal spectrum, transforming the cross-power spectral density into the time domain to obtain a cross-correlation function, and estimating the fundamental frequency of the speech signal based on the cross-correlation function.

Type: Application

Filed: May 3, 2010

Publication date: November 11, 2010

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Mohamed Krini, Gerhard Schmidt
Robust decoder

Patent number: 7831421

Abstract: Techniques and tools related to delayed or lost coded audio information are described. For example, a concealment technique for one or more missing frames is selected based on one or more factors that include a classification of each of one or more available frames near the one or more missing frames. As another example, information from a concealment signal is used to produce substitute information that is relied on in decoding a subsequent frame. As yet another example, a data structure having nodes corresponding to received packet delays is used to determine a desired decoder packet delay value.

Type: Grant

Filed: May 31, 2005

Date of Patent: November 9, 2010

Assignee: Microsoft Corporation

Inventors: Hosam A. Khalil, Tian Wang, Kazuhito Koishida, Xiaoqin Sun, Wei-Ge Chen
Speaker intent analysis system

Patent number: 7822611

Abstract: A speaker intent analysis system and method for validating the truthfulness and intent of a plurality of participants' responses to questions. A computer stores, retrieves, and transmits a series of questions to be answered audibly by participants. The participants' answers are received by a data processor. The data processor analyzes and records the participants' speech parameters for determining the likelihood of dishonesty. In addition to analyzing participants' speech parameters for distinguishing stress or other abnormality, the processor may be equipped with voice recognition software to screen responses that while not dishonest, are indicative of possible malfeasance on the part of the participants. Once the responses are analyzed, the processor produces an output that is indicative of the participant's credibility. The output may be sent to proper parties and/or devices such as a web page, computer, e-mail, PDA, pager, database, report, etc. for appropriate action.

Type: Grant

Filed: February 20, 2003

Date of Patent: October 26, 2010

Inventor: David B. Bezar
SYSTEM AND METHOD FOR FREQUENCY TO PHASE BALANCING FOR TIMBRE-ACCURATE LOW BIT RATE AUDIO ENCODING

Publication number: 20100241423

Abstract: Embodiments of a system and method for encoding audio data have been described. In one embodiment, the method includes transforming frequency domain data in a plurality of signal windows of an audio dataset from a cosine/sine format to a magnitude/cosine/sine format. The magnitude/cosine/sine format disproportionately represents a magnitude of the frequency domain data over a phase of the frequency domain data. The above transformation may be a pre-processing stage of vector quantization usable to produce a codebook.

Type: Application

Filed: March 18, 2009

Publication date: September 23, 2010

Inventors: Stanley Wayne Jackson, Jay T. Dresser
SPEECH ANALYSIS DEVICE, SPEECH ANALYSIS AND SYNTHESIS DEVICE, CORRECTION RULE INFORMATION GENERATION DEVICE, SPEECH ANALYSIS SYSTEM, SPEECH ANALYSIS METHOD, CORRECTION RULE INFORMATION GENERATION METHOD, AND PROGRAM

Publication number: 20100217584

Abstract: A speech analysis device which accurately analyzes an aperiodic component included in speech in a practical environment where there is background noise includes: a frequency band division unit which divides, into bandpass signals each associated with a corresponding one of frequency bands, an input signal representing a mixed sound of background noise and speech; a noise interval identification unit which identifies a noise interval and a speech interval of the input signal; an SNR calculation unit which calculates an SN ratio; a correlation function calculation unit which calculates an autocorrelation function of each bandpass signal; a correction amount determination unit which determines a correction amount for an aperiodic component ratio, based on the calculated SN ratio; and an aperiodic component ratio calculation unit which calculates, for each frequency band, an aperiodic component ratio of the aperiodic component, based on the determined correction amount and the calculated autocorrelation function.

Type: Application

Filed: May 4, 2010

Publication date: August 26, 2010

Inventors: Yoshifumi Hirose, Takahiro Kamai
Signal processing using pilot based coding

Patent number: 7774199

Abstract: An apparatus for processing a signal and method thereof are disclosed. Data coding and entropy coding are performed with interconnection, and grouping is used to enhance coding efficiency. The present invention includes the steps of obtaining a difference value and index information and entropy-decoding the index information and identifying an entropy table corresponding to the entropy-decoded index information and entropy-decoding the difference value using the identified entropy table and obtaining data using a reference value corresponding to a plurality of data and the decoded difference value.

Type: Grant

Filed: October 9, 2006

Date of Patent: August 10, 2010

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Hyen-O Oh, Dong Soo Kim, Jae Hyun Lim, Yang-Won Jung, Hyo Jin Kim

prev 1 2 3 4 5 6 7 8 9 next