Analysis By Synthesis Patents (Class 704/220)

Audio encoder, audio decoder and related methods for processing multi-channel audio signals using complex prediction

Patent number: 8655670

Abstract: An encoder, based on a combination of two audio channels, obtains a first combination signal as a mid-signal and a residual signal derivable using a predicted side signal derived from the mid signal. The first combination signal and the prediction residual signal are encoded and written into a data stream together with the prediction information. A decoder generates decoded first and second channel signals using the prediction residual signal, the first combination signal and the prediction information. A real-to-imaginary transform may be applied for estimating the imaginary part of the spectrum of the first combination signal. For calculating the prediction signal used in the derivation of the prediction residual signal, the real-valued first combination signal is multiplied by a real portion of the complex prediction information and the estimated imaginary part of the first combination signal is multiplied by an imaginary portion of the complex prediction information.

Type: Grant

Filed: October 5, 2012

Date of Patent: February 18, 2014

Assignees: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V., Dolby International AB

Inventors: Heiko Purnhagen, Pontus Carlsson, Lars Villemoes, Julien Robillard, Matthias Neusinger, Christian Helmrich, Johannes Hilpert, Nikolaus Rettelbach, Sascha Disch, Bernd Edler
Audio encoder, audio decoder, method for encoding an audio information, method for decoding an audio information and computer program using an iterative interval size reduction

Patent number: 8655669

Abstract: An audio decoder has an arithmetic decoder for providing decoded spectral values on the basis of an arithmetically-encoded representation and a frequency-domain-to-time-domain converter for providing a time-domain audio representation. The arithmetic decoder selects a mapping rule describing a mapping of a code value onto a symbol code in dependence on a numeric current context value describing a current context state. The arithmetic decoder determines the numeric current context value in dependence on a plurality of previously decoded spectral values. The arithmetic decoder evaluates at least one table using an iterative interval size reduction to determine whether the numeric current context value is identical to a table context value described by an entry of the table or lies within an interval described by entries of the table, and derives a mapping rule index value describing a selected mapping table. An audio encoder also uses an iterative interval table size reduction.

Type: Grant

Filed: April 19, 2012

Date of Patent: February 18, 2014

Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.

Inventors: Guillaume Fuchs, Vignesh Subbaraman, Nikolaus Rettelbach, Markus Multrus, Marc Gayer, Patrick Warmbold, Christian Griebel, Oliver Weiss
Speech conversion

Patent number: 8650035

Abstract: A speech conversion system facilitates voice communications. A database comprises a plurality of conversion heuristics, at least some of the conversion heuristics being associated with identification information for at least one first party. At least one speech converter is configured to convert a first speech signal received from the at least one first party into a converted first speech signal different than the first speech signal.

Type: Grant

Filed: November 18, 2005

Date of Patent: February 11, 2014

Assignee: Verizon Laboratories Inc.

Inventor: Adrian E. Conway
Audio encoder, audio decoder, method for encoding and audio information, method for decoding an audio information and computer program using a hash table describing both significant state values and interval boundaries

Patent number: 8645145

Abstract: An audio decoder includes an arithmetic decoder for providing a plurality of decoded spectral values on the basis of an arithmetically encoded representation of the spectral values, and a frequency-domain-to-time-domain converter for providing a time-domain audio representation using the decoded spectral values. The arithmetic decoder selects a mapping rule describing a mapping of a code value onto a symbol code in dependence on a context state described by a numeric current context value. The arithmetic decoder determines the numeric current context value in dependence on a plurality of previously decoded spectral values. The arithmetic decoder evaluates a hash table, entries of which define both significant state values and boundaries of intervals of numeric context values, in order to select the mapping rule. A mapping rule index value is individually associated to a numeric context value being a significant state value.

Type: Grant

Filed: July 12, 2012

Date of Patent: February 4, 2014

Assignee: Fraunhoffer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.

Inventors: Vignesh Subbaraman, Guillaume Fuchs, Markus Multrus, Nikolaus Rettelbach, Marc Gayer, Oliver Weiss, Christian Griebel, Patrick Warmbold
System and method for method for improving speech intelligibility of voice calls using common speech codecs

Patent number: 8645142

Abstract: System and method to improve intelligibility of coded speech, the method including: receiving an encoded speech signal from a network; extracting an encoded media data stream and one or more control data packets from the encoded speech signal; decoding the encoded media data stream to produce a decoded speech signal; boosting an upper spectral portion of the decoded speech signal to produce a boosted speech signal; and outputting the boosted speech signal. In another embodiment, the method may include: receiving an uncoded speech signal; processing the uncoded speech signal, wherein the processing comprises generating an unencoded data stream from the uncoded speech signal; boosting an upper spectral portion of the unencoded data stream to produce a boosted speech signal; encoding the boosted speech signal to produce an encoded speech signal; and outputting the boosted speech signal.

Type: Grant

Filed: March 27, 2012

Date of Patent: February 4, 2014

Assignee: Avaya Inc.

Inventors: Heinz Teutsch, John Cornelius Lynch
Classification of voice messages based on analysis of the content of the message and user-provisioned tagging rules

Patent number: 8638911

Abstract: One or more tags are associated with a voice or multimedia message. These tags can be applied to the message based on one or more of an analysis of the message, rules, caller information, presence information, user input and GPS information. Based on the assigned and associated tags, one or more of message handling, classification and one or more actions can be automatically invoked to assist with management of messages. An interface is also provided that allows for the management of the assigned tags as well as the editing and creation of new tags and rules to assist with message management.

Type: Grant

Filed: July 24, 2009

Date of Patent: January 28, 2014

Assignee: Avaya Inc.

Inventors: Mehmet Balasaygun, Michael J. Killian
Speech synthesizing apparatus, method, and program

Patent number: 8630857

Abstract: Disclosed is a speech synthesizing apparatus including a segment selection unit that selects a segment suited to a target segment environment from candidate segments, includes a prosody change amount calculation unit that calculates prosody change amount of each candidate segment based on prosody information of candidate segments and the target segment environment, a selection criterion calculation unit that calculates a selection criterion based on the prosody change amount, a candidate selection unit that narrows down selection candidates based on the prosody change amount and the selection criterion, and an optimum segment search unit than searches for an optimum segment from among the narrowed-down candidate segments.

Type: Grant

Filed: February 15, 2008

Date of Patent: January 14, 2014

Assignee: NEC Corporation

Inventors: Masanori Kato, Reishi Kondo, Yasuyuki Mitsui
Very low bit rate signal coder and decoder

Patent number: 8620660

Abstract: Improved oscillator-based source modeling methods for estimating model parameters, for evaluating model quality for restoring the input from the model parameters, and for improving performance over known in the art methods are disclosed. An application of these innovations to speech coding is described. The improved oscillator model is derived from the information contained in the current input signal as well as from some form of data history, often the restored versions of the earlier processed data. Operations can be performed in real time, and compression can be achieved at a user-specified level of performance and, in some cases, without information loss. The new model can be combined with methods in the existing art in order to complement the properties of these methods, to improve overall performance. The present invention is effective for very low bit-rate coding/compression and decoding/decompression of digital signals, including digitized speech and audio signals.

Type: Grant

Filed: October 29, 2010

Date of Patent: December 31, 2013

Assignee: The United States of America, as Represented by the Secretary of the Navy

Inventors: Anton Yen, Irina Gorodnitsky
Method and apparatus for transforming between different filter bank domains

Patent number: 8620671

Abstract: Filter banks may have different structures and different individual output signal domains. Often a translation between different filter bank domains is desirable. Usually, mapping matrices are used that, however, vary over frequency. This requires a significant amount of lookup tables. A method for transforming first data frames of a first filter bank domain to second data frames of a different second filter bank domain, comprises steps of transcoding sub-bands of the first filter bank domain into sub-bands of an intermediate domain that corresponds to said second filter bank domain but has warped phase, and transcoding the sub-bands of the intermediate domain to sub-bands of the second filter bank domain, wherein a phase correction is performed on the sub-bands of the intermediate domain.

Type: Grant

Filed: February 19, 2009

Date of Patent: December 31, 2013

Assignee: Thomson Licensing

Inventors: Peter Jax, Sven Kordon
Automatic determination of stability and validity of Speech Transmission Index measurements

Publication number: 20130297300

Abstract: The Speech Transmission Index (STI) uses the modulation transfer function to characterize speech communication channels with a single index. This index accurately predicts speech intelligibility. In order to measure the STI, a device or software is needed that utilizes an artificial test signal to estimate the modulation transfer function, and to derive the STI from that function. In practice, many STI measurements are found to be invalid and inaccurate, for two reasons: (1) the measurements are carried out in test conditions for which by design the STI is not intended; (2) equipment is used that does not meet specifications and requirements. A procedure is described to automatically verify if conditions for obtaining valid STI results are met. If possible, corrections are applied to compensate for incorrect settings and poorly adjusted measuring equipment. If automatic correction of the problem is not possible, then a warning is generated.

Type: Application

Filed: May 4, 2012

Publication date: November 7, 2013

Inventor: Sander Jeroen van Wijngaarden
Apparatus and method for encoding a multi-channel audio signal

Patent number: 8577045

Abstract: An encoding apparatus comprises a frame processor (105) which receives a multi channel audio signal comprising at least a first audio signal from a first microphone (101) and a second audio signal from a second microphone (103). An ITD processor 107 then determines an inter time difference between the first audio signal and the second audio signal and a set of delays (109, 111) generates a compensated multi channel audio signal from the multi channel audio signal by delaying at least one of the first and second audio signals in response to the inter time difference signal. A combiner (113) then generates a mono signal by combining channels of the compensated multi channel audio signal and a mono signal encoder (115) encodes the mono signal. The inter time difference may specifically be determined by an algorithm based on determining cross correlations between the first and second audio signals.

Type: Grant

Filed: September 9, 2008

Date of Patent: November 5, 2013

Assignee: Motorola Mobility LLC

Inventor: Jonathan A. Gibbs
Automatic polarity adaptation for ambient noise cancellation

Patent number: 8571226

Abstract: A sound reproducing device has a loudspeaker arranged to produce sound from an audio signal provided by an audio signal source. A microphone is positioned to pick up ambient noise and generate a microphone signal which comprises the noise. An ambient noise cancellation (ANC) system receives the microphone signal from the microphone and generates anti-noise corresponding to the ambient noise in the microphone signal. An automatic polarity adaptation (AAP) system monitors the ANC system and, when a decision criterion is fulfilled, causes a switch in polarity for the generated anti-noise.

Type: Grant

Filed: December 10, 2010

Date of Patent: October 29, 2013

Assignees: Sony Corporation, Sony Mobile Communications AB

Inventor: Peter Isberg
Encoding device, decoding device, and method thereof

Patent number: 8560328

Abstract: A decoding device is capable of flexibly calculating high-band spectrum data with a high accuracy in accordance with an encoding band selected by an upper-node layer of the encoding side. In this device: a first layer decoder decodes first layer encoded information to generate a first layer decoded signal; a second layer decoder decodes second layer encoded information to generate a second layer decoded signal; a spectrum decoder performs a band extension process by using the second layer decoded signal and the first layer decoded signal up-sampled in an up-sampler so as to generate an all-band decoded signal; and a switch outputs the first layer decoded signal or the all-band decoded signal according to the control information generated in a controller.

Type: Grant

Filed: December 14, 2007

Date of Patent: October 15, 2013

Assignee: Panasonic Corporation

Inventors: Tomofumi Yamanashi, Masahiro Oshikiri
Parameter decoding apparatus and parameter decoding method

Patent number: 8538765

Abstract: A parameter decoding apparatus includes a prediction residue decoder that finds a quantized prediction residue based on encoded information included in a current frame subject to decoding and an auto-regressive predictor produces a predicted parameter by multiplying a predictive coefficient with a past decoded parameter. An adder decodes a parameter by adding the quantized prediction residue and the predicted parameter, wherein the prediction residue decoder, when the current frame is erased, finds a current-frame quantized prediction residue from a weighted linear sum of a parameter decoded in the past and a future-frame quantized prediction residue.

Type: Grant

Filed: May 17, 2013

Date of Patent: September 17, 2013

Assignee: Panasonic Corporation

Inventor: Hiroyuki Ehara
Searching for symbol string

Patent number: 8532988

Abstract: A method for searching for an input symbol string, includes receiving (B) an input symbol string, proceeding (C) in a trie data structure to a calculation point indicated by the next symbol, calculating (D) distances at the calculation point, selecting (E) repeatedly the next branch to follow (C) to the next calculation point to repeat the calculation (D). After the calculation (G), selecting the symbol string having the shortest distance to the input symbol string on the basis of the performed calculations. To minimize the number of calculations, not only the distances are calculated (D) at the calculation points, but also the smallest possible length difference corresponding to each distance, and on the basis of each distance and corresponding length difference a reference value is calculated, and the branch is selected (E) in such a manner that next the routine proceeds from the calculation point producing the lowest reference value.

Type: Grant

Filed: July 3, 2003

Date of Patent: September 10, 2013

Assignee: Syslore Oy

Inventor: Jorkki Hyvonen
Handoffs between different voice encoder systems

Patent number: 8521520

Abstract: Provided are methods and systems of managing handoffs in a wireless communication system having different types of vocoders. Some embodiments include translating state memory of a first vocoder to a second vocoder using a state memory transcoder. The state memory may be delayed to align differences in algorithmic delays between the first vocoder and the second vocoder. In one embodiment, a speech signal may be decoded from the first vocoder, delayed, and encoded to the second vocoder. Furthermore, for a period of time during and/or adjacent to the handoff, the first vocoder may output with decreasing amplitude while the second vocoder outputs with increasing amplitude. Such techniques may be used alone or in combination.

Type: Grant

Filed: February 3, 2010

Date of Patent: August 27, 2013

Assignee: General Electric Company

Inventors: Richard Louis Zinser, Michael James Hartman, John Erik Hershey
Adaptive audio signal source vector quantization device and adaptive audio signal source vector quantization method that search for pitch period based on variable resolution

Patent number: 8521519

Abstract: An adaptive sound source vector quantization device includes a first pitch cycle instructor, a search range calculator, and a second pitch cycle instructor. The first pitch cycle instructor successively instructs pitch cycle search candidates in a predetermined search range having a search resolution which transits over a predetermined pitch cycle candidate for the first sub-frame. The search range calculator calculates a predetermined range before and after the pitch cycle of the first sub-frame as the pitch cycle search range for the second sub-frame, if the predetermined range includes the predetermined pitch cycle search candidate. In the predetermined range, the search resolution transits over a boundary defined by the predetermined pitch cycle. The second pitch cycle instructor successively instructs the pitch cycle search candidates in the search range for the second sub-frame.

Type: Grant

Filed: February 29, 2008

Date of Patent: August 27, 2013

Assignee: Panasonic Corporation

Inventors: Kaoru Sato, Toshiyuki Morii
Apparatus and method for synthesizing an output signal

Patent number: 8515759

Abstract: An apparatus for synthesizing a rendered output signal having a first audio channel and a second audio channel includes a decorrelator stage for generating a decorrelator signal based on a downmix signal, and a combiner for performing a weighted combination of the downmix signal and a decorrelated signal based on parametric audio object information, downmix information and target rendering information. The combiner solves the problem of optimally combining matrixing with decorrelation for a high quality stereo scene reproduction of a number of individual audio objects using a multichannel downmix.

Type: Grant

Filed: April 23, 2008

Date of Patent: August 20, 2013

Assignee: Dolby International AB

Inventors: Jonas Engdegard, Heiko Purnhagen, Barbara Resch, Lars Villemoes, Cornelia Falch, Juergen Herre, Johannes Hilpert, Andreas Hoelzer, Leonid Terentiev
Modulation device, modulation method, demodulation device, and demodulation method

Patent number: 8498860

Abstract: A modulation device including: a modulation unit for modulating a carrier in an audible sound range by an encoded transmission signal to generate a modulated signal; a masker sound generation unit for generating a masker signal outputted as a masker sound for making the modulated signal harder to hear when transmitted with the modulated signal; and an acoustic signal generation unit for inserting the masker signal in the modulated signal to generate an acoustic signal.

Type: Grant

Filed: October 2, 2006

Date of Patent: July 30, 2013

Assignee: NTT DoCoMo, Inc.

Inventor: Hosei Matsuoka
HANDHELD DEVICE WIRELESS MUSIC STREAMING FOR GAMEPLAY

Publication number: 20130190088

Abstract: Systems and methods are disclosed for streaming of audio data of separate streams in at least two different formats. In some embodiments handheld game devices are in wireless communication and a first of the handheld game devices streams audio data during game play to a second of the handheld game devices. In some embodiments the audio data includes audio data from a plurality of streams of audio data. In some embodiments the streams of audio data include streams of audio data in different formats, generally different compressed formats, some of which may be selected based on whether a device includes circuitry specifically configured to decompress audio data in a specific data format.

Type: Application

Filed: March 8, 2013

Publication date: July 25, 2013

Applicant: Activision Publishing, Inc.

Inventors: Gregory Keith Oberg, Jesse Nathaniel Booth
Automated sound segment selection method and system

Patent number: 8494844

Abstract: A computerized method and system is provided for automatically selecting from a digitized sound sample a segment of the sample that is optimal for the purpose of measuring clinical metrics for voice and speech assessment. A quality measure based on quality parameters of segments of the sound sample is applied to candidate segments to identify the highest quality segment within the sound sample. The invention can optionally provide feedback to the speaker to help the speaker increase the quality of the sound sample provided. The invention also can optionally perform sound pressure level calibration and noise calibration. The invention may optionally compute clinical metrics on the selected segment and may further include a normative database method or system for storing and analyzing clinical measurements.

Type: Grant

Filed: November 19, 2009

Date of Patent: July 23, 2013

Assignee: Human Centered Technologies, Inc.

Inventor: David N. Fernandes
Audio encoder and decoder with long term prediction

Patent number: 8494863

Abstract: The present invention teaches a new audio coding system that can code both general audio and speech signals well at low bit rates. A proposed audio coding system comprises a linear prediction unit for filtering an input signal based on an adaptive filter; a transformation unit for transforming a frame of the filtered input signal into a transform domain; a quantization unit for quantizing a transform domain signal; a long term prediction unit for determining an estimation of the frame of the filtered input signal based on a reconstruction of a previous segment of the filtered input signal; and a transform domain signal combination unit for combining, in the transform domain, the long term prediction estimation and the transformed input signal to generate the transform domain signal.

Type: Grant

Filed: December 30, 2008

Date of Patent: July 23, 2013

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Arijit Biswas, Heiko Purnhagen, Kristofer Kjoerling, Barbara Resch, Lars Villemoes, Per Hedelin
Method and apparatus for transmitting speech data to a remote device in a distributed speech recognition system

Patent number: 8494849

Abstract: A method of transmitting speech data to a remote device in a distributed speech recognition system, includes the steps of: dividing an input speech signal into frames; calculating, for each frame, a voice activity value representative of the presence of speech activity in the frame; grouping the frames into multiframes, each multiframe including a predetermined number of frames; calculating, for each multiframe, a voice activity marker representative of the number of frames in the multiframe representing speech activity; and selectively transmitting, on the basis of the voice activity marker associated with each multiframe, the multiframes to the remote device.

Type: Grant

Filed: June 20, 2005

Date of Patent: July 23, 2013

Assignee: Telecom Italia S.p.A.

Inventors: Ivano Salvatore Collotta, Donato Ettorre, Maurizio Fodrini, Pierluigi Gallo, Roberto Spagnolo
System and method for modeling speech spectra

Patent number: 8489392

Abstract: A system and method for modeling speech in such a way that both voiced and unvoiced contributions can co-exist at certain frequencies. In various embodiments, three spectral bands (or bands of up to three different types) are used. In one embodiment, the lowest band or group of bands is completely voiced, the middle band or group of bands contains both voiced and unvoiced contributions, and the highest band or group of bands is completely unvoiced. The embodiments of the present invention may be used for speech coding and other speech processing applications.

Type: Grant

Filed: September 13, 2007

Date of Patent: July 16, 2013

Assignee: Nokia Corporation

Inventors: Jani Nurminen, Sakari Himanen
Arrangement for creating and using a phonetic-alphabet representation of a name of a party to a call

Patent number: 8484034

Abstract: A first party creates and edits a phonetic-alphabet representation of its name. The phonetic representation is conveyed to a second party as “caller-identification” information by messages that set up a call between the parties. The phonetic representation of the name is displayed to the second party, converted to speech, and/or converted to an alphabet of a language of the second party and then displayed to the second party.

Type: Grant

Filed: March 31, 2008

Date of Patent: July 9, 2013

Assignee: Avaya Inc.

Inventors: Paul Roller Michaelis, David Mohler, Charles Wrobel
Parameter decoding device, parameter encoding device, and parameter decoding method

Patent number: 8468015

Abstract: A parameter decoding device performs a parameter compensation process so as to suppress degradation of a main observation quality in a prediction quantization. The parameter decoding device includes first amplifiers which multiply inputted quantization prediction residual vectors by a weighting coefficient. A further amplifier multiplies the preceding frame decoding LSF vector yn?1 by the weighting coefficient. An additional amplifier multiplies the code vector xn+1 outputted from a codebook by the weighting coefficient ?0. An adder calculates the total of the vectors outputted from the amplifiers, the further amplifier, and the additional amplifier. A selector switch selects the vector outputted from the adder if the frame erasure coding Bn of the current frame indicates that ‘the n-th frame is an erased frame’ and the frame erasure coding Bn+1 of the next frame indicates that ‘the n+1-th frame is a normal frame’.

Type: Grant

Filed: November 9, 2007

Date of Patent: June 18, 2013

Assignee: Panasonic Corporation

Inventor: Hiroyuki Ehara
Method and arrangement for smoothing of stationary background noise

Patent number: 8457953

Abstract: In a method of smoothing background noise in a telecommunication speech session; receiving and decoding S1O a signal representative of a speech session, the signal comprising both a speech component and a background noise component. Subsequently, determining LPC parameters S20 and an excitation signal S30 for the received signal. Thereafter, synthesizing and outputting (S40) an output signal based on the determined LPC parameters and excitation signal. In addition, modifying S35 the determined excitation signal by reducing power and spectral fluctuations of the excitation signal to provide a smoothed output signal.

Type: Grant

Filed: February 13, 2008

Date of Patent: June 4, 2013

Assignee: Telefonaktiebolaget LM Ericsson (Publ)

Inventor: Stefan Bruhn
Methods for improving high frequency reconstruction

Patent number: 8447621

Abstract: The present invention proposes a new method and a new apparatus for enhancement of audio source coding systems utilizing high frequency reconstruction (HFR). It utilizes a detection mechanism on the encoder side to assess what parts of the spectrum will not be correctly reproduced by the HFR method in the decoder. Information on this is efficiently coded and sent to the decoder, where it is combined with the output of the HFR unit.

Type: Grant

Filed: August 9, 2011

Date of Patent: May 21, 2013

Assignee: Dolby International AB

Inventors: Kristofer Kjörling, Per Ekstrand, Holger Hörich
Distortion-based method and apparatus for buffer control in a communication system

Patent number: 8442819

Abstract: Methods and apparatus are disclosed for controlling a buffer in a communication system, such as a digital audio broadcasting (DAB) communication system. A more consistent perceptual quality over time provides for a more pleasing auditory experience to a listener. The disclosed bit allocation process determines, for each frame, a distortion d[k] at which the frame is to be encoded. The distortion d[k] is determined to minimize (i) the probability for a buffer overflow, and (ii) the variation of perceived distortion over time. A buffer level is controlled by partitioning a signal into a sequence of successive frames; estimating a distortion rate for a number of frames; and selecting a distortion such that the variance of the buffer level is bounded by a specified value.

Type: Grant

Filed: April 13, 2006

Date of Patent: May 14, 2013

Assignee: Agere Systems LLC

Inventor: Christof Faller
Method and apparatus for adaptive sub-band allocation of spectral coefficients

Patent number: 8438012

Abstract: An apparatus and method for adaptive sub-band allocation of spectral coefficients are disclosed. The sizes of sub-bands are determined according to the distribution of spectral coefficients transformed from an input speech/audio signal to perform more elaborate quantization in units of sub-bands. Thus, quantization noise of the spectral coefficients is reduced, and sound quality in a frequency region is enhanced, thereby improving the quality of the signal.

Type: Grant

Filed: September 9, 2009

Date of Patent: May 7, 2013

Assignee: Electronics and Telecommunications Research Institute

Inventors: Hyun Woo Kim, Hyun Joo Bae, Byung Sun Lee
Confidence tying for unsupervised synthetic speech adaptation

Patent number: 8438029

Abstract: Disclosed are apparatus and methods for generating synthesized utterances. A computing device can receive speech data corresponding to spoken utterances of a particular speaker. Textual elements of an input text corresponding to the speech data can be recognized. Confidence levels associated with the recognized textual elements can be determined. Speech-synthesis parameters of decision trees can be adapted based on the speech data, recognized textual elements, and associated confidence levels. Each adapted decision tree can map individual elements of a text to individual of the speech-synthesis parameters. A second input text can be received. The second input text can be mapped to speech-synthesis parameters using the adapted decision trees. A synthesized spoken utterance can be generated corresponding to the second input text using the speech-synthesis parameters. At least some of the speech-synthesis parameters are configured to simulate the particular speaker.

Type: Grant

Filed: August 22, 2012

Date of Patent: May 7, 2013

Assignee: Google Inc.

Inventors: Matthew Nicholas Stuttle, Byungha Chun
Method and arrangement for speech coding in wireless communication systems

Patent number: 8438018

Abstract: The present invention relates to speech coding in wireless and wireline communication systems. The present invention provides a method of saving bandwidth by a controlled dropping of speech frames at an encoder in a sending communication device. The dropping is controlled in a manner to minimize the effects on the speech quality after the decoding in the receiving communication device, by assuring that the state mismatch between the encoder and the decoder is removed or at least significantly reduced. This is achieved by letting the encoder run an ECU algorithm with a similar behavior as the one running in the decoder in the receiving communication device.

Type: Grant

Filed: February 6, 2006

Date of Patent: May 7, 2013

Assignee: Telefonaktiebolaget LM Ericsson (Publ)

Inventors: Ingemar Johansson, Jonas Svedberg
Audio encoder, decoder, and encoding method thereof

Patent number: 8423371

Abstract: An encoder capable of reducing the degradation of the quality of the decoded signal in the case of band expansion in which the high band of the spectrum of an input signal is estimated from the low band. In this encoder, a first layer encoder encodes an input signal and generates first encoded information, a first layer decoder decodes the first encoded information and generates a first decoded signal, a characteristic judger analyzes the intensity of the harmonic structure of the input signal and generates harmonic characteristic information representing the analysis result, and a second layer encoder changes, on the basis of the harmonic characteristic information, the numbers of bits allocated to parameters included in second encoded information created by encoding the difference between the input signal and the first decoded signal before creating the second information.

Type: Grant

Filed: December 22, 2008

Date of Patent: April 16, 2013

Assignee: Panasonic Corporation

Inventors: Tomofumi Yamanashi, Masahiro Oshikiri
Method and device for efficient quantization of transform information in an embedded speech and audio codec

Patent number: 8396707

Abstract: A method and device for coding an input sound signal in at least one lower layer and at least one upper layer of an embedded codec comprises, in the at least one lower layer, coding the input sound signal to produce coding parameters, wherein coding the input sound signal comprises producing a synthesized sound signal. An error signal is computed as a difference between the input sound signal and the synthesized sound signal and a spectral mask is calculated as a function of a minima of a spectrum related to the input sound signal. In the at least one upper layer, the error signal is coded to produce coding coefficients, the spectral mask is applied to the coding coefficients, and the masked coding coefficients are quantized. Applying the spectral mask to the coding coefficients reduces the quantization noise produced upon quantizing the coding coefficients.

Type: Grant

Filed: September 25, 2008

Date of Patent: March 12, 2013

Assignee: VoiceAge Corporation

Inventors: Tommy Vaillancourt, Redwan Salami
Concealment of transmission error in a digital audio signal in a hierarchical decoding structure

Patent number: 8391373

Abstract: A method is provided for concealing a transmission error in a digital signal chopped into a plurality of successive frames associated with different time intervals in which, on reception, the signal may comprise erased frames and valid frames, the valid frames comprising information relating to the concealment of frame loss. The method is implemented during a hierarchical decoding using a core decoding and a transform-based decoding using windows introducing a time delay of less than a frame with respect to the core decoding. The method includes concealing a first set of missing samples for the erased frame, implemented in a first time interval; a step of concealing a second set of missing samples utilizing information of said valid frame and implemented in a second time interval; and a step of transition between the first and the second set of missing samples to obtain at least part of the missing frame.

Type: Grant

Filed: March 20, 2009

Date of Patent: March 5, 2013

Assignee: France Telecom

Inventors: David Virette, Pierrick Philippe, Balazs Kovesi
Method, device and system for enhancement layer signal encoding and decoding

Patent number: 8380526

Abstract: A method, device and system for signal encoding and decoding are disclosed. The method includes: encoding a core layer signal to obtain a core layer signal code; selecting an enhancement sample point that requires enhancement layer signal encoding according to the core layer signal code and the number of bits that can be used by an enhancement layer; obtaining an enhancement layer signal code of the enhancement sample point; and outputting a bit stream, where the bit stream includes the core layer signal code and the enhancement layer signal code. In embodiments of the present invention, according to the number of bits that can be used by the enhancement layer, the enhancement sample point that requires enhancement layer signal encoding is selected; the enhancement layer signal of the selected enhancement sample point is encoded and decoded; when no sufficient bits are available for the enhancement layer, the enhancement quality of the core layer can be improved.

Type: Grant

Filed: May 19, 2011

Date of Patent: February 19, 2013

Assignee: Huawei Technologies Co., Ltd.

Inventors: Chen Hu, Zexin Liu, Lei Miao, Longyin Chen, Qing Zhang, Wei Xiao, Herve Marcel Taddei
Encoder and decoder using inter channel prediction based on optimally determined signals

Patent number: 8374883

Abstract: An encoder improves inter-channel prediction (ICP) performance in scalable stereo sound encoding using an ICP. In the encoder, ICP analysis units use, as reference signal candidates, a frequency coefficient in the low-band portion of a side residual signal, a frequency coefficient in each sub-band portion of a monaural residual signal, and a frequency coefficient in the low-band portion of the monaural residual signal, respectively, and perform an ICP analysis between the these respective candidates and a frequency coefficient in each sub-band portion of the side residual signal to generate first, second, and third ICP coefficients.

Type: Grant

Filed: October 31, 2008

Date of Patent: February 12, 2013

Assignee: Panasonic Corporation

Inventors: Haishan Zhong, Zongxian Liu, Kok Seng Chong, Koji Yoshida
Hierarchical encoding/decoding device

Patent number: 8374853

Abstract: A system for coding a hierarchical audio signal, comprising, at least, a core layer using parametric coding by analysis by synthesis in a first frequency band, a band extension layer for widening said first frequency band into a second frequency band, or wideband. The system also comprises a wideband audio coding quality enhancement layer based on transform coding using a spectral parameter obtained from said band extension layer. Application to transmitting speech and/or audio signals over packet networks.

Type: Grant

Filed: July 7, 2006

Date of Patent: February 12, 2013

Assignee: France Telecom

Inventors: Stéphane Ragot, David Virette
Distributed apparatus and method for a perceptual quality measurement service

Patent number: 8370132

Abstract: Apparatus and methods are provided for measuring perceptual quality of a signal transmitted over a communication network, such as a circuit-switching network, packet-switching network, or a combination thereof. In accordance with one embodiment, a distributed apparatus is provided for measuring perceptual quality of a signal transmitted over a communication network. The distributed apparatus includes communication ports located at various locations in the network. The distributed apparatus may also include a signal processor including a processor for providing non-intrusive measurement of the perceptual quality of the signal. The distributed apparatus may further include recorders operatively connected to the communication ports and to the signal processor, wherein at least one of the recorders processes the signal at one of the communication ports and the recorder sends the signal to the signal processor to measure the perceptual quality of the signal.

Type: Grant

Filed: November 21, 2005

Date of Patent: February 5, 2013

Assignee: Verizon Services Corp.

Inventor: Adrian E. Conway
Voice encoding device, voice decoding device, and methods therefor

Patent number: 8364495

Abstract: An encoding device capable of realizing a scalable CODEC of a high performance. In this encoding device, an LPC analyzing unit (551) analyzes an input voice (301) efficiently with a synthesized LPC parameter obtained from a core decoder (305), to acquire an encoded LPC coefficient. An adaptive code note (552) is stored with its sound source codes, as acquired from the core decoder (305). The adaptive code note (552) and a stochastic code note (553) send sound source samples to a gain adjusting unit (554). This gain adjusting unit (554) multiplies the individual sound source samples by an amplification based on the gain parameters acquired from the core decoder (305), and then adds the products to acquire sound source vectors. These vectors are sent to an LPC synthesizing unit (555). This LPC synthesizing unit (555) filters the sound source vectors acquired at the gain adjusting unit (554), with the LPC parameter, to acquire a synthetic signal.

Type: Grant

Filed: September 1, 2005

Date of Patent: January 29, 2013

Assignee: Panasonic Corporation

Inventor: Toshiyuki Morii
Voice encoding device and voice encoding method

Patent number: 8364472

Abstract: Provided is an audio encoding device which can detect an optimal pitch pulse when using pitch pulse information as redundant information.

Type: Grant

Filed: February 29, 2008

Date of Patent: January 29, 2013

Assignee: Panasonic Corporation

Inventor: Hiroyuki Ehara
Method and apparatus for extending the bandwidth of a speech signal

Patent number: 8355906

Abstract: A bandwidth extension module, and an associated method and computer-readable medium, suitable for use in artificially extending the bandwidth of a lowband speech signal. The bandwidth extension module comprises a band-pass filter configured to produce a band-pass signal from the lowband speech signal; at least one carrier frequency modulator, each carrier frequency modulator configured to pitch-synchronously modulate the band-pass signal about a respective carrier frequency, the at least one carrier frequency modulator collectively producing a highband speech signal component; a synthesis filter configured to determine a highband speech signal based on the highband speech signal component; and a summation module configured to combine the lowband speech signal with the highband speech signal to obtain a bandwidth-extended speech signal.

Type: Grant

Filed: May 21, 2010

Date of Patent: January 15, 2013

Assignee: Apple Inc.

Inventors: Peter Kabal, Rafi Rabipour, Yasheng Qian
Encoding device, decoding device, and method thereof

Patent number: 8352249

Abstract: An encoding device improves the sound quality of a stereo signal while maintaining a low bit rate. The encoding device includes: an LP inverse filter which LP-inverse-filters a left signal L(n) by using an inverse quantization linear prediction coefficient AdM(z) of a monaural signal; a T/F conversion unit which converts the left sound source signal Le(n) from a temporal region to a frequency region; an inverse quantizer which inverse-quantizes encoded information Mqe; spectrum division units which divide a high-frequency component of the sound source signal Mde(f) and the left signal Le(f) into a plurality of bands; and scale factor calculation units which calculate scale factors ai and ssi by using a monaural sound source signal Mdeh,i(f), a left sound source signal Leh,i(f), Mdeh,i(f), and right sound source signal Reh,i(f) of each divided band.

Type: Grant

Filed: November 4, 2008

Date of Patent: January 8, 2013

Assignee: Panasonic Corporation

Inventors: Kok Seng Chong, Koji Yoshida, Masahiro Oshikiri
Selection of encoding modes and/or encoding rates for speech compression with closed loop re-decision

Patent number: 8346544

Abstract: In a device configurable to encode speech performing an closed loop re-decision may comprise representing a speech signal by amplitude components and phase components for a current frame and a past frame. In a first closed loop stage, a first set of compressed components and a first set of uncompressed components for a current frame may be generated. A first set of features may be generated by comparing current and past frame amplitude and/or phase components. In a second closed loop stage, a second set of compressed components for the current frame may be generated by compressing the first set of compressed components and compressing the first set of uncompressed components. Generation of a second set of features may be based on the second set of compressed components from the current frame and a combination of amplitude and/or phase components from the past frame.

Type: Grant

Filed: January 22, 2007

Date of Patent: January 1, 2013

Assignee: QUALCOMM Incorporated

Inventors: Sharath Manjunath, Ananthapadmanabhan Aasanipalai Kandhadai, Eddie L. T. Choy
Audio encoding method and device

Patent number: 8340305

Abstract: Audio encoding method and device comprising the transmission, in addition to the data representing a frequency-limited signal, of information relating to a temporal filter that is to be applied to the entire enhanced signal, both in its transmitted low-frequency part and in its reconstituted high-frequency part. The application of this filter for reshaping the reconstituted high-frequency part and the correction of compression artefacts present in the transmitted low-frequency part. In this way, the application of the temporal filter, simple and inexpensive, to all or part of the reconstituted signal, makes it possible to obtain a signal of good perceived quality.

Type: Grant

Filed: December 28, 2007

Date of Patent: December 25, 2012

Assignee: Mobiclip

Inventor: Alexandre Delattre
Multi-reference LPC filter quantization and inverse quantization device and method

Patent number: 8332213

Abstract: A multi-reference quantization device and method for quantizing an input LPC filter, comprises a plurality of differential quantizers using respective, different references, and a selector of a reference amongst the different references of the differential quantizers using a reference selection criterion. The input LPC filter is differentially quantized by the differential quantizer using the selected reference. A device and method for inverse quantizing a multi-reference differentially quantized LPC filter extracted from a bitstream, comprises an extractor from the bitstream of information about a reference amongst a plurality of possible references used for quantizing the multi-reference differentially quantized LPC filter, and a differential inverse quantizer using the reference corresponding to the extracted reference information to inverse quantize the multi-reference differentially quantized LPC filter.

Type: Grant

Filed: July 10, 2009

Date of Patent: December 11, 2012

Assignee: VoiceAge Corporation

Inventors: Philippe Gournay, Bruno Bessette, Redwan Salami
Method and apparatus for an audio signal processing

Patent number: 8326609

Abstract: An apparatus for processing an audio signal and method thereof are disclosed, by which the audio signal can be efficiently processed. The present invention includes obtaining start position information of a sub-frame from a header of the main frame and processing an audio signal based on the start position information of the sub-frame, wherein the main frame includes a plurality of sub-frames.

Type: Grant

Filed: June 29, 2007

Date of Patent: December 4, 2012

Assignee: LG Electronics Inc.

Inventor: Hyeon O Oh
Audio compression

Patent number: 8326638

Abstract: For audio encoding and decoding, in order to enhance coded audio signals, the audio signal is divided into at least a low frequency band and a high frequency band, the high frequency band is divided into at least two high frequency sub-band signals, and parameters are generated that refer at least to the low frequency band signal sections which match best with high-frequency sub-band signals.

Type: Grant

Filed: November 4, 2005

Date of Patent: December 4, 2012

Assignee: Nokia Corporation

Inventor: Mikko Tammi
Speech processing and speech synthesis using a linear combination of bases at peak frequencies for spectral envelope information

Patent number: 8321208

Abstract: An information extraction unit extracts spectral envelope information of L-dimension from each frame of speech data by discrete Fourier transform. The spectral envelope information is represented by L points. A basis storage unit stores N bases (L>N>1). Each basis is differently a frequency band having a maximum as a peak frequency in a spectral domain having L-dimension. A value corresponding to a frequency outside the frequency band along a frequency axis of the spectral domain is zero. Two frequency bands of which two peak frequencies are adjacent along the frequency axis partially overlap. A parameter calculation unit minimizes a distortion between the spectral envelope information and a linear combination of each basis with a coefficient for each of L points of the spectral envelope information by changing the coefficient, and sets the coefficient of each basis from which the distortion is minimized to a spectral envelope parameter of the spectral envelope information.

Type: Grant

Filed: December 3, 2008

Date of Patent: November 27, 2012

Assignee: Kabushiki Kaisha Toshiba

Inventors: Masatsune Tamura, Katsumi Tsuchiya, Takehiko Kagoshima
MDCT domain post-filtering apparatus and method for quality enhancement of speech

Patent number: 8315853

Abstract: A post-filtering apparatus and method for speech enhancement in a modified discrete cosine transform (MDCT) domain are disclosed. In the apparatus and method, previous and current MDCT coefficients are used for obtaining a speech spectrum coefficient similar to a real speech spectrum, and a convex function is used for transforming the speech spectrum coefficient and obtaining a post-filter coefficient so that difference can increase in the case where the speech spectrum coefficient is small but decrease in the case where the coefficient is large. Then, the post-filter coefficient is applied to the MDCT coefficient. With this configuration, both the current and previous MDCT values are used, so that it is possible to obtain a spectrum coefficient similar to the real speech spectrum and to obtain a more accurate filter coefficient. Further, the coefficient is adaptively transformed through the convex function, thereby enhancing speech quality.

Type: Grant

Filed: June 5, 2008

Date of Patent: November 20, 2012

Assignee: Electronics and Telecommunications Research Institute

Inventors: Hyun-woo Kim, Jong-mo Sung, Mi-suk Lee, Do-young Kim, Byung-sun Lee

prev 1 2 3 4 5 6 7 … next