Excitation Patterns Patents (Class 704/223)
  • Patent number: 10381015
    Abstract: A technology of accurately coding and decoding coefficients which are convertible into linear prediction coefficients even for a frame in which the spectrum variation is great while suppressing an increase in the code amount as a whole is provided. A coding device includes: a first coding unit that obtains a first code by coding coefficients which are convertible into linear prediction coefficients of more than one order; and a second coding unit that obtains a second code by coding at least quantization errors of the first coding unit if (A-1) an index Q commensurate with how high the peak-to-valley height of a spectral envelope is, the spectral envelope corresponding to the coefficients which are convertible into the linear prediction coefficients of more than one order, is larger than or equal to a predetermined threshold value Th1 and/or (B-1) an index Q? commensurate with how short the peak-to-valley height of the spectral envelope is, is smaller than or equal to a predetermined threshold value Th1?.
    Type: Grant
    Filed: July 25, 2018
    Date of Patent: August 13, 2019
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 10339948
    Abstract: Disclosed are a method and apparatus for encoding and decoding a high frequency for bandwidth extension. The method includes: estimating a weight; and generating a high frequency excitation signal by applying the weight between random noise and a decoded low frequency spectrum.
    Type: Grant
    Filed: September 11, 2017
    Date of Patent: July 2, 2019
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Ki-hyun Choo
  • Patent number: 10290300
    Abstract: Embodiments are disclosed for recognizing speech in a computing system. An example speech recognition method includes receiving metadata at a generation unit that includes a database of accented substrings, generating, via the generation unit, accent-corrected phonetic data for words included in the metadata, the accent-corrected phonetic data representing different pronunciations of the words included in the metadata based on the accented substrings stored in the database, receiving, at a voice recognition engine, extracted speech data derived from utterances input by a user to the speech recognition system, and receiving, at the voice recognition engine, the accent-corrected phonetic data.
    Type: Grant
    Filed: July 24, 2015
    Date of Patent: May 14, 2019
    Assignee: Harman International Industries, Incorporated
    Inventor: Rajat Pashine
  • Patent number: 10283143
    Abstract: A time-varying pitch of a signal may be estimated by processing a sequence of frames of the speech signal. An estimated fractional chirp rate may be computed for each frame of the sequence of frames, and the estimated fractional chirp rates may be used to compute a pitch template for the sequence, where the pitch template indicates the time-varying pitch of the signal subject to a scale factor. A first pitch estimate for each frame of the sequence of frames may be computed by computing a scale factor and multiplying the pitch template by the scale factor. A second pitch estimate may be computed from the first pitch estimate by identifying peaks in the frequency representations using the first pitch estimates and fitting a parametric function to the peaks.
    Type: Grant
    Filed: March 20, 2017
    Date of Patent: May 7, 2019
    Assignee: Friday Harbor LLC
    Inventors: David C. Bradley, Jeremy Semko
  • Patent number: 10236007
    Abstract: An audio encoder for encoding an audio signal, includes: a first encoding processor for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor includes: a time frequency converter for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; a spectral encoder for encoding the frequency domain representation; a second encoding processor for encoding a second different audio signal portion in the time domain; a cross-processor for calculating, from the encoded spectral representation of the first audio signal portion, initialization data of the second encoding processor, so that the second encoding processing is initialized to encode the second audio signal portion immediately following the first audio signal portion in time in the audio signal.
    Type: Grant
    Filed: January 24, 2017
    Date of Patent: March 19, 2019
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Martin Dietz, Markus Multrus, Guillaume Fuchs, Emmanuel Ravelli, Matthias Neusinger, Markus Schnell, Benjamin Schubert, Bernhard Grill
  • Patent number: 10217453
    Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphing of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.
    Type: Grant
    Filed: October 14, 2016
    Date of Patent: February 26, 2019
    Assignee: SoundHound, Inc.
    Inventors: Mark Stevans, Monika Almudafar-Depeyrot, Keyvan Mohajer
  • Patent number: 10152982
    Abstract: An audio signal processing device comprises a discontinuity detector configured to determine an occurrence of a discontinuity from a sudden increase of an amplitude of decoded audio obtained by decoding the first audio packet which is received correctly after an occurrence of a packet loss, and a discontinuity corrector for correcting the discontinuity of the decoded audio.
    Type: Grant
    Filed: September 15, 2017
    Date of Patent: December 11, 2018
    Assignee: NTT DOCOMO, INC.
    Inventors: Kimitaka Tsutsumi, Kei Kikuiri, Atsushi Yamaguchi
  • Patent number: 10147435
    Abstract: An audio signal, having first and second regions of frequency spectrum, is coded. Spectral peaks in the first region are encoded by a first coding method. For a segment of the audio signal, a relation between energy of bands in the first and second regions is determined. A relation between the energy of the band in the second region and energy of neighboring bands in the second region is determined. A determination is made whether available bits are sufficient for encoding at least one non-peak segment of the first region and the band in the second region. Responsive to first and second relations fulfilling a respective predetermined criterion and a sufficient number of bits, encoding the band in the second region using a second coding method different from the first coding method, and otherwise, subjecting the band in the second region to BandWidth Extension BWE or noise fill.
    Type: Grant
    Filed: July 20, 2017
    Date of Patent: December 4, 2018
    Assignee: Telefonaktiebolaget L M Ericsson (publ)
    Inventors: Erik Norvell, Volodya Grancharov
  • Patent number: 10083698
    Abstract: A speech coding method of reducing error propagation due to voice packet loss, is achieved by limiting or reducing a pitch gain only for the first subframe or the first two subframes within a speech frame, the excitation of a next frame is obtained according to the reduced or limited pitch gain value of the first subframe, and the next frame is encoded according to the obtained excitation. The method is used for a voiced speech class.
    Type: Grant
    Filed: August 15, 2017
    Date of Patent: September 25, 2018
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Yang Gao
  • Patent number: 10074376
    Abstract: A technology of accurately coding and decoding coefficients which are convertible into linear prediction coefficients even for a frame in which the spectrum variation is great while suppressing an increase in the code amount as a whole is provided. A coding device includes: a first coding unit that obtains a first code by coding coefficients which are convertible into linear prediction coefficients of more than one order; and a second coding unit that obtains a second code by coding at least quantization errors of the first coding unit if (A-1) an index Q commensurate with how high the peak-to-valley height of a spectral envelope is, the spectral envelope corresponding to the coefficients which are convertible into the linear prediction coefficients of more than one order, is larger than or equal to a predetermined threshold value Th1 and/or (B-1) an index Q? commensurate with how short the peak-to-valley height of the spectral envelope is, is smaller than or equal to a predetermined threshold value Th1?.
    Type: Grant
    Filed: March 16, 2015
    Date of Patent: September 11, 2018
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 10044547
    Abstract: A digital code recovery circuit includes a data transmitter that outputs either input data or a preamble code as transmitter data. A radio frequency interconnect (RFI) transmitter modulates carrier signals based on the transmitter data and transmits the modulated carrier signals over a channel to an RFI receiver that demodulates the carrier signals to obtain recovered transmitter data. A calibration storage device stores preamble data and a calibration circuit receives the recovered transmitter data. If the recovered transmitter data originated from the preamble code, the calibration circuit determines a set of digital calibration adjustments from the recovered transmitter data and the preamble data. If the recovered transmitter data originated from the input data, the calibration circuit applies the set of digital calibration adjustments to the recovered transmitter data to obtain adjusted digital code and outputs the adjusted digital code.
    Type: Grant
    Filed: October 30, 2015
    Date of Patent: August 7, 2018
    Assignee: TAIWAN SEMICONDUCTOR MANUFACTURING COMPANY, LTD.
    Inventors: Fu-Lung Hsueh, William Wu Shen, Lan-Chou Cho
  • Patent number: 10038485
    Abstract: A codebook C is provided in a MIMO transmitter as well as a MIMO receiver. The codebook C will include M codewords ci, where i is a unique codeword index for each codeword ci. Each codeword defines weighting factors to apply to the MIMO signals, and may correspond to channel matrices or vectors to apply to the MIMO signals prior to transmission from the respective antennas of the MIMO transmitter. The present invention creates codeword subsets Si for each codeword ci of the codebook C. Each codeword subset Si defines L codewords cj, which are selected from all the codewords ci in the codebook C. The codewords cj in a codeword subset Si are the L codewords in the entire codebook that best correlate with the corresponding codeword ci.
    Type: Grant
    Filed: December 22, 2017
    Date of Patent: July 31, 2018
    Assignee: Apple Inc.
    Inventors: Wen Tong, Hosein Nikopour, Amir Khandani, Hua Xu, Ming Jia, Peiying Zhu, Dong-sheng Yu
  • Patent number: 10002605
    Abstract: A method and system for achieving emotional text to speech. The method includes: receiving text data; generating emotion tag for the text data by a rhythm piece; and achieving TTS to the text data corresponding to the emotion tag, where the emotion tags are expressed as a set of emotion vectors; where each emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories. A system for the same includes: a text data receiving module; an emotion tag generating module; and a TTS module for achieving TTS, wherein the emotion tag is expressed as a set of emotion vectors; and wherein emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories.
    Type: Grant
    Filed: December 12, 2016
    Date of Patent: June 19, 2018
    Assignee: International Business Machines Corporation
    Inventors: Shenghua Bao, Jian Chen, Yong Qin, Qin Shi, Zhiwei Shuang, Zhong Su, Liu Wen, Shi Lei Zhang
  • Patent number: 9972325
    Abstract: In accordance with an embodiment, a method of encoding an audio/speech signal includes determining a mixed codebook vector based on an incoming audio/speech signal, where the mixed codebook vector includes a sum of a first codebook entry from a first codebook and a second codebook entry from a second codebook. The method further includes generating an encoded audio signal based on the determined mixed codebook vector, and transmitting a coded excitation index of the determined mixed codebook vector.
    Type: Grant
    Filed: February 15, 2013
    Date of Patent: May 15, 2018
    Assignee: Huawei Technologies Co., Ltd.
    Inventor: Yang Gao
  • Patent number: 9911425
    Abstract: A device and method for quantizing a gain of a fixed contribution of an excitation in a frame, including sub-frames, of a coded sound signal, wherein the gain of the fixed excitation contribution is estimated in a sub-frame using a parameter representative of a classification of the frame. The gain of the fixed excitation contribution is then quantized in the sub-frame using the estimated gain. The device and method is used in jointly quantizing gains of adaptive and fixed contributions of an excitation in a frame of a coded sound signal. For retrieving a quantized gain of a fixed contribution of an excitation in a sub-frame of a frame, the gain of the fixed excitation contribution is estimated using a parameter representative of a classification of the frame, a gain codebook supplies a correction factor in response to a received, gain codebook index, and a multiplier multiplies the estimated gain by the correction factor to provide a quantized gain of the fixed excitation contribution.
    Type: Grant
    Filed: March 17, 2017
    Date of Patent: March 6, 2018
    Assignee: VOICEAGE CORPORATION
    Inventor: Vladimir Malenovsky
  • Patent number: 9886960
    Abstract: A signal encoding method and device are disclosed. The method includes, when an encoding manner of a previous frame of a currently-input frame is a continuous encoding manner, predicting a comfort noise that is generated by a decoder according to the currently-input frame when the currently-input frame is encoded into an SID frame, determining an actual silence signal, determining a deviation degree between the comfort noise and the actual silence signal, determining an encoding manner of the currently-input frame according to the deviation degree, and encoding the currently-input frame according to the encoding manner of the currently-input frame. It is determined, according to the deviation degree between the comfort noise and the actual silence signal, that the encoding manner of the currently-input frame is the hangover frame encoding manner or the SID frame encoding manner, which can save communication bandwidth.
    Type: Grant
    Filed: November 25, 2015
    Date of Patent: February 6, 2018
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Zhe Wang
  • Patent number: 9848316
    Abstract: The present invention relates to a method for performing a D2D discovery and a terminal using the same. Particularly, the present invention relates to a method for performing a D2D discovery, which compares a congestion level of another discovery slot selected randomly or according to a preset pattern with a threshold or a congestion level of a current discovery slot and determines a discovery slot to transmit a discovery message and a transmission period of the discovery message according to a result of the comparison, and a terminal using the same.
    Type: Grant
    Filed: July 10, 2014
    Date of Patent: December 19, 2017
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Sangkyu Baek, Seunghoon Park, Kyungkyu Kim, Hyunseok Ryu, Chiwoo Lim, Youngbin Chang
  • Patent number: 9837096
    Abstract: A system, apparatus and a method for transmitting continuous audio data configured to mitigate data discontinuities in a receiving device. The method may mitigate data discontinuities by transmitting a continuous stream of audio data that has reduced changes to the audio data characteristics. The method may transmit filler audio data when no application audio data is available. The application audio data and the filler audio data are processed to reduce changes to the audio data characteristics in each stream.
    Type: Grant
    Filed: May 20, 2015
    Date of Patent: December 5, 2017
    Assignee: 2236008 Ontario, Inc.
    Inventors: Joe Mammone, Michael Mead Truman
  • Patent number: 9804820
    Abstract: A system comprising at least one processor configured to perform: receiving a first request to access a first user profile of a first user from a first device configured to execute a first virtual assistant to interact with the first user; in response to receiving the first request, providing the first device with access to information in the first user profile so that the first virtual assistant is able to customize, based on the accessed information, its behavior when interacting with the first user; receiving a second request to access the first user profile from a second device configured to execute a second virtual assistant to interact with the first user; and in response to receiving the second request, providing the second device with access to the information so that the second virtual assistant is able to customize, based on the accessed information, its behavior when interacting with the first user.
    Type: Grant
    Filed: December 16, 2013
    Date of Patent: October 31, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Holger Quast, Carey Radebaugh, Sean P. Brown, Kenneth S. Harper, Andrew R. Wyatt
  • Patent number: 9799345
    Abstract: An apparatus for decoding an encoded audio signal including bandwidth extension control data indicating either a first harmonic bandwidth extension mode or a second non-harmonic bandwidth extension mode, includes: an input interface for receiving the encoded audio signal including the bandwidth extension control data indicating either the first harmonic bandwidth extension mode or the second non-harmonic bandwidth extension mode; a processor for decoding the audio signal using the second non-harmonic bandwidth extension mode; and a controller for controlling the processor to decode the audio signal using the second non-harmonic bandwidth extension mode, even when the bandwidth extension control data indicates the first harmonic bandwidth extension mode for the encoded signal.
    Type: Grant
    Filed: June 8, 2016
    Date of Patent: October 24, 2017
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Andreas Niedermeier, Stephan Wilde, Daniel Fischer, Matthias Hildenbrand, Marc Gayer, Max Neuendorf
  • Patent number: 9754580
    Abstract: A system for carrying out voice pattern recognition and a method for achieving same. The system includes an arrangement for acquiring an input voice, a signal processing library for extracting acoustic and prosodic features of the acquired voice, a database for storing a recognition dictionary, at least one instance of a prosody detector for carrying out a prosody detection process on extracted respective prosodic features, communicating with an end user application for applying control thereto.
    Type: Grant
    Filed: October 12, 2015
    Date of Patent: September 5, 2017
    Assignee: TECHNOLOGIES FOR VOICE INTERFACE
    Inventors: Danny Lionel Weissberg, Stas Tiomkin
  • Patent number: 9626988
    Abstract: A speech masking apparatus includes a microphone and a speaker. The microphone can detect a human voice. The speaker can output a masking language which can include phonemes resembling human speech. At least one component of the masking language can have a pitch, a volume, a theme, and/or a phonetic content substantially matching a pitch, a volume, a theme, and/or a phonetic content of the voice.
    Type: Grant
    Filed: March 10, 2014
    Date of Patent: April 18, 2017
    Assignee: Medical Privacy Solutions, LLC
    Inventors: Babak Arvanaghi, Joel Fechter
  • Patent number: 9570063
    Abstract: A method and system for achieving emotional text to speech. The method includes: receiving text data; generating emotion tag for the text data by a rhythm piece; and achieving TTS to the text data corresponding to the emotion tag, where the emotion tags are expressed as a set of emotion vectors; where each emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories. A system for the same includes: a text data receiving module; an emotion tag generating module; and a TTS module for achieving TTS, wherein the emotion tag is expressed as a set of emotion vectors; and wherein emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories.
    Type: Grant
    Filed: July 23, 2015
    Date of Patent: February 14, 2017
    Assignee: International Business Machines Corporation
    Inventors: Shenghua Bao, Jian Chen, Yong Qin, Qin Shi, Zhiwei Shuang, Zhong Su, Liu Wen, Shi Lei Zhang
  • Patent number: 9532062
    Abstract: A mechanism is disclosed to adapt in real-time the encoding bitrate of a video source in an adaptive video streaming system with the aim of avoiding playback interruptions and to guarantee the best possible video quality. The mechanism is made of a feedback loop in which a controller computes the encoding bitrate that the sender (the video server) transmits to a receiver (client) employing a packet switching network such as the Internet. The mechanism can be used in video streaming systems employing scalable encoding (f.i. H.264 SVC, VP8, VP9) or multi bit-rate encoding (stream-switching). The automatic control that computes the video encoding bitrate can be executed either at the server or at the client.
    Type: Grant
    Filed: November 20, 2014
    Date of Patent: December 27, 2016
    Assignee: Quavlive S.R.L.
    Inventors: Luca De Cicco, Saverio Mascolo
  • Patent number: 9514757
    Abstract: A stereo signal encoding device is provided that enables a lower bitrate without decreasing quality when applying an intermittent transmission technique to a stereo signal. A stereo encoding unit generates first stereo encoded data by encoding the stereo signal when the stereo signal of the current frame is an audio section. A stereo DTX encoding unit is a means for encoding the stereo signal when the stereo signal of the current frame is a non-audio section. The stereo DTX encoding unit generates second stereo encoded data by encoding each of a monaural signal spectral parameter that is a spectral parameter of a monaural signal generated using the first channel signal and the second channel signal, first channel signal information relating to the first channel signal, and second channel signal information relating to the second channel signal.
    Type: Grant
    Filed: October 17, 2011
    Date of Patent: December 6, 2016
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Masahiro Oshikiri, Hiroyuki Ehara
  • Patent number: 9437207
    Abstract: Various of the disclosed embodiments relate to systems and methods for extracting audio information, e.g. a textual description of speech, from a speech recording while retaining the anonymity of the speaker. In certain embodiments, a third party may perform various aspects of the anonymization and speech processing. Certain embodiments facilitate anonymization in compliance with various legislative requirements even when third parties are involved.
    Type: Grant
    Filed: April 3, 2013
    Date of Patent: September 6, 2016
    Assignee: PULLSTRING, INC.
    Inventors: Oren M Jacob, Martin Reddy, Brian Langner
  • Patent number: 9419696
    Abstract: A base station receives a message comprising at least one index identifying a subset of a second plurality of beamforming codewords. Each of the second plurality of beamforming codewords is identifiable by an index presented by a number of bits. The number of bits is greater than or equal to log2(N), N being number of the second plurality of beamforming codewords, and less than number of bits in representation of a corresponding beamforming codeword. The base station transmits signals employing a first plurality of beamforming codewords being selected employing the subset of the second plurality of beamforming codewords.
    Type: Grant
    Filed: July 12, 2015
    Date of Patent: August 16, 2016
    Assignee: COMCAST CABLE COMMUNICATIONS, LLC
    Inventor: Esmael Hejazi Dinan
  • Patent number: 9401156
    Abstract: There is provided a method of using an adaptive tilt compensation by a speech decoder. The method comprises receiving a bit stream including a plurality of parameters representative of a speech signal; identifying an adaptive code vector and a fixed code vector using the plurality of parameters; scaling the adaptive code vector and the fixed code vector to generate a scaled adaptive code vector and a scaled fixed code vector; summing the scaled adaptive code vector and the scaled fixed code vector to generate a synthesized output; calculating a first reflection coefficient based on the plurality of parameters representative of the speech signal; multiplying the first reflection coefficient by a factor to generate a tilt factor; and applying the tilt factor to the synthesized output based on an encoding bit rate.
    Type: Grant
    Filed: June 27, 2008
    Date of Patent: July 26, 2016
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Huan-Yu Su, Yang Gao
  • Patent number: 9390721
    Abstract: The present invention pertains to a speech decoding device that is capable of preventing degradation in sound quality associated with an adjustment of the slope of a spectrum of an output signal (a decoding signal), making it less likely that a loss of bandwidth sensitivity due to the attenuation of a higher band region is perceived. For each frame of the bandwidth extension layer decoding signal, a filter assessment unit (304) determines whether or not to apply a low-pass filter to the bandwidth extension layer decoding signal on the basis of a change in energy in the bandwidth extension layer decoding signal. A low-pass filtering unit (306) filters the bandwidth extension layer decoding signal of the frames to which the low-pass filter is to be applied, as determined by the filter assessment unit (304), using the low-pass filter.
    Type: Grant
    Filed: December 20, 2012
    Date of Patent: July 12, 2016
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Katsunori Daimou, Masahiro Oshikiri
  • Patent number: 9331758
    Abstract: The embodiments of the present invention disclose a method and an apparatus for sub-sampling of a codebook in the LTE-A system where a precoding matrix W is a product of two matrices W1 and W2, i.e. W=W1W2, codebooks for W, W1 and W2 are denoted as C, C1 and C2 respectively, and r indicates a rank. The method includes sub-sampling the codebook C such that the sub-sampled codebook C has a size of equal to or less than 4 bits. In the sub-sampling, codewords evenly distributed in the codebook C are extracted, where some or all of the codewords are in a form of discrete Fourier transform (DFT) vector to be suitable for evenly linear arrays, and the other codewords are suitable for cross-polarized linear arrays.
    Type: Grant
    Filed: October 6, 2011
    Date of Patent: May 3, 2016
    Assignee: Alcatel Lucent
    Inventors: Lu Wu, Hongwei Yang, Hao Liu
  • Patent number: 9325544
    Abstract: A method of updating the state of a decoder that decodes a wide-band signal including a plurality of sub-band signals, including: receiving the plurality of sub-band signals; for each sub-band signal, storing portions of that sub-band signal in a respective buffer; responsive to determining that a portion of the wide-band signal is degraded, performing a packet loss concealment algorithm to determine wide-band replacement data for the degraded portion; selecting a portion of the sub-band signal stored in each buffer in dependence on the determined wide-band replacement data; and updating the state of the decoder using the selected portions.
    Type: Grant
    Filed: October 31, 2012
    Date of Patent: April 26, 2016
    Assignee: CSR TECHNOLOGY INC.
    Inventors: Amin Fazeldehkordi, Rogerio G. Alves
  • Patent number: 9263049
    Abstract: Various techniques are disclosed for improving packet loss concealment to reduce artifacts by using audio character measures of the audio signal. These techniques include attenuation to a noise fill instead of attenuation to silence, varying how long to wait before attenuating the extrapolation, varying the rate of attenuation of the extrapolation, attenuating periodic extrapolation at a different rate than non-periodic extrapolation, and performing period extrapolation on successively longer fill data based on the audio character measures, adjusting weighting between periodic and non-periodic extrapolation based on the audio character measures, and adjusting weighting between periodic extrapolation and non-periodic extrapolation non-linearly.
    Type: Grant
    Filed: October 25, 2010
    Date of Patent: February 16, 2016
    Assignee: Polycom, Inc.
    Inventor: Eric David Elias
  • Patent number: 9225354
    Abstract: A coding method, a decoding method, a coder, and a decoder are disclosed herein. A coding method includes: obtaining the pulse distribution, on a track, of the pulses to be encoded on the track; determining a distribution identifier for identifying the pulse distribution according to the pulse distribution; and generating a coding index that includes the distribution identifier. A decoding method includes: receiving a coding index; obtaining a distribution identifier from the coding index, wherein the distribution identifier is configured to identify the pulse distribution, on a track, of the pulses to be encoded on the track; determining the pulse distribution, on a track, of all the pulses to be encoded on the track according to the distribution identifier; and reconstructing the pulse order on the track according to the pulse distribution.
    Type: Grant
    Filed: February 9, 2015
    Date of Patent: December 29, 2015
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Fuwei Ma, Dejun Zhang
  • Patent number: 9117446
    Abstract: A method and system for achieving emotional text to speech. The method includes: receiving text data; generating emotion tag for the text data by a rhythm piece; and achieving TTS to the text data corresponding to the emotion tag, where the emotion tags are expressed as a set of emotion vectors; where each emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories. A system for the same includes: a text data receiving module; an emotion tag generating module; and a TTS module for achieving TTS, wherein the emotion tag is expressed as a set of emotion vectors; and wherein emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories.
    Type: Grant
    Filed: August 31, 2011
    Date of Patent: August 25, 2015
    Assignee: International Business Machines Corporation
    Inventors: Shenghua Bao, Jian Chen, Yong Qin, Qin Shi, Zhiwei Shuang, Zhong Su, Liu Wen, Shi Lei Zhang
  • Patent number: 9093066
    Abstract: In a coder, a method for producing forward aliasing cancellation (FAC) parameters for cancelling time-domain aliasing caused to a coded audio signal in a first transform-coded frame by a transition between the first transform-coded frame using a first coding mode with overlapping window and a second frame using a second coding mode with non-overlapping window, comprising: calculating a FAC target representative of a difference between the audio signal of the first frame prior to coding and a sum of synthesis of the coded audio signal of the first transform-coded frame and a time reversed version of last synthesis samples of the second frame as well as a zero point response of a synthesis filter used in the second frame; and weighting the FAC target to produce the FAC parameters. In a decoder, weighted forward aliasing cancellation (FAC) parameters are received and inverse weighted to produce a FAC synthesis.
    Type: Grant
    Filed: January 13, 2011
    Date of Patent: July 28, 2015
    Assignee: VOICEAGE CORPORATION
    Inventor: Bruno Bessette
  • Patent number: 9083954
    Abstract: A video processing method and system and a related device are disclosed, an encoder sends a method for compressing a pixel value of a video image to a decoder, the method includes a mapping between a pixel value of the video image before compression and a pixel value of the video image after the compression; compressing, by the encoder according to the method for compressing a pixel value, the pixel value of the video image to obtain a video image to be encoded; encoding, by the encoder, the video image to be encoded to obtain a video image encoded stream; and sending, by the encoder, the video image encoded stream to the decoder, the decoder receives and decodes the video image encoded stream to obtain a decoded video image, and decompresses a pixel value of the decoded video image to restore the pixel value of the video image before compression.
    Type: Grant
    Filed: November 14, 2013
    Date of Patent: July 14, 2015
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Pulin Wang, Yongning Qin
  • Patent number: 9082398
    Abstract: In accordance with an embodiment, a method of decoding an audio/speech signal includes decoding an excitation signal based on an incoming audio/speech information, determining a stability of a high frequency portion of the excitation signal, smoothing an energy of the high frequency portion of the excitation signal based on the stability of the high frequency portion of the excitation signal, and producing an audio signal based on smoothing the high frequency portion of the excitation signal.
    Type: Grant
    Filed: February 27, 2013
    Date of Patent: July 14, 2015
    Assignee: Huawei Technologies Co., Ltd.
    Inventor: Yang Gao
  • Patent number: 9075446
    Abstract: Certain aspects of the present disclosure relate to a method for quantizing signals and reconstructing signals, and/or encoding or decoding data for storage or transmission. Points of a signal may be determined as local extrema or points where an absolute rise of the signal is greater than a threshold. The tread and value of the points may be quantized, and certain of the quantizations may be discarded before the quantizations are transmitted. After being received, the signal may be reconstructed from the quantizations using an iterative process.
    Type: Grant
    Filed: December 17, 2010
    Date of Patent: July 7, 2015
    Assignee: QUALCOMM Incorporated
    Inventors: Harinath Garudadri, Pawan K. Baheti, Somdeb Majumdar
  • Patent number: 9053702
    Abstract: Compressibility-based reallocation of initial bit allocations for frames of an audio signal is described. Applications to redundancy-based retransmission of critical frames (e.g., for fixed-bit-rate modes of speech codec operation) are also described.
    Type: Grant
    Filed: December 19, 2012
    Date of Patent: June 9, 2015
    Assignee: QUALCOMM Incorporated
    Inventors: Vivek Rajendran, Venkatesh Krishnan, Daniel J. Sinder
  • Patent number: 9043214
    Abstract: A method of signal processing according to one embodiment includes calculating an envelope of a first signal that is based on a low-frequency portion of a speech signal, calculating an envelope of a second signal that is based on a high-frequency portion of the speech signal, and calculating a plurality of gain factor values according to a time-varying relation between the envelopes of the first and second signal. The method includes attenuating, based on a variation over time of a relation between the envelopes of the first and second signals, at least one of the plurality of gain factor values. In one example, the variation over time of a relation between the envelopes is indicated by at least one distance among the plurality of gain factor values.
    Type: Grant
    Filed: April 21, 2006
    Date of Patent: May 26, 2015
    Assignee: QUALCOMM INCORPORATED
    Inventors: Koen Bernard Vos, Ananthapadmanabhan Aasanipalai Kandhadai
  • Patent number: 9015038
    Abstract: A mixed time-domain/frequency-domain coding device and method for coding an input sound signal, wherein a time-domain excitation contribution is calculated in response to the input sound signal. A cut-off frequency for the time-domain excitation contribution is also calculated in response to the input sound signal, and a frequency extent of the time-domain excitation contribution is adjusted in relation to this cut-off frequency. Following calculation of a frequency-domain excitation contribution in response to the input sound signal, the adjusted time-domain excitation contribution and the frequency-domain excitation contribution are added to form a mixed time-domain/frequency-domain excitation constituting a coded version of the input sound signal. In the calculation of the time-domain excitation contribution, the input sound signal may be processed in successive frames of the input sound signal and a number of sub-frames to be used in a current frame may be calculated.
    Type: Grant
    Filed: October 25, 2011
    Date of Patent: April 21, 2015
    Assignee: VoiceAge Corporation
    Inventors: Tommy Vaillancourt, Milan Jelinek
  • Patent number: 9015040
    Abstract: An apparatus for encoding an audio signal having a stream of audio samples has: a windower for applying a prediction coding analysis window to the stream of audio samples to obtain windowed data for a prediction analysis and for applying a transform coding analysis window to the stream of audio samples to obtain windowed data for a transform analysis, wherein the transform coding analysis window is associated with audio samples within a current frame of audio samples and with audio samples of a predefined portion of a future frame of audio samples being a transform-coding look-ahead portion, wherein the prediction coding analysis window is associated with at least the portion of the audio samples of the current frame and with audio samples of a predefined portion of the future frame being a prediction coding look-ahead portion, wherein the transform coding look-ahead portion and the prediction coding look-ahead portion are identically to each other or are different from each other by less than 20%; and an enc
    Type: Grant
    Filed: August 14, 2013
    Date of Patent: April 21, 2015
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Emmanuel Ravelli, Ralf Geiger, Markus Schnell, Guillaume Fuchs, Vesa Ruoppila, Tom Baeckstroem, Bernhard Grill, Christian Helmrich
  • Patent number: 9008329
    Abstract: Provided are methods and systems for noise suppression within multiple time-frequency points of spectral representations. A multi-feature cluster tracker is used to track signal and noise sources and to predict signal versus noise dominance at each time-frequency point. Multiple features, such as binaural and monaural features, may be used for these purposes. A Gaussian mixture model (GMM) is developed and, in some embodiments, dynamically updated for distinguishing signal from noise and performing mask-based noise reduction. Each frequency band may use a different GMM or share a GMM with other frequency bands. A GMM may be combined from two models, with one trained to model time-frequency points in which the target dominates and another trained to model time-frequency points in which the noise dominates. Dynamic updates of a GMM may be performed using an expectation-maximization algorithm in an unsupervised fashion.
    Type: Grant
    Filed: June 8, 2012
    Date of Patent: April 14, 2015
    Assignee: Audience, Inc.
    Inventors: Michael Mandel, Carlos Avendano
  • Patent number: 9009037
    Abstract: Disclosed is an encoding device that improves the quality of a decoded signal in a hierarchical coding (scalable coding) method, wherein a band to be quantized is selected for every level (layer). The encoding device (101) is equipped with a second layer encoding unit (205) that selects a first band to be quantized of a first input signal from among a plurality of sub-bands, and that generates second layer encoding information containing first band information of said band; a second layer decoding unit (206) that generates a first decoded signal using the second layer encoding information; an addition unit (207) that generates a second input signal using the first input signal and the first decoded signal; and a third layer encoding unit (208) that selects a second band to be quantized of the second input signal using the first decoded signal, and that generates third layer encoding information.
    Type: Grant
    Filed: October 13, 2010
    Date of Patent: April 14, 2015
    Assignee: Panasonic Intellectual Property Corporation of America
    Inventors: Tomofumi Yamanashi, Toshiyuki Morii
  • Patent number: 8990094
    Abstract: An electronic device for coding a transient frame is described. The electronic device includes a processor and executable instructions stored in memory that is in electronic communication with the processor. The electronic device obtains a current transient frame. The electronic device also obtains a residual signal based on the current transient frame. Additionally, the electronic device determines a set of peak locations based on the residual signal. The electronic device further determines whether to use a first coding mode or a second coding mode for coding the current transient frame based on at least the set of peak locations. The electronic device also synthesizes an excitation based on the first coding mode if the first coding mode is determined. The electronic device also synthesizes an excitation based on the second coding mode if the second coding mode is determined.
    Type: Grant
    Filed: September 8, 2011
    Date of Patent: March 24, 2015
    Assignee: QUALCOMM Incorporated
    Inventors: Venkatesh Krishnan, Ananthapadmanabhan Arasanipalai Kandhadai
  • Patent number: 8988256
    Abstract: A coding method, a decoding method, a coder, and a decoder are disclosed herein. A coding method includes: obtaining the pulse distribution, on a track, of the pulses to be encoded on the track; determining a distribution identifier for identifying the pulse distribution according to the pulse distribution; and generating a coding index that includes the distribution identifier. A decoding method includes: receiving a coding index; obtaining a distribution identifier from the coding index, wherein the distribution identifier is configured to identify the pulse distribution, on a track, of the pulses to be encoded on the track; determining the pulse distribution, on a track, of all the pulses to be encoded on the track according to the distribution identifier; and reconstructing the pulse order on the track according to the pulse distribution.
    Type: Grant
    Filed: September 18, 2012
    Date of Patent: March 24, 2015
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Fuwei Ma, Dejun Zhang
  • Publication number: 20150073784
    Abstract: In one embodiment of the present invention, a method of decoding an encoded audio bitstream and generating frequency bandwidth extension includes decoding the audio bitstream to produce a decoded low band audio signal and generate a low band excitation spectrum corresponding to a low frequency band. A sub-band area is selected from within the low frequency band using a parameter which indicates energy information of a spectral envelope of the decoded low band audio signal. A high band excitation spectrum is generated for a high frequency band by copying a sub-band excitation spectrum from the selected sub-band area to a high sub-band area corresponding to the high frequency band. Using the generated high band excitation spectrum, an extended high band audio signal is generated by applying a high band spectral envelope. The extended high band audio signal is added to the decoded low band audio signal to generate an audio output signal having an extended frequency bandwidth.
    Type: Application
    Filed: September 5, 2014
    Publication date: March 12, 2015
    Inventor: Yang Gao
  • Patent number: 8977545
    Abstract: Described herein are multi-channel noise suppression systems and methods that are configured to detect and suppress wind and background noise using at least two spatially separated microphones: at least one primary speech microphone and at least one noise reference microphone. The multi-channel noise suppression systems and methods are configured, in at least one example, to first detect and suppress wind noise in the input speech signal picked up by the primary speech microphone and, potentially, the input speech signal picked up by the noise reference microphone. Following wind noise detection and suppression, the multi-channel noise suppression systems and methods are configured to perform further noise suppression in two stages: a first linear processing stage that includes a blocking matrix and an adaptive noise canceler, followed by a second non-linear processing stage.
    Type: Grant
    Filed: November 14, 2011
    Date of Patent: March 10, 2015
    Assignee: Broadcom Corporation
    Inventors: Huaiyu Zeng, Jes Thyssen, Nelson Sollenberger, Juin-Hwey Chen, Xianxian Zhang
  • Patent number: 8965773
    Abstract: A method is provided for hierarchical coding of a digital audio signal comprising, for a current frame of the input signal: a core coding, delivering a scalar quantization index for each sample of the current frame and at least one enhancement coding delivering indices of scalar quantization for each coded sample of an enhancement signal. The enhancement coding comprises a step of obtaining a filter for shaping the coding noise used to determine a target signal and in that the indices of scalar quantization of said enhancement signal are determined by minimizing the error between a set of possible values of scalar quantization and said target signal. The coding method can also comprise a shaping of the coding noise for the core bitrate coding. A coder implementing the coding method is also provided.
    Type: Grant
    Filed: November 17, 2009
    Date of Patent: February 24, 2015
    Assignee: Orange
    Inventors: Balazs Kovesi, St├ęphane Ragot, Alain Le Guyader
  • Patent number: 8930200
    Abstract: A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then combined between different vectors, so that encoding idle spaces of different vectors can be recombined, thereby facilitating saving of encoding bits, and because an encoding index of a vector is split and then shorter split indexes are recombined, thereby facilitating reduction of requirements for the bit width of operating parts in encoding/decoding calculation.
    Type: Grant
    Filed: July 24, 2013
    Date of Patent: January 6, 2015
    Assignee: Huawei Technologies Co., Ltd
    Inventors: Fuwei Ma, Dejun Zhang, Lei Miao, Fengyan Qi