Analysis By Synthesis Patents (Class 704/220)
  • Patent number: 10418024
    Abstract: Systems and methods are provided for training an audio generation model for a first person using a first voice audio data and a first text transcript of the first voice audio data. Using a second voice audio data and a second text transcript of the second voice audio data, a plurality of pitch voice audio data for the second person may be generated with different pitches. The audio generation model may be trained for the second person using the generated plurality of pitch voice audio data with the different pitches for the second person. Output voice audio may be generated for the second person using received text and the model trained with the generated plurality of pitch voice audio data.
    Type: Grant
    Filed: July 16, 2018
    Date of Patent: September 17, 2019
    Assignee: salesforce.com, inc.
    Inventors: John Seymour, Azeem Aqil
  • Patent number: 10347274
    Abstract: An apparatus for decoding an encoded audio signal, includes: a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions being spectral prediction residual values; a frequency regenerator for generating a reconstructed second spectral portion using a first spectral portion of the first set of first spectral portions, wherein the reconstructed second spectral portion additionally includes spectral prediction residual values; and an inverse prediction filter for performing an inverse prediction over frequency using the spectral residual values for the first set of first spectral portions and the reconstructed second spectral portion using prediction filter information included in the encoded audio signal.
    Type: Grant
    Filed: February 13, 2017
    Date of Patent: July 9, 2019
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Frederik Nagel, Ralf Geiger, Balaji Nagendran Thoshkahna, Konstantin Schmidt, Stefan Bayer, Christian Neukam, Bernd Edler, Christian Helmrich
  • Patent number: 10277997
    Abstract: Example embodiments disclosed herein relate to audio signal processing. The audio signal has multiple audio objects. A method of processing an audio signal is disclosed. The method includes obtaining an object position for each of the audio objects; and determining cluster positions for grouping the audio objects into clusters based on the object positions, a plurality of object-to-cluster gains, and a set of metrics. The metrics indicate a quality of the cluster positions and a quality of the object-to-cluster gains, each of the cluster positions is a centroid of a respective one of the clusters, and one of the object-to-cluster gains defines a ratio of the respective audio object in one of the clusters. The method also includes determining the object-to-cluster gains based on the object positions, the cluster positions and the set of metrics; and generating a cluster signal based on the determined cluster positions and object-to-cluster gains.
    Type: Grant
    Filed: August 4, 2016
    Date of Patent: April 30, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
  • Patent number: 10176816
    Abstract: Provided are a vector quantization device, a voice coding device, a vector quantization method, and a voice coding method which enable a reduction in the calculation amount of voice codec without deterioration of voice quality. In the vector quantization device, a first reference vector calculation unit (201) calculates a first reference vector by multiplying a target vector (x) by an auditory weighting LPC synthesis filter (H), and a second reference vector calculation unit (202) calculates a second reference vector by multiplying an element of the first reference vector by a filter having a high pass characteristic. A polarity preliminary selection unit (205) generates a polar vector by disposing a unit pulse having a positive or negative polarity, which is selected on the basis of the polarity of an element of the second reference vector, in the position of said element.
    Type: Grant
    Filed: July 16, 2015
    Date of Patent: January 8, 2019
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventor: Toshiyuki Morii
  • Patent number: 10170136
    Abstract: A method which includes: detecting phrases in a transcript of an audiovisual file; applying a speech recognition algorithm to the audiovisual file and to a list of words of the phrase, to output a temporal location of each of the words that are uttered in the audio channel; compiling a list of sub-phrases of each of the phrases; creating a temporal sub-phrase map that comprises a temporal location of each of the sub-phrases; extracting the uttered sub-phrases from the audiovisual file, to create multiple sub-phrase audiovisual files; and constructing a database the multiple sub-phrase audiovisual files and of the sub-phrase uttered in each of the files. The method may also include: receiving a phrase; querying the database for audiovisual files which comprise uttered sub-phrases of the phrase; and splicing at least some of the audiovisual files to a compilation audiovisual file in which the phrase is uttered.
    Type: Grant
    Filed: May 6, 2015
    Date of Patent: January 1, 2019
    Assignee: AL LEVY TECHNOLOGIES LTD.
    Inventor: Alon Levi
  • Patent number: 10115408
    Abstract: A device and method for quantizing a gain of a fixed contribution of an excitation in a frame, including sub-frames, of a coded sound signal, wherein the gain of the fixed excitation contribution is estimated in a sub-frame using a parameter representative of a classification of the frame. The gain of the fixed excitation contribution is then quantized in the sub-frame using the estimated gain. The device and method is used in jointly quantizing gains of adaptive and fixed contributions of an excitation in a frame of a coded sound signal. For retrieving a quantized gain of a fixed contribution of an excitation in a sub-frame of a frame, the gain of the fixed excitation contribution is estimated using a parameter representative of a classification of the frame, a gain codebook supplies a correction factor in response to a received, gain codebook index, and a multiplier multiplies the estimated gain by the correction factor to provide a quantized gain of the fixed excitation contribution.
    Type: Grant
    Filed: May 31, 2017
    Date of Patent: October 30, 2018
    Assignee: VOICEAGE CORPORATION
    Inventor: Vladimir Malenovsky
  • Patent number: 10056089
    Abstract: An audio encoding method and a related apparatus are disclosed. The audio coding method includes: performing a time-frequency transformation on a current frame of a time-domain audio signal, to obtain spectral coefficients of the current audio frame; obtaining one or more reference coding parameters of the current frame; and determining whether the reference coding parameters satisfy a set of parameter conditions. If any one of the parameter conditions is satisfied, the spectral coefficients of the current frame are encoded by using a transform coded excitation (TCX) algorithm. If none of the parameter conditions is satisfied, the spectral coefficients of the current audio frame are encoded using a high quality transform coding (HQ) algorithm. The audio encoding method and the related apparatus help improve encoding quality or encoding efficiency in audio signal encoding.
    Type: Grant
    Filed: January 18, 2017
    Date of Patent: August 21, 2018
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Zexin Liu, Lei Miao
  • Patent number: 10003668
    Abstract: The present disclosure relates to a telecommunication method for operating a telecommunication device being coupled to a digital telecommunication network. The telecommunication device has a processor, a display device and a memory with an application. The telecommunication device is connectable via the network to a backend server with access to a database comprising a plurality of datasets. The method has: sending an initializing data request to the backend server requesting a first number of the datasets from the database; in response to receiving an initializing data packet with the requested first number of the datasets; and calculating a second number of the datasets to be requested by a second data request.
    Type: Grant
    Filed: December 2, 2015
    Date of Patent: June 19, 2018
    Assignee: SAP SE
    Inventor: Daniel Jakobs
  • Patent number: 9997153
    Abstract: An information processing method includes receiving a change instruction to change a voice parameter used in synthesizing a voice for a set of texts, changing the voice parameter in accordance with the change instruction to change the voice parameter, changing, in accordance with the change instruction, an image parameter used in synthesizing an image of a virtual object, the virtual object indicating a character that vocalizes the voice that has been synthesized, synthesizing the voice using the changed voice parameter, and synthesizing the image using the changed image parameter.
    Type: Grant
    Filed: August 19, 2016
    Date of Patent: June 12, 2018
    Assignee: Yamaha Corporation
    Inventors: Naoki Yamamoto, Yuki Murakami
  • Patent number: 9986352
    Abstract: An acoustic processing device comprises: a resonant band detecting means that detects a resonant band of sound output from a speaker based on a measurement result of a predetermined measurement signal reproduced through the speaker; an analyzing means that analyzes the measurement result of the predetermined measurement signal; a control parameter generating means that generates a control parameter for controlling the resonant band detected by the resonant band detecting means based on an analysis result by the analyzing means; and an audio signal controlling means that controls an audio signal input from a predetermined audio signal reproducing device based on the control parameter generated by the control parameter generating means such that a resonant band component of reproduced sound of the audio signal is suppressed to be short on a time axis.
    Type: Grant
    Filed: February 4, 2015
    Date of Patent: May 29, 2018
    Assignee: CLARION CO., LTD.
    Inventors: Yasuhiro Fujita, Takeshi Hashimoto, Tetsuo Watanabe, Kazutomo Fukue
  • Patent number: 9947340
    Abstract: A method of regenerating wideband speech from narrowband speech, the method comprising: receiving samples of a narrowband speech signal in a first range of frequencies; modulating received samples of the narrowband speech signal with a modulation signal having a modulating frequency adapted to upshift each frequency in the first range of frequencies by an amount determined by the modulating frequency wherein the modulating frequency is selected to translate into a target band a selected frequency band within the first range of signals; filtering the modulated samples using a target band filter to form a regenerated speech signal in the target band; and combining the narrow band speech signal with the regenerated speech signal in the target band to regenerate a wideband speech signal, the method comprising the step of controlling the modulated samples to lie in a second range of frequencies identified by determining a signal characteristic of frequencies in the first range of frequencies.
    Type: Grant
    Filed: December 10, 2009
    Date of Patent: April 17, 2018
    Assignee: SKYPE
    Inventors: Mattias Nilsson, Soren Vang Anderson, Koen Bernard Vos
  • Patent number: 9843859
    Abstract: Preprocessing speech signals from an indirect conduction microphone. One exemplary method preprocesses the speech signal in two stages. In stage one, an external speech sample is characterized using an auto regression model, and coefficients from the model are convolved with the internal speech signal from the indirect conduction microphone to produce a pre-conditioned internal speech signal. In stage two, a training sound is received by the indirect conduction microphone and filtered through a low-pass filter. The result is then modeled using auto regression, and inverted to produce an inverted filter model. The pre-conditioned internal speech signal is convolved with the inverted filter model to remove negative or undesirable acoustic characteristics and loss from the speech signal from the indirect conduction microphone.
    Type: Grant
    Filed: May 28, 2015
    Date of Patent: December 12, 2017
    Assignee: MOTOROLA SOLUTIONS, INC.
    Inventors: Cheah Heng Tan, Linus Francis, Robert J. Novorita
  • Patent number: 9812143
    Abstract: An audio coding method and apparatus, where the method includes, for each audio frame in audio, when a signal characteristic of the audio frame and a signal characteristic of a previous audio frame meet a preset modification condition, determining a first modification weight according to linear spectral frequency (LSF) differences of the audio frame and LSF differences of the previous audio frame, modifying a linear predictive parameter of the audio frame according to the determined first modification weight, and coding the audio frame according to a modified linear predictive parameter of the audio frame. According to the present disclosure, audio having a wider bandwidth can be coded while a bit rate remains unchanged or a bit rate slightly changes and a spectrum between audio frames is steadier.
    Type: Grant
    Filed: November 28, 2016
    Date of Patent: November 7, 2017
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Zexin Liu, Bin Wang, Lei Miao
  • Patent number: 9761236
    Abstract: A method performed in an audio decoder for reconstructing an original audio signal having a lowband portion and a highband portion is disclosed. The method includes receiving an encoded audio signal and extracting reconstruction parameters from the encoded audio signal. The method further includes decoding the encoded audio signal with a core audio decoder to obtain a decoded lowband portion and regenerating the highband portion based at least in part on a cross over frequency and the decoded lowband portion to obtain a regenerated highband portion. The method also includes creating a synthetic sinusoid with a level based at least in part on a spectral envelope value for the particular subband and a noise floor value for the particular subband and adding the synthetic sinusoid to the regenerated highband portion in the particular frequency band specified by the location information.
    Type: Grant
    Filed: March 8, 2017
    Date of Patent: September 12, 2017
    Assignee: Dolby International AB
    Inventors: Kristofer Kjoerling, Per Ekstrand, Holger Hoerich
  • Patent number: 9761237
    Abstract: A method performed in an audio decoder for reconstructing an original audio signal having a lowband portion and a highband portion is disclosed. The method includes receiving an encoded audio signal and extracting reconstruction parameters from the encoded audio signal. The method further includes decoding the encoded audio signal with a core audio decoder to obtain a decoded lowband portion and regenerating the highband portion based at least in part on a cross over frequency and the decoded lowband portion to obtain a regenerated highband portion. The method also includes creating a synthetic sinusoid with a level based at least in part on a spectral envelope value for the particular subband and a noise floor value for the particular subband and adding the synthetic sinusoid to the regenerated highband portion in the particular frequency band specified by the location information.
    Type: Grant
    Filed: March 8, 2017
    Date of Patent: September 12, 2017
    Assignee: Dolby International AB
    Inventors: Kristofer Kjoerling, Per Ekstrand, Holger Hoerich
  • Patent number: 9712939
    Abstract: A gain contribution of the audio signal for each of the N audio objects to at least one of M speakers may be determined. Determining the gain contribution may involve determining a center of loudness position that is a function of speaker (or cluster) positions and gains assigned to each speaker (or cluster). Determining the gain contribution also may involve determining a minimum value of a cost function. A first term of the cost function may represent a difference between the center of loudness position and an audio object position.
    Type: Grant
    Filed: June 17, 2014
    Date of Patent: July 18, 2017
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Antonio Mateos Sole, Giulio Cengarle, Dirk Jeroen Breebart, Nicolas R. Tsingos
  • Patent number: 9600824
    Abstract: A system functions to recommend equipment expansions, additions and/or substitutions; interconnections; supplemental capabilities; features; etc. based upon a knowledge of a consumer's existing audio and/or visual system configuration.
    Type: Grant
    Filed: May 31, 2011
    Date of Patent: March 21, 2017
    Assignee: Universal Electronics Inc.
    Inventors: Paul Ogaz, Arsham Hatambeiki, Patrick H. Hayes
  • Patent number: 9601104
    Abstract: Speech traits of an entity imbue an artificial intelligence system with idiomatic traits of persons from a particular category. Electronic units of speech are collected from an electronic stream of speech that is generated by a first entity. Tokens from the electronic stream of speech are identified, where each token identifies a particular electronic unit of speech from the electronic stream of speech, and where identification of the tokens is semantic-free. Nodes in a first speech graph are populated with the tokens to develop a first speech graph having a first shape. The first shape is matched to a second shape of a second speech graph from a second entity in a known category. The first entity is assigned to the known category, and synthetic speech generated by an artificial intelligence system is modified based on the first entity being assigned to the known category.
    Type: Grant
    Filed: August 2, 2016
    Date of Patent: March 21, 2017
    Assignee: International Business Machines Corporation
    Inventors: Guillermo A. Cecchi, James R. Kozloski, Clifford A. Pickover, Irina Rish
  • Patent number: 9595263
    Abstract: An apparatus for decoding an encoded audio signal is provided. The apparatus includes a pulse information decoder and a signal decoder. The pulse information decoder is adapted to decode a plurality of pulse positions, wherein each one of the pulse positions indicates a position of one of the pulses of the track, wherein the pulse information decoder is configured to decode the plurality of pulse positions by using a track positions number, a total pulses number, and one state number. The signal decoder is adapted to decode the encoded audio signal by generating a synthesized audio signal using the plurality of pulse positions and a plurality of predictive filter coefficients.
    Type: Grant
    Filed: August 14, 2013
    Date of Patent: March 14, 2017
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Tom Baeckstroem, Guillaume Fuchs
  • Patent number: 9554703
    Abstract: A diabetes care system for detection of an analyte and method for selective data transmission are disclosed. The diabetes care system has a mobile component and a base station, wherein a data transmission occurring between the mobile component and the base station within a time interval in which a wireless communication link exists, wherein is selectively performed in such a manner that within the time interval, a first partial set of the data is transmitted from the mobile component to the base station. The first partial set is selected using a processor-controlled selection algorithm in such a manner that the data transmitted in the time interval is representative of the entirety of the data stored in the mobile component.
    Type: Grant
    Filed: June 14, 2007
    Date of Patent: January 31, 2017
    Assignee: Roche Diabetes Care, Inc.
    Inventors: Jürgen Rasch-Menges, Paul Jansen, Hans-Peter Haar, Ulrich Haueter, Andreas Poredda
  • Patent number: 9524727
    Abstract: In a quantization method for quantizing a received excitation signal in a communication system performing the steps of re-shuffling S301 the elements of the received excitation signal to provide a re-shuffled excitation signal; coding S302 the re-shuffled excitation signal with a variable bit-rate algorithm to provide a coded excitation signal; and reassigning S303 codewords of the coded excitation signal if a number of used bits exceeds a predetermined fixed bit rate requirement to provide a quantized excitation signal.
    Type: Grant
    Filed: November 13, 2012
    Date of Patent: December 20, 2016
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Volodya Grancharov, Erik Norvell, Sigurdur Sverrisson
  • Patent number: 9466308
    Abstract: A method for coding and decoding an audio signal or speech signal and an apparatus adopting the method are provided.
    Type: Grant
    Filed: December 22, 2014
    Date of Patent: October 11, 2016
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ki Hyun Choo, Jung-Hoe Kim, Eun Mi Oh, Ho Sang Sung
  • Patent number: 9449606
    Abstract: An encoder for providing an audio stream on the basis of a transform-domain representation of an input audio signal includes a quantization error calculator configured to determine a multi-band quantization error over a plurality of frequency bands of the input audio signal for which separate band gain information is available. The encoder also includes an audio stream provider for providing the audio stream such that the audio stream includes information describing an audio content of the frequency bands and information describing the multi-band quantization error. A decoder for providing a decoded representation of an audio signal on the basis of an encoded audio stream representing spectral components of frequency bands of the audio signal includes a noise filler for introducing noise into spectral components of a plurality of frequency bands to which separate frequency band gain information is associated on the basis of a common multi-band noise intensity value.
    Type: Grant
    Filed: January 16, 2014
    Date of Patent: September 20, 2016
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Nikolaus Rettelbach, Bernhard Grill, Guillaume Fuchs, Stefan Geyersberger, Markus Multrus, Harald Popp, Juergen Herre, Stefan Wabnik, Gerald Schuller, Jens Hirschfeld
  • Patent number: 9431003
    Abstract: Speech traits of an entity imbue an artificial intelligence system with idiomatic traits of persons from a particular category. Electronic units of speech are collected from an electronic stream of speech that is generated by a first entity. Tokens from the electronic stream of speech are identified, where each token identifies a particular electronic unit of speech from the electronic stream of speech, and where identification of the tokens is semantic-free. Nodes in a first speech graph are populated with the tokens to develop a first speech graph having a first shape. The first shape is matched to a second shape of a second speech graph from a second entity in a known category. The first entity is assigned to the known category, and synthetic speech generated by an artificial intelligence system is modified based on the first entity being assigned to the known category.
    Type: Grant
    Filed: March 27, 2015
    Date of Patent: August 30, 2016
    Assignee: International Business Machines Corporation
    Inventors: Guillermo A. Cecchi, James R. Kozloski, Clifford A. Pickover, Irina Rish
  • Patent number: 9418152
    Abstract: A system and method for receiving an initial search entry to search text data. The text data may be, for example, an N word lattice, transcribed by a text to speech engine. The difference between the initial search entry and one or more entries in the dictionary may be measured. One or more similar entries may be selected from the dictionary that have the smallest measures of difference to the initial search entry. The text data may be searched for the one or more selected similar entries. Each of the searched similar entries found in the text data may be displayed as a search result.
    Type: Grant
    Filed: February 9, 2011
    Date of Patent: August 16, 2016
    Assignee: NICE-SYSTEMS LTD.
    Inventors: Maor Nissan, Moshe Wasserblat
  • Patent number: 9406307
    Abstract: A method, device, and apparatus provide the ability to predict a portion of a polyphonic audio signal for compression and networking applications. The solution involves a framework of a cascade of long term prediction filters, which by design is tailored to account for all periodic components present in a polyphonic signal. This framework is complemented with a design method to optimize the system parameters. Specialization may include specific techniques for coding and networking scenarios, where the potential of each enhanced prediction is realized to considerably improve the overall system performance for that application. One specific technique provides enhanced inter-frame prediction for the compression of polyphonic audio signals, particularly at low delay. Another specific technique provides improved frame loss concealment capabilities to combat packet loss in audio communications.
    Type: Grant
    Filed: August 19, 2013
    Date of Patent: August 2, 2016
    Assignee: The Regents of the University of California
    Inventors: Kenneth Rose, Tejaswi Nanjundaswamy
  • Patent number: 9396739
    Abstract: The invention discloses a method including: performing in a unit of first timeframe frame length, framing on a continuous voice sample to obtain a plurality of first timeframes, detecting energy of each of the first timeframes, and determining a target first timeframe including a potential abrupt exception of a voice signal by analyzing a relationship between the energy of the plurality of first timeframes; performing, in a unit of second timeframe frame length, framing on the continuous voice sample to obtain a plurality of second timeframes, and processing each of the second timeframes to acquire a tone feature, and determining, by analyzing a tone feature of at least one of the second timeframes including at least one target second timeframe, whether the potential abrupt exception of a voice signal included in the target first timeframe included in the target second timeframe is a real abrupt exception of a voice signal.
    Type: Grant
    Filed: June 23, 2015
    Date of Patent: July 19, 2016
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Lijing Xu
  • Patent number: 9320499
    Abstract: An ultrasonic diagnosing apparatus includes: a transducer array 1 composed of arrayed transducer elements T1 to T6 for transmitting ultrasound; driving circuits D1A to D6A each provided for transmission channels for driving each of the transducer elements; a transmission trigger generator 2 for generating a trigger pulse for controlling each of the driving circuits; a parallel reception beam former 3 for processing reception signals from the transducer elements; a signal processor 4 for processing an output signal of the parallel reception beam former; and a control unit 5 for controlling the transmission trigger generator, the parallel reception beam former and the signal processor.
    Type: Grant
    Filed: April 18, 2013
    Date of Patent: April 26, 2016
    Assignee: KONICA MINOLTA, INC.
    Inventors: Hiroshi Fukukita, Yoshihiko Itoh
  • Patent number: 9237008
    Abstract: An encryption device that, when voice or image data or the like being encoded is encrypted using a one-time pad (OTP) cipher and then transmitted, reduces a period of time in which a cipher key for the OTP cipher runs out. A first terminal device determines whether to encode transmission data by a first encoding scheme or a second encoding scheme having a lower bit rate than the first encoding scheme, depending on the number of remaining bits of an OTP cipher key, and encodes the transmission data according to the determined encoding scheme, thereby generating encoded data. The first terminal device encrypts the generated encoded data with the OTP cipher using the OTP cipher key, thereby generating encrypted communication data, and transmits the generated encrypted communication data to a second terminal device.
    Type: Grant
    Filed: July 25, 2011
    Date of Patent: January 12, 2016
    Assignee: Mitsubishi Electric Corporation
    Inventor: Yoichi Shibata
  • Patent number: 9178591
    Abstract: A method in a mobile communication terminal includes holding a definition of a sub-sampled codebook identifying precoding matrices to be used for providing precoding feedback by the terminal. The precoding matrices in the sub-sampled codebook are selected from a master codebook that is made-up of a long-term sub-codebook and a short-term sub-codebook. The definition defines a first subset of the long-term sub-codebook and a second subset of the short-term sub-codebook. A Multiple-Input Multiple-Output (MIMO) signal is received in the terminal via multiple receive antennas. Based on the received MIMO signal, a precoding matrix is selected from the sub-sampled codebook for precoding subsequent MIMO signals transmitted to the terminal. The precoding feedback indicating the selected precoding matrix is calculated.
    Type: Grant
    Filed: June 5, 2014
    Date of Patent: November 3, 2015
    Assignee: MARVELL WORLD TRADE LTD.
    Inventors: Krishna Srikanth Gomadam, Adoram Erell
  • Patent number: 9177545
    Abstract: A recognition dictionary creating device includes a user dictionary in which a phoneme label string of an inputted voice is registered and an interlanguage acoustic data mapping table in which a correspondence between phoneme labels in different languages is defined, and refers to the interlanguage acoustic data mapping table to convert the phoneme label string registered in the user dictionary and expressed in a language set at the time of creating the user dictionary into a phoneme label string expressed in another language which the recognition dictionary creating device has switched.
    Type: Grant
    Filed: January 22, 2010
    Date of Patent: November 3, 2015
    Assignee: Mitsubishi Electric Corporation
    Inventor: Yuzo Maruta
  • Patent number: 9153236
    Abstract: A parametric background noise estimate is continuously updated during an active or non-silence phase so that the noise generation may immediately be started with upon the entrance of an inactive phase following the active phase. In accordance with another aspect, a spectral domain is very efficiently used in order to parameterize the background noise thereby yielding a background noise synthesis which is more realistic and thus leads to a more transparent active to inactive phase switching.
    Type: Grant
    Filed: August 13, 2013
    Date of Patent: October 6, 2015
    Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.
    Inventors: Panji Setiawan, Konstantin Schmidt, Stephan Wilde
  • Patent number: 9129590
    Abstract: Disclosed are an audio encoding device and an audio decoding device which reduce degradation of subjective quality of a decoding signal caused by power mismatch of a decoding signal which is generated by a concealing process upon disappearance of a frame. When a frame is lost, a past encoding parameter is used to obtain a concealed LPC of the current frame and a concealed sound source parameter. A normal CELP decoding is performed from the obtained concealed sound source parameter. Correction is performed by using a conceal parameter on the obtained concealed LPC and the concealed sound source signal. The power of the corrected concealed sound source signal is adjusted to match a reference sound source power. A filter gain of the synthesis filter is adjusted so as to adjust the power of a decoded sound signal to the power of a decoded sound signal during an error-free state.
    Type: Grant
    Filed: February 29, 2008
    Date of Patent: September 8, 2015
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya Kawashima, Hiroyuki Ehara, Koji Yoshida
  • Patent number: 9111536
    Abstract: A method and system for compressing an audio signal. The method includes receiving a segment of an audio signal and selectively disabling noise suppression for the received segment. The segment is filtered in a noise-suppression module if noise suppression is not disabled. The method also includes calculating an autocorrelation coefficient and an LSP coefficient, predicting a short-term coefficient and long-term coefficients according to the LSP coefficient and calculating one or more bandwidth-expanded correlation coefficients. Further, the method includes determining the type of packet in which to encode the segment. An encoding rate is selected from among a full rate encode, a half-rate encode, and an eight-rate encode if noise suppression is not disabled. An encoding rate is selected from among a full rate encode and a half-rate encode if noise suppression is disabled. Furthermore, the segment is formed into a packet of the determined type and selected rate.
    Type: Grant
    Filed: March 7, 2011
    Date of Patent: August 18, 2015
    Assignee: TEXAS INSTRUMENTS INCORPORATED
    Inventor: Mukund Kanyana Navada
  • Patent number: 9106241
    Abstract: In lossy data compression of a signal using ADPCM, an adaptive decorrelation or “prediction” filter is used to reduce the amplitude of the signal, the spectral dynamic range of the signal also being reduced. This latter reduction is effected in a nonuniform manner, if known techniques are used, with regions of high spectral density being compressed more than regions of low spectral density. The present invention recognises that using a uniform compression ratio results in a better tradeoff between compression and robustness to transmission channel errors. A method is described for obtaining a uniform compression ratio by adjusting coefficients of the decorrelation filter in dependence on coefficients of an adaptive training filter that is fed from the output of the decorrelation filter. A reverse method is also provided along with encoder, decoder and codec implementing the techniques.
    Type: Grant
    Filed: September 2, 2010
    Date of Patent: August 11, 2015
    Inventors: Peter Graham Craven, Malcolm Law
  • Patent number: 9100767
    Abstract: A converter and conversion method are disclosed for converting N channel audio input channels into M channel audio output channels, wherein a processor is used for applying a transfer function to a signal received on an input channel to obtain reverberation components of a calculated output channel, wherein said transfer function is a simplified transfer function matching a selected subset of a set of local maxima of a measured reverberation when applied to a corresponding stimulus.
    Type: Grant
    Filed: November 20, 2009
    Date of Patent: August 4, 2015
    Assignee: AURO TECHNOLOGIES
    Inventors: Wilfried Van Baelen, Ralph Kessler
  • Patent number: 9082397
    Abstract: An apparatus including at least one processor and at least one memory including computer program code the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to select at least two single frequency components; generate an indicator, the indicator being configured to represent the at least two single frequency components and is configured to be dependent on the frequency separation between the two single frequency components.
    Type: Grant
    Filed: November 6, 2007
    Date of Patent: July 14, 2015
    Assignee: Nokia Technologies Oy
    Inventors: Lasse Laaksonen, Mikko Tammi, Adriana Vasilache, Anssi Ramo
  • Patent number: 9047860
    Abstract: A method for concatenating a first frame of samples and a subsequent second frame of samples, the method comprising applying a phase filter adapted to minimizing a discontinuity at a boundary between the first and second frames of samples.
    Type: Grant
    Filed: January 31, 2006
    Date of Patent: June 2, 2015
    Assignee: SKYPE
    Inventor: Soren Andersen
  • Patent number: 9043202
    Abstract: An apparatus for decoding data segments representing a time-domain data stream, a data segment being encoded in the time domain or in the frequency domain, a data segment being encoded in the frequency domain having successive blocks of data representing successive and overlapping blocks of time-domain data samples. The apparatus includes a time-domain decoder for decoding a data segment being encoded in the time domain and a processor for processing the data segment being encoded in the frequency domain and output data of the time-domain decoder to obtain overlapping time-domain data blocks. The apparatus further includes an overlap/add-combiner for combining the overlapping time-domain data blocks to obtain a decoded data segment of the time-domain data stream.
    Type: Grant
    Filed: April 10, 2014
    Date of Patent: May 26, 2015
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Ralf Geiger, Max Neuendorf, Yoshikazu Yokotani, Nikolaus Rettelbach, Juergen Herre, Stefan Geyersberger
  • Patent number: 9015040
    Abstract: An apparatus for encoding an audio signal having a stream of audio samples has: a windower for applying a prediction coding analysis window to the stream of audio samples to obtain windowed data for a prediction analysis and for applying a transform coding analysis window to the stream of audio samples to obtain windowed data for a transform analysis, wherein the transform coding analysis window is associated with audio samples within a current frame of audio samples and with audio samples of a predefined portion of a future frame of audio samples being a transform-coding look-ahead portion, wherein the prediction coding analysis window is associated with at least the portion of the audio samples of the current frame and with audio samples of a predefined portion of the future frame being a prediction coding look-ahead portion, wherein the transform coding look-ahead portion and the prediction coding look-ahead portion are identically to each other or are different from each other by less than 20%; and an enc
    Type: Grant
    Filed: August 14, 2013
    Date of Patent: April 21, 2015
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Emmanuel Ravelli, Ralf Geiger, Markus Schnell, Guillaume Fuchs, Vesa Ruoppila, Tom Baeckstroem, Bernhard Grill, Christian Helmrich
  • Patent number: 8990074
    Abstract: A method of noise-robust speech classification is disclosed. Classification parameters are input to a speech classifier from external components. Internal classification parameters are generated in the speech classifier from at least one of the input parameters. A Normalized Auto-correlation Coefficient Function threshold is set. A parameter analyzer is selected according to a signal environment. A speech mode classification is determined based on a noise estimate of multiple frames of input speech.
    Type: Grant
    Filed: April 10, 2012
    Date of Patent: March 24, 2015
    Assignee: QUALCOMM Incorporated
    Inventors: Ethan Robert Duni, Vivek Rajendran
  • Patent number: 8988256
    Abstract: A coding method, a decoding method, a coder, and a decoder are disclosed herein. A coding method includes: obtaining the pulse distribution, on a track, of the pulses to be encoded on the track; determining a distribution identifier for identifying the pulse distribution according to the pulse distribution; and generating a coding index that includes the distribution identifier. A decoding method includes: receiving a coding index; obtaining a distribution identifier from the coding index, wherein the distribution identifier is configured to identify the pulse distribution, on a track, of the pulses to be encoded on the track; determining the pulse distribution, on a track, of all the pulses to be encoded on the track according to the distribution identifier; and reconstructing the pulse order on the track according to the pulse distribution.
    Type: Grant
    Filed: September 18, 2012
    Date of Patent: March 24, 2015
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Fuwei Ma, Dejun Zhang
  • Patent number: 8982971
    Abstract: A multi-carrier signal is typically comprised of many equidistant sub-carriers. This results in periodicity of spectrum within the bandwidth of such a multi-carrier signal. An unknown multi-carrier signal with equidistant sub-carriers can thus be sensed together with its sub-carrier spacing by finding a discernable local maximum in the cepstrum (Fourier transform of the log spectrum) of the multi-carrier signal.
    Type: Grant
    Filed: March 29, 2012
    Date of Patent: March 17, 2015
    Assignee: QRC, Inc.
    Inventors: Sinisa Peric, Thomas F. Callahan, III
  • Patent number: 8983852
    Abstract: The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal.
    Type: Grant
    Filed: May 25, 2010
    Date of Patent: March 17, 2015
    Assignee: Dolby International AB
    Inventors: Per Ekstrand, Lars Villemoes, Per Hedelin
  • Patent number: 8976906
    Abstract: A multi-carrier signal is typically comprised of many equidistant sub-carriers. This results in periodicity of spectrum within the bandwidth of such a multi-carrier signal. An unknown multi-carrier signal with equidistant sub-carriers can thus be sensed together with its sub-carrier spacing by finding a discernible local maximum in the cepstrum (Fourier transform of the log spectrum) of the multi-carrier signal.
    Type: Grant
    Filed: March 29, 2012
    Date of Patent: March 10, 2015
    Assignee: QRC, Inc.
    Inventors: Sinisa Peric, Thomas F. Callahan, III
  • Publication number: 20140379333
    Abstract: A wave resynthesis method and system comprises receiving input wave form, processing received data to create an enhanced wave form, identifying the enhanced wave form, transmitting the identified wave form to a receiving unit, identifying the received wave form, resynthesizing the received wave form and outputting the resynthesized wave form. Identifying the enhanced wave form includes sampling the waveform and measuring the angle of the samples at two or more points in the waveform. The enhancing of voice audio input includes the parallel processing the input audio by a module that is a low pass filter with dynamic offset, an envelope controlled band-pass filter, a high pass filter and adding an amount of dynamic synthesized sub bass to the audio. The four processed audio signals are combined in a summing mixer with the original audio. The receiving unit has a complete set of encrypted tables for accurate resynthesizing/reproduction.
    Type: Application
    Filed: February 19, 2014
    Publication date: December 25, 2014
    Applicant: Max Sound Corporation
    Inventor: Lloyd Trammell
  • Patent number: 8918324
    Abstract: A method for coding and decoding an audio signal or speech signal and an apparatus adopting the method are provided.
    Type: Grant
    Filed: January 27, 2010
    Date of Patent: December 23, 2014
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Ki Hyun Choo, Jung-Hoe Kim, Eun Mi Oh, Ho Sang Sung
  • Patent number: 8918315
    Abstract: An encoding apparatus includes a first layer encoder that encodes a signal, a first layer decoder that decodes first layer encoded data, a first layer error transform coefficient calculator that transforms a first layer error signal into a frequency domain and a second layer encoder that encodes the first layer error transform coefficient to acquire second layer encoded data. The second layer encoder includes a band determiner that determines a band to be encoded by the second layer encoder, and a first shape vector encoder that refers the first layer error transform coefficient included in the band to generate a first shape vector and first shape encoded information, a target gain calculator calculates target gain per subband, a gain vector generator generates a gain vector using a plurality of target gains, and a gain vector encoder encodes the gain vector to acquire gain encoded information.
    Type: Grant
    Filed: August 13, 2013
    Date of Patent: December 23, 2014
    Assignee: Panasonic Intellectual Property Corporation of America
    Inventors: Masahiro Oshikiri, Toshiyuki Morii, Tomofumi Yamanashi
  • Patent number: 8918314
    Abstract: An encoding apparatus includes a first layer encoder that encodes an input signal, a first layer decoder that decodes the first layer encoded data, a weighting filter that filters a first layer error signal to acquire a weighted first layer error signal, a first layer error transform coefficient calculator that transforms the weighted first layer error signal into a frequency domain, and a second layer encoder that encodes the first layer error transform coefficient. The second layer encoder includes a first shape vector encoder that refers the first layer error transform coefficient to generate a first shape vector and first shape encoded information. A target gain calculator calculates a target gain using the first layer error transform coefficient and the first shape vector, a gain vector generator generates a gain vector, and a gain vector encoder encodes the gain vector to acquire gain encoded information.
    Type: Grant
    Filed: August 13, 2013
    Date of Patent: December 23, 2014
    Assignee: Panasonic Intellectual Property Corporation of America
    Inventors: Masahiro Oshikiri, Toshiyuki Morii, Tomofumi Yamanashi
  • Patent number: 8914282
    Abstract: By monitoring the wind noise in a location in which a cellular telephone is operating and by applying noise reduction and/or cancellation protocols at the appropriate time via analog and/or digital signal processing, it is possible to significantly reduce wind noise entering into a communication system.
    Type: Grant
    Filed: August 14, 2012
    Date of Patent: December 16, 2014
    Inventor: Alon Konchitsky