Autocorrelation Patents (Class 704/217)
  • Patent number: 10692510
    Abstract: It is shown an encoder for encoding an audio signal with reduced background noise using linear predictive coding. The encoder includes a background noise estimator configured to estimate background noise of the audio signal, a background noise reducer configured to generate background noise reduced audio signal by subtracting the estimated background noise of the audio signal from the audio signal, and a predictor configured to subject the audio signal to linear prediction analysis to obtain a first set of linear prediction filter (LPC) coefficients and to subject the background noise reduced audio signal to linear prediction analysis to obtain a second set of linear prediction filter (LPC) coefficients. Furthermore, the encoder includes an analysis filter composed of a cascade of time-domain filters controlled by the obtained first set of LPC coefficients and the obtained second set of LPC coefficients.
    Type: Grant
    Filed: March 14, 2018
    Date of Patent: June 23, 2020
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Johannes Fischer, Tom Bäckström, Emma Jokinen
  • Patent number: 10666475
    Abstract: An electronic transmitter includes: a modulator to phase modulate a carrier signal with a baseband signal using a phase modulation sequence; and an emitter to emit the phase modulated signal. The phase modulated signal has poor autocorrelation, has a corresponding mismatched filter based on the phase modulation sequence, and is configured to demodulate into the baseband signal through poor cross-correlation with the mismatched filter. Sometimes, the transmitter is part of a sensing apparatus, where the emitter emits the phase modulated signal at a target and the emitted signal reflects off the target. The sensing apparatus includes a receiver that has a collector to collect the reflected signal, and a demodulator to demodulate the collected signal into the baseband signal through the poor cross-correlation with the mismatched filter. Sometimes, the transmitter is part of a communication system, where the emitter emits the phase modulated signal to an intended recipient.
    Type: Grant
    Filed: October 29, 2018
    Date of Patent: May 26, 2020
    Assignee: BAE Systems Information and Electronic Systems Integration Inc.
    Inventors: William D. Watson, Prabahan Basu, Jonathan P. Beaudeau, David J. Couto
  • Patent number: 10629214
    Abstract: An encoder and a decoder are provided that are capable of reproducing a frequency-domain envelope sequence that provides high approximation accuracy around peaks caused by the pitch period of an audio signal by using a small amount of code. An encoder of the present invention comprises a periodic-combined-envelope generating part and a variable-length coding part. The periodic-combined-envelope generating part generates a periodic combined envelope sequence which is a frequency-domain sequence based on a spectral envelope sequence which is a frequency-domain sequence corresponding to a linear predictive coefficient code obtained from an input audio signal and on a frequency-domain period. The variable-length coding part encodes a frequency-domain sequence derived from the input audio signal. A decoder of the present invention comprises a periodic-combined-envelope generating part and a variable-length decoding part.
    Type: Grant
    Filed: November 26, 2018
    Date of Patent: April 21, 2020
    Assignee: Nippon Telegraph and Telephone Corporation
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 10607616
    Abstract: An encoder and a decoder are provided that are capable of reproducing a frequency-domain envelope sequence that provides high approximation accuracy around peaks caused by the pitch period of an audio signal by using a small amount of code. An encoder of the present invention comprises a periodic-combined-envelope generating part and a variable-length coding part. The periodic-combined-envelope generating part generates a periodic combined envelope sequence which is a frequency-domain sequence based on a spectral envelope sequence which is a frequency-domain sequence corresponding to a linear predictive coefficient code obtained from an input audio signal and on a frequency-domain period. The variable-length coding part encodes a frequency-domain sequence derived from the input audio signal. A decoder of the present invention comprises a periodic-combined-envelope generating part and a variable-length decoding part.
    Type: Grant
    Filed: November 26, 2018
    Date of Patent: March 31, 2020
    Assignee: Nippon Telegraph and Telephone Corporation
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 10475462
    Abstract: A method includes generating, by a processor, an audio fingerprint representative of an audio signal. The audio fingerprint is based on a plurality of first intensity values corresponding to one or more segments of the audio signal. The plurality of first intensity values are based on a Fast Fourier Transform (FFT) performed on at least one sampled segment of the audio signal. The method also includes comparing a plurality of second intensity values based on a recorded sound to determine whether the second intensity values match the first intensity values. The method additionally includes causing a message to be communicated to a device used to record the sound based on a determination that the plurality of second intensity values match the plurality of first intensity values.
    Type: Grant
    Filed: November 8, 2017
    Date of Patent: November 12, 2019
    Assignee: PLAYFUSION LIMITED
    Inventors: Riaan Hodgson, David Gomberg, Mark Gerhard
  • Patent number: 10469907
    Abstract: Provided are a signal processing method for determining an audience rating of media, and an additional information inserting apparatus, a media reproducing apparatus and an audience rating determining apparatus for performing the same method. In detail, the signal processing method for determining an audience rating of media is a method that may determine an audience rating of media with respect to a whole section of an audio signal by inserting additional information into a silence section through a noise signal.
    Type: Grant
    Filed: July 2, 2018
    Date of Patent: November 5, 2019
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Young Ho Jeong, Seung Kwon Beack, Tae Jin Lee, Hui Yong Kim
  • Patent number: 10381011
    Abstract: An apparatus for determining an estimated pitch lag is provided. The apparatus includes an input interface for receiving a plurality of original pitch lag values, and a pitch lag estimator for estimating the estimated pitch lag. The pitch lag estimator is configured to estimate the estimated pitch lag depending on a plurality of original pitch lag values and depending on a plurality of information values, wherein for each original pitch lag value of the plurality of original pitch lag values, an information value of the plurality of information values is assigned to the original pitch lag value.
    Type: Grant
    Filed: December 21, 2015
    Date of Patent: August 13, 2019
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Jeremie Lecomte, Michael Schnabel, Goran Markovic, Martin Dietz, Bernhard Neugebauer
  • Patent number: 10255928
    Abstract: A method and apparatus to encoding or decoding an audio signal is provided. In the method and apparatus, a noise-floor level to use in encoding or decoding a high frequency signal is updated according to the degree of a voiced or unvoiced sound included in the signal.
    Type: Grant
    Filed: November 13, 2017
    Date of Patent: April 9, 2019
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ki-hyun Choo, Eun-mi Oh, Ho-sang Sung, Jung-hoe Kim, Mi-young Kim
  • Patent number: 10210880
    Abstract: An encoding method, a decoding method, an encoding apparatus, a decoding apparatus, a transmitter, a receiver, and a communications system. The encoding method includes: dividing a to-be-encoded time-domain signal into a low band signal and a high band signal; performing encoding on the low band signal to obtain a low frequency encoding parameter; performing encoding on the high band signal to obtain a high frequency encoding parameter, and obtaining a synthesized high band signal; performing short-time post-filtering processing on the synthesized high band signal to obtain a short-time filtering signal; and calculating a high frequency gain based on the high band signal and the short-time filtering signal. A technical solution according to the embodiments of the present application can improve an encoding and/or decoding effect.
    Type: Grant
    Filed: August 15, 2017
    Date of Patent: February 19, 2019
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Bin Wang, Zexin Liu, Lei Miao
  • Patent number: 10170129
    Abstract: An apparatus for encoding a speech signal by determining a codebook vector of a speech coding algorithm is provided. The apparatus includes a matrix determiner for determining an autocorrelation matrix R, and a codebook vector determiner for determining the codebook vector depending on the autocorrelation matrix R. The matrix determiner is configured to determine the autocorrelation matrix R by determining vector coefficients of a vector r, wherein the autocorrelation matrix R includes a plurality of rows and a plurality of columns, wherein the vector r indicates one of the columns or one of the rows of the autocorrelation matrix R, wherein R(i, j)=r(|i?j|), wherein R(i, j) indicates the coefficients of the autocorrelation matrix R, wherein i is a first index indicating one of a plurality of rows of the autocorrelation matrix R, and wherein j is a second index indicating one of the plurality of columns of the autocorrelation matrix R.
    Type: Grant
    Filed: April 3, 2015
    Date of Patent: January 1, 2019
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Tom Baeckstroem, Markus Multrus, Guillaume Fuchs, Christian Helmrich, Martin Dietz
  • Patent number: 10055490
    Abstract: The present invention relates to the continuous monitoring of an audio signal and identification of audio items within an audio signal. The technology disclosed utilizes predictive caching of fingerprints to improve efficiency. Fingerprints are cached for tracking an audio signal with known alignment and for watching an audio signal without known alignment, based on already identified fingerprints extracted from the audio signal. Software running on a smart phone or other battery-powered device cooperates with software running on an audio identification server.
    Type: Grant
    Filed: June 14, 2016
    Date of Patent: August 21, 2018
    Assignee: SoundHound, Inc.
    Inventors: Bernard Mont-Reynaud, Aaron Master, Timothy Stonehocker, Keyvan Mohajer
  • Patent number: 9978394
    Abstract: Provided is a method, non-transitory computer program product and system for an improved noise suppression technique for speech enhancement. It operates on speech signals from a single or multiple input sources. Background noise monitoring is performed with one or multiple input speech signals to determine if the input speech contains active voice. If the absence of active voice is detected, the spectrum of the input speech is used to update a long-term noise spectrum estimate. In addition, the input from one or more secondary microphones can be used to update a short-term noise spectrum estimate. The input speech spectrum is then compared to the long-term and/or short-term noise spectra, and a selective spectrum gain based shaping is applied to the input speech spectrum to reduce noise.
    Type: Grant
    Filed: February 24, 2015
    Date of Patent: May 22, 2018
    Assignee: QOSOUND, INC.
    Inventor: Huan-Yu Su
  • Patent number: 9916603
    Abstract: Systems and methods for transmitting content to a client via a communication network are provided. An insertion server, running within a firewall device associated with a private IP network, detects establishment of a transport communication protocol connection between a client associated with the network and a destination located external to the network by examining packets as they pass through the network and pass by the insertion server. A content request of an application protocol initiated by the client and directed to the destination is observed by the insertion server. The content request is negated by the insertion server by causing a canceling message of the transport communication protocol to be sent to the destination. Unsolicited content is caused to be selected for delivery to the client by the insertion server. The unsolicited content is sent by the insertion server to the client via the application protocol.
    Type: Grant
    Filed: August 24, 2016
    Date of Patent: March 13, 2018
    Assignee: Fortinet, Inc.
    Inventors: Kunhua Lin, Michael Xie
  • Patent number: 9842601
    Abstract: Vector Quantizer and method therein for efficient vector quantization, e.g. in a transform audio codec. The method comprises comparing an input target vector s with a plurality of centroids, each centroid representing a respective class of codevectors in a codebook. Further, a starting point for a search related to the input target vector in the codebook is determined, based on the result of the comparison. The codevectors in the codebook are sorted according to a distortion measure reflecting the distance between each codevector and the centroids of the classes. The Vector Quantizer and method enables that the class of codevectors comprising the most probable candidate codevectors in regard of the input vector s may be searched first.
    Type: Grant
    Filed: June 21, 2016
    Date of Patent: December 12, 2017
    Assignee: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)
    Inventors: Volodya Grancharov, Tomas Jansson Toftg{dot over (a)}rd
  • Patent number: 9773318
    Abstract: A method of detecting camera defect includes: obtaining an image by a processing unit, the processing unit having a surface fit module, a subtraction module, and a peak quantification module; determining a first autocorrelation map for a first sub-region in the image; determining, using the surface fit module, a first surface fit for first scene content in the first sub-region; subtracting, using the subtraction module, the first surface fit from the first autocorrelation map for the first sub-region in the image to obtain a first residual map; and quantifying, using the peak quantification module, a first noise in the first residual map.
    Type: Grant
    Filed: October 2, 2015
    Date of Patent: September 26, 2017
    Assignee: Varian Medical Systems, Inc.
    Inventor: Hassan Mostafavi
  • Patent number: 9767829
    Abstract: A speech signal processing apparatus and a speech signal processing method for enhancing speech intelligibility are provided. The speech signal processing apparatus includes an input signal gain determiner to determine a gain of an input signal based on a harmonic characteristic of a voiced speech, a voiced speech output unit to output a voiced speech in which a harmonic component is preserved by applying the gain to the input signal, a linear predictive coefficient determiner to determine a linear predictive coefficient based on the voiced speech, and an unvoiced speech preserver to preserve an unvoiced speech of the input signal based on the linear predictive coefficient.
    Type: Grant
    Filed: July 10, 2014
    Date of Patent: September 19, 2017
    Assignees: Samsung Electronics Co., Ltd., Yonsei University Wonju Industry-Academic Cooperation Foundation
    Inventors: Jun Il Sohn, Yun Seo Ku, Dong Wook Kim, Young Cheol Park
  • Patent number: 9613620
    Abstract: A device may receive data indicative of a plurality of speech sounds associated with first voice characteristics of a first voice. The device may receive an input indicative of speech associated with second voice characteristics of a second voice. The device may map at least one portion of the speech of the second voice to one or more speech sounds of the plurality of speech sounds of the first voice. The device may compare the first voice characteristics with the second voice characteristics based on the map. The comparison may include vocal tract characteristics, nasal cavity characteristics, and voicing characteristics. The device may determine a given representation configured to associate the first voice characteristics with the second voice characteristics. The device may provide an output indicative of pronunciations of the one or more speech sounds of the first voice according to the second voice characteristics based on the given representation.
    Type: Grant
    Filed: February 25, 2015
    Date of Patent: April 4, 2017
    Assignee: Google Inc.
    Inventors: Ioannis Agiomyrgiannakis, Zoi Roupakia
  • Patent number: 9589284
    Abstract: Systems and methods for transmitting content to a client via a communication network are provided. According to one embodiment, an insertion server running within a firewall device of a network observes a content request of an application protocol by monitoring or proxying transport communication protocol connections established through the firewall device. The content request is (i) originated by a client device coupled to the network, (ii) directed to a destination device coupled to the network and (iii) associated with one of the multiple transport communication protocol connections. Responsive to observing the content request, the insertion server determines whether one or more conditions are satisfied. If so, the content request is negated by causing a canceling message of the transport communication protocol to be sent to the destination device and unsolicited content is selected and delivered to the client device via the application protocol.
    Type: Grant
    Filed: March 12, 2016
    Date of Patent: March 7, 2017
    Assignee: Fortinet, Inc.
    Inventors: Kunhua Lin, Michael Xie
  • Patent number: 9576501
    Abstract: In one aspect, a device includes a processor, a display accessible to the processor, and memory accessible to the processor. The memory bears instructions executable by the processor to provide sound corresponding to a portion of text presented on the display with at least one portion of the sound being provided as if originating at least substantially from a location on the display at which the portion of text is presented on the display.
    Type: Grant
    Filed: March 12, 2015
    Date of Patent: February 21, 2017
    Assignee: Lenovo (Singapore) Pte. Ltd.
    Inventor: Lucio Mitsuru Seki
  • Patent number: 9514738
    Abstract: A speech is recognized using ACF factors extracted from running autocorrelation functions calculated from the speech. The extracted ACF factors are a W?(0) (width of ACF amplitude around zero-delay origin), a W?(0)max (maximum value of the W?(0)), a ?1 (pitch period), a ?1 (pitch strength), and a ??1/?t (rate of the pitch strength change). Syllables in the speech are identified by comparing the ACF factors with templates stored in a database.
    Type: Grant
    Filed: November 13, 2012
    Date of Patent: December 6, 2016
    Assignees: Yoshimasa Electronic Inc.
    Inventor: Yoichi Ando
  • Patent number: 9460201
    Abstract: A method includes receiving an audio fingerprint from a listening device. The method also includes, in response to determining that a portion of a stored audio fingerprint substantially matches a portion of the received audio fingerprint, identifying a longest unordered match between the received audio fingerprint and the stored audio fingerprint that satisfies a similarity threshold. The method further includes, in response to determining that the identified longest unordered match satisfies a length criterion, detecting a match between the received audio fingerprint and the stored audio fingerprint.
    Type: Grant
    Filed: May 6, 2013
    Date of Patent: October 4, 2016
    Assignee: IHEARTMEDIA MANAGEMENT SERVICES, INC.
    Inventor: Dyon Anniballi
  • Patent number: 9445189
    Abstract: A noise suppressing apparatus that calculates a suppression coefficient for suppressing noise of an input signal by using a frequency spectrum of the input signal includes a frequency converting section that converts the input signal into a frequency spectrum; a noise level estimating section that calculates an estimated noise level of the input signal; a weight coefficient calculating section that calculates N (N is 2 or more) weight coefficients at predetermined intervals; and a suppression coefficient calculating section that calculates a joint distribution model of sound by weighting N statistical distribution models with the N weight coefficients, derives an estimation expression for a sound spectrum of the input signal on the basis of posteriori probability using the calculated joint distribution model of sound as priori probability, and calculates the suppression coefficient on the basis of the derived estimation expression and level of the input signal.
    Type: Grant
    Filed: December 10, 2014
    Date of Patent: September 13, 2016
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventor: Shinichi Yuzuriha
  • Patent number: 9401155
    Abstract: Vector Quantizer and method therein for efficient vector quantization, e.g. in a transform audio codec. The method comprises comparing an input target vector s with a plurality of centroids, each centroid representing a respective class of codevectors in a codebook. Further, a starting point for a search related to the input target vector in the codebook is determined, based on the result of the comparison. The codevectors in the codebook are sorted according to a distortion measure reflecting the distance between each codevector and the centroids of the classes. The Vector Quantizer and method enables that the class of codevectors comprising the most probable candidate codevectors in regard of the input vector. s may be searched first.
    Type: Grant
    Filed: December 12, 2012
    Date of Patent: July 26, 2016
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Volodya Grancharov, Tomas Jansson Toftgård
  • Patent number: 9373336
    Abstract: A method and device for performing audio recognition, including: collecting a first audio document to be recognized; initiating calculation of first characteristic information of the first audio document, including: conducting time-frequency analysis for the first audio document to generate a first preset number of phase channels; and extracting at least one peak value characteristic point from each phase channel of the first preset number of phrase channels, where the at least one peak value characteristic point of each phase channel constitutes the peak value characteristic point sequence of said each phase channel; and obtaining a recognition result for the first audio document, wherein the recognition result is identified based on the first characteristic information, and wherein the first characteristic information is calculated based on the respective peak value characteristic point sequences of the preset number of phase channels.
    Type: Grant
    Filed: December 11, 2013
    Date of Patent: June 21, 2016
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Hailong Liu, Dadong Xie, Jie Hou, Bin Xiao, Xiao Liu, Bo Chen
  • Patent number: 9361896
    Abstract: A selected channel of a multi-channel signal which is represented by frames composed from sampling values having a high time resolution can be encoded with higher quality when a wave form parameter representation representing a wave form of an intermediate resolution representation of the selected channel is derived, the wave form parameter representation including a sequence of intermediate wave form parameters having a time resolution lower than the high time resolution of the sampling values and higher than a time resolution defined by a frame repetition rate. The wave form parameter representation with the intermediate resolution can be used to shape a reconstructed channel to retrieve a channel having a signal envelope close to that one of the selected original channel. The time scale on which the shaping is performed is shorter than the time scale of a framewise processing, thus enhancing the quality of the reconstructed channel.
    Type: Grant
    Filed: January 9, 2014
    Date of Patent: June 7, 2016
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Juergen Herre, Matthias Neusinger, Dirk Jeroen Breebaart, Gerard Hotho
  • Patent number: 9330400
    Abstract: Aspects of the present disclosure include systems and methods for injecting content into a webpage at or local to a network access gateway. For example, in an embodiment, a network access gateway is provided for accessing the internet. A user logs onto the internet through the network access gateway and request a webpage. The gateway requests the webpage from the webpage provider. Before the gateway delivers the webpage to the user, the gateway, or a content injection engine local to the gateway, injects content, such as, for example, advertisements or other useful information, into the webpage and then sends the altered webpage to the user.
    Type: Grant
    Filed: October 17, 2014
    Date of Patent: May 3, 2016
    Assignee: NOMADIX, INC.
    Inventors: Balaji Pitchaikani, Eric Christopher Brusseau, Vadim Olshansky, Peter Matthew Feldman, Charles S. Zumbahlen, Elyas Manzur Salem
  • Patent number: 9311929
    Abstract: A speech analysis system uses one or more digital processors to reconstruct a speech signal by accurately extracting speech formants from a digitized version of the speech signal. The system extracts the formants by determining an estimated instantaneous frequency and an estimated instantaneous bandwidth of speech resonances of the digital version of the speech signal in real time. The system digitally filters the digital speech signal using a plurality of complex digital filters in parallel having overlapping bandwidths to ensure that substantially all of the bandwidth of the speech signal is covered. This virtual chain of overlapping complex digital filters produces a corresponding plurality of complex filtered signals. A first estimated frequency and a first estimated bandwidth is generated for each of the filtered signals, and speech resonances of the input speech signal are identified therefrom.
    Type: Grant
    Filed: October 31, 2012
    Date of Patent: April 12, 2016
    Assignee: Eliza Corporation
    Inventors: John P. Kroeker, Janet Slifka, Richard S. McGowan
  • Patent number: 9257131
    Abstract: A speech signal processing apparatus includes an amplitude and phase signal generation section that, based on an analyzing signal expressed by a complex signal generated from a speech signal applied with pitch marks every 1 pitch cycle, generates an amplitude signal and a phase signal on the time axis of the speech signal, a phase signal conversion section that converts the phase signal into a phase signal of a target pitch cycle width for each section of the 1 pitch cycle width based on the pitch marks, and a pitch conversion speech signal generation section that generates a speech signal in which pitch cycle is converted to the target pitch cycle based on an amplitude signal of the target pitch cycle width of a section corresponding to the section of the amplitude signal and based on a phase signal of the target pitch cycle width.
    Type: Grant
    Filed: October 30, 2013
    Date of Patent: February 9, 2016
    Assignee: FUJITSU LIMITED
    Inventor: Kazuhiro Watanabe
  • Patent number: 9078077
    Abstract: An approach to forming output signals both permits flexible and temporally and/or frequency local processing of input signals while limiting or mitigating artifacts in such output signals. Generally, the approach involves first synthesizing prototype signals for the output signals, or equivalently characterizing such prototypes, for example, according to their statistical characteristics, and then forming the output signals as estimates of the prototype signals, for example, as weighted combinations of the input signals.
    Type: Grant
    Filed: October 21, 2011
    Date of Patent: July 7, 2015
    Assignee: Bose Corporation
    Inventors: Paul B. Hultz, Tobe Barksdale, Michael Dublin, Luke C. Walters
  • Patent number: 9026435
    Abstract: The invention provides a method for estimating a fundamental frequency of a speech signal comprising the steps of receiving a signal spectrum of the speech signal, filtering the signal spectrum to obtain a refined signal spectrum, determining a cross-power spectral density using the refined signal spectrum and the signal spectrum, transforming the cross-power spectral density into the time domain to obtain a cross-correlation function, and estimating the fundamental frequency of the speech signal based on the cross-correlation function.
    Type: Grant
    Filed: May 3, 2010
    Date of Patent: May 5, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Mohamed Krini, Gerhard Schmidt
  • Patent number: 8990094
    Abstract: An electronic device for coding a transient frame is described. The electronic device includes a processor and executable instructions stored in memory that is in electronic communication with the processor. The electronic device obtains a current transient frame. The electronic device also obtains a residual signal based on the current transient frame. Additionally, the electronic device determines a set of peak locations based on the residual signal. The electronic device further determines whether to use a first coding mode or a second coding mode for coding the current transient frame based on at least the set of peak locations. The electronic device also synthesizes an excitation based on the first coding mode if the first coding mode is determined. The electronic device also synthesizes an excitation based on the second coding mode if the second coding mode is determined.
    Type: Grant
    Filed: September 8, 2011
    Date of Patent: March 24, 2015
    Assignee: QUALCOMM Incorporated
    Inventors: Venkatesh Krishnan, Ananthapadmanabhan Arasanipalai Kandhadai
  • Patent number: 8954324
    Abstract: Voice activity detection using multiple microphones can be based on a relationship between an energy at each of a speech reference microphone and a noise reference microphone. The energy output from each of the speech reference microphone and the noise reference microphone can be determined. A speech to noise energy ratio can be determined and compared to a predetermined voice activity threshold. In another embodiment, the absolute value of the autocorrelation of the speech and noise reference signals are determined and a ratio based on autocorrelation values is determined. Ratios that exceed the predetermined threshold can indicate the presence of a voice signal. The speech and noise energies or autocorrelations can be determined using a weighted average or over a discrete frame size.
    Type: Grant
    Filed: September 28, 2007
    Date of Patent: February 10, 2015
    Assignee: QUALCOMM Incorporated
    Inventors: Song Wang, Samir Kumar Gupta, Eddie L. T. Choy
  • Patent number: 8935164
    Abstract: A non-spatial speech detection system includes a plurality of microphones whose output is supplied to a fixed beamformer. An adaptive beamformer is used for receiving the output of the plurality of microphones and one or more processors are used for processing an output from the fixed beamformer and identifying speech from noise though the use of an algorithm utilizing a covariance matrix.
    Type: Grant
    Filed: May 2, 2012
    Date of Patent: January 13, 2015
    Assignee: Gentex Corporation
    Inventors: Robert R. Turnbull, Michael A. Bryson
  • Patent number: 8930200
    Abstract: A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then combined between different vectors, so that encoding idle spaces of different vectors can be recombined, thereby facilitating saving of encoding bits, and because an encoding index of a vector is split and then shorter split indexes are recombined, thereby facilitating reduction of requirements for the bit width of operating parts in encoding/decoding calculation.
    Type: Grant
    Filed: July 24, 2013
    Date of Patent: January 6, 2015
    Assignee: Huawei Technologies Co., Ltd
    Inventors: Fuwei Ma, Dejun Zhang, Lei Miao, Fengyan Qi
  • Patent number: 8924200
    Abstract: A method for decoding an audio signal in a decoder having a CELP-based decoder element including a fixed codebook component, at least one pitch period value, and a first decoder output, wherein a bandwidth of the audio signal extends beyond a bandwidth of the CELP-based decoder element. The method includes obtaining an up-sampled fixed codebook signal by up-sampling the fixed codebook component to a higher sample rate, obtaining an up-sampled excitation signal based on the up-sampled fixed codebook signal and an up-sampled pitch period value, and obtaining a composite output signal based on the up-sampled excitation signal and an output signal of the CELP-based decoder element, wherein the composite output signal includes a bandwidth portion that extends beyond a bandwidth of the CELP-based decoder element.
    Type: Grant
    Filed: September 28, 2011
    Date of Patent: December 30, 2014
    Assignee: Motorola Mobility LLC
    Inventors: Jonathan A. Gibbs, James P. Ashley, Udar Mittal
  • Patent number: 8868432
    Abstract: A method for decoding an audio signal having a bandwidth that extends beyond a bandwidth of a CELP excitation signal in an audio decoder including a CELP-based decoder element. The method includes obtaining a second excitation signal having an audio bandwidth extending beyond the audio bandwidth of the CELP excitation signal, obtaining a set of signals by filtering the second excitation signal with a set of bandpass filters, scaling the set of signals using a set of energy-based parameters, and obtaining a composite output signal by combining the scaled set of signals with a signal based on the audio signal decoded by the CELP-based decoder element.
    Type: Grant
    Filed: September 28, 2011
    Date of Patent: October 21, 2014
    Assignee: Motorola Mobility LLC
    Inventors: Jonathan A. Gibbs, James P. Ashley, Udar Mittal
  • Patent number: 8762158
    Abstract: A method and apparatus for generating synthesis audio signals are provided. The method includes decoding a bitstream; splitting the decoded bitstream into n sub-band signals; generating n transformed sub-band signals by transforming the n sub-band signals in a frequency domain; and generating synthesis audio signals by respectively multiplying the n transformed sub-band signals by values corresponding to synthesis filter bank coefficients.
    Type: Grant
    Filed: August 5, 2011
    Date of Patent: June 24, 2014
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Hyun-wook Kim, Han-gil Moon, Sang-hoon Lee
  • Patent number: 8738370
    Abstract: A speech analyzer includes a speech acquiring section, a frequency converting section, an autocorrelation section, and a pitch detection section. The frequency converting section converts the speech signal acquired by the speech acquiring section into a frequency spectrum. The autocorrelation section determines an autocorrelation waveform by shifting the frequency spectrum along the frequency axis. The pitch detection section determines the pitch frequency from the distance between two local crests or troughs of the autocorrelation waveform.
    Type: Grant
    Filed: June 2, 2006
    Date of Patent: May 27, 2014
    Assignees: AGI Inc.
    Inventors: Shunji Mitsuyoshi, Kaoru Ogata, Fumiaki Monma
  • Patent number: 8731913
    Abstract: A method for overlap-adding signals useful for performing frame loss concealment (FLC) in an audio decoder as well as in other applications. The method uses a dynamic mix of windows to overlap two signals whose normalized cross-correlation may vary from zero to one. If the overlapping signals are decomposed into a correlated component and an uncorrelated component, they are overlap-added separately using the appropriate window, and then added together. If the overlapping signals are not decomposed, a weighted mix of windows is used. The mix is determined by a measure estimating the amount of cross-correlation between overlapping signals, or the relative amount of correlated to uncorrelated signals.
    Type: Grant
    Filed: April 13, 2007
    Date of Patent: May 20, 2014
    Assignee: Broadcom Corporation
    Inventors: Robert W. Zopf, Juin-Hwey Chen
  • Patent number: 8723700
    Abstract: The present invention discloses a method and a device for pulse encoding, and a method and a device for pulse decoding. The method for pulse encoding includes: calculating an index value of an input pulse; selecting an adjustment threshold value according to the number of pulses, and comparing the index value of the pulse with the adjustment threshold value; if the index value is smaller than the adjustment threshold value, adopting the first number of encoding bits to encode the index value, if the index value is not smaller than the adjustment threshold value, adopting the second number of encoding bits to encode the index value plus an offset value, where the first number is smaller than the second number, the first number and the second number are both positive integers, and the offset value is greater than or equal to the adjustment threshold value.
    Type: Grant
    Filed: December 14, 2011
    Date of Patent: May 13, 2014
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Fuwei Ma, Dejun Zhang, Minjie Xie, Qing Zhang
  • Patent number: 8670968
    Abstract: A method for training a ranking application. The method includes ranking the help postings to create an initial ranking using initial parameter values, and storing user interactions with the help postings to obtain stored interactions. Simulations are performed using the stored interactions to generate revised parameter values for the ranking application. Performing the simulations includes calculating relevance values from the stored interactions, creating a test posting, assigning, to the test posting, an initial score and a relevance value randomly selected from the relevance values to generate a test ranking, and simulating user interactions with the test ranking to generate simulated rankings. The simulated rankings are analyzed to obtain revised parameter values. The method further includes ranking, using the revised parameter values, the help postings to generate a revised ranking, and displaying the help postings in the forum according to the revised ranking.
    Type: Grant
    Filed: August 31, 2012
    Date of Patent: March 11, 2014
    Assignee: Intuit Inc.
    Inventors: Igor A. Podgorny, Floyd J. Morgan, Derek Szydlowski
  • Patent number: 8666752
    Abstract: Provided are an encoding apparatus and a decoding apparatus of a multi-channel signal. The encoding apparatus of the multi-channel signal may process a phase parameter associated with phase information between a plurality of channels constituting the multi-channel signal, based on a characteristic of the multi-channel signal. The encoding apparatus may generate an encoded bitstream with respect to the multi-channel signal using the processed phase parameter and a mono signal extracted from the multi-channel signal.
    Type: Grant
    Filed: March 17, 2010
    Date of Patent: March 4, 2014
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jung-Hoe Kim, Eun Mi Oh
  • Patent number: 8660842
    Abstract: Speech recognition device uses visual information to narrow down the range of likely adaptation parameters even before a speaker makes an utterance. Images of the speaker and/or the environment are collected using an image capturing device, and then processed to extract biometric features and environmental features. The extracted features and environmental features are then used to estimate adaptation parameters. A voice sample may also be collected to refine the adaptation parameters for more accurate speech recognition.
    Type: Grant
    Filed: March 9, 2010
    Date of Patent: February 25, 2014
    Assignee: Honda Motor Co., Ltd.
    Inventor: Antoine R. Raux
  • Patent number: 8583425
    Abstract: Methods, systems, and computer readable media for fricatives and high frequencies detection are disclosed. According to one method, the method includes receiving a narrowband signal. The method also includes detecting, using one or more autocorrelation coefficients, a high frequency speech component associated with the narrowband signal.
    Type: Grant
    Filed: June 21, 2011
    Date of Patent: November 12, 2013
    Assignee: Genband US LLC
    Inventors: Emmanuel Rossignol Thepie Fapi, Eric Poulin
  • Patent number: 8583427
    Abstract: A signal processing system which discriminates between voice signals and data signals modulated by a voiceband carrier. The signal processing system includes a voice exchange, a data exchange and a call discriminator. The voice exchange is capable of exchanging voice signals between a switched circuit network and a packet based network. The signal processing system also includes a data exchange capable of exchanging data signals modulated by a voiceband carrier on the switched circuit network with unmodulated data signal packets on the packet based network. The data exchange is performed by demodulating data signals from the switched circuit network for transmission on the packet based network, and modulating data signal packets from the packet based network for transmission on the switched circuit network. The call discriminator is used to selectively enable the voice exchange and data exchange.
    Type: Grant
    Filed: January 25, 2010
    Date of Patent: November 12, 2013
    Assignee: Broadcom Corporation
    Inventors: Onur Tackin, Scott Branden
  • Patent number: 8577673
    Abstract: In one embodiment, a method of receiving a decoded audio signal that has a transmitted pitch lag is disclosed. The method includes estimating pitch correlations of possible short pitch lags that are smaller than a minimum pitch limitation and have an approximated multiple relationship with the transmitted pitch lag, checking if one of the pitch correlations of the possible short pitch lags is large enough compared to a pitch correlation estimated with the transmitted pitch lag, and selecting a short pitch lag as a corrected pitch lag if a corresponding pitch correlation is large enough. The postprocessing is performed using the corrected pitch lag. In another embodiment, when the existence of irregular harmonics or wrong pitch lag is detected, a coded-excited linear prediction (CELP) postfilter is made more aggressive.
    Type: Grant
    Filed: September 15, 2009
    Date of Patent: November 5, 2013
    Assignee: Huawei Technologies Co., Ltd.
    Inventor: Yang Gao
  • Patent number: 8577045
    Abstract: An encoding apparatus comprises a frame processor (105) which receives a multi channel audio signal comprising at least a first audio signal from a first microphone (101) and a second audio signal from a second microphone (103). An ITD processor 107 then determines an inter time difference between the first audio signal and the second audio signal and a set of delays (109, 111) generates a compensated multi channel audio signal from the multi channel audio signal by delaying at least one of the first and second audio signals in response to the inter time difference signal. A combiner (113) then generates a mono signal by combining channels of the compensated multi channel audio signal and a mono signal encoder (115) encodes the mono signal. The inter time difference may specifically be determined by an algorithm based on determining cross correlations between the first and second audio signals.
    Type: Grant
    Filed: September 9, 2008
    Date of Patent: November 5, 2013
    Assignee: Motorola Mobility LLC
    Inventor: Jonathan A. Gibbs
  • Patent number: 8566085
    Abstract: The present disclosure relates to coding and decoding technologies, and discloses a preprocessing method, a preprocessing apparatus, and a coding device. The preprocessing method includes: obtaining characteristic information of a current frame signal; identifying whether the current frame signal requires no coding operation of removing LTC according to the characteristic information of the current frame signal and preset information; and if identifying that the current frame signal requires no coding operation of removing LTC, performing the coding operation of removing STC for the current frame signal; and if identifying that the current frame signal requires the coding operation of removing LTC, performing the coding operations of removing both LTC and STC for the current frame signal. Through the technical solution provided herein, the coding operation of removing LTC is performed for only part of the input frame signals.
    Type: Grant
    Filed: March 15, 2010
    Date of Patent: October 22, 2013
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Lei Miao, Fengyan Qi, Jianfeng Xu, Dejun Zhang, Qing Zhang
  • Patent number: 8566106
    Abstract: A method and device for searching an algebraic codebook during encoding of a sound signal, wherein the algebraic codebook comprises a set of codevectors formed of a number of pulse positions and a number of pulses distributed over the pulse positions. In the algebraic codebook searching method and device, a reference signal for use in searching the algebraic codebook is calculated. In a first stage, a position of a first pulse is determined in relation with the reference signal and among the number of pulse positions. In each of a number of stages subsequent to the first stage, (a) an algebraic codebook gain is recomputed, (b) the reference signal is updated using the recomputed algebraic codebook gain and (c) a position of another pulse is determined in relation with the updated reference signal and among the number of pulse positions.
    Type: Grant
    Filed: September 11, 2008
    Date of Patent: October 22, 2013
    Assignee: Voiceage Corporation
    Inventors: Redwan Salami, Vaclav Eksler, Milan Jelinek
  • Patent number: 8560313
    Abstract: A method of and system for transient noise rejection for improved speech recognition. The method comprises the steps of (a) receiving audio including user speech and at least some transient noise associated with the speech, (b) converting the received audio into digital data, (c) segmenting the digital data into acoustic frames, and (d) extracting acoustic feature vectors from the acoustic frames. The method also comprises the steps of (e) evaluating the acoustic frames for transient noise on a frame-by-frame basis, (f) rejecting those acoustic frames having transient noise, (g) accepting as speech frames those acoustic frames having no transient noise and, thereafter, (h) recognizing the user speech using the speech frames.
    Type: Grant
    Filed: May 13, 2010
    Date of Patent: October 15, 2013
    Assignee: General Motors LLC
    Inventors: Gaurav Talwar, Rathinavelu Chengalvarayan