Autocorrelation Patents (Class 704/217)

Very short pitch detection and coding

Patent number: 11894007

Abstract: A method includes detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in a time domain and detecting a lack of low frequency energy in the speech or audio signal in a frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.

Type: Grant

Filed: February 9, 2022

Date of Patent: February 6, 2024

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Yang Gao, Fengyan Qi
System and method for optimization of audio fingerprint search

Patent number: 11294955

Abstract: A system and method are presented for optimization of audio fingerprint search. In an embodiment, the audio fingerprints are organized into a recursive tree with different branches containing fingerprint sets that are dissimilar to each other. The tree is constructed using a clustering algorithm based on a similarity measure. The similarity measure may comprise a Hamming distance for a binary fingerprint or a Euclidean distance for continuous valued fingerprints. In another embodiment, each fingerprint is stored at a plurality of resolutions and clustering is performed hierarchically. The recognition of an incoming fingerprint begins from the root of the tree and proceeds down its branches until a match or mismatch is declared. In yet another embodiment, a fingerprint definition is generalized to include more detailed audio information than in the previous definition.

Type: Grant

Filed: April 8, 2019

Date of Patent: April 5, 2022

Inventors: Srinath Cheluvaraja, Ananth Nagaraja Iyer, Felix Immanuel Wyss
Dynamic service injection

Patent number: 11023672

Abstract: Features are disclosed for injection services that allow a development team to quickly and easily include functionality developed by other teams. The main application server injects functionality into responses. The injected service content may include executable content (e.g., scripts) which may be retrieved from a content distribution network. This provides a framework for integrating various, decoupled features into a single main application.

Type: Grant

Filed: January 29, 2018

Date of Patent: June 1, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Bogdan Ciprian Pistol, Samuel Edward Creed, Marek Jan Dec, Ulrich Geilmann, Afshin Khashei Varnamkhasti, Shonn Oleg Lyga, Nick Obradovic, Erik Shadwick, Gurvinder Singh, Ganna Topol, Sheng-Yuan Wang
Error correction code structure

Patent number: 11016844

Abstract: Various implementations described herein relate to systems and methods for encoding data having input bits to be stored in a non-volatile storage device, including mapping the input bits to a plurality of component codes of an error correction code (ECC) and encoding the input bits as the plurality of component codes, wherein first input bits of the input bits encoded by any of the plurality of component codes are encoded by every other component code of the plurality of component codes in a non-overlapping manner.

Type: Grant

Filed: March 15, 2019

Date of Patent: May 25, 2021

Assignee: Toshiba Memory Corporation

Inventors: Avi Steiner, Hanan Weingarten, Meir Nadam-Olegnowicz, Ofir Kanter, Amir Nassie
Training apparatus, speech synthesis system, and speech synthesis method

Patent number: 10957303

Abstract: A training apparatus includes an autoregressive model configured to estimate a current signal from a past signal sequence and a current context label, a vocal tract feature analyzer configured to analyze an input speech signal to determine a vocal tract filter coefficient representing a vocal tract feature, a residual signal generator configured to output a residual signal, a quantization unit configured to quantize the residual signal output from the residual signal generator to generate a quantized residual signal, and a training controller configured to provide as a condition, a context label of an already known input text for the input speech signal corresponding to the already known input text to the autoregressive model and to train the autoregressive model by bringing a past sequence of the quantized residual signals for the input speech signal and the current context label into correspondence with a current signal of the quantized residual signal.

Type: Grant

Filed: February 21, 2018

Date of Patent: March 23, 2021

Assignee: NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY

Inventors: Kentaro Tachibana, Tomoki Toda
Identification of audio signals in surrounding sounds and guidance of an autonomous vehicle in response to the same

Patent number: 10747231

Abstract: Embodiments include apparatuses, systems, and methods for a computer-aided or autonomous driving (CA/AD) system to identify and respond to an audio signal, e.g., an emergency alarm signal. In embodiments, the CA/AD driving system may include a plurality of microphones disposed to capture the audio signal included in surrounding sounds to a semi-autonomous or autonomous (SA/AD) vehicle. In embodiments, an audio analysis unit may receive the audio signal to extract audio features from the audio signal. In embodiments, a neural network such as a Deep Neural Network (DNN) may receive the extracted audio features from the audio analysis unit and to generate a probability score to allow identification of the audio signal. In embodiments, the CA/AD driving system may control driving elements of the SA/AD vehicle to autonomously or semi-autonomously drive the SA/AD vehicle in response to the identification. Other embodiments may also be described and claimed.

Type: Grant

Filed: November 17, 2017

Date of Patent: August 18, 2020

Assignee: Intel Corporation

Inventors: Sarang Akotkar, Mithil Ramteke, Tobias Bocklet, Sivasubramanian Sundaram
Encoder and method for encoding an audio signal with reduced background noise using linear predictive coding

Patent number: 10692510

Abstract: It is shown an encoder for encoding an audio signal with reduced background noise using linear predictive coding. The encoder includes a background noise estimator configured to estimate background noise of the audio signal, a background noise reducer configured to generate background noise reduced audio signal by subtracting the estimated background noise of the audio signal from the audio signal, and a predictor configured to subject the audio signal to linear prediction analysis to obtain a first set of linear prediction filter (LPC) coefficients and to subject the background noise reduced audio signal to linear prediction analysis to obtain a second set of linear prediction filter (LPC) coefficients. Furthermore, the encoder includes an analysis filter composed of a cascade of time-domain filters controlled by the obtained first set of LPC coefficients and the obtained second set of LPC coefficients.

Type: Grant

Filed: March 14, 2018

Date of Patent: June 23, 2020

Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Johannes Fischer, Tom Bäckström, Emma Jokinen
Techniques for phase modulated signals having poor autocorrelation

Patent number: 10666475

Abstract: An electronic transmitter includes: a modulator to phase modulate a carrier signal with a baseband signal using a phase modulation sequence; and an emitter to emit the phase modulated signal. The phase modulated signal has poor autocorrelation, has a corresponding mismatched filter based on the phase modulation sequence, and is configured to demodulate into the baseband signal through poor cross-correlation with the mismatched filter. Sometimes, the transmitter is part of a sensing apparatus, where the emitter emits the phase modulated signal at a target and the emitted signal reflects off the target. The sensing apparatus includes a receiver that has a collector to collect the reflected signal, and a demodulator to demodulate the collected signal into the baseband signal through the poor cross-correlation with the mismatched filter. Sometimes, the transmitter is part of a communication system, where the emitter emits the phase modulated signal to an intended recipient.

Type: Grant

Filed: October 29, 2018

Date of Patent: May 26, 2020

Assignee: BAE Systems Information and Electronic Systems Integration Inc.

Inventors: William D. Watson, Prabahan Basu, Jonathan P. Beaudeau, David J. Couto
Encoder, decoder, coding method, decoding method, coding program, decoding program and recording medium

Patent number: 10629214

Abstract: An encoder and a decoder are provided that are capable of reproducing a frequency-domain envelope sequence that provides high approximation accuracy around peaks caused by the pitch period of an audio signal by using a small amount of code. An encoder of the present invention comprises a periodic-combined-envelope generating part and a variable-length coding part. The periodic-combined-envelope generating part generates a periodic combined envelope sequence which is a frequency-domain sequence based on a spectral envelope sequence which is a frequency-domain sequence corresponding to a linear predictive coefficient code obtained from an input audio signal and on a frequency-domain period. The variable-length coding part encodes a frequency-domain sequence derived from the input audio signal. A decoder of the present invention comprises a periodic-combined-envelope generating part and a variable-length decoding part.

Type: Grant

Filed: November 26, 2018

Date of Patent: April 21, 2020

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
Encoder, decoder, coding method, decoding method, coding program, decoding program and recording medium

Patent number: 10607616

Abstract: An encoder and a decoder are provided that are capable of reproducing a frequency-domain envelope sequence that provides high approximation accuracy around peaks caused by the pitch period of an audio signal by using a small amount of code. An encoder of the present invention comprises a periodic-combined-envelope generating part and a variable-length coding part. The periodic-combined-envelope generating part generates a periodic combined envelope sequence which is a frequency-domain sequence based on a spectral envelope sequence which is a frequency-domain sequence corresponding to a linear predictive coefficient code obtained from an input audio signal and on a frequency-domain period. The variable-length coding part encodes a frequency-domain sequence derived from the input audio signal. A decoder of the present invention comprises a periodic-combined-envelope generating part and a variable-length decoding part.

Type: Grant

Filed: November 26, 2018

Date of Patent: March 31, 2020

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
Audio recognition apparatus and method

Patent number: 10475462

Abstract: A method includes generating, by a processor, an audio fingerprint representative of an audio signal. The audio fingerprint is based on a plurality of first intensity values corresponding to one or more segments of the audio signal. The plurality of first intensity values are based on a Fast Fourier Transform (FFT) performed on at least one sampled segment of the audio signal. The method also includes comparing a plurality of second intensity values based on a recorded sound to determine whether the second intensity values match the first intensity values. The method additionally includes causing a message to be communicated to a device used to record the sound based on a determination that the plurality of second intensity values match the plurality of first intensity values.

Type: Grant

Filed: November 8, 2017

Date of Patent: November 12, 2019

Assignee: PLAYFUSION LIMITED

Inventors: Riaan Hodgson, David Gomberg, Mark Gerhard
Signal processing method for determining audience rating of media, and additional information inserting apparatus, media reproducing apparatus and audience rating determining apparatus for performing the same method

Patent number: 10469907

Abstract: Provided are a signal processing method for determining an audience rating of media, and an additional information inserting apparatus, a media reproducing apparatus and an audience rating determining apparatus for performing the same method. In detail, the signal processing method for determining an audience rating of media is a method that may determine an audience rating of media with respect to a whole section of an audio signal by inserting additional information into a silence section through a noise signal.

Type: Grant

Filed: July 2, 2018

Date of Patent: November 5, 2019

Assignee: Electronics and Telecommunications Research Institute

Inventors: Young Ho Jeong, Seung Kwon Beack, Tae Jin Lee, Hui Yong Kim
Apparatus and method for improved concealment of the adaptive codebook in a CELP-like concealment employing improved pitch lag estimation

Patent number: 10381011

Abstract: An apparatus for determining an estimated pitch lag is provided. The apparatus includes an input interface for receiving a plurality of original pitch lag values, and a pitch lag estimator for estimating the estimated pitch lag. The pitch lag estimator is configured to estimate the estimated pitch lag depending on a plurality of original pitch lag values and depending on a plurality of information values, wherein for each original pitch lag value of the plurality of original pitch lag values, an information value of the plurality of information values is assigned to the original pitch lag value.

Type: Grant

Filed: December 21, 2015

Date of Patent: August 13, 2019

Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Jeremie Lecomte, Michael Schnabel, Goran Markovic, Martin Dietz, Bernhard Neugebauer
Apparatus, medium and method to encode and decode high frequency signal

Patent number: 10255928

Abstract: A method and apparatus to encoding or decoding an audio signal is provided. In the method and apparatus, a noise-floor level to use in encoding or decoding a high frequency signal is updated according to the degree of a voiced or unvoiced sound included in the signal.

Type: Grant

Filed: November 13, 2017

Date of Patent: April 9, 2019

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Ki-hyun Choo, Eun-mi Oh, Ho-sang Sung, Jung-hoe Kim, Mi-young Kim
Encoding method, decoding method, encoding apparatus, and decoding apparatus

Patent number: 10210880

Abstract: An encoding method, a decoding method, an encoding apparatus, a decoding apparatus, a transmitter, a receiver, and a communications system. The encoding method includes: dividing a to-be-encoded time-domain signal into a low band signal and a high band signal; performing encoding on the low band signal to obtain a low frequency encoding parameter; performing encoding on the high band signal to obtain a high frequency encoding parameter, and obtaining a synthesized high band signal; performing short-time post-filtering processing on the synthesized high band signal to obtain a short-time filtering signal; and calculating a high frequency gain based on the high band signal and the short-time filtering signal. A technical solution according to the embodiments of the present application can improve an encoding and/or decoding effect.

Type: Grant

Filed: August 15, 2017

Date of Patent: February 19, 2019

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Bin Wang, Zexin Liu, Lei Miao
Apparatus for encoding a speech signal employing ACELP in the autocorrelation domain

Patent number: 10170129

Abstract: An apparatus for encoding a speech signal by determining a codebook vector of a speech coding algorithm is provided. The apparatus includes a matrix determiner for determining an autocorrelation matrix R, and a codebook vector determiner for determining the codebook vector depending on the autocorrelation matrix R. The matrix determiner is configured to determine the autocorrelation matrix R by determining vector coefficients of a vector r, wherein the autocorrelation matrix R includes a plurality of rows and a plurality of columns, wherein the vector r indicates one of the columns or one of the rows of the autocorrelation matrix R, wherein R(i, j)=r(|i?j|), wherein R(i, j) indicates the coefficients of the autocorrelation matrix R, wherein i is a first index indicating one of a plurality of rows of the autocorrelation matrix R, and wherein j is a second index indicating one of the plurality of columns of the autocorrelation matrix R.

Type: Grant

Filed: April 3, 2015

Date of Patent: January 1, 2019

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Tom Baeckstroem, Markus Multrus, Guillaume Fuchs, Christian Helmrich, Martin Dietz
System and methods for continuous audio matching

Patent number: 10055490

Abstract: The present invention relates to the continuous monitoring of an audio signal and identification of audio items within an audio signal. The technology disclosed utilizes predictive caching of fingerprints to improve efficiency. Fingerprints are cached for tracking an audio signal with known alignment and for watching an audio signal without known alignment, based on already identified fingerprints extracted from the audio signal. Software running on a smart phone or other battery-powered device cooperates with software running on an audio identification server.

Type: Grant

Filed: June 14, 2016

Date of Patent: August 21, 2018

Assignee: SoundHound, Inc.

Inventors: Bernard Mont-Reynaud, Aaron Master, Timothy Stonehocker, Keyvan Mohajer
Noise suppressor

Patent number: 9978394

Abstract: Provided is a method, non-transitory computer program product and system for an improved noise suppression technique for speech enhancement. It operates on speech signals from a single or multiple input sources. Background noise monitoring is performed with one or multiple input speech signals to determine if the input speech contains active voice. If the absence of active voice is detected, the spectrum of the input speech is used to update a long-term noise spectrum estimate. In addition, the input from one or more secondary microphones can be used to update a short-term noise spectrum estimate. The input speech spectrum is then compared to the long-term and/or short-term noise spectra, and a selective spectrum gain based shaping is applied to the input speech spectrum to reduce noise.

Type: Grant

Filed: February 24, 2015

Date of Patent: May 22, 2018

Assignee: QOSOUND, INC.

Inventor: Huan-Yu Su
Network advertising system

Patent number: 9916603

Abstract: Systems and methods for transmitting content to a client via a communication network are provided. An insertion server, running within a firewall device associated with a private IP network, detects establishment of a transport communication protocol connection between a client associated with the network and a destination located external to the network by examining packets as they pass through the network and pass by the insertion server. A content request of an application protocol initiated by the client and directed to the destination is observed by the insertion server. The content request is negated by the insertion server by causing a canceling message of the transport communication protocol to be sent to the destination. Unsolicited content is caused to be selected for delivery to the client by the insertion server. The unsolicited content is sent by the insertion server to the client via the application protocol.

Type: Grant

Filed: August 24, 2016

Date of Patent: March 13, 2018

Assignee: Fortinet, Inc.

Inventors: Kunhua Lin, Michael Xie
Vector quantizer

Patent number: 9842601

Abstract: Vector Quantizer and method therein for efficient vector quantization, e.g. in a transform audio codec. The method comprises comparing an input target vector s with a plurality of centroids, each centroid representing a respective class of codevectors in a codebook. Further, a starting point for a search related to the input target vector in the codebook is determined, based on the result of the comparison. The codevectors in the codebook are sorted according to a distortion measure reflecting the distance between each codevector and the centroids of the classes. The Vector Quantizer and method enables that the class of codevectors comprising the most probable candidate codevectors in regard of the input vector s may be searched first.

Type: Grant

Filed: June 21, 2016

Date of Patent: December 12, 2017

Assignee: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)

Inventors: Volodya Grancharov, Tomas Jansson Toftg{dot over (a)}rd
Systems and methods for detecting camera defect caused by exposure to radiation

Patent number: 9773318

Abstract: A method of detecting camera defect includes: obtaining an image by a processing unit, the processing unit having a surface fit module, a subtraction module, and a peak quantification module; determining a first autocorrelation map for a first sub-region in the image; determining, using the surface fit module, a first surface fit for first scene content in the first sub-region; subtracting, using the subtraction module, the first surface fit from the first autocorrelation map for the first sub-region in the image to obtain a first residual map; and quantifying, using the peak quantification module, a first noise in the first residual map.

Type: Grant

Filed: October 2, 2015

Date of Patent: September 26, 2017

Assignee: Varian Medical Systems, Inc.

Inventor: Hassan Mostafavi
Speech signal processing apparatus and method for enhancing speech intelligibility

Patent number: 9767829

Abstract: A speech signal processing apparatus and a speech signal processing method for enhancing speech intelligibility are provided. The speech signal processing apparatus includes an input signal gain determiner to determine a gain of an input signal based on a harmonic characteristic of a voiced speech, a voiced speech output unit to output a voiced speech in which a harmonic component is preserved by applying the gain to the input signal, a linear predictive coefficient determiner to determine a linear predictive coefficient based on the voiced speech, and an unvoiced speech preserver to preserve an unvoiced speech of the input signal based on the linear predictive coefficient.

Type: Grant

Filed: July 10, 2014

Date of Patent: September 19, 2017

Assignees: Samsung Electronics Co., Ltd., Yonsei University Wonju Industry-Academic Cooperation Foundation

Inventors: Jun Il Sohn, Yun Seo Ku, Dong Wook Kim, Young Cheol Park
Methods and systems for voice conversion

Patent number: 9613620

Abstract: A device may receive data indicative of a plurality of speech sounds associated with first voice characteristics of a first voice. The device may receive an input indicative of speech associated with second voice characteristics of a second voice. The device may map at least one portion of the speech of the second voice to one or more speech sounds of the plurality of speech sounds of the first voice. The device may compare the first voice characteristics with the second voice characteristics based on the map. The comparison may include vocal tract characteristics, nasal cavity characteristics, and voicing characteristics. The device may determine a given representation configured to associate the first voice characteristics with the second voice characteristics. The device may provide an output indicative of pronunciations of the one or more speech sounds of the first voice according to the second voice characteristics based on the given representation.

Type: Grant

Filed: February 25, 2015

Date of Patent: April 4, 2017

Assignee: Google Inc.

Inventors: Ioannis Agiomyrgiannakis, Zoi Roupakia
Network advertising system

Patent number: 9589284

Abstract: Systems and methods for transmitting content to a client via a communication network are provided. According to one embodiment, an insertion server running within a firewall device of a network observes a content request of an application protocol by monitoring or proxying transport communication protocol connections established through the firewall device. The content request is (i) originated by a client device coupled to the network, (ii) directed to a destination device coupled to the network and (iii) associated with one of the multiple transport communication protocol connections. Responsive to observing the content request, the insertion server determines whether one or more conditions are satisfied. If so, the content request is negated by causing a canceling message of the transport communication protocol to be sent to the destination device and unsolicited content is selected and delivered to the client device via the application protocol.

Type: Grant

Filed: March 12, 2016

Date of Patent: March 7, 2017

Assignee: Fortinet, Inc.

Inventors: Kunhua Lin, Michael Xie
Providing sound as originating from location of display at which corresponding text is presented

Patent number: 9576501

Abstract: In one aspect, a device includes a processor, a display accessible to the processor, and memory accessible to the processor. The memory bears instructions executable by the processor to provide sound corresponding to a portion of text presented on the display with at least one portion of the sound being provided as if originating at least substantially from a location on the display at which the portion of text is presented on the display.

Type: Grant

Filed: March 12, 2015

Date of Patent: February 21, 2017

Assignee: Lenovo (Singapore) Pte. Ltd.

Inventor: Lucio Mitsuru Seki
Method and device for recognizing speech

Patent number: 9514738

Abstract: A speech is recognized using ACF factors extracted from running autocorrelation functions calculated from the speech. The extracted ACF factors are a W?(0) (width of ACF amplitude around zero-delay origin), a W?(0)max (maximum value of the W?(0)), a ?1 (pitch period), a ?1 (pitch strength), and a ??1/?t (rate of the pitch strength change). Syllables in the speech are identified by comparing the ACF factors with templates stored in a database.

Type: Grant

Filed: November 13, 2012

Date of Patent: December 6, 2016

Assignees: Yoshimasa Electronic Inc.

Inventor: Yoichi Ando
Unordered matching of audio fingerprints

Patent number: 9460201

Abstract: A method includes receiving an audio fingerprint from a listening device. The method also includes, in response to determining that a portion of a stored audio fingerprint substantially matches a portion of the received audio fingerprint, identifying a longest unordered match between the received audio fingerprint and the stored audio fingerprint that satisfies a similarity threshold. The method further includes, in response to determining that the identified longest unordered match satisfies a length criterion, detecting a match between the received audio fingerprint and the stored audio fingerprint.

Type: Grant

Filed: May 6, 2013

Date of Patent: October 4, 2016

Assignee: IHEARTMEDIA MANAGEMENT SERVICES, INC.

Inventor: Dyon Anniballi
Noise suppressing apparatus and noise suppressing method

Patent number: 9445189

Abstract: A noise suppressing apparatus that calculates a suppression coefficient for suppressing noise of an input signal by using a frequency spectrum of the input signal includes a frequency converting section that converts the input signal into a frequency spectrum; a noise level estimating section that calculates an estimated noise level of the input signal; a weight coefficient calculating section that calculates N (N is 2 or more) weight coefficients at predetermined intervals; and a suppression coefficient calculating section that calculates a joint distribution model of sound by weighting N statistical distribution models with the N weight coefficients, derives an estimation expression for a sound spectrum of the input signal on the basis of posteriori probability using the calculated joint distribution model of sound as priori probability, and calculates the suppression coefficient on the basis of the derived estimation expression and level of the input signal.

Type: Grant

Filed: December 10, 2014

Date of Patent: September 13, 2016

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventor: Shinichi Yuzuriha
Vector quantizer

Patent number: 9401155

Abstract: Vector Quantizer and method therein for efficient vector quantization, e.g. in a transform audio codec. The method comprises comparing an input target vector s with a plurality of centroids, each centroid representing a respective class of codevectors in a codebook. Further, a starting point for a search related to the input target vector in the codebook is determined, based on the result of the comparison. The codevectors in the codebook are sorted according to a distortion measure reflecting the distance between each codevector and the centroids of the classes. The Vector Quantizer and method enables that the class of codevectors comprising the most probable candidate codevectors in regard of the input vector. s may be searched first.

Type: Grant

Filed: December 12, 2012

Date of Patent: July 26, 2016

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Volodya Grancharov, Tomas Jansson Toftgård
Method and device for audio recognition

Patent number: 9373336

Abstract: A method and device for performing audio recognition, including: collecting a first audio document to be recognized; initiating calculation of first characteristic information of the first audio document, including: conducting time-frequency analysis for the first audio document to generate a first preset number of phase channels; and extracting at least one peak value characteristic point from each phase channel of the first preset number of phrase channels, where the at least one peak value characteristic point of each phase channel constitutes the peak value characteristic point sequence of said each phase channel; and obtaining a recognition result for the first audio document, wherein the recognition result is identified based on the first characteristic information, and wherein the first characteristic information is calculated based on the respective peak value characteristic point sequences of the preset number of phase channels.

Type: Grant

Filed: December 11, 2013

Date of Patent: June 21, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Hailong Liu, Dadong Xie, Jie Hou, Bin Xiao, Xiao Liu, Bo Chen
Temporal and spatial shaping of multi-channel audio signal

Patent number: 9361896

Abstract: A selected channel of a multi-channel signal which is represented by frames composed from sampling values having a high time resolution can be encoded with higher quality when a wave form parameter representation representing a wave form of an intermediate resolution representation of the selected channel is derived, the wave form parameter representation including a sequence of intermediate wave form parameters having a time resolution lower than the high time resolution of the sampling values and higher than a time resolution defined by a frame repetition rate. The wave form parameter representation with the intermediate resolution can be used to shape a reconstructed channel to retrieve a channel having a signal envelope close to that one of the selected original channel. The time scale on which the shaping is performed is shorter than the time scale of a framewise processing, thus enhancing the quality of the reconstructed channel.

Type: Grant

Filed: January 9, 2014

Date of Patent: June 7, 2016

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Disch, Juergen Herre, Matthias Neusinger, Dirk Jeroen Breebaart, Gerard Hotho
Systems and methods for injecting content

Patent number: 9330400

Abstract: Aspects of the present disclosure include systems and methods for injecting content into a webpage at or local to a network access gateway. For example, in an embodiment, a network access gateway is provided for accessing the internet. A user logs onto the internet through the network access gateway and request a webpage. The gateway requests the webpage from the webpage provider. Before the gateway delivers the webpage to the user, the gateway, or a content injection engine local to the gateway, injects content, such as, for example, advertisements or other useful information, into the webpage and then sends the altered webpage to the user.

Type: Grant

Filed: October 17, 2014

Date of Patent: May 3, 2016

Assignee: NOMADIX, INC.

Inventors: Balaji Pitchaikani, Eric Christopher Brusseau, Vadim Olshansky, Peter Matthew Feldman, Charles S. Zumbahlen, Elyas Manzur Salem
Digital processor based complex acoustic resonance digital speech analysis system

Patent number: 9311929

Abstract: A speech analysis system uses one or more digital processors to reconstruct a speech signal by accurately extracting speech formants from a digitized version of the speech signal. The system extracts the formants by determining an estimated instantaneous frequency and an estimated instantaneous bandwidth of speech resonances of the digital version of the speech signal in real time. The system digitally filters the digital speech signal using a plurality of complex digital filters in parallel having overlapping bandwidths to ensure that substantially all of the bandwidth of the speech signal is covered. This virtual chain of overlapping complex digital filters produces a corresponding plurality of complex filtered signals. A first estimated frequency and a first estimated bandwidth is generated for each of the filtered signals, and speech resonances of the input speech signal are identified therefrom.

Type: Grant

Filed: October 31, 2012

Date of Patent: April 12, 2016

Assignee: Eliza Corporation

Inventors: John P. Kroeker, Janet Slifka, Richard S. McGowan
Speech signal processing apparatus and method

Patent number: 9257131

Abstract: A speech signal processing apparatus includes an amplitude and phase signal generation section that, based on an analyzing signal expressed by a complex signal generated from a speech signal applied with pitch marks every 1 pitch cycle, generates an amplitude signal and a phase signal on the time axis of the speech signal, a phase signal conversion section that converts the phase signal into a phase signal of a target pitch cycle width for each section of the 1 pitch cycle width based on the pitch marks, and a pitch conversion speech signal generation section that generates a speech signal in which pitch cycle is converted to the target pitch cycle based on an amplitude signal of the target pitch cycle width of a section corresponding to the section of the amplitude signal and based on a phase signal of the target pitch cycle width.

Type: Grant

Filed: October 30, 2013

Date of Patent: February 9, 2016

Assignee: FUJITSU LIMITED

Inventor: Kazuhiro Watanabe
Estimation of synthetic audio prototypes with frequency-based input signal decomposition

Patent number: 9078077

Abstract: An approach to forming output signals both permits flexible and temporally and/or frequency local processing of input signals while limiting or mitigating artifacts in such output signals. Generally, the approach involves first synthesizing prototype signals for the output signals, or equivalently characterizing such prototypes, for example, according to their statistical characteristics, and then forming the output signals as estimates of the prototype signals, for example, as weighted combinations of the input signals.

Type: Grant

Filed: October 21, 2011

Date of Patent: July 7, 2015

Assignee: Bose Corporation

Inventors: Paul B. Hultz, Tobe Barksdale, Michael Dublin, Luke C. Walters
Method for estimating a fundamental frequency of a speech signal

Patent number: 9026435

Abstract: The invention provides a method for estimating a fundamental frequency of a speech signal comprising the steps of receiving a signal spectrum of the speech signal, filtering the signal spectrum to obtain a refined signal spectrum, determining a cross-power spectral density using the refined signal spectrum and the signal spectrum, transforming the cross-power spectral density into the time domain to obtain a cross-correlation function, and estimating the fundamental frequency of the speech signal based on the cross-correlation function.

Type: Grant

Filed: May 3, 2010

Date of Patent: May 5, 2015

Assignee: Nuance Communications, Inc.

Inventors: Mohamed Krini, Gerhard Schmidt
Coding and decoding a transient frame

Patent number: 8990094

Abstract: An electronic device for coding a transient frame is described. The electronic device includes a processor and executable instructions stored in memory that is in electronic communication with the processor. The electronic device obtains a current transient frame. The electronic device also obtains a residual signal based on the current transient frame. Additionally, the electronic device determines a set of peak locations based on the residual signal. The electronic device further determines whether to use a first coding mode or a second coding mode for coding the current transient frame based on at least the set of peak locations. The electronic device also synthesizes an excitation based on the first coding mode if the first coding mode is determined. The electronic device also synthesizes an excitation based on the second coding mode if the second coding mode is determined.

Type: Grant

Filed: September 8, 2011

Date of Patent: March 24, 2015

Assignee: QUALCOMM Incorporated

Inventors: Venkatesh Krishnan, Ananthapadmanabhan Arasanipalai Kandhadai
Multiple microphone voice activity detector

Patent number: 8954324

Abstract: Voice activity detection using multiple microphones can be based on a relationship between an energy at each of a speech reference microphone and a noise reference microphone. The energy output from each of the speech reference microphone and the noise reference microphone can be determined. A speech to noise energy ratio can be determined and compared to a predetermined voice activity threshold. In another embodiment, the absolute value of the autocorrelation of the speech and noise reference signals are determined and a ratio based on autocorrelation values is determined. Ratios that exceed the predetermined threshold can indicate the presence of a voice signal. The speech and noise energies or autocorrelations can be determined using a weighted average or over a discrete frame size.

Type: Grant

Filed: September 28, 2007

Date of Patent: February 10, 2015

Assignee: QUALCOMM Incorporated

Inventors: Song Wang, Samir Kumar Gupta, Eddie L. T. Choy
Non-spatial speech detection system and method of using same

Patent number: 8935164

Abstract: A non-spatial speech detection system includes a plurality of microphones whose output is supplied to a fixed beamformer. An adaptive beamformer is used for receiving the output of the plurality of microphones and one or more processors are used for processing an output from the fixed beamformer and identifying speech from noise though the use of an algorithm utilizing a covariance matrix.

Type: Grant

Filed: May 2, 2012

Date of Patent: January 13, 2015

Assignee: Gentex Corporation

Inventors: Robert R. Turnbull, Michael A. Bryson
Vector joint encoding/decoding method and vector joint encoder/decoder

Patent number: 8930200

Abstract: A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then combined between different vectors, so that encoding idle spaces of different vectors can be recombined, thereby facilitating saving of encoding bits, and because an encoding index of a vector is split and then shorter split indexes are recombined, thereby facilitating reduction of requirements for the bit width of operating parts in encoding/decoding calculation.

Type: Grant

Filed: July 24, 2013

Date of Patent: January 6, 2015

Assignee: Huawei Technologies Co., Ltd

Inventors: Fuwei Ma, Dejun Zhang, Lei Miao, Fengyan Qi
Audio signal bandwidth extension in CELP-based speech coder

Patent number: 8924200

Abstract: A method for decoding an audio signal in a decoder having a CELP-based decoder element including a fixed codebook component, at least one pitch period value, and a first decoder output, wherein a bandwidth of the audio signal extends beyond a bandwidth of the CELP-based decoder element. The method includes obtaining an up-sampled fixed codebook signal by up-sampling the fixed codebook component to a higher sample rate, obtaining an up-sampled excitation signal based on the up-sampled fixed codebook signal and an up-sampled pitch period value, and obtaining a composite output signal based on the up-sampled excitation signal and an output signal of the CELP-based decoder element, wherein the composite output signal includes a bandwidth portion that extends beyond a bandwidth of the CELP-based decoder element.

Type: Grant

Filed: September 28, 2011

Date of Patent: December 30, 2014

Assignee: Motorola Mobility LLC

Inventors: Jonathan A. Gibbs, James P. Ashley, Udar Mittal
Audio signal bandwidth extension in CELP-based speech coder

Patent number: 8868432

Abstract: A method for decoding an audio signal having a bandwidth that extends beyond a bandwidth of a CELP excitation signal in an audio decoder including a CELP-based decoder element. The method includes obtaining a second excitation signal having an audio bandwidth extending beyond the audio bandwidth of the CELP excitation signal, obtaining a set of signals by filtering the second excitation signal with a set of bandpass filters, scaling the set of signals using a set of energy-based parameters, and obtaining a composite output signal by combining the scaled set of signals with a signal based on the audio signal decoded by the CELP-based decoder element.

Type: Grant

Filed: September 28, 2011

Date of Patent: October 21, 2014

Assignee: Motorola Mobility LLC

Inventors: Jonathan A. Gibbs, James P. Ashley, Udar Mittal
Decoding method and decoding apparatus therefor

Patent number: 8762158

Abstract: A method and apparatus for generating synthesis audio signals are provided. The method includes decoding a bitstream; splitting the decoded bitstream into n sub-band signals; generating n transformed sub-band signals by transforming the n sub-band signals in a frequency domain; and generating synthesis audio signals by respectively multiplying the n transformed sub-band signals by values corresponding to synthesis filter bank coefficients.

Type: Grant

Filed: August 5, 2011

Date of Patent: June 24, 2014

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hyun-wook Kim, Han-gil Moon, Sang-hoon Lee
Speech analyzer detecting pitch frequency, speech analyzing method, and speech analyzing program

Patent number: 8738370

Abstract: A speech analyzer includes a speech acquiring section, a frequency converting section, an autocorrelation section, and a pitch detection section. The frequency converting section converts the speech signal acquired by the speech acquiring section into a frequency spectrum. The autocorrelation section determines an autocorrelation waveform by shifting the frequency spectrum along the frequency axis. The pitch detection section determines the pitch frequency from the distance between two local crests or troughs of the autocorrelation waveform.

Type: Grant

Filed: June 2, 2006

Date of Patent: May 27, 2014

Assignees: AGI Inc.

Inventors: Shunji Mitsuyoshi, Kaoru Ogata, Fumiaki Monma
Scaled window overlap add for mixed signals

Patent number: 8731913

Abstract: A method for overlap-adding signals useful for performing frame loss concealment (FLC) in an audio decoder as well as in other applications. The method uses a dynamic mix of windows to overlap two signals whose normalized cross-correlation may vary from zero to one. If the overlapping signals are decomposed into a correlated component and an uncorrelated component, they are overlap-added separately using the appropriate window, and then added together. If the overlapping signals are not decomposed, a weighted mix of windows is used. The mix is determined by a measure estimating the amount of cross-correlation between overlapping signals, or the relative amount of correlated to uncorrelated signals.

Type: Grant

Filed: April 13, 2007

Date of Patent: May 20, 2014

Assignee: Broadcom Corporation

Inventors: Robert W. Zopf, Juin-Hwey Chen
Method and device for pulse encoding, method and device for pulse decoding

Patent number: 8723700

Abstract: The present invention discloses a method and a device for pulse encoding, and a method and a device for pulse decoding. The method for pulse encoding includes: calculating an index value of an input pulse; selecting an adjustment threshold value according to the number of pulses, and comparing the index value of the pulse with the adjustment threshold value; if the index value is smaller than the adjustment threshold value, adopting the first number of encoding bits to encode the index value, if the index value is not smaller than the adjustment threshold value, adopting the second number of encoding bits to encode the index value plus an offset value, where the first number is smaller than the second number, the first number and the second number are both positive integers, and the offset value is greater than or equal to the adjustment threshold value.

Type: Grant

Filed: December 14, 2011

Date of Patent: May 13, 2014

Assignee: Huawei Technologies Co., Ltd.

Inventors: Fuwei Ma, Dejun Zhang, Minjie Xie, Qing Zhang
System and method for ranking a posting

Patent number: 8670968

Abstract: A method for training a ranking application. The method includes ranking the help postings to create an initial ranking using initial parameter values, and storing user interactions with the help postings to obtain stored interactions. Simulations are performed using the stored interactions to generate revised parameter values for the ranking application. Performing the simulations includes calculating relevance values from the stored interactions, creating a test posting, assigning, to the test posting, an initial score and a relevance value randomly selected from the relevance values to generate a test ranking, and simulating user interactions with the test ranking to generate simulated rankings. The simulated rankings are analyzed to obtain revised parameter values. The method further includes ranking, using the revised parameter values, the help postings to generate a revised ranking, and displaying the help postings in the forum according to the revised ranking.

Type: Grant

Filed: August 31, 2012

Date of Patent: March 11, 2014

Assignee: Intuit Inc.

Inventors: Igor A. Podgorny, Floyd J. Morgan, Derek Szydlowski
Apparatus and method for encoding and decoding multi-channel signal

Patent number: 8666752

Abstract: Provided are an encoding apparatus and a decoding apparatus of a multi-channel signal. The encoding apparatus of the multi-channel signal may process a phase parameter associated with phase information between a plurality of channels constituting the multi-channel signal, based on a characteristic of the multi-channel signal. The encoding apparatus may generate an encoded bitstream with respect to the multi-channel signal using the processed phase parameter and a mono signal extracted from the multi-channel signal.

Type: Grant

Filed: March 17, 2010

Date of Patent: March 4, 2014

Assignee: Samsung Electronics Co., Ltd.

Inventors: Jung-Hoe Kim, Eun Mi Oh
Enhancing speech recognition using visual information

Patent number: 8660842

Abstract: Speech recognition device uses visual information to narrow down the range of likely adaptation parameters even before a speaker makes an utterance. Images of the speaker and/or the environment are collected using an image capturing device, and then processed to extract biometric features and environmental features. The extracted features and environmental features are then used to estimate adaptation parameters. A voice sample may also be collected to refine the adaptation parameters for more accurate speech recognition.

Type: Grant

Filed: March 9, 2010

Date of Patent: February 25, 2014

Assignee: Honda Motor Co., Ltd.

Inventor: Antoine R. Raux
Methods, systems, and computer readable media for fricatives and high frequencies detection

Patent number: 8583425

Abstract: Methods, systems, and computer readable media for fricatives and high frequencies detection are disclosed. According to one method, the method includes receiving a narrowband signal. The method also includes detecting, using one or more autocorrelation coefficients, a high frequency speech component associated with the narrowband signal.

Type: Grant

Filed: June 21, 2011

Date of Patent: November 12, 2013

Assignee: Genband US LLC

Inventors: Emmanuel Rossignol Thepie Fapi, Eric Poulin

1 2 3 4 next