Patents by Inventor Jan Skoglund

Jan Skoglund has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

CODING OF A SOUNDFIELD REPRESENTATION

Publication number: 20180218740

Abstract: A method includes: receiving a representation of a soundfield, the representation characterizing the soundfield around a point in space; decomposing the received representation into independent signals; and encoding the independent signals, wherein a quantization noise for any of the independent signals has a common spatial profile with the independent signal.

Type: Application

Filed: January 27, 2017

Publication date: August 2, 2018

Inventors: Willem Bastiaan Kleijn, Jan Skoglund, Sze Chie Lim
Incoherent idempotent ambisonics rendering

Patent number: 10015618

Abstract: Techniques of rendering sound for a listener involve producing, as the amplitude of each of the source driving signals, a sum of two terms: a first term based on a solution s† to the equation b=A·s, and a second term based on a projection of a specified vector ? onto the nullspace of A, ? not being a solution to the equation b=A·s. Along these lines, in one example, the first term is equivalent to a Moore-Penrose pseudoinverse, e.g., AH(AAH)?1·b. In general, any solution to the equation b=A·s is satisfactory. The specified vector that is projected onto the nullspace of A is defined to reduce the coherence of the net sound field. Advantageously, the resulting operator is both linear time-invariant and idempotent so that the sound field may be faithfully reproduce both inside the RSF and at a sufficient range outside the RSF to cover a human head.

Type: Grant

Filed: August 1, 2017

Date of Patent: July 3, 2018

Assignee: GOOGLE LLC

Inventors: Willem Bastiaan Kleijn, Andrew Allen, Jan Skoglund, Sze Chie Lim
ECHO CANCELLATION FOR KEYWORD SPOTTING

Publication number: 20180174598

Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.

Type: Application

Filed: December 18, 2017

Publication date: June 21, 2018

Inventors: Turaj Zakizadeh Shabestary, Willem Bastiaan Kleijn, Jan Skoglund
PROJECTION-BASED AUDIO CODING

Publication number: 20180124540

Abstract: Techniques of performing ambisonic coding involve coupling channels of a high-order ambisonics (HOA) signal using a projection matrix based on positions of a set of loudspeakers on a unit sphere to form a projected HOA signal. Each pair of components of the projected HOA signal may then be encoded into a stereo format. In some arrangements, the projection matrix may be based on a decoding or demixing matrix that is in turn based on spherical harmonics evaluated at specified loudspeaker positions. In this way, the encoding efficiency (e.g., bitrate for a given sound quality) is improved over the conventional approaches to performing ambisonic coding.

Type: Application

Filed: October 31, 2017

Publication date: May 3, 2018

Inventors: Jan SKOGLUND, Michael GRACZYK
Acoustic keystroke transient canceler for speech communication terminals using a semi-blind adaptive filter model

Patent number: 9881630

Abstract: Provided are methods and systems for acoustic keystroke transient cancellation/suppression for user communication devices using a semi-blind adaptive filter model. The methods and systems are designed to overcome existing problems in transient noise suppression by taking into account some less-defective signal as side information on the transients and also accounting for acoustic signal propagation, including the reverberation effects, using dynamic models. The methods and systems take advantage of a synchronous reference microphone embedded in the keyboard of the user device, and utilize an adaptive filtering approach exploiting the knowledge of this keybed microphone signal.

Type: Grant

Filed: December 30, 2015

Date of Patent: January 30, 2018

Assignee: GOOGLE LLC

Inventors: Herbert Buchner, Simon J. Godsill, Jan Skoglund
BI-MAGNITUDE PROCESSING FRAMEWORK FOR NONLINEAR ECHO CANCELLATION IN MOBILE DEVICES

Publication number: 20180007482

Abstract: Techniques of performing acoustic echo cancellation involve providing a bi-magnitude filtering operation that performs a first filtering operation when a magnitude of an incoming audio signal to be output from a loudspeaker is less than a specified threshold and a second filtering operation when the magnitude of the incoming audio signal is greater than the threshold. The first filtering operation may take the form of a convolution between the incoming audio signal and a first impulse response function. The second filtering operation may take the form of a convolution between a nonlinear function of the incoming audio signal and a second impulse response function. For such a convolution, the bi-magnitude filtering operation involves providing, as the incoming audio signal, samples of the incoming audio signal over a specified window of time. The first and second impulse response functions may be determined from an input signal input into a microphone.

Type: Application

Filed: June 30, 2017

Publication date: January 4, 2018

Inventors: Jan Skoglund, Yiteng Huang, Alejandro Luebs
Reverberation estimator

Patent number: 9799322

Abstract: Provided are methods and systems for generating Direct-to-Reverberant Ratio (DRR) estimates. The methods and systems use a null-steered beamformer to produce accurate DRR estimates across a variety of room sizes, reverberation times, and source-receiver distances. The DRR estimation algorithm uses spatial selectivity to separate direct and reverberant energy and account for noise separately. The formulation considers the response of the beamformer to reverberant sound and the effect of noise. The DRR estimation algorithm is more robust to background noise than existing approaches, and is applicable where a signal is recorded with two or more microphones, such as with mobile communications devices, laptop computers, and the like.

Type: Grant

Filed: October 22, 2014

Date of Patent: October 24, 2017

Assignee: Google Inc.

Inventors: D. James Eaton, Alastair H. Moore, Patrick A. Naylor, Jan Skoglund
GLOBALLY OPTIMIZED LEAST-SQUARES POST-FILTERING FOR SPEECH ENHANCEMENT

Publication number: 20170221502

Abstract: Existing post-filtering methods for microphone array speech enhancement have two common deficiencies. First, they assume that noise is either white or diffuse and cannot deal with point interferers. Second, they estimate the post-filter coefficients using only two microphones at a time, performing averaging over all the microphones pairs, yielding a suboptimal solution. The provided method describes a post-filtering solution that implements signal models which handle white noise, diffuse noise, and point interferers. The method also implements a globally optimized least-squares approach of microphones in a microphone array, providing a more optimal solution than existing conventional methods. Experimental results demonstrate the described method outperforming conventional methods in various acoustic scenarios.

Type: Application

Filed: February 3, 2016

Publication date: August 3, 2017

Applicant: Google Inc.

Inventors: Yiteng HUANG, Alejandro LUEBS, Jan SKOGLUND, Willem Bastiaan KLEIJN
Globally optimized least-squares post-filtering for speech enhancement

Patent number: 9721582

Abstract: Existing post-filtering methods for microphone array speech enhancement have two common deficiencies. First, they assume that noise is either white or diffuse and cannot deal with point interferers. Second, they estimate the post-filter coefficients using only two microphones at a time, performing averaging over all the microphones pairs, yielding a suboptimal solution. The provided method describes a post-filtering solution that implements signal models which handle white noise, diffuse noise, and point interferers. The method also implements a globally optimized least-squares approach of microphones in a microphone array, providing a more optimal solution than existing conventional methods. Experimental results demonstrate the described method outperforming conventional methods in various acoustic scenarios.

Type: Grant

Filed: February 3, 2016

Date of Patent: August 1, 2017

Assignee: GOOGLE INC.

Inventors: Yiteng Huang, Alejandro Luebs, Jan Skoglund, Willem Bastiaan Kleijn
Situation dependent transient suppression

Patent number: 9721580

Abstract: Provided are methods and systems for providing situation-dependent transient noise suppression for audio signals. Different strategies (e.g., levels of aggressiveness) of transient suppression and signal restoration are applied to audio signals associated with participants in a video/audio conference depending on whether or not each participant is speaking (e.g., whether a voiced segment or an unvoiced/non-speech segment of audio is present). If no participants are speaking or there is an unvoiced/non-speech sound present, a more aggressive strategy for transient suppression and signal restoration is utilized. On the other hand, where voiced audio is detected (e.g., a participant is speaking), the methods and systems apply a softer, less aggressive suppression and restoration process.

Type: Grant

Filed: March 31, 2014

Date of Patent: August 1, 2017

Assignee: Google Inc.

Inventors: Jan Skoglund, Alejandro Luebs
ACOUSTIC KEYSTROKE TRANSIENT CANCELER FOR SPEECH COMMUNICATION TERMINALS USING A SEMI-BLIND ADAPTIVE FILTER MODEL

Publication number: 20170194015

Abstract: Provided are methods and systems for acoustic keystroke transient cancellation/suppression for user communication devices using a semi-blind adaptive filter model. The methods and systems are designed to overcome existing problems in transient noise suppression by taking into account some less-defective signal as side information on the transients and also accounting for acoustic signal propagation, including the reverberation effects, using dynamic models. The methods and systems take advantage of a synchronous reference microphone embedded in the keyboard of the user device, and utilize an adaptive filtering approach exploiting the knowledge of this keybed microphone signal.

Type: Application

Filed: December 30, 2015

Publication date: July 6, 2017

Applicant: GOOGLE INC.

Inventors: Herbert BUCHNER, Simon J. GODSILL, Jan SKOGLUND
Objective speech quality metric

Patent number: 9524733

Abstract: Methods and systems are provided for using a model of human speech quality perception to provide an objective measure for predicting subjective quality assessments. A Virtual Speech Quality Objective Listener (ViSQOL) model is a signal-based full-reference metric that uses a spectro-temporal measure of similarity between a reference signal and test speech signal. Specifically, the model provides for the ability to detect and predict the level of clock drift, and determine whether such clock drift will impact a listener's quality of experience.

Type: Grant

Filed: May 10, 2013

Date of Patent: December 20, 2016

Assignee: Google Inc.

Inventors: Jan Skoglund, Andrew J. Hines, Noami A. Harte, Anil Kokaram
Keyboard typing detection and suppression

Patent number: 9520141

Abstract: Provided are methods and systems for detecting the presence of a transient noise event in an audio stream using primarily or exclusively the incoming audio data. Such an approach offers improved temporal resolution and is computationally efficient. The methods and systems presented utilize some time-frequency representation of an audio signal as the basis in a predictive model in an attempt to find outlying transient noise events and interpret the true detection state as a Hidden Markov Model (HMM) to model temporal and frequency cohesion common amongst transient noise events.

Type: Grant

Filed: February 28, 2013

Date of Patent: December 13, 2016

Assignee: GOOGLE INC.

Inventors: Jens Enzo Nyby Christensen, Simon J. Godsill, Jan Skoglund
HIERARCHICAL DECORRELATION OF MULTICHANNEL AUDIO

Publication number: 20160293176

Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.

Type: Application

Filed: June 15, 2016

Publication date: October 6, 2016

Applicant: GOOGLE INC.

Inventors: Minyue LI, Jan SKOGLUND, Willem Bastiaan KLEIJN
Audio clipping detection

Patent number: 9426592

Abstract: Methods and systems for detecting the presence and frequency of clipping in an audio signal are provided. A clipping detection algorithm detects the presence of hard and soft clipping using histograms with intervals of samples, rather than attempting to identify the clipping value. Therefore, it is not essential to the algorithm that there be a large number of bins. Furthermore, the bins may be non-uniformly distributed since the number of samples belonging to lower amplitudes is of little importance. The detection algorithm is also configured to determine the severity and/or perceptual effect of any clipping found to be present in the signal by calculating the ratio of clipped samples to non-clipped samples. Temporal information on the occurrence of clipping in the signal is also used to evaluate perceptual effect.

Type: Grant

Filed: February 14, 2013

Date of Patent: August 23, 2016

Assignee: GOOGLE INC.

Inventors: Jan Skoglund, Jan Thomas Linden
Hierarchical deccorelation of multichannel audio

Patent number: 9396732

Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.

Type: Grant

Filed: October 18, 2012

Date of Patent: July 19, 2016

Assignee: GOOGLE INC.

Inventors: Minyue Li, Willem Bastiaan Kleijn, Jan Skoglund
DETECTION AND SUPPRESSION OF KEYBOARD TRANSIENT NOISE IN AUDIO STREAMS WITH AUXILIARY KEYBED MICROPHONE

Publication number: 20160196833

Abstract: Provided are methods and systems for enhancing speech when corrupted by transient noise (e.g., keyboard typing noise). The methods and systems utilize a reference microphone input signal for the transient noise in a signal restoration process used for the voice part of the signal. A robust Bayesian statistical model is used to regress the voice microphone on the reference microphone, which allows for direct inference about the desired voice signal while marginalizing the unwanted power spectral values of the voice and transient noise. Also provided is a straightforward and efficient Expectation-maximization (EM) procedure for fast enhancement of the corrupted signal. The methods and systems are designed to operate easily in real-time on standard hardware, and have very low latency so that there is no irritating delay in speaker response.

Type: Application

Filed: January 7, 2015

Publication date: July 7, 2016

Applicant: GOOGLE INC.

Inventors: Simon J. GODSILL, Herbert BUCHNER, Jan SKOGLUND
Rearrangement and rate allocation for compressing multichannel audio

Patent number: 9336791

Abstract: Provided are methods and systems for rearranging a multichannel audio signal into sub-signals and allocating bit rates among them, such that compressing the sub-signals with a set of audio codecs at the allocated bit rates yields an optimal fidelity with respect to the original multichannel audio signal. Rearranging the multichannel audio signal into sub-signals and assigning each sub-signal a bit rate may be optimized according to a criterion. Existing audio codecs may be used to quantize the sub-signals at the assigned bit rates and the compressed sub-signals may be combined into the original format according to the manner in which the original multichannel audio signal is rearranged.

Type: Grant

Filed: January 24, 2013

Date of Patent: May 10, 2016

Assignee: GOOGLE INC.

Inventors: Minyue Li, Jan Skoglund, Willem Bastiaan Kleijn
REVERBERATION ESTIMATOR

Publication number: 20160118038

Abstract: Provided are methods and systems for generating Direct-to-Reverberant Ratio (DRR) estimates. The methods and systems use a null-steered beamformer to produce accurate DRR estimates across a variety of room sizes, reverberation times, and source-receiver distances. The DRR estimation algorithm uses spatial selectivity to separate direct and reverberant energy and account for noise separately. The formulation considers the response of the beamformer to reverberant sound and the effect of noise. The DRR estimation algorithm is more robust to background noise than existing approaches, and is applicable where a signal is recorded with two or more microphones, such as with mobile communications devices, laptop computers, and the like.

Type: Application

Filed: October 22, 2014

Publication date: April 28, 2016

Applicant: GOOGLE INC.

Inventors: D. James EATON, Alastair H. MOORE, Patrick A. NAYLOR, Jan SKOGLUND
Detection of chopped speech

Patent number: 9263061

Abstract: Methods and systems are provided for detecting chop in an audio signal. A time-frequency representation, such as a spectrogram, is created for an audio signal and used to calculate a gradient of mean power per frame of the audio signal. Positive and negative gradients are defined for the signal based on the gradient of mean power, and a maximum overlap offset between the positive and negative gradients is determined by calculating a value that maximizes the cross-correlation of the positive and negative gradients. The negative gradient values may be combined (e.g., summed) with the overlap offset, and the combined values then compared with a threshold to estimate the amount of chop present in the audio signal. The chop detection model provided is low-complexity and is applicable to narrowband, wideband, and superwideband speech.

Type: Grant

Filed: May 21, 2013

Date of Patent: February 16, 2016

Assignee: GOOGLE INC.

Inventors: Andrew J. Hines, Jan Skoglund, Naomi Harte, Anil Kokaram

prev 1 2 3 4 next