Patents by Inventor Jan Skoglund

Jan Skoglund has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20180218740
    Abstract: A method includes: receiving a representation of a soundfield, the representation characterizing the soundfield around a point in space; decomposing the received representation into independent signals; and encoding the independent signals, wherein a quantization noise for any of the independent signals has a common spatial profile with the independent signal.
    Type: Application
    Filed: January 27, 2017
    Publication date: August 2, 2018
    Inventors: Willem Bastiaan Kleijn, Jan Skoglund, Sze Chie Lim
  • Patent number: 10015618
    Abstract: Techniques of rendering sound for a listener involve producing, as the amplitude of each of the source driving signals, a sum of two terms: a first term based on a solution s† to the equation b=A·s, and a second term based on a projection of a specified vector ? onto the nullspace of A, ? not being a solution to the equation b=A·s. Along these lines, in one example, the first term is equivalent to a Moore-Penrose pseudoinverse, e.g., AH(AAH)?1·b. In general, any solution to the equation b=A·s is satisfactory. The specified vector that is projected onto the nullspace of A is defined to reduce the coherence of the net sound field. Advantageously, the resulting operator is both linear time-invariant and idempotent so that the sound field may be faithfully reproduce both inside the RSF and at a sufficient range outside the RSF to cover a human head.
    Type: Grant
    Filed: August 1, 2017
    Date of Patent: July 3, 2018
    Assignee: GOOGLE LLC
    Inventors: Willem Bastiaan Kleijn, Andrew Allen, Jan Skoglund, Sze Chie Lim
  • Publication number: 20180174598
    Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.
    Type: Application
    Filed: December 18, 2017
    Publication date: June 21, 2018
    Inventors: Turaj Zakizadeh Shabestary, Willem Bastiaan Kleijn, Jan Skoglund
  • Publication number: 20180124540
    Abstract: Techniques of performing ambisonic coding involve coupling channels of a high-order ambisonics (HOA) signal using a projection matrix based on positions of a set of loudspeakers on a unit sphere to form a projected HOA signal. Each pair of components of the projected HOA signal may then be encoded into a stereo format. In some arrangements, the projection matrix may be based on a decoding or demixing matrix that is in turn based on spherical harmonics evaluated at specified loudspeaker positions. In this way, the encoding efficiency (e.g., bitrate for a given sound quality) is improved over the conventional approaches to performing ambisonic coding.
    Type: Application
    Filed: October 31, 2017
    Publication date: May 3, 2018
    Inventors: Jan SKOGLUND, Michael GRACZYK
  • Patent number: 9881630
    Abstract: Provided are methods and systems for acoustic keystroke transient cancellation/suppression for user communication devices using a semi-blind adaptive filter model. The methods and systems are designed to overcome existing problems in transient noise suppression by taking into account some less-defective signal as side information on the transients and also accounting for acoustic signal propagation, including the reverberation effects, using dynamic models. The methods and systems take advantage of a synchronous reference microphone embedded in the keyboard of the user device, and utilize an adaptive filtering approach exploiting the knowledge of this keybed microphone signal.
    Type: Grant
    Filed: December 30, 2015
    Date of Patent: January 30, 2018
    Assignee: GOOGLE LLC
    Inventors: Herbert Buchner, Simon J. Godsill, Jan Skoglund
  • Publication number: 20180007482
    Abstract: Techniques of performing acoustic echo cancellation involve providing a bi-magnitude filtering operation that performs a first filtering operation when a magnitude of an incoming audio signal to be output from a loudspeaker is less than a specified threshold and a second filtering operation when the magnitude of the incoming audio signal is greater than the threshold. The first filtering operation may take the form of a convolution between the incoming audio signal and a first impulse response function. The second filtering operation may take the form of a convolution between a nonlinear function of the incoming audio signal and a second impulse response function. For such a convolution, the bi-magnitude filtering operation involves providing, as the incoming audio signal, samples of the incoming audio signal over a specified window of time. The first and second impulse response functions may be determined from an input signal input into a microphone.
    Type: Application
    Filed: June 30, 2017
    Publication date: January 4, 2018
    Inventors: Jan Skoglund, Yiteng Huang, Alejandro Luebs
  • Patent number: 9799322
    Abstract: Provided are methods and systems for generating Direct-to-Reverberant Ratio (DRR) estimates. The methods and systems use a null-steered beamformer to produce accurate DRR estimates across a variety of room sizes, reverberation times, and source-receiver distances. The DRR estimation algorithm uses spatial selectivity to separate direct and reverberant energy and account for noise separately. The formulation considers the response of the beamformer to reverberant sound and the effect of noise. The DRR estimation algorithm is more robust to background noise than existing approaches, and is applicable where a signal is recorded with two or more microphones, such as with mobile communications devices, laptop computers, and the like.
    Type: Grant
    Filed: October 22, 2014
    Date of Patent: October 24, 2017
    Assignee: Google Inc.
    Inventors: D. James Eaton, Alastair H. Moore, Patrick A. Naylor, Jan Skoglund
  • Publication number: 20170221502
    Abstract: Existing post-filtering methods for microphone array speech enhancement have two common deficiencies. First, they assume that noise is either white or diffuse and cannot deal with point interferers. Second, they estimate the post-filter coefficients using only two microphones at a time, performing averaging over all the microphones pairs, yielding a suboptimal solution. The provided method describes a post-filtering solution that implements signal models which handle white noise, diffuse noise, and point interferers. The method also implements a globally optimized least-squares approach of microphones in a microphone array, providing a more optimal solution than existing conventional methods. Experimental results demonstrate the described method outperforming conventional methods in various acoustic scenarios.
    Type: Application
    Filed: February 3, 2016
    Publication date: August 3, 2017
    Applicant: Google Inc.
    Inventors: Yiteng HUANG, Alejandro LUEBS, Jan SKOGLUND, Willem Bastiaan KLEIJN
  • Patent number: 9721582
    Abstract: Existing post-filtering methods for microphone array speech enhancement have two common deficiencies. First, they assume that noise is either white or diffuse and cannot deal with point interferers. Second, they estimate the post-filter coefficients using only two microphones at a time, performing averaging over all the microphones pairs, yielding a suboptimal solution. The provided method describes a post-filtering solution that implements signal models which handle white noise, diffuse noise, and point interferers. The method also implements a globally optimized least-squares approach of microphones in a microphone array, providing a more optimal solution than existing conventional methods. Experimental results demonstrate the described method outperforming conventional methods in various acoustic scenarios.
    Type: Grant
    Filed: February 3, 2016
    Date of Patent: August 1, 2017
    Assignee: GOOGLE INC.
    Inventors: Yiteng Huang, Alejandro Luebs, Jan Skoglund, Willem Bastiaan Kleijn
  • Patent number: 9721580
    Abstract: Provided are methods and systems for providing situation-dependent transient noise suppression for audio signals. Different strategies (e.g., levels of aggressiveness) of transient suppression and signal restoration are applied to audio signals associated with participants in a video/audio conference depending on whether or not each participant is speaking (e.g., whether a voiced segment or an unvoiced/non-speech segment of audio is present). If no participants are speaking or there is an unvoiced/non-speech sound present, a more aggressive strategy for transient suppression and signal restoration is utilized. On the other hand, where voiced audio is detected (e.g., a participant is speaking), the methods and systems apply a softer, less aggressive suppression and restoration process.
    Type: Grant
    Filed: March 31, 2014
    Date of Patent: August 1, 2017
    Assignee: Google Inc.
    Inventors: Jan Skoglund, Alejandro Luebs
  • Publication number: 20170194015
    Abstract: Provided are methods and systems for acoustic keystroke transient cancellation/suppression for user communication devices using a semi-blind adaptive filter model. The methods and systems are designed to overcome existing problems in transient noise suppression by taking into account some less-defective signal as side information on the transients and also accounting for acoustic signal propagation, including the reverberation effects, using dynamic models. The methods and systems take advantage of a synchronous reference microphone embedded in the keyboard of the user device, and utilize an adaptive filtering approach exploiting the knowledge of this keybed microphone signal.
    Type: Application
    Filed: December 30, 2015
    Publication date: July 6, 2017
    Applicant: GOOGLE INC.
    Inventors: Herbert BUCHNER, Simon J. GODSILL, Jan SKOGLUND
  • Patent number: 9524733
    Abstract: Methods and systems are provided for using a model of human speech quality perception to provide an objective measure for predicting subjective quality assessments. A Virtual Speech Quality Objective Listener (ViSQOL) model is a signal-based full-reference metric that uses a spectro-temporal measure of similarity between a reference signal and test speech signal. Specifically, the model provides for the ability to detect and predict the level of clock drift, and determine whether such clock drift will impact a listener's quality of experience.
    Type: Grant
    Filed: May 10, 2013
    Date of Patent: December 20, 2016
    Assignee: Google Inc.
    Inventors: Jan Skoglund, Andrew J. Hines, Noami A. Harte, Anil Kokaram
  • Patent number: 9520141
    Abstract: Provided are methods and systems for detecting the presence of a transient noise event in an audio stream using primarily or exclusively the incoming audio data. Such an approach offers improved temporal resolution and is computationally efficient. The methods and systems presented utilize some time-frequency representation of an audio signal as the basis in a predictive model in an attempt to find outlying transient noise events and interpret the true detection state as a Hidden Markov Model (HMM) to model temporal and frequency cohesion common amongst transient noise events.
    Type: Grant
    Filed: February 28, 2013
    Date of Patent: December 13, 2016
    Assignee: GOOGLE INC.
    Inventors: Jens Enzo Nyby Christensen, Simon J. Godsill, Jan Skoglund
  • Publication number: 20160293176
    Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.
    Type: Application
    Filed: June 15, 2016
    Publication date: October 6, 2016
    Applicant: GOOGLE INC.
    Inventors: Minyue LI, Jan SKOGLUND, Willem Bastiaan KLEIJN
  • Patent number: 9426592
    Abstract: Methods and systems for detecting the presence and frequency of clipping in an audio signal are provided. A clipping detection algorithm detects the presence of hard and soft clipping using histograms with intervals of samples, rather than attempting to identify the clipping value. Therefore, it is not essential to the algorithm that there be a large number of bins. Furthermore, the bins may be non-uniformly distributed since the number of samples belonging to lower amplitudes is of little importance. The detection algorithm is also configured to determine the severity and/or perceptual effect of any clipping found to be present in the signal by calculating the ratio of clipped samples to non-clipped samples. Temporal information on the occurrence of clipping in the signal is also used to evaluate perceptual effect.
    Type: Grant
    Filed: February 14, 2013
    Date of Patent: August 23, 2016
    Assignee: GOOGLE INC.
    Inventors: Jan Skoglund, Jan Thomas Linden
  • Patent number: 9396732
    Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.
    Type: Grant
    Filed: October 18, 2012
    Date of Patent: July 19, 2016
    Assignee: GOOGLE INC.
    Inventors: Minyue Li, Willem Bastiaan Kleijn, Jan Skoglund
  • Publication number: 20160196833
    Abstract: Provided are methods and systems for enhancing speech when corrupted by transient noise (e.g., keyboard typing noise). The methods and systems utilize a reference microphone input signal for the transient noise in a signal restoration process used for the voice part of the signal. A robust Bayesian statistical model is used to regress the voice microphone on the reference microphone, which allows for direct inference about the desired voice signal while marginalizing the unwanted power spectral values of the voice and transient noise. Also provided is a straightforward and efficient Expectation-maximization (EM) procedure for fast enhancement of the corrupted signal. The methods and systems are designed to operate easily in real-time on standard hardware, and have very low latency so that there is no irritating delay in speaker response.
    Type: Application
    Filed: January 7, 2015
    Publication date: July 7, 2016
    Applicant: GOOGLE INC.
    Inventors: Simon J. GODSILL, Herbert BUCHNER, Jan SKOGLUND
  • Patent number: 9336791
    Abstract: Provided are methods and systems for rearranging a multichannel audio signal into sub-signals and allocating bit rates among them, such that compressing the sub-signals with a set of audio codecs at the allocated bit rates yields an optimal fidelity with respect to the original multichannel audio signal. Rearranging the multichannel audio signal into sub-signals and assigning each sub-signal a bit rate may be optimized according to a criterion. Existing audio codecs may be used to quantize the sub-signals at the assigned bit rates and the compressed sub-signals may be combined into the original format according to the manner in which the original multichannel audio signal is rearranged.
    Type: Grant
    Filed: January 24, 2013
    Date of Patent: May 10, 2016
    Assignee: GOOGLE INC.
    Inventors: Minyue Li, Jan Skoglund, Willem Bastiaan Kleijn
  • Publication number: 20160118038
    Abstract: Provided are methods and systems for generating Direct-to-Reverberant Ratio (DRR) estimates. The methods and systems use a null-steered beamformer to produce accurate DRR estimates across a variety of room sizes, reverberation times, and source-receiver distances. The DRR estimation algorithm uses spatial selectivity to separate direct and reverberant energy and account for noise separately. The formulation considers the response of the beamformer to reverberant sound and the effect of noise. The DRR estimation algorithm is more robust to background noise than existing approaches, and is applicable where a signal is recorded with two or more microphones, such as with mobile communications devices, laptop computers, and the like.
    Type: Application
    Filed: October 22, 2014
    Publication date: April 28, 2016
    Applicant: GOOGLE INC.
    Inventors: D. James EATON, Alastair H. MOORE, Patrick A. NAYLOR, Jan SKOGLUND
  • Patent number: 9263061
    Abstract: Methods and systems are provided for detecting chop in an audio signal. A time-frequency representation, such as a spectrogram, is created for an audio signal and used to calculate a gradient of mean power per frame of the audio signal. Positive and negative gradients are defined for the signal based on the gradient of mean power, and a maximum overlap offset between the positive and negative gradients is determined by calculating a value that maximizes the cross-correlation of the positive and negative gradients. The negative gradient values may be combined (e.g., summed) with the overlap offset, and the combined values then compared with a threshold to estimate the amount of chop present in the audio signal. The chop detection model provided is low-complexity and is applicable to narrowband, wideband, and superwideband speech.
    Type: Grant
    Filed: May 21, 2013
    Date of Patent: February 16, 2016
    Assignee: GOOGLE INC.
    Inventors: Andrew J. Hines, Jan Skoglund, Naomi Harte, Anil Kokaram