Patents by Inventor Bastiaan Kleijn

Bastiaan Kleijn has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SPEECH CODING USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS

Publication number: 20200176004

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for coding speech using neural networks. One of the methods includes obtaining a bitstream of parametric coder parameters characterizing spoken speech; generating, from the parametric coder parameters, a conditioning sequence; generating a reconstruction of the spoken speech that includes a respective speech sample at each of a plurality of decoder time steps, comprising, at each decoder time step: processing a current reconstruction sequence using an auto-regressive generative neural network, wherein the auto-regressive generative neural network is configured to process the current reconstruction to compute a score distribution over possible speech sample values, and wherein the processing comprises conditioning the auto-regressive generative neural network on at least a portion of the conditioning sequence; and sampling a speech sample from the possible speech sample values.

Type: Application

Filed: November 30, 2018

Publication date: June 4, 2020

Inventors: Willem Bastiaan Kleijn, Jan K. Skoglund, Alejandro Luebs, Sze Chie Lim
ECHO CANCELLATION FOR KEYWORD SPOTTING

Publication number: 20200152220

Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.

Type: Application

Filed: October 10, 2019

Publication date: May 14, 2020

Inventors: Turaj Zakizadeh Shabestary, Willem Bastiaan Kleijn, Jan Skoglund
JOINT WIDEBAND SOURCE LOCALIZATION AND ACQUISITION BASED ON A GRID-SHIFT APPROACH

Publication number: 20200120419

Abstract: Techniques of source localization and acquisition involve a wide-band joint acoustic source localization and acquisition approach in light of sparse optimization framework based on an orthogonal matching pursuit-based grid-shift procedure. Along these lines, a specific grid structure is constructed with the same number of grid points as compared to the on-grid case, but which is “shifted” across the acoustic scene. More specifically, it is expected that each source will be located close to a grid point in at least one of the set of shifted grids. The sparse solutions corresponding to the set of shifted grids are combined to obtain the source location estimates. The estimated source positions are used as side information to obtain the original source signals.

Type: Application

Filed: October 10, 2018

Publication date: April 16, 2020

Inventors: Willem Bastiaan Kleijn, Jan Skoglund, Christos Tzagkarakis
Hierarchical decorrelation of multichannel audio

Patent number: 10553234

Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.

Type: Grant

Filed: November 21, 2018

Date of Patent: February 4, 2020

Assignee: GOOGLE LLC

Inventors: Minyue Li, Willem Bastiaan Kleijn, Jan Skoglund
MULTI-CHANNEL ECHO CANCELLATION WITH SCENARIO MEMORY

Publication number: 20190392853

Abstract: According to an aspect, a method for multi-channel echo cancellation includes receiving a microphone signal and a multi-channel loudspeaker driving signal. The multi-channel loudspeaker driving signal includes a first driving signal that drives a first loudspeaker, and a second driving signal that drives a second loudspeaker. The first driving signal is substantially the same as second driving signal. The microphone signal includes a near-end signal with echo. The method includes determining a unique solution for acoustic transfer functions for a present acoustic scenario based on the microphone signal and the multi-channel loudspeaker driving signal. The acoustic transfer functions include first and second acoustic transfer function. The unique solution is determined based on time-frequency transforms of observations from the present acoustic scenario and at least one previous acoustic scenario.

Type: Application

Filed: June 26, 2018

Publication date: December 26, 2019

Inventors: Willem Bastiaan Kleijn, Turaj Zakizadeh Shabestary
Echo cancellation for keyword spotting

Patent number: 10490203

Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.

Type: Grant

Filed: December 18, 2017

Date of Patent: November 26, 2019

Assignee: GOOGLE LLC

Inventors: Turaj Zakizadeh Shabestary, Willem Bastiaan Kleijn, Jan Skoglund
SOUND PROCESSING NODE OF AN ARRANGEMENT OF SOUND PROCESSING NODES

Publication number: 20190273987

Abstract: The invention relates to a sound processing node for an arrangement of sound processing nodes, the sound processing nodes being configured to receive a plurality of sound signals, wherein the sound processing node comprises a processor configured to generate an output signal on the basis of the plurality of sound signals weighted by a plurality of beamforming weights, wherein the processor is configured to adaptively determine the plurality of beamforming weights on the basis of an adaptive linearly constrained minimum variance beamformer using a transformed version of a least mean squares formulation of a constrained gradient descent approach, wherein the transformed version of the least mean squares formulation of the constrained gradient descent approach is based on a transformation of the least mean squares formulation of the constrained gradient descent approach to the dual domain.

Type: Application

Filed: May 21, 2019

Publication date: September 5, 2019

Inventors: Wenyu JIN, Thomas SHERSON, Willem Bastiaan KLEIJN, Richard HEUSDENS, Yue LANG
Mutual information based intelligibility enhancement

Patent number: 10396906

Abstract: Provided are methods and systems for improving the intelligibility of speech in a noisy environment. A communication model is developed that includes noise inherent in the message production and message interpretation processes, and considers that these noises have fixed signal-to-noise ratios. The communication model forms the basis of an algorithm designed to optimize the intelligibility of speech in a noisy environment. The intelligibility optimization algorithm only does something (e.g., manipulates the audio signal) when needed, and thus if no noise is present the algorithm does not alter or otherwise interfere with the audio signals, thereby preventing any speech distortion. The algorithm is also very fast and efficient in comparison to most existing approaches for speech intelligibility enhancement, and therefore the algorithm lends itself to easy implementation in an appropriate device (e.g., cellular phone or smartphone).

Type: Grant

Filed: March 20, 2018

Date of Patent: August 27, 2019

Assignee: GOOGLE LLC

Inventors: Willem Bastiaan Kleijn, Richard C. Hendriks
CODING OF A SOUNDFIELD REPRESENTATION

Publication number: 20190259397

Abstract: A method includes: receiving a representation of a soundfield, the representation characterizing the soundfield around a point in space; decomposing the received representation into independent signals; and encoding the independent signals, wherein a quantization noise for any of the independent signals has a common spatial profile with the independent signal.

Type: Application

Filed: May 6, 2019

Publication date: August 22, 2019

Inventors: Willem Bastiaan Kleijn, Jan Skoglund, Sze Chie Lim
Coding of a soundfield representation

Patent number: 10332530

Abstract: A method includes: receiving a representation of a soundfield, the representation characterizing the soundfield around a point in space; decomposing the received representation into independent signals; and encoding the independent signals, wherein a quantization noise for any of the independent signals has a common spatial profile with the independent signal.

Type: Grant

Filed: January 27, 2017

Date of Patent: June 25, 2019

Assignee: GOOGLE LLC

Inventors: Willem Bastiaan Kleijn, Jan Skoglund, Sze Chie Lim
DEVICES AND METHODS FOR EVALUATING SPEECH QUALITY

Publication number: 20190172479

Abstract: The disclosure relates to an apparatus for determining a quality score (MOS) for an audio signal sample, the apparatus comprising: an extractor configured to extract a feature vector from the audio signal sample, wherein the feature vector comprises a plurality of feature values and wherein each feature value is associated to a different feature of the feature vector; a pre-processor configured to pre-process a feature value of the feature vector based on a cumulative distribution function associated to the feature represented by the feature value to obtain a pre-processed feature value; and a processor configured to implement a neural network and to determine the quality score (MOS) for the audio signal sample based on the pre-processed feature value and a set of neural network parameters for the neural network associated to the cumulative distribution function.

Type: Application

Filed: February 7, 2019

Publication date: June 6, 2019

Inventors: Wei XIAO, Mona HAKAMI, Willem Bastiaan KLEIJN
Sound processing node of an arrangement of sound processing nodes

Patent number: 10313785

Abstract: A sound processing node for an arrangement of sound processing nodes is disclosed. The sound processing nodes being configured to receive a plurality of sound signals, wherein the sound processing node comprises a processor configured to determine a beamforming signal on the basis of the plurality of sound signals weighted by a plurality of weights, wherein the processor is configured to determine the plurality of weights using a transformed version of a linearly constrained minimum variance approach, the transformed version of the linearly constrained minimum variance approach being obtained by applying a convex relaxation to the linearly constrained minimum variance approach.

Type: Grant

Filed: March 29, 2018

Date of Patent: June 4, 2019

Assignee: Huawei Technologies Co., Ltd.

Inventors: Yue Lang, Wenyu Jin, Thomas Sherson, Richard Heusdens, Willem Bastiaan Kleijn
Independent temporally concurrent Video stream coding

Patent number: 10291917

Abstract: Implementations of independent temporally concurrent video stream coding may include encoding a plurality of input frames from an input video sequence, wherein the plurality of input frames includes a first input frame. Encoding the plurality of input frames may include generating a first plurality of encoded frames based on the plurality of input frames such that the first plurality of encoded frames includes a first encoded I-frame corresponding to the first input frame, and generating a second plurality of encoded frames based on the plurality of input frames such that the second plurality of encoded frames includes a first encoded P-frame corresponding to the first input frame. Implementations of independent temporally concurrent video stream coding may include including the first plurality of encoded frames and the second plurality of encoded frames in an output, and transmitting the output to a decoder.

Type: Grant

Filed: August 25, 2015

Date of Patent: May 14, 2019

Assignee: Google LLC

Inventors: Ermin Kozica, Dave Zachariah, Willem Bastiaan Kleijn
Directional emphasis in ambisonics

Patent number: 10264386

Abstract: Techniques of rendering high-order ambisonics (HOAs) involve adjusting the weights of a spherical harmonic (SH) expansion of a sound field based on weights of a SH expansion of a direction emphasis function that multiplies a monopole density that, when its product with a Green's function is integrated over the unit sphere, produces the sound field. An advantage of the improved techniques lies in the ability to better reproduce directionality of a given sound field in a computationally manner, whether the sound field is a temporal function or a time-frequency function.

Type: Grant

Filed: February 9, 2018

Date of Patent: April 16, 2019

Assignee: GOOGLE LLC

Inventor: Willem Bastiaan Kleijn
HIERARCHICAL DECORRELATION OF MULTICHANNEL AUDIO

Publication number: 20190096418

Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.

Type: Application

Filed: November 21, 2018

Publication date: March 28, 2019

Inventors: Minyue Li, Willem Bastiaan Kleijn, Jan Skoglund
Hierarchical decorrelation of multichannel audio

Patent number: 10141000

Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.

Type: Grant

Filed: June 15, 2016

Date of Patent: November 27, 2018

Assignee: GOOGLE LLC

Inventors: Minyue Li, Jan Skoglund, Willem Bastiaan Kleijn
SOUND PROCESSING NODE OF AN ARRANGEMENT OF SOUND PROCESSING NODES

Publication number: 20180270573

Abstract: A sound processing node for an arrangement of sound processing nodes is disclosed. The sound processing nodes being configured to receive a plurality of sound signals, wherein the sound processing node comprises a processor configured to determine a beamforming signal on the basis of the plurality of sound signals weighted by a plurality of weights, wherein the processor is configured to determine the plurality of weights using a transformed version of a linearly constrained minimum variance approach, the transformed version of the linearly constrained minimum variance approach being obtained by applying a convex relaxation to the linearly constrained minimum variance approach.

Type: Application

Filed: March 29, 2018

Publication date: September 20, 2018

Inventors: Yue LANG, Wenyu JIN, Thomas SHERSON, Richard HEUSDENS, Willem Bastiaan KLEIJN
CODING OF A SOUNDFIELD REPRESENTATION

Publication number: 20180218740

Abstract: A method includes: receiving a representation of a soundfield, the representation characterizing the soundfield around a point in space; decomposing the received representation into independent signals; and encoding the independent signals, wherein a quantization noise for any of the independent signals has a common spatial profile with the independent signal.

Type: Application

Filed: January 27, 2017

Publication date: August 2, 2018

Inventors: Willem Bastiaan Kleijn, Jan Skoglund, Sze Chie Lim
MUTUAL INFORMATION BASED INTELLIGIBILITY ENHANCEMENT

Publication number: 20180212690

Abstract: Provided are methods and systems for improving the intelligibility of speech in a noisy environment. A communication model is developed that includes noise inherent in the message production and message interpretation processes, and considers that these noises have fixed signal-to-noise ratios. The communication model forms the basis of an algorithm designed to optimize the intelligibility of speech in a noisy environment. The intelligibility optimization algorithm only does something (e.g., manipulates the audio signal) when needed, and thus if no noise is present the algorithm does not alter or otherwise interfere with the audio signals, thereby preventing any speech distortion. The algorithm is also very fast and efficient in comparison to most existing approaches for speech intelligibility enhancement, and therefore the algorithm lends itself to easy implementation in an appropriate device (e.g., cellular phone or smartphone).

Type: Application

Filed: March 20, 2018

Publication date: July 26, 2018

Inventors: Willem Bastiaan Kleijn, Andrew Allen
Mutual information based intelligibility enhancement

Patent number: 10014961

Abstract: Provided are methods and systems for improving the intelligibility of speech in a noisy environment. A communication model is developed that includes noise inherent in the message production and message interpretation processes, and considers that these noises have fixed signal-to-noise ratios. The communication model forms the basis of an algorithm designed to optimize the intelligibility of speech in a noisy environment. The intelligibility optimization algorithm only does something (e.g., manipulates the audio signal) when needed, and thus if no noise is present the algorithm does not alter or otherwise interfere with the audio signals, thereby preventing any speech distortion. The algorithm is also very fast and efficient in comparison to most existing approaches for speech intelligibility enhancement, and therefore the algorithm lends itself to easy implementation in an appropriate device (e.g., cellular phone or smartphone).

Type: Grant

Filed: April 10, 2014

Date of Patent: July 3, 2018

Assignee: Google LLC

Inventors: Willem Bastiaan Kleijn, Richard C. Hendriks

prev 1 2 3 4 5 6 … next