Patents by Inventor Bastiaan Kleijn

Bastiaan Kleijn has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20200176004
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for coding speech using neural networks. One of the methods includes obtaining a bitstream of parametric coder parameters characterizing spoken speech; generating, from the parametric coder parameters, a conditioning sequence; generating a reconstruction of the spoken speech that includes a respective speech sample at each of a plurality of decoder time steps, comprising, at each decoder time step: processing a current reconstruction sequence using an auto-regressive generative neural network, wherein the auto-regressive generative neural network is configured to process the current reconstruction to compute a score distribution over possible speech sample values, and wherein the processing comprises conditioning the auto-regressive generative neural network on at least a portion of the conditioning sequence; and sampling a speech sample from the possible speech sample values.
    Type: Application
    Filed: November 30, 2018
    Publication date: June 4, 2020
    Inventors: Willem Bastiaan Kleijn, Jan K. Skoglund, Alejandro Luebs, Sze Chie Lim
  • Publication number: 20200152220
    Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.
    Type: Application
    Filed: October 10, 2019
    Publication date: May 14, 2020
    Inventors: Turaj Zakizadeh Shabestary, Willem Bastiaan Kleijn, Jan Skoglund
  • Publication number: 20200120419
    Abstract: Techniques of source localization and acquisition involve a wide-band joint acoustic source localization and acquisition approach in light of sparse optimization framework based on an orthogonal matching pursuit-based grid-shift procedure. Along these lines, a specific grid structure is constructed with the same number of grid points as compared to the on-grid case, but which is “shifted” across the acoustic scene. More specifically, it is expected that each source will be located close to a grid point in at least one of the set of shifted grids. The sparse solutions corresponding to the set of shifted grids are combined to obtain the source location estimates. The estimated source positions are used as side information to obtain the original source signals.
    Type: Application
    Filed: October 10, 2018
    Publication date: April 16, 2020
    Inventors: Willem Bastiaan Kleijn, Jan Skoglund, Christos Tzagkarakis
  • Patent number: 10553234
    Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.
    Type: Grant
    Filed: November 21, 2018
    Date of Patent: February 4, 2020
    Assignee: GOOGLE LLC
    Inventors: Minyue Li, Willem Bastiaan Kleijn, Jan Skoglund
  • Publication number: 20190392853
    Abstract: According to an aspect, a method for multi-channel echo cancellation includes receiving a microphone signal and a multi-channel loudspeaker driving signal. The multi-channel loudspeaker driving signal includes a first driving signal that drives a first loudspeaker, and a second driving signal that drives a second loudspeaker. The first driving signal is substantially the same as second driving signal. The microphone signal includes a near-end signal with echo. The method includes determining a unique solution for acoustic transfer functions for a present acoustic scenario based on the microphone signal and the multi-channel loudspeaker driving signal. The acoustic transfer functions include first and second acoustic transfer function. The unique solution is determined based on time-frequency transforms of observations from the present acoustic scenario and at least one previous acoustic scenario.
    Type: Application
    Filed: June 26, 2018
    Publication date: December 26, 2019
    Inventors: Willem Bastiaan Kleijn, Turaj Zakizadeh Shabestary
  • Patent number: 10490203
    Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.
    Type: Grant
    Filed: December 18, 2017
    Date of Patent: November 26, 2019
    Assignee: GOOGLE LLC
    Inventors: Turaj Zakizadeh Shabestary, Willem Bastiaan Kleijn, Jan Skoglund
  • Publication number: 20190273987
    Abstract: The invention relates to a sound processing node for an arrangement of sound processing nodes, the sound processing nodes being configured to receive a plurality of sound signals, wherein the sound processing node comprises a processor configured to generate an output signal on the basis of the plurality of sound signals weighted by a plurality of beamforming weights, wherein the processor is configured to adaptively determine the plurality of beamforming weights on the basis of an adaptive linearly constrained minimum variance beamformer using a transformed version of a least mean squares formulation of a constrained gradient descent approach, wherein the transformed version of the least mean squares formulation of the constrained gradient descent approach is based on a transformation of the least mean squares formulation of the constrained gradient descent approach to the dual domain.
    Type: Application
    Filed: May 21, 2019
    Publication date: September 5, 2019
    Inventors: Wenyu JIN, Thomas SHERSON, Willem Bastiaan KLEIJN, Richard HEUSDENS, Yue LANG
  • Patent number: 10396906
    Abstract: Provided are methods and systems for improving the intelligibility of speech in a noisy environment. A communication model is developed that includes noise inherent in the message production and message interpretation processes, and considers that these noises have fixed signal-to-noise ratios. The communication model forms the basis of an algorithm designed to optimize the intelligibility of speech in a noisy environment. The intelligibility optimization algorithm only does something (e.g., manipulates the audio signal) when needed, and thus if no noise is present the algorithm does not alter or otherwise interfere with the audio signals, thereby preventing any speech distortion. The algorithm is also very fast and efficient in comparison to most existing approaches for speech intelligibility enhancement, and therefore the algorithm lends itself to easy implementation in an appropriate device (e.g., cellular phone or smartphone).
    Type: Grant
    Filed: March 20, 2018
    Date of Patent: August 27, 2019
    Assignee: GOOGLE LLC
    Inventors: Willem Bastiaan Kleijn, Richard C. Hendriks
  • Publication number: 20190259397
    Abstract: A method includes: receiving a representation of a soundfield, the representation characterizing the soundfield around a point in space; decomposing the received representation into independent signals; and encoding the independent signals, wherein a quantization noise for any of the independent signals has a common spatial profile with the independent signal.
    Type: Application
    Filed: May 6, 2019
    Publication date: August 22, 2019
    Inventors: Willem Bastiaan Kleijn, Jan Skoglund, Sze Chie Lim
  • Patent number: 10332530
    Abstract: A method includes: receiving a representation of a soundfield, the representation characterizing the soundfield around a point in space; decomposing the received representation into independent signals; and encoding the independent signals, wherein a quantization noise for any of the independent signals has a common spatial profile with the independent signal.
    Type: Grant
    Filed: January 27, 2017
    Date of Patent: June 25, 2019
    Assignee: GOOGLE LLC
    Inventors: Willem Bastiaan Kleijn, Jan Skoglund, Sze Chie Lim
  • Publication number: 20190172479
    Abstract: The disclosure relates to an apparatus for determining a quality score (MOS) for an audio signal sample, the apparatus comprising: an extractor configured to extract a feature vector from the audio signal sample, wherein the feature vector comprises a plurality of feature values and wherein each feature value is associated to a different feature of the feature vector; a pre-processor configured to pre-process a feature value of the feature vector based on a cumulative distribution function associated to the feature represented by the feature value to obtain a pre-processed feature value; and a processor configured to implement a neural network and to determine the quality score (MOS) for the audio signal sample based on the pre-processed feature value and a set of neural network parameters for the neural network associated to the cumulative distribution function.
    Type: Application
    Filed: February 7, 2019
    Publication date: June 6, 2019
    Inventors: Wei XIAO, Mona HAKAMI, Willem Bastiaan KLEIJN
  • Patent number: 10313785
    Abstract: A sound processing node for an arrangement of sound processing nodes is disclosed. The sound processing nodes being configured to receive a plurality of sound signals, wherein the sound processing node comprises a processor configured to determine a beamforming signal on the basis of the plurality of sound signals weighted by a plurality of weights, wherein the processor is configured to determine the plurality of weights using a transformed version of a linearly constrained minimum variance approach, the transformed version of the linearly constrained minimum variance approach being obtained by applying a convex relaxation to the linearly constrained minimum variance approach.
    Type: Grant
    Filed: March 29, 2018
    Date of Patent: June 4, 2019
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Yue Lang, Wenyu Jin, Thomas Sherson, Richard Heusdens, Willem Bastiaan Kleijn
  • Patent number: 10291917
    Abstract: Implementations of independent temporally concurrent video stream coding may include encoding a plurality of input frames from an input video sequence, wherein the plurality of input frames includes a first input frame. Encoding the plurality of input frames may include generating a first plurality of encoded frames based on the plurality of input frames such that the first plurality of encoded frames includes a first encoded I-frame corresponding to the first input frame, and generating a second plurality of encoded frames based on the plurality of input frames such that the second plurality of encoded frames includes a first encoded P-frame corresponding to the first input frame. Implementations of independent temporally concurrent video stream coding may include including the first plurality of encoded frames and the second plurality of encoded frames in an output, and transmitting the output to a decoder.
    Type: Grant
    Filed: August 25, 2015
    Date of Patent: May 14, 2019
    Assignee: Google LLC
    Inventors: Ermin Kozica, Dave Zachariah, Willem Bastiaan Kleijn
  • Patent number: 10264386
    Abstract: Techniques of rendering high-order ambisonics (HOAs) involve adjusting the weights of a spherical harmonic (SH) expansion of a sound field based on weights of a SH expansion of a direction emphasis function that multiplies a monopole density that, when its product with a Green's function is integrated over the unit sphere, produces the sound field. An advantage of the improved techniques lies in the ability to better reproduce directionality of a given sound field in a computationally manner, whether the sound field is a temporal function or a time-frequency function.
    Type: Grant
    Filed: February 9, 2018
    Date of Patent: April 16, 2019
    Assignee: GOOGLE LLC
    Inventor: Willem Bastiaan Kleijn
  • Publication number: 20190096418
    Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.
    Type: Application
    Filed: November 21, 2018
    Publication date: March 28, 2019
    Inventors: Minyue Li, Willem Bastiaan Kleijn, Jan Skoglund
  • Patent number: 10141000
    Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.
    Type: Grant
    Filed: June 15, 2016
    Date of Patent: November 27, 2018
    Assignee: GOOGLE LLC
    Inventors: Minyue Li, Jan Skoglund, Willem Bastiaan Kleijn
  • Publication number: 20180270573
    Abstract: A sound processing node for an arrangement of sound processing nodes is disclosed. The sound processing nodes being configured to receive a plurality of sound signals, wherein the sound processing node comprises a processor configured to determine a beamforming signal on the basis of the plurality of sound signals weighted by a plurality of weights, wherein the processor is configured to determine the plurality of weights using a transformed version of a linearly constrained minimum variance approach, the transformed version of the linearly constrained minimum variance approach being obtained by applying a convex relaxation to the linearly constrained minimum variance approach.
    Type: Application
    Filed: March 29, 2018
    Publication date: September 20, 2018
    Inventors: Yue LANG, Wenyu JIN, Thomas SHERSON, Richard HEUSDENS, Willem Bastiaan KLEIJN
  • Publication number: 20180218740
    Abstract: A method includes: receiving a representation of a soundfield, the representation characterizing the soundfield around a point in space; decomposing the received representation into independent signals; and encoding the independent signals, wherein a quantization noise for any of the independent signals has a common spatial profile with the independent signal.
    Type: Application
    Filed: January 27, 2017
    Publication date: August 2, 2018
    Inventors: Willem Bastiaan Kleijn, Jan Skoglund, Sze Chie Lim
  • Publication number: 20180212690
    Abstract: Provided are methods and systems for improving the intelligibility of speech in a noisy environment. A communication model is developed that includes noise inherent in the message production and message interpretation processes, and considers that these noises have fixed signal-to-noise ratios. The communication model forms the basis of an algorithm designed to optimize the intelligibility of speech in a noisy environment. The intelligibility optimization algorithm only does something (e.g., manipulates the audio signal) when needed, and thus if no noise is present the algorithm does not alter or otherwise interfere with the audio signals, thereby preventing any speech distortion. The algorithm is also very fast and efficient in comparison to most existing approaches for speech intelligibility enhancement, and therefore the algorithm lends itself to easy implementation in an appropriate device (e.g., cellular phone or smartphone).
    Type: Application
    Filed: March 20, 2018
    Publication date: July 26, 2018
    Inventors: Willem Bastiaan Kleijn, Andrew Allen
  • Patent number: 10014961
    Abstract: Provided are methods and systems for improving the intelligibility of speech in a noisy environment. A communication model is developed that includes noise inherent in the message production and message interpretation processes, and considers that these noises have fixed signal-to-noise ratios. The communication model forms the basis of an algorithm designed to optimize the intelligibility of speech in a noisy environment. The intelligibility optimization algorithm only does something (e.g., manipulates the audio signal) when needed, and thus if no noise is present the algorithm does not alter or otherwise interfere with the audio signals, thereby preventing any speech distortion. The algorithm is also very fast and efficient in comparison to most existing approaches for speech intelligibility enhancement, and therefore the algorithm lends itself to easy implementation in an appropriate device (e.g., cellular phone or smartphone).
    Type: Grant
    Filed: April 10, 2014
    Date of Patent: July 3, 2018
    Assignee: Google LLC
    Inventors: Willem Bastiaan Kleijn, Richard C. Hendriks