Patents by Inventor Jean-Marc Valin

Jean-Marc Valin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11924367
    Abstract: Joint noise and echo suppression may be performed for enhancing two-way audio communications. Audio data is captured at a communication device and audio data transmitted to the communication device from another communication device are used as input features to a trained machine learning model that uses the transmitted audio data as a reference signal to eliminate residual echo in the captured audio data when also suppressing noise in the captured audio data.
    Type: Grant
    Filed: February 9, 2022
    Date of Patent: March 5, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Jean-Marc Valin, Karim Helwani, Srikanth Venkata Tenneti, Erfan Soltanmohammadi, Mehmet Umut Isik, Richard Newman, Michael Mark Goodwin, Arvindh Krishnaswamy
  • Publication number: 20240007817
    Abstract: Real-time low-complexity stereo speech enhancement with spatial cue preservation may be performed. A stereo speech enhancement system receives a stereo input signal (e.g., a left and right input signal). The stereo speech enhancement system estimates spatial cues for a target speaker and downmixes the stereo input signal into a monaural signal. A low-complexity model may then process the monaural signal to generate an enhanced monaural signal. The stereo speech enhancement system upmixes the enhanced monaural signal based on the estimated spatial cues for the target speaker, to generate an enhanced stereo output signal.
    Type: Application
    Filed: June 30, 2022
    Publication date: January 4, 2024
    Applicant: Amazon Technologies, Inc.
    Inventors: Masahito Togami, Karim Helwani, Jean-Marc Valin, Michael Mark Goodwin
  • Patent number: 11521637
    Abstract: Post-filtering may be performed for ratio masks as part of audio enhancement. Audio data may be received. A machine learning model may be applied to generate gain values for different spectrum bands of the audio data. The gain values may then be modified using an envelope post-filter according to a monotonically increasing function applied to the gain values to produce modified gain values used to generate an enhanced version of the audio data.
    Type: Grant
    Filed: September 29, 2020
    Date of Patent: December 6, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Jean-Marc Valin, Mehmet Umut Isik, Neerad Dilip Phansalkar, Ritwik Giri, Karim Helwani, Arvindh Krishnaswamy
  • Patent number: 10432932
    Abstract: Systems and methods are described for removing ringing artifacts from a coded image. For each block of coded image data, a direction is identified by selecting a directional block divided into a plurality of pixel lines in one predetermined direction of a set of at least four predetermined directions. The directional block may have a constant value across each pixel line. A parameter related to a sum of a mean-square difference between a pixel value and the pixel average of pixels falling on a pixel line in the directional block that includes the location of the pixel may be calculated for each direction of the set. The direction of the set having a minimum summed mean-square difference is selected as the direction for the block based on the calculated parameter values. A non-linear filter is then applied to each block based on the identified direction for the block.
    Type: Grant
    Filed: July 1, 2016
    Date of Patent: October 1, 2019
    Assignee: Mozilla Corporation
    Inventors: Jean-Marc Valin, Timothy B. Terriberry
  • Patent number: 9967594
    Abstract: Systems and methods are described for predicting a mode value of a block of an image. Mode values are received for a predetermined number of neighboring blocks of the image. Each mode value may within a predetermined range of integer values. Using a probability table, a probability of each mode value in the range of integers being selected may be computed. The probability table may have a first axis that includes every integer value within the range of integer values, and second axis that includes combinations of the neighboring blocks, each combination of the neighboring blocks representing a binary determination whether or not each neighboring block is equal to a corresponding integer value on the first axis. Based on the computed probabilities, a mode value for the block may be selected and coded by an entropy coder.
    Type: Grant
    Filed: June 27, 2014
    Date of Patent: May 8, 2018
    Assignee: Mozilla Corporation
    Inventors: Jean-Marc Valin, Timothy B. Terriberry
  • Patent number: 9560386
    Abstract: An apparatus and corresponding method are provided for pyramid vector quantization of video data, including receiving the video data in the frequency domain; and pyramid vector quantizing at least one group of pictures (GOP) from the received video data in accordance with a potentially varying bandwidth, the GOP comprising key frames and predicted frames.
    Type: Grant
    Filed: February 21, 2014
    Date of Patent: January 31, 2017
    Assignee: Mozilla Corporation
    Inventors: Jean-Marc Valin, Timothy B. Terriberry
  • Publication number: 20170013260
    Abstract: Systems and methods are described for removing ringing artifacts from a coded image. For each block of coded image data, a direction is identified by selecting a directional block divided into a plurality of pixel lines in one predetermined direction of a set of at least four predetermined directions. The directional block may have a constant value across each pixel line. A parameter related to a sum of a mean-square difference between a pixel value and the pixel average of pixels falling on a pixel line in the directional block that includes the location of the pixel may be calculated for each direction of the set. The direction of the set having a minimum summed mean-square difference is selected as the direction for the block based on the calculated parameter values. A non-linear filter is then applied to each block based on the identified direction for the block.
    Type: Application
    Filed: July 1, 2016
    Publication date: January 12, 2017
    Inventors: Jean-Marc Valin, Timothy B. Terriberry
  • Patent number: 9425820
    Abstract: Systems and methods are described for encoding quantized vector parameters in a bitstream are described. An exemplary method may include receiving a vector of integers used in a data compression codebook, the sum of the integers equaling a pulse sum. An initial expected magnitude may be determined for a first integer, the initial expected magnitude being based on the pulse sum, a distribution parameter, and a value corresponding to a number of integers in the vector. The actual magnitude of the first integer may be encoded based on the initial expected magnitude of the first integer. The pulse sum may be adjusted using the encoded actual magnitude. Also, the value corresponding to the number of integers in the vector may be reduced by one. Expected magnitudes for each of the remaining integers of the vector may then be calculated recursively.
    Type: Grant
    Filed: June 3, 2014
    Date of Patent: August 23, 2016
    Assignee: Mozilla Corporation
    Inventors: Jean-Marc Valin, Timothy B. Terriberry
  • Patent number: 9015042
    Abstract: Embodiments are described of a multi-block coding scheme for an audio signal to prevent partial collapse conditions from causing pre-echo compression artifacts. An audio codec includes a segmentation component partitioning the audio signal into a plurality of tiles, wherein each tile comprises data from a particular segment of time and a particular set of frequencies of the audio signal; a band energy component determining an energy value for each tile corresponding to a signal component in a respective tile; an encoder flag tracking component marking a tile as not collapsed or collapsed based on the energy value in that tile; and a decoder flag tracking component filling all tiles marked as collapsed with pseudorandom noise at an estimated energy level.
    Type: Grant
    Filed: March 7, 2012
    Date of Patent: April 21, 2015
    Inventors: Jean-Marc Valin, Timothy B. Terriberry
  • Patent number: 9009036
    Abstract: Embodiments are generally directed to systems and methods for bit allocation and band partitioning for gain-shape vector quantization in an audio codec. An audio codec implements a method that uses an implicit, dynamic scheme to allow an encoder and decoder to recreate a series of bit allocation decisions for gain and shape without transmitting additional side information for each decision, based on the number of bits that are left remaining and available in a given packet. For implementation in practical codecs, the band comprising the allocation of bits for the shape is recursively split into equal partitions until the number of bits allocated to each partition is less than the maximum codebook size.
    Type: Grant
    Filed: March 7, 2012
    Date of Patent: April 14, 2015
    Inventors: Jean-Marc Valin, Timothy B. Terriberry
  • Patent number: 9008811
    Abstract: Embodiments are described for a system and method for implementing an adaptive time-frequency resolution in audio and video coding systems. A method of adaptively transforming the time-frequency resolution for a defined spectrum comprises dividing the spectrum of the input signal into a into plurality of bands; determining, for each band of the plurality of bands, a characteristic of the content (e.g., tonal or transient content); modifying the time-frequency resolution value to one or more bands of the plurality of bands to increase either a time resolution of the band or a frequency resolution of the band depending on the characteristic of the content; determining a cost associated with modifying the time-frequency resolution value of the one more bands based on an entropy measure of the bands, and altering the modified time-frequency resolution values in a manner that accounts for the coding cost.
    Type: Grant
    Filed: September 16, 2011
    Date of Patent: April 14, 2015
    Inventors: Jean-Marc Valin, Timothy B. Terriberry
  • Publication number: 20150003523
    Abstract: Systems and methods are described for predicting a mode value of a block of an image. Mode values are received for a predetermined number of neighboring blocks of the image. Each mode value may within a predetermined range of integer values. Using a probability table, a probability of each mode value in the range of integers being selected may be computed. The probability table may have a first axis that includes every integer value within the range of integer values, and second axis that includes combinations of the neighboring blocks, each combination of the neighboring blocks representing a binary determination whether or not each neighboring block is equal to a corresponding integer value on the first axis. Based on the computed probabilities, a mode value for the block may be selected and coded by an entropy coder.
    Type: Application
    Filed: June 27, 2014
    Publication date: January 1, 2015
    Inventors: Jean-Marc Valin, Timothy B. TERRIBERRY
  • Publication number: 20140358978
    Abstract: Systems and methods are described for encoding quantized vector parameters in a bitstream are described. An exemplary method may include receiving a vector of integers used in a data compression codebook, the sum of the integers equaling a pulse sum. An initial expected magnitude may be determined for a first integer, the initial expected magnitude being based on the pulse sum, a distribution parameter, and a value corresponding to a number of integers in the vector. The actual magnitude of the first integer may be encoded based on the initial expected magnitude of the first integer. The pulse sum may be adjusted using the encoded actual magnitude. Also, the value corresponding to the number of integers in the vector may be reduced by one. Expected magnitudes for each of the remaining integers of the vector may then be calculated recursively.
    Type: Application
    Filed: June 3, 2014
    Publication date: December 4, 2014
    Inventors: Jean-Marc VALIN, Timothy B. TERRIBERRY
  • Publication number: 20140286399
    Abstract: An apparatus and corresponding method are provided for pyramid vector quantization of video data, including receiving the video data in the frequency domain; and pyramid vector quantizing at least one group of pictures (GOP) from the received video data in accordance with a potentially varying bandwidth, the GOP comprising key frames and predicted frames.
    Type: Application
    Filed: February 21, 2014
    Publication date: September 25, 2014
    Inventors: Jean-Marc VALIN, Timothy B. TERRIBERRY
  • Patent number: 8838442
    Abstract: Embodiments are directed to an audio coding scheme implemented in a codec that eliminates birdie artifacts generated by transform coding methods. A frequency coefficient spreading method invertibly rotates a spectrum of coefficient values based on a defined rotation angle, The rotated spectrum is then quantized, and the rotation operation is then reversed so that a previously sparse spectrum (i.e., one with few non-zero values) becomes one that has many non-zero values. The method arranges the coefficients for a particular partition into a linear array and computes a gain factor for the partition. A rotation angle of between 0 and ?/4 for successive pairs of coefficients of the linear array based on the gain factor is then derived. One or more rotation operations are then applied to successive pairs of coefficients in the linear array using a specific rotation angle and a stride length for each rotation operation.
    Type: Grant
    Filed: March 7, 2012
    Date of Patent: September 16, 2014
    Inventors: Timothy B. Terriberry, Jean-Marc Valin
  • Patent number: 8396163
    Abstract: Disclosed is a method of communicating a complex vector, using one or more index bits, the method comprising quantizing the complex vector using a codebook, the codebook comprising a plurality of complex vectors mapped from real vectors that lie on a reflected simplex where the simplex is of dimension one less than twice the length of the complex vector; indexing the quantized vector to form the one or more index bits; and transmitting the index bits to the transmitter, thereby communicating the complex vector.
    Type: Grant
    Filed: September 18, 2009
    Date of Patent: March 12, 2013
    Assignee: Commonwealth Scientific and Industrial Research Organization
    Inventors: Iain Bruce Collings, Daniel Ryan, Jean-Marc Valin
  • Publication number: 20120232908
    Abstract: Embodiments are described of a multi-block coding scheme for an audio signal to prevent partial collapse conditions from causing pre-echo compression artifacts. An audio codec includes a segmentation component partitioning the audio signal into a plurality of tiles, wherein each tile comprises data from a particular segment of time and a particular set of frequencies of the audio signal; a band energy component determining an energy value for each tile corresponding to a signal component in a respective tile; an encoder flag tracking component marking a tile as not collapsed or collapsed based on the energy value in that tile; and a decoder flag tracking component filling all tiles marked as collapsed with pseudorandom noise at an estimated energy level.
    Type: Application
    Filed: March 7, 2012
    Publication date: September 13, 2012
    Inventors: Timothy B. Terriberry, Jean-Marc Valin
  • Publication number: 20120232909
    Abstract: Embodiments are directed to an audio coding scheme implemented in a codec that eliminates birdie artifacts generated by transform coding methods. A frequency coefficient spreading method invertibly rotates a spectrum of coefficient values based on a defined rotation angle, The rotated spectrum is then quantized, and the rotation operation is then reversed so that a previously sparse spectrum (i.e., one with few non-zero values) becomes one that has many non-zero values. The method arranges the coefficients for a particular partition into a linear array and computes a gain factor for the partition. A rotation angle of between 0 and ?/4 for successive pairs of coefficients of the linear array based on the gain factor is then derived. One or more rotation operations are then applied to successive pairs of coefficients in the linear array using a specific rotation angle and a stride length for each rotation operation.
    Type: Application
    Filed: March 7, 2012
    Publication date: September 13, 2012
    Inventors: Timothy B. Terriberry, Jean-Marc Valin
  • Publication number: 20120232913
    Abstract: Embodiments are generally directed to systems and methods for bit allocation and band partitioning for gain-shape vector quantization in an audio codec. An audio codec implements a method that uses an implicit, dynamic scheme to allow an encoder and decoder to recreate a series of bit allocation decisions for gain and shape without transmitting additional side information for each decision, based on the number of bits that are left remaining and available in a given packet. For implementation in practical codecs, the band comprising the allocation of bits for the shape is recursively split into equal partitions until the number of bits allocated to each partition is less than the maximum codebook size.
    Type: Application
    Filed: March 7, 2012
    Publication date: September 13, 2012
    Inventors: Timothy B. Terriberry, Jean-Marc Valin
  • Publication number: 20120069898
    Abstract: Embodiments are described for a system and method for implementing an adaptive time-frequency resolution in audio and video coding systems. A method of adaptively transforming the time-frequency resolution for a defined spectrum comprises dividing the spectrum of the input signal into a into plurality of bands; determining, for each band of the plurality of bands, a characteristic of the content (e.g., tonal or transient content); modifying the time-frequency resolution value to one or more bands of the plurality of bands to increase either a time resolution of the band or a frequency resolution of the band depending on the characteristic of the content; determining a cost associated with modifying the time-frequency resolution value of the one more bands based on an entropy measure of the bands, and altering the modified time-frequency resolution values in a manner that accounts for the coding cost.
    Type: Application
    Filed: September 16, 2011
    Publication date: March 22, 2012
    Inventors: Jean-Marc Valin, Timothy B. Terriberry