Patents by Inventor Jean-Marc Valin
Jean-Marc Valin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11924367Abstract: Joint noise and echo suppression may be performed for enhancing two-way audio communications. Audio data is captured at a communication device and audio data transmitted to the communication device from another communication device are used as input features to a trained machine learning model that uses the transmitted audio data as a reference signal to eliminate residual echo in the captured audio data when also suppressing noise in the captured audio data.Type: GrantFiled: February 9, 2022Date of Patent: March 5, 2024Assignee: Amazon Technologies, Inc.Inventors: Jean-Marc Valin, Karim Helwani, Srikanth Venkata Tenneti, Erfan Soltanmohammadi, Mehmet Umut Isik, Richard Newman, Michael Mark Goodwin, Arvindh Krishnaswamy
-
Publication number: 20240007817Abstract: Real-time low-complexity stereo speech enhancement with spatial cue preservation may be performed. A stereo speech enhancement system receives a stereo input signal (e.g., a left and right input signal). The stereo speech enhancement system estimates spatial cues for a target speaker and downmixes the stereo input signal into a monaural signal. A low-complexity model may then process the monaural signal to generate an enhanced monaural signal. The stereo speech enhancement system upmixes the enhanced monaural signal based on the estimated spatial cues for the target speaker, to generate an enhanced stereo output signal.Type: ApplicationFiled: June 30, 2022Publication date: January 4, 2024Applicant: Amazon Technologies, Inc.Inventors: Masahito Togami, Karim Helwani, Jean-Marc Valin, Michael Mark Goodwin
-
Patent number: 11521637Abstract: Post-filtering may be performed for ratio masks as part of audio enhancement. Audio data may be received. A machine learning model may be applied to generate gain values for different spectrum bands of the audio data. The gain values may then be modified using an envelope post-filter according to a monotonically increasing function applied to the gain values to produce modified gain values used to generate an enhanced version of the audio data.Type: GrantFiled: September 29, 2020Date of Patent: December 6, 2022Assignee: Amazon Technologies, Inc.Inventors: Jean-Marc Valin, Mehmet Umut Isik, Neerad Dilip Phansalkar, Ritwik Giri, Karim Helwani, Arvindh Krishnaswamy
-
Patent number: 10432932Abstract: Systems and methods are described for removing ringing artifacts from a coded image. For each block of coded image data, a direction is identified by selecting a directional block divided into a plurality of pixel lines in one predetermined direction of a set of at least four predetermined directions. The directional block may have a constant value across each pixel line. A parameter related to a sum of a mean-square difference between a pixel value and the pixel average of pixels falling on a pixel line in the directional block that includes the location of the pixel may be calculated for each direction of the set. The direction of the set having a minimum summed mean-square difference is selected as the direction for the block based on the calculated parameter values. A non-linear filter is then applied to each block based on the identified direction for the block.Type: GrantFiled: July 1, 2016Date of Patent: October 1, 2019Assignee: Mozilla CorporationInventors: Jean-Marc Valin, Timothy B. Terriberry
-
Patent number: 9967594Abstract: Systems and methods are described for predicting a mode value of a block of an image. Mode values are received for a predetermined number of neighboring blocks of the image. Each mode value may within a predetermined range of integer values. Using a probability table, a probability of each mode value in the range of integers being selected may be computed. The probability table may have a first axis that includes every integer value within the range of integer values, and second axis that includes combinations of the neighboring blocks, each combination of the neighboring blocks representing a binary determination whether or not each neighboring block is equal to a corresponding integer value on the first axis. Based on the computed probabilities, a mode value for the block may be selected and coded by an entropy coder.Type: GrantFiled: June 27, 2014Date of Patent: May 8, 2018Assignee: Mozilla CorporationInventors: Jean-Marc Valin, Timothy B. Terriberry
-
Patent number: 9560386Abstract: An apparatus and corresponding method are provided for pyramid vector quantization of video data, including receiving the video data in the frequency domain; and pyramid vector quantizing at least one group of pictures (GOP) from the received video data in accordance with a potentially varying bandwidth, the GOP comprising key frames and predicted frames.Type: GrantFiled: February 21, 2014Date of Patent: January 31, 2017Assignee: Mozilla CorporationInventors: Jean-Marc Valin, Timothy B. Terriberry
-
Publication number: 20170013260Abstract: Systems and methods are described for removing ringing artifacts from a coded image. For each block of coded image data, a direction is identified by selecting a directional block divided into a plurality of pixel lines in one predetermined direction of a set of at least four predetermined directions. The directional block may have a constant value across each pixel line. A parameter related to a sum of a mean-square difference between a pixel value and the pixel average of pixels falling on a pixel line in the directional block that includes the location of the pixel may be calculated for each direction of the set. The direction of the set having a minimum summed mean-square difference is selected as the direction for the block based on the calculated parameter values. A non-linear filter is then applied to each block based on the identified direction for the block.Type: ApplicationFiled: July 1, 2016Publication date: January 12, 2017Inventors: Jean-Marc Valin, Timothy B. Terriberry
-
Patent number: 9425820Abstract: Systems and methods are described for encoding quantized vector parameters in a bitstream are described. An exemplary method may include receiving a vector of integers used in a data compression codebook, the sum of the integers equaling a pulse sum. An initial expected magnitude may be determined for a first integer, the initial expected magnitude being based on the pulse sum, a distribution parameter, and a value corresponding to a number of integers in the vector. The actual magnitude of the first integer may be encoded based on the initial expected magnitude of the first integer. The pulse sum may be adjusted using the encoded actual magnitude. Also, the value corresponding to the number of integers in the vector may be reduced by one. Expected magnitudes for each of the remaining integers of the vector may then be calculated recursively.Type: GrantFiled: June 3, 2014Date of Patent: August 23, 2016Assignee: Mozilla CorporationInventors: Jean-Marc Valin, Timothy B. Terriberry
-
Patent number: 9015042Abstract: Embodiments are described of a multi-block coding scheme for an audio signal to prevent partial collapse conditions from causing pre-echo compression artifacts. An audio codec includes a segmentation component partitioning the audio signal into a plurality of tiles, wherein each tile comprises data from a particular segment of time and a particular set of frequencies of the audio signal; a band energy component determining an energy value for each tile corresponding to a signal component in a respective tile; an encoder flag tracking component marking a tile as not collapsed or collapsed based on the energy value in that tile; and a decoder flag tracking component filling all tiles marked as collapsed with pseudorandom noise at an estimated energy level.Type: GrantFiled: March 7, 2012Date of Patent: April 21, 2015Inventors: Jean-Marc Valin, Timothy B. Terriberry
-
Patent number: 9009036Abstract: Embodiments are generally directed to systems and methods for bit allocation and band partitioning for gain-shape vector quantization in an audio codec. An audio codec implements a method that uses an implicit, dynamic scheme to allow an encoder and decoder to recreate a series of bit allocation decisions for gain and shape without transmitting additional side information for each decision, based on the number of bits that are left remaining and available in a given packet. For implementation in practical codecs, the band comprising the allocation of bits for the shape is recursively split into equal partitions until the number of bits allocated to each partition is less than the maximum codebook size.Type: GrantFiled: March 7, 2012Date of Patent: April 14, 2015Inventors: Jean-Marc Valin, Timothy B. Terriberry
-
Patent number: 9008811Abstract: Embodiments are described for a system and method for implementing an adaptive time-frequency resolution in audio and video coding systems. A method of adaptively transforming the time-frequency resolution for a defined spectrum comprises dividing the spectrum of the input signal into a into plurality of bands; determining, for each band of the plurality of bands, a characteristic of the content (e.g., tonal or transient content); modifying the time-frequency resolution value to one or more bands of the plurality of bands to increase either a time resolution of the band or a frequency resolution of the band depending on the characteristic of the content; determining a cost associated with modifying the time-frequency resolution value of the one more bands based on an entropy measure of the bands, and altering the modified time-frequency resolution values in a manner that accounts for the coding cost.Type: GrantFiled: September 16, 2011Date of Patent: April 14, 2015Inventors: Jean-Marc Valin, Timothy B. Terriberry
-
Publication number: 20150003523Abstract: Systems and methods are described for predicting a mode value of a block of an image. Mode values are received for a predetermined number of neighboring blocks of the image. Each mode value may within a predetermined range of integer values. Using a probability table, a probability of each mode value in the range of integers being selected may be computed. The probability table may have a first axis that includes every integer value within the range of integer values, and second axis that includes combinations of the neighboring blocks, each combination of the neighboring blocks representing a binary determination whether or not each neighboring block is equal to a corresponding integer value on the first axis. Based on the computed probabilities, a mode value for the block may be selected and coded by an entropy coder.Type: ApplicationFiled: June 27, 2014Publication date: January 1, 2015Inventors: Jean-Marc Valin, Timothy B. TERRIBERRY
-
Publication number: 20140358978Abstract: Systems and methods are described for encoding quantized vector parameters in a bitstream are described. An exemplary method may include receiving a vector of integers used in a data compression codebook, the sum of the integers equaling a pulse sum. An initial expected magnitude may be determined for a first integer, the initial expected magnitude being based on the pulse sum, a distribution parameter, and a value corresponding to a number of integers in the vector. The actual magnitude of the first integer may be encoded based on the initial expected magnitude of the first integer. The pulse sum may be adjusted using the encoded actual magnitude. Also, the value corresponding to the number of integers in the vector may be reduced by one. Expected magnitudes for each of the remaining integers of the vector may then be calculated recursively.Type: ApplicationFiled: June 3, 2014Publication date: December 4, 2014Inventors: Jean-Marc VALIN, Timothy B. TERRIBERRY
-
Publication number: 20140286399Abstract: An apparatus and corresponding method are provided for pyramid vector quantization of video data, including receiving the video data in the frequency domain; and pyramid vector quantizing at least one group of pictures (GOP) from the received video data in accordance with a potentially varying bandwidth, the GOP comprising key frames and predicted frames.Type: ApplicationFiled: February 21, 2014Publication date: September 25, 2014Inventors: Jean-Marc VALIN, Timothy B. TERRIBERRY
-
Patent number: 8838442Abstract: Embodiments are directed to an audio coding scheme implemented in a codec that eliminates birdie artifacts generated by transform coding methods. A frequency coefficient spreading method invertibly rotates a spectrum of coefficient values based on a defined rotation angle, The rotated spectrum is then quantized, and the rotation operation is then reversed so that a previously sparse spectrum (i.e., one with few non-zero values) becomes one that has many non-zero values. The method arranges the coefficients for a particular partition into a linear array and computes a gain factor for the partition. A rotation angle of between 0 and ?/4 for successive pairs of coefficients of the linear array based on the gain factor is then derived. One or more rotation operations are then applied to successive pairs of coefficients in the linear array using a specific rotation angle and a stride length for each rotation operation.Type: GrantFiled: March 7, 2012Date of Patent: September 16, 2014Inventors: Timothy B. Terriberry, Jean-Marc Valin
-
Patent number: 8396163Abstract: Disclosed is a method of communicating a complex vector, using one or more index bits, the method comprising quantizing the complex vector using a codebook, the codebook comprising a plurality of complex vectors mapped from real vectors that lie on a reflected simplex where the simplex is of dimension one less than twice the length of the complex vector; indexing the quantized vector to form the one or more index bits; and transmitting the index bits to the transmitter, thereby communicating the complex vector.Type: GrantFiled: September 18, 2009Date of Patent: March 12, 2013Assignee: Commonwealth Scientific and Industrial Research OrganizationInventors: Iain Bruce Collings, Daniel Ryan, Jean-Marc Valin
-
Publication number: 20120232908Abstract: Embodiments are described of a multi-block coding scheme for an audio signal to prevent partial collapse conditions from causing pre-echo compression artifacts. An audio codec includes a segmentation component partitioning the audio signal into a plurality of tiles, wherein each tile comprises data from a particular segment of time and a particular set of frequencies of the audio signal; a band energy component determining an energy value for each tile corresponding to a signal component in a respective tile; an encoder flag tracking component marking a tile as not collapsed or collapsed based on the energy value in that tile; and a decoder flag tracking component filling all tiles marked as collapsed with pseudorandom noise at an estimated energy level.Type: ApplicationFiled: March 7, 2012Publication date: September 13, 2012Inventors: Timothy B. Terriberry, Jean-Marc Valin
-
Publication number: 20120232909Abstract: Embodiments are directed to an audio coding scheme implemented in a codec that eliminates birdie artifacts generated by transform coding methods. A frequency coefficient spreading method invertibly rotates a spectrum of coefficient values based on a defined rotation angle, The rotated spectrum is then quantized, and the rotation operation is then reversed so that a previously sparse spectrum (i.e., one with few non-zero values) becomes one that has many non-zero values. The method arranges the coefficients for a particular partition into a linear array and computes a gain factor for the partition. A rotation angle of between 0 and ?/4 for successive pairs of coefficients of the linear array based on the gain factor is then derived. One or more rotation operations are then applied to successive pairs of coefficients in the linear array using a specific rotation angle and a stride length for each rotation operation.Type: ApplicationFiled: March 7, 2012Publication date: September 13, 2012Inventors: Timothy B. Terriberry, Jean-Marc Valin
-
Publication number: 20120232913Abstract: Embodiments are generally directed to systems and methods for bit allocation and band partitioning for gain-shape vector quantization in an audio codec. An audio codec implements a method that uses an implicit, dynamic scheme to allow an encoder and decoder to recreate a series of bit allocation decisions for gain and shape without transmitting additional side information for each decision, based on the number of bits that are left remaining and available in a given packet. For implementation in practical codecs, the band comprising the allocation of bits for the shape is recursively split into equal partitions until the number of bits allocated to each partition is less than the maximum codebook size.Type: ApplicationFiled: March 7, 2012Publication date: September 13, 2012Inventors: Timothy B. Terriberry, Jean-Marc Valin
-
Publication number: 20120069898Abstract: Embodiments are described for a system and method for implementing an adaptive time-frequency resolution in audio and video coding systems. A method of adaptively transforming the time-frequency resolution for a defined spectrum comprises dividing the spectrum of the input signal into a into plurality of bands; determining, for each band of the plurality of bands, a characteristic of the content (e.g., tonal or transient content); modifying the time-frequency resolution value to one or more bands of the plurality of bands to increase either a time resolution of the band or a frequency resolution of the band depending on the characteristic of the content; determining a cost associated with modifying the time-frequency resolution value of the one more bands based on an entropy measure of the bands, and altering the modified time-frequency resolution values in a manner that accounts for the coding cost.Type: ApplicationFiled: September 16, 2011Publication date: March 22, 2012Inventors: Jean-Marc Valin, Timothy B. Terriberry