Patents by Inventor Felix Torres

Felix Torres has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Bitrate distribution in immersive voice and audio services

Patent number: 12283281

Abstract: Embodiments are disclosed for bitrate distribution in immersive voice and audio services. In an embodiment, a method of encoding an IVAS bitstream comprises: receiving an input audio signal; downmixing the input audio signal into one or more downmix channels and spatial metadata; reading a set of one or more bitrates for the downmix channels and a set of quantization levels for the spatial metadata from a bitrate distribution control table; determining a combination of the one or more bitrates for the downmix channels; determining a metadata quantization level from the set of metadata quantization levels using a bitrate distribution process; quantizing and coding the spatial metadata using the metadata quantization level; generating, using the combination of one or more bitrates, a downmix bitstream for the one or more downmix channels; combining the downmix bitstream, the quantized and coded spatial metadata and the set of quantization levels into the IVAS bitstream.

Type: Grant

Filed: October 28, 2020

Date of Patent: April 22, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Rishabh Tyagi, Juan Felix Torres, Stefanie Brown
AUDIO PROCESSING IN IMMERSIVE AUDIO SERVICES

Publication number: 20250088816

Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.

Type: Application

Filed: November 26, 2024

Publication date: March 13, 2025

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian B. Lee
Audio processing in immersive audio services

Patent number: 12167219

Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.

Type: Grant

Filed: November 12, 2019

Date of Patent: December 10, 2024

Assignees: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian B. Lee
RENDERING OF IMMERSIVE AUDIO CONTENT

Publication number: 20240305952

Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.

Type: Application

Filed: March 15, 2024

Publication date: September 12, 2024

Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Michael William Mason, Juan Felix Torres, Antonio Mateos Sole, Daniel Arteaga, Adam J. Mills, Mark David de Burgh, Andrew Robert Owen
SYSTEMS AND METHODS FOR COVARIANCE SMOOTHING

Publication number: 20240265927

Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.

Type: Application

Filed: March 14, 2024

Publication date: August 8, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: David S. McGrath, Stefanie Brown, Juan Felix Torres
Methods and devices for generating or decoding a bitstream comprising immersive audio signals

Patent number: 12020718

Abstract: The present document describes a method (500) for generating a bitstream (101), wherein the bitstream (101) comprises a sequence of superframes (400) for a sequence of frames of an immersive audio signal (111). The method (500) comprises, repeatedly for the sequence of superframes (400), inserting (501) coded audio data (206) for one or more frames of one or more downmix channel signals (203) derived from the immersive audio signal (111), into data fields (411, 421, 412, 422) of a superframe (400); and inserting (502) metadata (202, 205) for reconstructing one or more frames of the immersive audio signal (111) from the coded audio data (206), into a metadata field (403) of the superframe (400).

Type: Grant

Filed: July 2, 2019

Date of Patent: June 25, 2024

Assignees: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Stefan Bruhn, Juan Felix Torres
Transforming audio signals captured in different formats into a reduced number of formats for simplifying encoding and decoding operations

Patent number: 12014745

Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.

Type: Grant

Filed: August 8, 2022

Date of Patent: June 18, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Stefan Bruhn, Michael Eckert, Juan Felix Torres, Stefanie Brown, David S. McGrath
ADAPTIVE PANNER OF AUDIO OBJECTS

Publication number: 20240179485

Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.

Type: Application

Filed: December 11, 2023

Publication date: May 30, 2024

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
Systems and methods for covariance smoothing

Patent number: 11972767

Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.

Type: Grant

Filed: July 31, 2020

Date of Patent: April 30, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: David S. McGrath, Stefanie Brown, Juan Felix Torres
Rendering of immersive audio content

Patent number: 11937074

Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.

Type: Grant

Filed: January 28, 2021

Date of Patent: March 19, 2024

Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Michael William Mason, Juan Felix Torres, Antonio Mateos Sole, Daniel Arteaga, Adam J. Mills, Mark David de Burgh, Andrew Robert Owen
Adaptive panner of audio objects

Patent number: 11843930

Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.

Type: Grant

Filed: June 6, 2022

Date of Patent: December 12, 2023

Assignees: DOLBY LABORATORIES LICENSING CORPORATION, Dolby International AB

Inventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
QUANTIZATION AND ENTROPY CODING OF PARAMETERS FOR A LOW LATENCY AUDIO CODEC

Publication number: 20230343346

Abstract: Described is a method of frame-wise encoding metadata for an input signal, the metadata comprising a plurality of at least partially interrelated parameters calculable from the input signal. The method comprises, for each frame: iteratively performing, by using a looping process, steps of: determining a processing strategy from a plurality of processing strategies for calculating and quantizing the parameters; calculating and quantizing the parameters based on the determined processing strategy to obtain quantized parameters; and encoding the quantized parameters. In particular, each of the plurality of processing strategies comprises a respective first indication indicative of an ordering related to the calculation and quantization of individual parameters; and the processing strategy is determined based on at least one bitrate threshold.

Type: Application

Filed: June 10, 2021

Publication date: October 26, 2023

Applicant: Dolby Laboratories Licensing Corporation

Inventors: David S. MCGRATH, Rishabh TYAGI, Stefanie BROWN, Juan Felix Torres
Audio processing in adaptive intermediate spatial format

Patent number: 11594232

Abstract: Systems, methods, and computer program products of audio processing based on Adaptive Intermediate Spatial Format (AISF) are described. The AISF is an extension to ISF that allows spatial resolution around an ISF ring to be adjusted dynamically with respect to content of incoming audio objects. An AISF encoder device adaptively warps each ISF ring during ISF encoding to adjust angular distance between objects, resulting in increase in uniformity of energy distribution around the ISF ring. At an AISF decoder device, matrices that decode sound positions to the output speaker take into account the warping that was performed at the AISF encoder device to reproduce the true positions of sound sources.

Type: Grant

Filed: November 11, 2020

Date of Patent: February 28, 2023

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Juan Felix Torres, David S. McGrath, Michael William Mason
ADAPTIVE PANNER OF AUDIO OBJECTS

Publication number: 20220386053

Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.

Type: Application

Filed: June 6, 2022

Publication date: December 1, 2022

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, Dolby International AB

Inventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
ENCODING AND DECODING IVAS BITSTREAMS

Publication number: 20220284910

Abstract: Encoding/decoding an immersive voice and audio services (IVAS) bitstream comprises: encoding/decoding a coding mode indicator in a common header (CH) section of an IVAS bitstream, encoding/decoding a mode header or tool header in the tool header (TH) section of the bitstream, the TH section following the CH section, encoding/decoding a metadata payload in a metadata payload (MDP) section of the bitstream, the MDP section following the CH section, encoding/decoding an enhanced voice services (EVS) payload in an EVS payload (EP) section of the bitstream, the EP section following the CH section, and on the encoder side, storing or streaming the encoded bitstream, and on the decoder side, controlling an audio decoder based on the coding mode, the tool header, the EVS payload, and the metadata payload or storing a representation of same.

Type: Application

Filed: July 30, 2020

Publication date: September 8, 2022

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Rishabh Tyagi, Juan Felix Torres
Transforming audio signals captured in different formats into a reduced number of formats for simplifying encoding and decoding operations

Patent number: 11410666

Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.

Type: Grant

Filed: October 7, 2019

Date of Patent: August 9, 2022

Assignees: Dolby Laboratories Licensing Corporation

Inventors: Stefan Bruhn, Michael Eckert, Juan Felix Torres, Stefanie Brown, David S. McGrath
Adaptive panner of audio objects

Patent number: 11356787

Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.

Type: Grant

Filed: January 14, 2021

Date of Patent: June 7, 2022

Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
AUDIO PROCESSING IN IMMERSIVE AUDIO SERVICES

Publication number: 20220022000

Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.

Type: Application

Filed: November 12, 2019

Publication date: January 20, 2022

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian Lee
Loudspeaker excursion protection

Patent number: 11184706

Abstract: An apparatus and method of excursion protection of a loudspeaker. The method includes attenuating selected bands in a transform domain, controlled by a feedback signal resulting from an excursion transfer function that has been modified according to the real-time operational characteristics of the loudspeaker. In this manner, the system reduces the amount of wideband attenuation needed to address the predicted excursion, resulting in a better listening experience.

Type: Grant

Filed: May 14, 2019

Date of Patent: November 23, 2021

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Brian George Arnott, Nicholas Luke Appleton, Juan Felix Torres, William Thomas Rowley, Ho Young Sung, Michael J. Smithers
Rendering of immersive audio content

Patent number: 11128978

Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.

Type: Grant

Filed: November 18, 2016

Date of Patent: September 21, 2021

Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Michael William Mason, Juan Felix Torres, Antonio Mateos Sole, Daniel Arteaga, Adam J. Mills, Mark David deBurgh, Andrew Robert Owen

1 2 next