Patents by Inventor Felix Torres
Felix Torres has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11972767Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.Type: GrantFiled: July 31, 2020Date of Patent: April 30, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: David S. McGrath, Stefanie Brown, Juan Felix Torres
-
Patent number: 11937074Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.Type: GrantFiled: January 28, 2021Date of Patent: March 19, 2024Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Michael William Mason, Juan Felix Torres, Antonio Mateos Sole, Daniel Arteaga, Adam J. Mills, Mark David de Burgh, Andrew Robert Owen
-
Patent number: 11843930Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.Type: GrantFiled: June 6, 2022Date of Patent: December 12, 2023Assignees: DOLBY LABORATORIES LICENSING CORPORATION, Dolby International ABInventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
-
Publication number: 20230343346Abstract: Described is a method of frame-wise encoding metadata for an input signal, the metadata comprising a plurality of at least partially interrelated parameters calculable from the input signal. The method comprises, for each frame: iteratively performing, by using a looping process, steps of: determining a processing strategy from a plurality of processing strategies for calculating and quantizing the parameters; calculating and quantizing the parameters based on the determined processing strategy to obtain quantized parameters; and encoding the quantized parameters. In particular, each of the plurality of processing strategies comprises a respective first indication indicative of an ordering related to the calculation and quantization of individual parameters; and the processing strategy is determined based on at least one bitrate threshold.Type: ApplicationFiled: June 10, 2021Publication date: October 26, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: David S. MCGRATH, Rishabh TYAGI, Stefanie BROWN, Juan Felix Torres
-
Patent number: 11594232Abstract: Systems, methods, and computer program products of audio processing based on Adaptive Intermediate Spatial Format (AISF) are described. The AISF is an extension to ISF that allows spatial resolution around an ISF ring to be adjusted dynamically with respect to content of incoming audio objects. An AISF encoder device adaptively warps each ISF ring during ISF encoding to adjust angular distance between objects, resulting in increase in uniformity of energy distribution around the ISF ring. At an AISF decoder device, matrices that decode sound positions to the output speaker take into account the warping that was performed at the AISF encoder device to reproduce the true positions of sound sources.Type: GrantFiled: November 11, 2020Date of Patent: February 28, 2023Assignee: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Juan Felix Torres, David S. McGrath, Michael William Mason
-
Publication number: 20220406318Abstract: Embodiments are disclosed for bitrate distribution in immersive voice and audio services. In an embodiment, a method of encoding an IVAS bitstream comprises: receiving an input audio signal; downmixing the input audio signal into one or more downmix channels and spatial metadata; reading a set of one or more bitrates for the downmix channels and a set of quantization levels for the spatial metadata from a bitrate distribution control table; determining a combination of the one or more bitrates for the downmix channels; determining a metadata quantization level from the set of metadata quantization levels using a bitrate distribution process; quantizing and coding the spatial metadata using the metadata quantization level; generating, using the combination of one or more bitrates, a downmix bitstream for the one or more downmix channels; combining the downmix bitstream, the quantized and coded spatial metadata and the set of quantization levels into the IVAS bitstream.Type: ApplicationFiled: October 28, 2020Publication date: December 22, 2022Applicant: Dolby Laboratories Licensing CorporationInventors: Rishabh TYAGI, Juan Felix TORRES, Stefanie BROWN
-
Publication number: 20220386053Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.Type: ApplicationFiled: June 6, 2022Publication date: December 1, 2022Applicants: DOLBY LABORATORIES LICENSING CORPORATION, Dolby International ABInventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
-
Publication number: 20220375482Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.Type: ApplicationFiled: August 8, 2022Publication date: November 24, 2022Applicants: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Stefan BRUHN, Michael ECKERT, Juan Felix TORRES, Stefanie BROWN, David S. MCGRATH
-
Publication number: 20220284910Abstract: Encoding/decoding an immersive voice and audio services (IVAS) bitstream comprises: encoding/decoding a coding mode indicator in a common header (CH) section of an IVAS bitstream, encoding/decoding a mode header or tool header in the tool header (TH) section of the bitstream, the TH section following the CH section, encoding/decoding a metadata payload in a metadata payload (MDP) section of the bitstream, the MDP section following the CH section, encoding/decoding an enhanced voice services (EVS) payload in an EVS payload (EP) section of the bitstream, the EP section following the CH section, and on the encoder side, storing or streaming the encoded bitstream, and on the decoder side, controlling an audio decoder based on the coding mode, the tool header, the EVS payload, and the metadata payload or storing a representation of same.Type: ApplicationFiled: July 30, 2020Publication date: September 8, 2022Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Rishabh Tyagi, Juan Felix Torres
-
Publication number: 20220277757Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.Type: ApplicationFiled: July 31, 2020Publication date: September 1, 2022Applicant: Dolby Laboratories Licensing CorporationInventors: David S. MCGRATH, Stefanie BROWN, Juan Felix TORRES
-
Patent number: 11410666Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.Type: GrantFiled: October 7, 2019Date of Patent: August 9, 2022Assignees: Dolby Laboratories Licensing CorporationInventors: Stefan Bruhn, Michael Eckert, Juan Felix Torres, Stefanie Brown, David S. McGrath
-
Patent number: 11356787Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.Type: GrantFiled: January 14, 2021Date of Patent: June 7, 2022Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
-
Publication number: 20220022000Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.Type: ApplicationFiled: November 12, 2019Publication date: January 20, 2022Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL ABInventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian Lee
-
Publication number: 20210375297Abstract: The present document describes a method (500) for generating a bitstream (101), wherein the bitstream (101) comprises a sequence of superframes (400) for a sequence of frames of an immersive audio signal (111). The method (500) comprises, repeatedly for the sequence of superframes (400), inserting (501) coded audio data (206) for one or more frames of one or more downmix channel signals (203) derived from the immersive audio signal (111), into data fields (411, 421, 412, 422) of a superframe (400); and inserting (502) metadata (202, 205) for reconstructing one or more frames of the immersive audio signal (111) from the coded audio data (206), into a metadata field (403) of the superframe (400).Type: ApplicationFiled: July 2, 2019Publication date: December 2, 2021Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATIONInventors: Stefan BRUHN, Juan Felix TORRES
-
Patent number: 11184706Abstract: An apparatus and method of excursion protection of a loudspeaker. The method includes attenuating selected bands in a transform domain, controlled by a feedback signal resulting from an excursion transfer function that has been modified according to the real-time operational characteristics of the loudspeaker. In this manner, the system reduces the amount of wideband attenuation needed to address the predicted excursion, resulting in a better listening experience.Type: GrantFiled: May 14, 2019Date of Patent: November 23, 2021Assignee: Dolby Laboratories Licensing CorporationInventors: Brian George Arnott, Nicholas Luke Appleton, Juan Felix Torres, William Thomas Rowley, Ho Young Sung, Michael J. Smithers
-
Patent number: 11128978Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.Type: GrantFiled: November 18, 2016Date of Patent: September 21, 2021Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Michael William Mason, Juan Felix Torres, Antonio Mateos Sole, Daniel Arteaga, Adam J. Mills, Mark David deBurgh, Andrew Robert Owen
-
Publication number: 20210272574Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.Type: ApplicationFiled: October 7, 2019Publication date: September 2, 2021Applicants: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Stefan BRUHN, Michael ECKERT, Juan Felix TORRES, Stefanie BROWN, David S. MCGRATH
-
Publication number: 20210235215Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.Type: ApplicationFiled: January 28, 2021Publication date: July 29, 2021Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATIONInventors: Michael William Mason, Juan Felix Torres, Antonio Mateos Sole, Daniel Arteaga, Adam J. Mills, Mark David de Burgh, Andrew Robert Owen
-
Publication number: 20210219083Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.Type: ApplicationFiled: January 14, 2021Publication date: July 15, 2021Applicants: DOLBY LABORATORIES LICENSING CORPORATION, Dolby International ABInventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
-
Publication number: 20210211802Abstract: An apparatus and method of excursion protection of a loudspeaker. The method includes attenuating selected bands in a transform domain, controlled by a feedback signal resulting from an excursion transfer function that has been modified according to the real-time operational characteristics of the loudspeaker. In this manner, the system reduces the amount of wideband attenuation needed to address the predicted excursion, resulting in a better listening experience.Type: ApplicationFiled: May 14, 2019Publication date: July 8, 2021Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Brian George ARNOTT, Nicholas Luke APPLETON, Juan Felix TORRES, William Thomas ROWLEY, Ho Young SUNG, Michael J. SMITHERS