Patents by Inventor Juan Felix TORRES

Juan Felix TORRES has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11972767
    Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.
    Type: Grant
    Filed: July 31, 2020
    Date of Patent: April 30, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: David S. McGrath, Stefanie Brown, Juan Felix Torres
  • Patent number: 11937074
    Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.
    Type: Grant
    Filed: January 28, 2021
    Date of Patent: March 19, 2024
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Michael William Mason, Juan Felix Torres, Antonio Mateos Sole, Daniel Arteaga, Adam J. Mills, Mark David de Burgh, Andrew Robert Owen
  • Patent number: 11843930
    Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
    Type: Grant
    Filed: June 6, 2022
    Date of Patent: December 12, 2023
    Assignees: DOLBY LABORATORIES LICENSING CORPORATION, Dolby International AB
    Inventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
  • Publication number: 20230343346
    Abstract: Described is a method of frame-wise encoding metadata for an input signal, the metadata comprising a plurality of at least partially interrelated parameters calculable from the input signal. The method comprises, for each frame: iteratively performing, by using a looping process, steps of: determining a processing strategy from a plurality of processing strategies for calculating and quantizing the parameters; calculating and quantizing the parameters based on the determined processing strategy to obtain quantized parameters; and encoding the quantized parameters. In particular, each of the plurality of processing strategies comprises a respective first indication indicative of an ordering related to the calculation and quantization of individual parameters; and the processing strategy is determined based on at least one bitrate threshold.
    Type: Application
    Filed: June 10, 2021
    Publication date: October 26, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: David S. MCGRATH, Rishabh TYAGI, Stefanie BROWN, Juan Felix Torres
  • Patent number: 11594232
    Abstract: Systems, methods, and computer program products of audio processing based on Adaptive Intermediate Spatial Format (AISF) are described. The AISF is an extension to ISF that allows spatial resolution around an ISF ring to be adjusted dynamically with respect to content of incoming audio objects. An AISF encoder device adaptively warps each ISF ring during ISF encoding to adjust angular distance between objects, resulting in increase in uniformity of energy distribution around the ISF ring. At an AISF decoder device, matrices that decode sound positions to the output speaker take into account the warping that was performed at the AISF encoder device to reproduce the true positions of sound sources.
    Type: Grant
    Filed: November 11, 2020
    Date of Patent: February 28, 2023
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Juan Felix Torres, David S. McGrath, Michael William Mason
  • Publication number: 20220406318
    Abstract: Embodiments are disclosed for bitrate distribution in immersive voice and audio services. In an embodiment, a method of encoding an IVAS bitstream comprises: receiving an input audio signal; downmixing the input audio signal into one or more downmix channels and spatial metadata; reading a set of one or more bitrates for the downmix channels and a set of quantization levels for the spatial metadata from a bitrate distribution control table; determining a combination of the one or more bitrates for the downmix channels; determining a metadata quantization level from the set of metadata quantization levels using a bitrate distribution process; quantizing and coding the spatial metadata using the metadata quantization level; generating, using the combination of one or more bitrates, a downmix bitstream for the one or more downmix channels; combining the downmix bitstream, the quantized and coded spatial metadata and the set of quantization levels into the IVAS bitstream.
    Type: Application
    Filed: October 28, 2020
    Publication date: December 22, 2022
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Rishabh TYAGI, Juan Felix TORRES, Stefanie BROWN
  • Publication number: 20220386053
    Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
    Type: Application
    Filed: June 6, 2022
    Publication date: December 1, 2022
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, Dolby International AB
    Inventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
  • Publication number: 20220375482
    Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.
    Type: Application
    Filed: August 8, 2022
    Publication date: November 24, 2022
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Stefan BRUHN, Michael ECKERT, Juan Felix TORRES, Stefanie BROWN, David S. MCGRATH
  • Publication number: 20220284910
    Abstract: Encoding/decoding an immersive voice and audio services (IVAS) bitstream comprises: encoding/decoding a coding mode indicator in a common header (CH) section of an IVAS bitstream, encoding/decoding a mode header or tool header in the tool header (TH) section of the bitstream, the TH section following the CH section, encoding/decoding a metadata payload in a metadata payload (MDP) section of the bitstream, the MDP section following the CH section, encoding/decoding an enhanced voice services (EVS) payload in an EVS payload (EP) section of the bitstream, the EP section following the CH section, and on the encoder side, storing or streaming the encoded bitstream, and on the decoder side, controlling an audio decoder based on the coding mode, the tool header, the EVS payload, and the metadata payload or storing a representation of same.
    Type: Application
    Filed: July 30, 2020
    Publication date: September 8, 2022
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Rishabh Tyagi, Juan Felix Torres
  • Publication number: 20220277757
    Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.
    Type: Application
    Filed: July 31, 2020
    Publication date: September 1, 2022
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: David S. MCGRATH, Stefanie BROWN, Juan Felix TORRES
  • Patent number: 11410666
    Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.
    Type: Grant
    Filed: October 7, 2019
    Date of Patent: August 9, 2022
    Assignees: Dolby Laboratories Licensing Corporation
    Inventors: Stefan Bruhn, Michael Eckert, Juan Felix Torres, Stefanie Brown, David S. McGrath
  • Patent number: 11356787
    Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
    Type: Grant
    Filed: January 14, 2021
    Date of Patent: June 7, 2022
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
  • Publication number: 20220022000
    Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.
    Type: Application
    Filed: November 12, 2019
    Publication date: January 20, 2022
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian Lee
  • Publication number: 20210375297
    Abstract: The present document describes a method (500) for generating a bitstream (101), wherein the bitstream (101) comprises a sequence of superframes (400) for a sequence of frames of an immersive audio signal (111). The method (500) comprises, repeatedly for the sequence of superframes (400), inserting (501) coded audio data (206) for one or more frames of one or more downmix channel signals (203) derived from the immersive audio signal (111), into data fields (411, 421, 412, 422) of a superframe (400); and inserting (502) metadata (202, 205) for reconstructing one or more frames of the immersive audio signal (111) from the coded audio data (206), into a metadata field (403) of the superframe (400).
    Type: Application
    Filed: July 2, 2019
    Publication date: December 2, 2021
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Stefan BRUHN, Juan Felix TORRES
  • Patent number: 11184706
    Abstract: An apparatus and method of excursion protection of a loudspeaker. The method includes attenuating selected bands in a transform domain, controlled by a feedback signal resulting from an excursion transfer function that has been modified according to the real-time operational characteristics of the loudspeaker. In this manner, the system reduces the amount of wideband attenuation needed to address the predicted excursion, resulting in a better listening experience.
    Type: Grant
    Filed: May 14, 2019
    Date of Patent: November 23, 2021
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Brian George Arnott, Nicholas Luke Appleton, Juan Felix Torres, William Thomas Rowley, Ho Young Sung, Michael J. Smithers
  • Patent number: 11128978
    Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.
    Type: Grant
    Filed: November 18, 2016
    Date of Patent: September 21, 2021
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Michael William Mason, Juan Felix Torres, Antonio Mateos Sole, Daniel Arteaga, Adam J. Mills, Mark David deBurgh, Andrew Robert Owen
  • Publication number: 20210272574
    Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.
    Type: Application
    Filed: October 7, 2019
    Publication date: September 2, 2021
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Stefan BRUHN, Michael ECKERT, Juan Felix TORRES, Stefanie BROWN, David S. MCGRATH
  • Publication number: 20210235215
    Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.
    Type: Application
    Filed: January 28, 2021
    Publication date: July 29, 2021
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Michael William Mason, Juan Felix Torres, Antonio Mateos Sole, Daniel Arteaga, Adam J. Mills, Mark David de Burgh, Andrew Robert Owen
  • Publication number: 20210219083
    Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
    Type: Application
    Filed: January 14, 2021
    Publication date: July 15, 2021
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, Dolby International AB
    Inventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
  • Publication number: 20210211802
    Abstract: An apparatus and method of excursion protection of a loudspeaker. The method includes attenuating selected bands in a transform domain, controlled by a feedback signal resulting from an excursion transfer function that has been modified according to the real-time operational characteristics of the loudspeaker. In this manner, the system reduces the amount of wideband attenuation needed to address the predicted excursion, resulting in a better listening experience.
    Type: Application
    Filed: May 14, 2019
    Publication date: July 8, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Brian George ARNOTT, Nicholas Luke APPLETON, Juan Felix TORRES, William Thomas ROWLEY, Ho Young SUNG, Michael J. SMITHERS