Patents by Inventor Isaac Garcia Munoz

Isaac Garcia Munoz has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240129681
    Abstract: In general, various aspects of the techniques are directed to rescaling audio element for extended reality scene playback. A device comprising a memory and processing circuitry may be configured to perform the techniques. The memory may store an audio bitstream representative of an audio element in an extended reality scene. The processing circuitry may obtain a playback dimension associated with a physical space in which playback of the audio bitstream is to occur, and obtain a source dimension associated with a source space for the extended reality scene. The processing circuitry may modify, based on the playback dimension and the source dimension, a location of the audio element to obtain a modified location for the audio element, and render, based on the modified location for the audio element, the audio element to one or more speaker feeds. The processing circuitry may output the one or more speaker feeds.
    Type: Application
    Filed: October 12, 2022
    Publication date: April 18, 2024
    Inventors: Isaac Garcia Munoz, Alex Tung, Graham Bradley Davis, Andre Schevciw
  • Publication number: 20240114312
    Abstract: A device configured to process a bitstream may implement the techniques. The device comprises a memory configured to store the bitstream representative of at least one audio element in an extended reality scene, and audio descriptive information associated with the at least one audio element. The device also comprises processing circuitry coupled to the memory and configured to execute a scene manager and an audio unit. The scene manager is configured to construct, based on the at least one audio element, a scene graph that includes at least one node that represents the at least one audio element, and modify, based on the scene graph, the audio descriptive information to obtain modified audio descriptive information. The audio unit is configured to render, based on the modified audio descriptive information, the at least one audio element to one or more speaker feeds, and output the one or more speaker feeds.
    Type: Application
    Filed: September 15, 2023
    Publication date: April 4, 2024
    Inventors: Imed Bouazizi, Thomas Stockhammer, Isaac Garcia Munoz, Nikolai Konrad Leung, Andre Schevciw, Graham Bradley Davis
  • Patent number: 11937065
    Abstract: Systems and methods for determining parameter adjustments for a capture of audio are disclosed. The systems and methods includes processing circuitry configured to access at least one energy map that corresponds to one or more audio streams. The processing circuitry may then determine, from the at least one energy map, a parameter adjustment with respect to at least one audio element. The parameter adjustment may be configured to adjust the capture of audio by the at least one audio element. In addition, the process circuitry may be configured to output an indication indicating the parameter adjustment with respect to the at least one audio element.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: March 19, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters
  • Publication number: 20240031765
    Abstract: A device includes a processor configured to perform signal enhancement of an input audio signal to generate an enhanced mono audio signal. The processor is also configured to mix a first audio signal and a second audio signal to generate a stereo audio signal. The first audio signal is based on the enhanced mono audio signal.
    Type: Application
    Filed: July 25, 2022
    Publication date: January 25, 2024
    Inventors: Isaac Garcia MUNOZ, Shankar THAGADUR SHIVAPPA, Mason DAVIS, Alex TUNG, Dinesh RAMAKRISHNAN, Andre SCHEVCIW
  • Patent number: 11812252
    Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from the user via the user interface representing a desired listening position, select, based on the indication, at least one audio stream of the plurality of audio streams, and output, for a display and in response to obtaining the indication representing the desired listening position, a graphical user interface element suggesting an alternative listening position.
    Type: Grant
    Filed: August 2, 2022
    Date of Patent: November 7, 2023
    Assignee: QUALCOMM Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
  • Patent number: 11750998
    Abstract: Example devices, systems and methods for processing audio data are disclosed. An example device includes a memory configured to store one or more speaker feeds and one or more processors implemented in circuitry and communicatively coupled to the memory. The one or more processors are configured to determine whether a boundary separating an interior area from an exterior area exists, and based on the boundary existing, determine a transition distance value, the transition distance value being indicative of a size of a transition zone. The one or more processors are configured to obtain a listener location indicative of a virtual location of the device relative to the interior area and obtain, based at least in part on the boundary and the listener location, a current renderer. The one or more processors are configured to apply, to the audio data, the current renderer to obtain the one or more speaker feeds.
    Type: Grant
    Filed: September 8, 2021
    Date of Patent: September 5, 2023
    Assignee: Qualcomm Incorporated
    Inventor: Isaac Garcia Munoz
  • Patent number: 11743670
    Abstract: An example device includes a memory configured to store audio data and location data associated with a plurality of audio streams and one or more processors coupled to the memory. The one or more processors are configured to obtain a first location of a first audio stream that includes an audio source and obtain a second location of a second audio stream that includes the audio source. The one or more processors are configured to generate direction vectors originating at the first location and the second location, based on a location of the audio source and the first location, and the location of the audio source and the second location, respectively. The one or more processors are also configured to determine parameters that describe a vector field based on the first direction vector and the second direction vector.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: August 29, 2023
    Assignee: Qualcomm Incorporated
    Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
  • Patent number: 11646046
    Abstract: A device includes a memory configured to store directivity data of one or more audio sources corresponding to one or more input audio signals. The device also includes one or more processors configured to determine one or more equalizer settings based at least in part on the directivity data. The one or more processors are also configured to generate, based on the equalizer settings, one or more output audio signals that correspond to a psychoacoustic enhanced version of the one or more input audio signals.
    Type: Grant
    Filed: January 29, 2021
    Date of Patent: May 9, 2023
    Assignee: Qualcomm Incorporated
    Inventor: Isaac Garcia Munoz
  • Patent number: 11601776
    Abstract: An example device for processing one or more audio streams includes a memory configured to store the one or more audio streams and one or more processors implemented in circuitry coupled to the memory. The one or more processors are configured to determine a listener position. The one or more processors are also configured to determine one or more clusters of the one or more audio streams. The one or more processors are also configured to determine a rendering mode based on the listener position and the one or more clusters. The device also includes a renderer configured to render at least one of the one or more clusters of audio streams based on the rendering mode.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: March 7, 2023
    Assignee: Qualcomm Incorporated
    Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters, Isaac Garcia Munoz
  • Patent number: 11580213
    Abstract: A method and device for processing one or more audio streams based on password-based privacy restrictions is described. A device may be configured to receive unrestricted audio streams of the one or more audio streams based on privacy restrictions associated with a password, wherein the one or more audio streams are from audio elements represented in an acoustic environment that comprises one or more sub-acoustic spaces, each of the one or more audio streams representative of a respective soundfield, and generate the respective soundfields of the unrestricted audio streams.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: February 14, 2023
    Assignee: Qualcomm Incorporated
    Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
  • Patent number: 11558707
    Abstract: A device includes one or more processors configured to receive, via wireless transmission from a streaming device, encoded ambisonics audio data representing a sound field. The one or more processors are also configured to perform decoding of the ambisonics audio data to generate decoded ambisonics audio data. The decoding of the ambisonics audio data includes base layer decoding of a base layer of the encoded ambisonics audio data and selectively includes enhancement layer decoding in response to an amount of movement of the device. The one or more processors are further configured to adjust the decoded ambisonics audio data to alter the sound field based on data associated with at least one of a translation or an orientation associated with the movement of the device. The one or more processors are also configured to output the adjusted decoded ambisonics audio data to two or more loudspeakers for playback.
    Type: Grant
    Filed: June 28, 2021
    Date of Patent: January 17, 2023
    Assignee: Qualcomm Incorporated
    Inventors: Andre Schevciw, Vinay Melkote Krishnaprasad, Nils Gunther Peters, Isaac Garcia Munoz
  • Publication number: 20220377490
    Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from the user via the user interface representing a desired listening position, select, based on the indication, at least one audio stream of the plurality of audio streams, and output, for a display and in response to obtaining the indication representing the desired listening position, a graphical user interface element suggesting an alternative listening position.
    Type: Application
    Filed: August 2, 2022
    Publication date: November 24, 2022
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
  • Patent number: 11432097
    Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from a user via the user interface representing a desired listening position; and select, based on the indication, at least one audio stream of the plurality of audio streams.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: August 30, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
  • Patent number: 11429340
    Abstract: In some examples, a content consumer device configured to play one or more of a plurality of audio streams includes a memory configured to store the plurality of audio streams and audio location information associated with the plurality of audio streams and representative of audio stream coordinates in an acoustical space where an audio stream was captured or synthesized or both. Each of the audio streams is representative of a soundfield. The content consumer device also includes one or more processors coupled to the memory, and configured to determine device location information representative of device coordinates of the content consumer device in the acoustical space. The one or more processors are configured to select, based on the device location information and the audio location information, a subset of the plurality of audio streams, and output, based on the subset of the plurality of audio streams, one or more speaker feeds.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: August 30, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters
  • Patent number: 11425497
    Abstract: In an aspect, a lens is zoomed in to create a zoomed lens. Lens data associated with the lens includes a direction of the lens relative to an object in a field-of-view of the zoomed lens and a magnification of the object resulting from the zoomed lens. An array of microphones capture audio signals including audio produced by the object and interference produced by other objects. The audio signals are processed to identify a directional component associated with the audio produced by the object and three orthogonal components associated with the interference produced by the other objects. Stereo beamforming is used to increase a magnitude of the directional component (relative to the interference) while retaining a binaural nature of the audio signals. The increase in magnitude of the directional component is based on an amount of the magnification provided by the zoomed lens to the object.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: August 23, 2022
    Assignee: QUALCOMM Incorporated
    Inventors: S M Akramus Salehin, Lae-Hoon Kim, Vasudev Nayak, Shankar Thagadur Shivappa, Isaac Garcia Munoz, Sanghyun Chi, Erik Visser
  • Publication number: 20220246160
    Abstract: A device includes a memory configured to store directivity data of one or more audio sources corresponding to one or more input audio signals. The device also includes one or more processors configured to determine one or more equalizer settings based at least in part on the directivity data. The one or more processors are also configured to generate, based on the equalizer settings, one or more output audio signals that correspond to a psychoacoustic enhanced version of the one or more input audio signals.
    Type: Application
    Filed: January 29, 2021
    Publication date: August 4, 2022
    Inventor: Isaac Garcia MUNOZ
  • Publication number: 20220201419
    Abstract: An example device for processing one or more audio streams includes a memory configured to store the one or more audio streams and one or more processors implemented in circuitry coupled to the memory. The one or more processors are configured to determine a listener position. The one or more processors are also configured to determine one or more clusters of the one or more audio streams. The one or more processors are also configured to determine a rendering mode based on the listener position and the one or more clusters. The device also includes a renderer configured to render at least one of the one or more clusters of audio streams based on the rendering mode.
    Type: Application
    Filed: December 18, 2020
    Publication date: June 23, 2022
    Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters, Isaac Garcia Munoz
  • Publication number: 20220201418
    Abstract: An example device includes a memory configured to store audio data and location data associated with a plurality of audio streams and one or more processors coupled to the memory. The one or more processors are configured to obtain a first location of a first audio stream that includes an audio source and obtain a second location of a second audio stream that includes the audio source. The one or more processors are configured to generate direction vectors originating at the first location and the second location, based on a location of the audio source and the first location, and the location of the audio source and the second location, respectively. The one or more processors are also configured to determine parameters that describe a vector field based on the first direction vector and the second direction vector.
    Type: Application
    Filed: December 18, 2020
    Publication date: June 23, 2022
    Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
  • Publication number: 20220201395
    Abstract: In an aspect, a lens is zoomed in to create a zoomed lens. Lens data associated with the lens includes a direction of the lens relative to an object in a field-of-view of the zoomed lens and a magnification of the object resulting from the zoomed lens. An array of microphones capture audio signals including audio produced by the object and interference produced by other objects. The audio signals are processed to identify a directional component associated with the audio produced by the object and three orthogonal components associated with the interference produced by the other objects. Stereo beamforming is used to increase a magnitude of the directional component (relative to the interference) while retaining a binaural nature of the audio signals. The increase in magnitude of the directional component is based on an amount of the magnification provided by the zoomed lens to the object.
    Type: Application
    Filed: December 18, 2020
    Publication date: June 23, 2022
    Inventors: S M Akramus SALEHIN, Lae-Hoon KIM, Vasudev NAYAK, Shankar THAGADUR SHIVAPPA, Isaac Garcia MUNOZ, Sanghyun CHI, Erik VISSER
  • Patent number: 11354085
    Abstract: Example devices and methods are disclosed. An example device includes a memory configured to store a plurality of audio streams and an associated level of authorization for each of the plurality of audio streams. The device also includes one or more processors implemented in circuitry and communicatively coupled to the memory. The one or more processors are configured to select, based on the associated levels of authorization, a subset of the plurality of audio streams, the subset of the plurality of audio streams excluding at least one of the plurality of audio streams.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: June 7, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, S M Akramus Salehin, Nils Günther Peters