Patents by Inventor Nils Günther Peters

Nils Günther Peters has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12142285
    Abstract: In general, techniques are described for quantizing spatial components based on bit allocations determined for psychoacoustic audio coding. A device comprising a memory and one or more processors may perform the techniques. The memory may store a bitstream including an encoded foreground audio signal and a corresponding quantized spatial component. The one or more processors may perform psychoacoustic audio decoding with respect to the encoded foreground audio signal to obtain a foreground audio signal, and determine, when performing the psychoacoustic audio decoding, a first bit allocation for the encoded foreground audio signal. The one or more processors may also determine, based on the first bit allocation, a second bit allocation, and dequantize, based on the second bit allocation, the quantized spatial component to obtain a spatial component. The one or more processors may reconstruct, based on the foreground audio signal and the spatial component, scene-based audio data.
    Type: Grant
    Filed: June 22, 2020
    Date of Patent: November 12, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
  • Patent number: 12073842
    Abstract: In general, techniques are described for psychoacoustic audio coding of ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the bitstream that includes an encoded audio object and a corresponding spatial component that defines spatial characteristics of the encoded foreground audio signal. The encoded foreground audio signal may include a coded gain and a coded shape. The one or more processors may perform a gain and shape synthesis with respect to the coded gain and the coded shape to obtain a foreground audio signal, and reconstruct, based on the foreground audio signal and the spatial component, the ambisonic audio data.
    Type: Grant
    Filed: June 22, 2020
    Date of Patent: August 27, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
  • Publication number: 20240274141
    Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.
    Type: Application
    Filed: April 19, 2024
    Publication date: August 15, 2024
    Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Jason Filos
  • Patent number: 12047764
    Abstract: An example device includes a memory configured to store a plurality of representations of a soundfield, each representation of the soundfield comprising a different set of ambisonic coefficients representative of the same soundfield at concurrent periods of time. The device also includes a processor, coupled to the memory, and the processor is configured to perform audio playback based on a field of view and on a particular representation of the soundfield from the plurality of representations.
    Type: Grant
    Filed: August 30, 2019
    Date of Patent: July 23, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Nils Günther Peters, Dipanjan Sen, Thomas Stockhammer
  • Patent number: 11967329
    Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.
    Type: Grant
    Filed: February 19, 2021
    Date of Patent: April 23, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Jason Filos
  • Patent number: 11937065
    Abstract: Systems and methods for determining parameter adjustments for a capture of audio are disclosed. The systems and methods includes processing circuitry configured to access at least one energy map that corresponds to one or more audio streams. The processing circuitry may then determine, from the at least one energy map, a parameter adjustment with respect to at least one audio element. The parameter adjustment may be configured to adjust the capture of audio by the at least one audio element. In addition, the process circuitry may be configured to output an indication indicating the parameter adjustment with respect to the at least one audio element.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: March 19, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters
  • Patent number: 11843932
    Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.
    Type: Grant
    Filed: May 24, 2021
    Date of Patent: December 12, 2023
    Assignee: QUALCOMM Incorporated
    Inventors: Moo Young Kim, Nils Günther Peters, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
  • Patent number: 11812252
    Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from the user via the user interface representing a desired listening position, select, based on the indication, at least one audio stream of the plurality of audio streams, and output, for a display and in response to obtaining the indication representing the desired listening position, a graphical user interface element suggesting an alternative listening position.
    Type: Grant
    Filed: August 2, 2022
    Date of Patent: November 7, 2023
    Assignee: QUALCOMM Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
  • Patent number: 11798569
    Abstract: In general, techniques are described for obtaining audio rendering information from a bitstream. A method of rendering audio data includes receiving, at an interface of a device, an encoded audio bitstream, storing, to a memory of the device, encoded audio data of the encoded audio bitstream, parsing, by one or more processors of the device, a portion of the encoded audio data stored to the memory to select a renderer for the encoded audio data, the selected renderer comprising one of an object-based renderer or an ambisonic renderer, rendering, by the one or more processors of the device, the encoded audio data using the selected renderer to generate one or more rendered speaker feeds, and outputting, by one or more loudspeakers of the device, the one or more rendered speaker feeds.
    Type: Grant
    Filed: September 25, 2019
    Date of Patent: October 24, 2023
    Assignee: QUALCOMM Incorporated
    Inventors: Moo Young Kim, Nils Günther Peters
  • Patent number: 11743670
    Abstract: An example device includes a memory configured to store audio data and location data associated with a plurality of audio streams and one or more processors coupled to the memory. The one or more processors are configured to obtain a first location of a first audio stream that includes an audio source and obtain a second location of a second audio stream that includes the audio source. The one or more processors are configured to generate direction vectors originating at the first location and the second location, based on a location of the audio source and the first location, and the location of the audio source and the second location, respectively. The one or more processors are also configured to determine parameters that describe a vector field based on the first direction vector and the second direction vector.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: August 29, 2023
    Assignee: Qualcomm Incorporated
    Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
  • Patent number: 11664035
    Abstract: A device configured to decode a bitstream, where the device includes a memory configured to store a temporally encoded representation of spatial audio signals. The device is also configured to receive the bitstream that includes an indication of a spatial transformation, and includes a temporal decoding unit, coupled to the memory, configured to decode one or more spatial audio signals represented in a spatial domain, where the one or more spatial audio signals are associated with different angles in the spatial domain. In addition, the device includes an inverse spatial transformation unit, coupled to the temporal decoding unit, is configured to convert the one or more spatial audio signals represented in the spatial domain into at least three ambisonic coefficients that, in part, represent a soundfield in an ambisonics domain, and perform a spatial transformation of the soundfield based on the indication of the spatial transformation received in the bitstream.
    Type: Grant
    Filed: October 4, 2021
    Date of Patent: May 30, 2023
    Assignee: Qualcomm Incorporated
    Inventors: Nils Günther Peters, Moo Young Kim, Dipanjan Sen
  • Patent number: 11601776
    Abstract: An example device for processing one or more audio streams includes a memory configured to store the one or more audio streams and one or more processors implemented in circuitry coupled to the memory. The one or more processors are configured to determine a listener position. The one or more processors are also configured to determine one or more clusters of the one or more audio streams. The one or more processors are also configured to determine a rendering mode based on the listener position and the one or more clusters. The device also includes a renderer configured to render at least one of the one or more clusters of audio streams based on the rendering mode.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: March 7, 2023
    Assignee: Qualcomm Incorporated
    Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters, Isaac Garcia Munoz
  • Patent number: 11538489
    Abstract: In general, techniques are described by which to correlate scene-based audio data for psychoacoustic audio coding. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a bitstream including a plurality of encoded correlated components of a soundfield represented by scene-based audio data. The one or more processors may perform psychoacoustic audio decoding with respect to one or more of the plurality of encoded correlated components to obtain a plurality of correlated components, and obtain, from the bitstream, an indication representative of how the one or more of the plurality of correlated components were reordered in the bitstream. The one or more processors may reorder, based on the indication, the plurality of correlated components to obtain a plurality of reordered components, and reconstruct, based on the plurality of reordered components, the scene-based audio data.
    Type: Grant
    Filed: June 22, 2020
    Date of Patent: December 27, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
  • Publication number: 20220377490
    Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from the user via the user interface representing a desired listening position, select, based on the indication, at least one audio stream of the plurality of audio streams, and output, for a display and in response to obtaining the indication representing the desired listening position, a graphical user interface element suggesting an alternative listening position.
    Type: Application
    Filed: August 2, 2022
    Publication date: November 24, 2022
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
  • Patent number: 11432097
    Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from a user via the user interface representing a desired listening position; and select, based on the indication, at least one audio stream of the plurality of audio streams.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: August 30, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
  • Patent number: 11429340
    Abstract: In some examples, a content consumer device configured to play one or more of a plurality of audio streams includes a memory configured to store the plurality of audio streams and audio location information associated with the plurality of audio streams and representative of audio stream coordinates in an acoustical space where an audio stream was captured or synthesized or both. Each of the audio streams is representative of a soundfield. The content consumer device also includes one or more processors coupled to the memory, and configured to determine device location information representative of device coordinates of the content consumer device in the acoustical space. The one or more processors are configured to select, based on the device location information and the audio location information, a subset of the plurality of audio streams, and output, based on the subset of the plurality of audio streams, one or more speaker feeds.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: August 30, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters
  • Patent number: 11405699
    Abstract: An example device for accessing media data includes a memory configured to store media data; and one or more processors implemented in circuitry and configured to: receive a scene description of a GL Transmission Format 2.0 (glTF2) bitstream including a timed media object; determine a position of the timed media object in a presentation environment using the scene description; retrieve current timed media data for the timed media object for a current presentation time; and present the current timed media data according to the position of the timed media object at the current presentation time.
    Type: Grant
    Filed: September 30, 2020
    Date of Patent: August 2, 2022
    Assignee: QUALCOMM Incorporated
    Inventors: Imed Bouazizi, Thomas Stockhammer, Nils Günther Peters
  • Patent number: 11395083
    Abstract: In general, techniques are described by which to support scalable unified audio rendering. A device comprising an audio decoder, a memory, and a processor may be configured to perform various aspects of the techniques. The audio decoder may decode, from a bitstream, first audio data and second audio data. The memory may store the first audio data and the second audio data. The processor may render the first audio data into first spatial domain audio data for playback by virtual speakers at a set of virtual speaker locations, and render the second audio data into second spatial domain audio data for playback by the virtual speakers at the set of virtual speaker locations. The processor may also mix the first spatial domain audio data and the second spatial domain audio data to obtain mixed spatial domain audio data, and convert the mixed spatial domain audio data to scene-based audio data.
    Type: Grant
    Filed: January 31, 2019
    Date of Patent: July 19, 2022
    Assignee: QUALCOMM Incorporated
    Inventors: Andre Gustavo P. Schevciw, Nils Günther Peters
  • Publication number: 20220201419
    Abstract: An example device for processing one or more audio streams includes a memory configured to store the one or more audio streams and one or more processors implemented in circuitry coupled to the memory. The one or more processors are configured to determine a listener position. The one or more processors are also configured to determine one or more clusters of the one or more audio streams. The one or more processors are also configured to determine a rendering mode based on the listener position and the one or more clusters. The device also includes a renderer configured to render at least one of the one or more clusters of audio streams based on the rendering mode.
    Type: Application
    Filed: December 18, 2020
    Publication date: June 23, 2022
    Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters, Isaac Garcia Munoz
  • Publication number: 20220201418
    Abstract: An example device includes a memory configured to store audio data and location data associated with a plurality of audio streams and one or more processors coupled to the memory. The one or more processors are configured to obtain a first location of a first audio stream that includes an audio source and obtain a second location of a second audio stream that includes the audio source. The one or more processors are configured to generate direction vectors originating at the first location and the second location, based on a location of the audio source and the first location, and the location of the audio source and the second location, respectively. The one or more processors are also configured to determine parameters that describe a vector field based on the first direction vector and the second direction vector.
    Type: Application
    Filed: December 18, 2020
    Publication date: June 23, 2022
    Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz