Patents by Inventor Nils Günther Peters

Nils Günther Peters has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding

Patent number: 12142285

Abstract: In general, techniques are described for quantizing spatial components based on bit allocations determined for psychoacoustic audio coding. A device comprising a memory and one or more processors may perform the techniques. The memory may store a bitstream including an encoded foreground audio signal and a corresponding quantized spatial component. The one or more processors may perform psychoacoustic audio decoding with respect to the encoded foreground audio signal to obtain a foreground audio signal, and determine, when performing the psychoacoustic audio decoding, a first bit allocation for the encoded foreground audio signal. The one or more processors may also determine, based on the first bit allocation, a second bit allocation, and dequantize, based on the second bit allocation, the quantized spatial component to obtain a spatial component. The one or more processors may reconstruct, based on the foreground audio signal and the spatial component, scene-based audio data.

Type: Grant

Filed: June 22, 2020

Date of Patent: November 12, 2024

Assignee: QUALCOMM Incorporated

Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
Psychoacoustic audio coding of ambisonic audio data

Patent number: 12073842

Abstract: In general, techniques are described for psychoacoustic audio coding of ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the bitstream that includes an encoded audio object and a corresponding spatial component that defines spatial characteristics of the encoded foreground audio signal. The encoded foreground audio signal may include a coded gain and a coded shape. The one or more processors may perform a gain and shape synthesis with respect to the coded gain and the coded shape to obtain a foreground audio signal, and reconstruct, based on the foreground audio signal and the spatial component, the ambisonic audio data.

Type: Grant

Filed: June 22, 2020

Date of Patent: August 27, 2024

Assignee: QUALCOMM Incorporated

Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
SIGNALING FOR RENDERING TOOLS

Publication number: 20240274141

Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.

Type: Application

Filed: April 19, 2024

Publication date: August 15, 2024

Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Jason Filos
Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems

Patent number: 12047764

Abstract: An example device includes a memory configured to store a plurality of representations of a soundfield, each representation of the soundfield comprising a different set of ambisonic coefficients representative of the same soundfield at concurrent periods of time. The device also includes a processor, coupled to the memory, and the processor is configured to perform audio playback based on a field of view and on a particular representation of the soundfield from the plurality of representations.

Type: Grant

Filed: August 30, 2019

Date of Patent: July 23, 2024

Assignee: QUALCOMM Incorporated

Inventors: Nils Günther Peters, Dipanjan Sen, Thomas Stockhammer
Signaling for rendering tools

Patent number: 11967329

Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.

Type: Grant

Filed: February 19, 2021

Date of Patent: April 23, 2024

Assignee: QUALCOMM Incorporated

Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Jason Filos
Adjustment of parameter settings for extended reality experiences

Patent number: 11937065

Abstract: Systems and methods for determining parameter adjustments for a capture of audio are disclosed. The systems and methods includes processing circuitry configured to access at least one energy map that corresponds to one or more audio streams. The processing circuitry may then determine, from the at least one energy map, a parameter adjustment with respect to at least one audio element. The parameter adjustment may be configured to adjust the capture of audio by the at least one audio element. In addition, the process circuitry may be configured to output an indication indicating the parameter adjustment with respect to the at least one audio element.

Type: Grant

Filed: July 1, 2020

Date of Patent: March 19, 2024

Assignee: QUALCOMM Incorporated

Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters
Six degrees of freedom and three degrees of freedom backward compatibility

Patent number: 11843932

Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.

Type: Grant

Filed: May 24, 2021

Date of Patent: December 12, 2023

Assignee: QUALCOMM Incorporated

Inventors: Moo Young Kim, Nils Günther Peters, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
User interface feedback for controlling audio rendering for extended reality experiences

Patent number: 11812252

Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from the user via the user interface representing a desired listening position, select, based on the indication, at least one audio stream of the plurality of audio streams, and output, for a display and in response to obtaining the indication representing the desired listening position, a graphical user interface element suggesting an alternative listening position.

Type: Grant

Filed: August 2, 2022

Date of Patent: November 7, 2023

Assignee: QUALCOMM Incorporated

Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
Flexible rendering of audio data

Patent number: 11798569

Abstract: In general, techniques are described for obtaining audio rendering information from a bitstream. A method of rendering audio data includes receiving, at an interface of a device, an encoded audio bitstream, storing, to a memory of the device, encoded audio data of the encoded audio bitstream, parsing, by one or more processors of the device, a portion of the encoded audio data stored to the memory to select a renderer for the encoded audio data, the selected renderer comprising one of an object-based renderer or an ambisonic renderer, rendering, by the one or more processors of the device, the encoded audio data using the selected renderer to generate one or more rendered speaker feeds, and outputting, by one or more loudspeakers of the device, the one or more rendered speaker feeds.

Type: Grant

Filed: September 25, 2019

Date of Patent: October 24, 2023

Assignee: QUALCOMM Incorporated

Inventors: Moo Young Kim, Nils Günther Peters
Correlation-based rendering with multiple distributed streams accounting for an occlusion for six degree of freedom applications

Patent number: 11743670

Abstract: An example device includes a memory configured to store audio data and location data associated with a plurality of audio streams and one or more processors coupled to the memory. The one or more processors are configured to obtain a first location of a first audio stream that includes an audio source and obtain a second location of a second audio stream that includes the audio source. The one or more processors are configured to generate direction vectors originating at the first location and the second location, based on a location of the audio source and the first location, and the location of the audio source and the second location, respectively. The one or more processors are also configured to determine parameters that describe a vector field based on the first direction vector and the second direction vector.

Type: Grant

Filed: December 18, 2020

Date of Patent: August 29, 2023

Assignee: Qualcomm Incorporated

Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
Spatial transformation of ambisonic audio data

Patent number: 11664035

Abstract: A device configured to decode a bitstream, where the device includes a memory configured to store a temporally encoded representation of spatial audio signals. The device is also configured to receive the bitstream that includes an indication of a spatial transformation, and includes a temporal decoding unit, coupled to the memory, configured to decode one or more spatial audio signals represented in a spatial domain, where the one or more spatial audio signals are associated with different angles in the spatial domain. In addition, the device includes an inverse spatial transformation unit, coupled to the temporal decoding unit, is configured to convert the one or more spatial audio signals represented in the spatial domain into at least three ambisonic coefficients that, in part, represent a soundfield in an ambisonics domain, and perform a spatial transformation of the soundfield based on the indication of the spatial transformation received in the bitstream.

Type: Grant

Filed: October 4, 2021

Date of Patent: May 30, 2023

Assignee: Qualcomm Incorporated

Inventors: Nils Günther Peters, Moo Young Kim, Dipanjan Sen
Smart hybrid rendering for augmented reality/virtual reality audio

Patent number: 11601776

Abstract: An example device for processing one or more audio streams includes a memory configured to store the one or more audio streams and one or more processors implemented in circuitry coupled to the memory. The one or more processors are configured to determine a listener position. The one or more processors are also configured to determine one or more clusters of the one or more audio streams. The one or more processors are also configured to determine a rendering mode based on the listener position and the one or more clusters. The device also includes a renderer configured to render at least one of the one or more clusters of audio streams based on the rendering mode.

Type: Grant

Filed: December 18, 2020

Date of Patent: March 7, 2023

Assignee: Qualcomm Incorporated

Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters, Isaac Garcia Munoz
Correlating scene-based audio data for psychoacoustic audio coding

Patent number: 11538489

Abstract: In general, techniques are described by which to correlate scene-based audio data for psychoacoustic audio coding. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a bitstream including a plurality of encoded correlated components of a soundfield represented by scene-based audio data. The one or more processors may perform psychoacoustic audio decoding with respect to one or more of the plurality of encoded correlated components to obtain a plurality of correlated components, and obtain, from the bitstream, an indication representative of how the one or more of the plurality of correlated components were reordered in the bitstream. The one or more processors may reorder, based on the indication, the plurality of correlated components to obtain a plurality of reordered components, and reconstruct, based on the plurality of reordered components, the scene-based audio data.

Type: Grant

Filed: June 22, 2020

Date of Patent: December 27, 2022

Assignee: Qualcomm Incorporated

Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
USER INTERFACE FEEDBACK FOR CONTROLLING AUDIO RENDERING FOR EXTENDED REALITY EXPERIENCES

Publication number: 20220377490

Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from the user via the user interface representing a desired listening position, select, based on the indication, at least one audio stream of the plurality of audio streams, and output, for a display and in response to obtaining the indication representing the desired listening position, a graphical user interface element suggesting an alternative listening position.

Type: Application

Filed: August 2, 2022

Publication date: November 24, 2022

Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
User interface for controlling audio rendering for extended reality experiences

Patent number: 11432097

Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from a user via the user interface representing a desired listening position; and select, based on the indication, at least one audio stream of the plurality of audio streams.

Type: Grant

Filed: July 1, 2020

Date of Patent: August 30, 2022

Assignee: Qualcomm Incorporated

Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
Audio capture and rendering for extended reality experiences

Patent number: 11429340

Abstract: In some examples, a content consumer device configured to play one or more of a plurality of audio streams includes a memory configured to store the plurality of audio streams and audio location information associated with the plurality of audio streams and representative of audio stream coordinates in an acoustical space where an audio stream was captured or synthesized or both. Each of the audio streams is representative of a soundfield. The content consumer device also includes one or more processors coupled to the memory, and configured to determine device location information representative of device coordinates of the content consumer device in the acoustical space. The one or more processors are configured to select, based on the device location information and the audio location information, a subset of the plurality of audio streams, and output, based on the subset of the plurality of audio streams, one or more speaker feeds.

Type: Grant

Filed: July 1, 2020

Date of Patent: August 30, 2022

Assignee: Qualcomm Incorporated

Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters
Using GLTF2 extensions to support video and audio data

Patent number: 11405699

Abstract: An example device for accessing media data includes a memory configured to store media data; and one or more processors implemented in circuitry and configured to: receive a scene description of a GL Transmission Format 2.0 (glTF2) bitstream including a timed media object; determine a position of the timed media object in a presentation environment using the scene description; retrieve current timed media data for the timed media object for a current presentation time; and present the current timed media data according to the position of the timed media object at the current presentation time.

Type: Grant

Filed: September 30, 2020

Date of Patent: August 2, 2022

Assignee: QUALCOMM Incorporated

Inventors: Imed Bouazizi, Thomas Stockhammer, Nils Günther Peters
Scalable unified audio renderer

Patent number: 11395083

Abstract: In general, techniques are described by which to support scalable unified audio rendering. A device comprising an audio decoder, a memory, and a processor may be configured to perform various aspects of the techniques. The audio decoder may decode, from a bitstream, first audio data and second audio data. The memory may store the first audio data and the second audio data. The processor may render the first audio data into first spatial domain audio data for playback by virtual speakers at a set of virtual speaker locations, and render the second audio data into second spatial domain audio data for playback by the virtual speakers at the set of virtual speaker locations. The processor may also mix the first spatial domain audio data and the second spatial domain audio data to obtain mixed spatial domain audio data, and convert the mixed spatial domain audio data to scene-based audio data.

Type: Grant

Filed: January 31, 2019

Date of Patent: July 19, 2022

Assignee: QUALCOMM Incorporated

Inventors: Andre Gustavo P. Schevciw, Nils Günther Peters
SMART HYBRID RENDERING FOR AUGMENTED REALITY/VIRTUAL REALITY AUDIO

Publication number: 20220201419

Abstract: An example device for processing one or more audio streams includes a memory configured to store the one or more audio streams and one or more processors implemented in circuitry coupled to the memory. The one or more processors are configured to determine a listener position. The one or more processors are also configured to determine one or more clusters of the one or more audio streams. The one or more processors are also configured to determine a rendering mode based on the listener position and the one or more clusters. The device also includes a renderer configured to render at least one of the one or more clusters of audio streams based on the rendering mode.

Type: Application

Filed: December 18, 2020

Publication date: June 23, 2022

Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters, Isaac Garcia Munoz
CORRELATION-BASED RENDERING WITH MULTIPLE DISTRIBUTED STREAMS FOR SIX DEGREE OF FREEDOM APPLICATIONS

Publication number: 20220201418

Abstract: An example device includes a memory configured to store audio data and location data associated with a plurality of audio streams and one or more processors coupled to the memory. The one or more processors are configured to obtain a first location of a first audio stream that includes an audio source and obtain a second location of a second audio stream that includes the audio source. The one or more processors are configured to generate direction vectors originating at the first location and the second location, based on a location of the audio source and the first location, and the location of the audio source and the second location, respectively. The one or more processors are also configured to determine parameters that describe a vector field based on the first direction vector and the second direction vector.

Type: Application

Filed: December 18, 2020

Publication date: June 23, 2022

Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz

1 2 3 4 5 … next