Patents by Inventor S M Akramus Salehin

S M Akramus Salehin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Signaling for rendering tools

Patent number: 11967329

Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.

Type: Grant

Filed: February 19, 2021

Date of Patent: April 23, 2024

Assignee: QUALCOMM Incorporated

Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Jason Filos
Six degrees of freedom and three degrees of freedom backward compatibility

Patent number: 11843932

Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.

Type: Grant

Filed: May 24, 2021

Date of Patent: December 12, 2023

Assignee: QUALCOMM Incorporated

Inventors: Moo Young Kim, Nils Günther Peters, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
User interface feedback for controlling audio rendering for extended reality experiences

Patent number: 11812252

Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from the user via the user interface representing a desired listening position, select, based on the indication, at least one audio stream of the plurality of audio streams, and output, for a display and in response to obtaining the indication representing the desired listening position, a graphical user interface element suggesting an alternative listening position.

Type: Grant

Filed: August 2, 2022

Date of Patent: November 7, 2023

Assignee: QUALCOMM Incorporated

Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
Augmented audio for communications

Patent number: 11805380

Abstract: A device includes one or more processors configured to determine, based on data descriptive of two or more audio environments, a geometry of a mutual audio environment. The one or more processors are also configured to process audio data, based on the geometry of the mutual audio environment, for output at an audio device disposed in a first audio environment of the two or more audio environments.

Type: Grant

Filed: August 31, 2021

Date of Patent: October 31, 2023

Assignee: QUALCOMM Incorporated

Inventors: S M Akramus Salehin, Lae-Hoon Kim, Xiaoxin Zhang, Erik Visser
Correlation-based rendering with multiple distributed streams accounting for an occlusion for six degree of freedom applications

Patent number: 11743670

Abstract: An example device includes a memory configured to store audio data and location data associated with a plurality of audio streams and one or more processors coupled to the memory. The one or more processors are configured to obtain a first location of a first audio stream that includes an audio source and obtain a second location of a second audio stream that includes the audio source. The one or more processors are configured to generate direction vectors originating at the first location and the second location, based on a location of the audio source and the first location, and the location of the audio source and the second location, respectively. The one or more processors are also configured to determine parameters that describe a vector field based on the first direction vector and the second direction vector.

Type: Grant

Filed: December 18, 2020

Date of Patent: August 29, 2023

Assignee: Qualcomm Incorporated

Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
TRANSFORM AMBISONIC COEFFICIENTS USING AN ADAPTIVE NETWORK FOR PRESERVING SPATIAL DIRECTION

Publication number: 20230260525

Abstract: A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are configured to apply one adaptive network, based on a constraint that includes preservation of a spatial direction of one or more audio sources in the soundfield at the different time segments, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint.

Type: Application

Filed: April 24, 2023

Publication date: August 17, 2023

Inventors: Lae-Hoon KIM, Shankar THAGADUR SHIVAPPA, S M Akramus SALEHIN, Shuhua ZHANG, Erik VISSER
Spatial audio wind noise detection

Patent number: 11721353

Abstract: A device includes one or more processors configured to obtain audio signals representing sound captured by at least three microphones and determine spatial audio data based on the audio signals. The one or more processors are further configured to determine a metric indicative of wind noise in the audio signals. The metric is based on a comparison of a first value and a second value. The first value corresponds to an aggregate signal based on the spatial audio data, and the second value corresponds to a differential signal based on the spatial audio data.

Type: Grant

Filed: December 21, 2020

Date of Patent: August 8, 2023

Assignee: Qualcomm Incorporated

Inventors: S M Akramus Salehin, Lae-Hoon Kim, Hannes Pessentheiner, Shuhua Zhang, Sanghyun Chi, Erik Visser, Shankar Thagadur Shivappa
Transform ambisonic coefficients using an adaptive network

Patent number: 11636866

Abstract: A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device also includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are also configured to apply one adaptive network, based on a constraint, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint.

Type: Grant

Filed: March 23, 2021

Date of Patent: April 25, 2023

Assignee: Qualcomm Incorporated

Inventors: Lae-Hoon Kim, Shankar Thagadur Shivappa, S M Akramus Salehin, Shuhua Zhang, Erik Visser
Smart hybrid rendering for augmented reality/virtual reality audio

Patent number: 11601776

Abstract: An example device for processing one or more audio streams includes a memory configured to store the one or more audio streams and one or more processors implemented in circuitry coupled to the memory. The one or more processors are configured to determine a listener position. The one or more processors are also configured to determine one or more clusters of the one or more audio streams. The one or more processors are also configured to determine a rendering mode based on the listener position and the one or more clusters. The device also includes a renderer configured to render at least one of the one or more clusters of audio streams based on the rendering mode.

Type: Grant

Filed: December 18, 2020

Date of Patent: March 7, 2023

Assignee: Qualcomm Incorporated

Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters, Isaac Garcia Munoz
AUGMENTED AUDIO FOR COMMUNICATIONS

Publication number: 20230060774

Abstract: A device includes one or more processors configured to determine, based on data descriptive of two or more audio environments, a geometry of a mutual audio environment. The one or more processors are also configured to process audio data, based on the geometry of the mutual audio environment, for output at an audio device disposed in a first audio environment of the two or more audio environments.

Type: Application

Filed: August 31, 2021

Publication date: March 2, 2023

Inventors: S M Akramus SALEHIN, Lae-Hoon KIM, Xiaoxin ZHANG, Erik VISSER
SIGNALLING OF AUDIO EFFECT METADATA IN A BITSTREAM

Publication number: 20220386060

Abstract: Methods, systems, computer-readable media, and apparatuses for manipulating a soundfield are presented. Some configurations include receiving a bitstream that comprises metadata and a soundfield description; parsing the metadata to obtain an effect identifier and at least one effect parameter value; and applying, to the soundfield description, an effect identified by the effect identifier. The applying may include using the at least one effect parameter value to apply the identified effect to the soundfield description.

Type: Application

Filed: October 29, 2020

Publication date: December 1, 2022

Inventors: Nils Gunther PETERS, Shankar THAGADUR SHIVAPPA, S M Akramus SALEHIN, Jason FILOS, Siddhartha Goutham SWAMINATHAN, Ferdinando OLIVIERI
USER INTERFACE FEEDBACK FOR CONTROLLING AUDIO RENDERING FOR EXTENDED REALITY EXPERIENCES

Publication number: 20220377490

Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from the user via the user interface representing a desired listening position, select, based on the indication, at least one audio stream of the plurality of audio streams, and output, for a display and in response to obtaining the indication representing the desired listening position, a graphical user interface element suggesting an alternative listening position.

Type: Application

Filed: August 2, 2022

Publication date: November 24, 2022

Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
User interface for controlling audio rendering for extended reality experiences

Patent number: 11432097

Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from a user via the user interface representing a desired listening position; and select, based on the indication, at least one audio stream of the plurality of audio streams.

Type: Grant

Filed: July 1, 2020

Date of Patent: August 30, 2022

Assignee: Qualcomm Incorporated

Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
Audio capture and rendering for extended reality experiences

Patent number: 11429340

Abstract: In some examples, a content consumer device configured to play one or more of a plurality of audio streams includes a memory configured to store the plurality of audio streams and audio location information associated with the plurality of audio streams and representative of audio stream coordinates in an acoustical space where an audio stream was captured or synthesized or both. Each of the audio streams is representative of a soundfield. The content consumer device also includes one or more processors coupled to the memory, and configured to determine device location information representative of device coordinates of the content consumer device in the acoustical space. The one or more processors are configured to select, based on the device location information and the audio location information, a subset of the plurality of audio streams, and output, based on the subset of the plurality of audio streams, one or more speaker feeds.

Type: Grant

Filed: July 1, 2020

Date of Patent: August 30, 2022

Assignee: Qualcomm Incorporated

Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters
Spatial audio zoom

Patent number: 11425497

Abstract: In an aspect, a lens is zoomed in to create a zoomed lens. Lens data associated with the lens includes a direction of the lens relative to an object in a field-of-view of the zoomed lens and a magnification of the object resulting from the zoomed lens. An array of microphones capture audio signals including audio produced by the object and interference produced by other objects. The audio signals are processed to identify a directional component associated with the audio produced by the object and three orthogonal components associated with the interference produced by the other objects. Stereo beamforming is used to increase a magnitude of the directional component (relative to the interference) while retaining a binaural nature of the audio signals. The increase in magnitude of the directional component is based on an amount of the magnification provided by the zoomed lens to the object.

Type: Grant

Filed: December 18, 2020

Date of Patent: August 23, 2022

Assignee: QUALCOMM Incorporated

Inventors: S M Akramus Salehin, Lae-Hoon Kim, Vasudev Nayak, Shankar Thagadur Shivappa, Isaac Garcia Munoz, Sanghyun Chi, Erik Visser
SPATIAL AUDIO ZOOM

Publication number: 20220201395

Abstract: In an aspect, a lens is zoomed in to create a zoomed lens. Lens data associated with the lens includes a direction of the lens relative to an object in a field-of-view of the zoomed lens and a magnification of the object resulting from the zoomed lens. An array of microphones capture audio signals including audio produced by the object and interference produced by other objects. The audio signals are processed to identify a directional component associated with the audio produced by the object and three orthogonal components associated with the interference produced by the other objects. Stereo beamforming is used to increase a magnitude of the directional component (relative to the interference) while retaining a binaural nature of the audio signals. The increase in magnitude of the directional component is based on an amount of the magnification provided by the zoomed lens to the object.

Type: Application

Filed: December 18, 2020

Publication date: June 23, 2022

Inventors: S M Akramus SALEHIN, Lae-Hoon KIM, Vasudev NAYAK, Shankar THAGADUR SHIVAPPA, Isaac Garcia MUNOZ, Sanghyun CHI, Erik VISSER
SPATIAL AUDIO WIND NOISE DETECTION

Publication number: 20220199100

Abstract: A device includes one or more processors configured to obtain audio signals representing sound captured by at least three microphones and determine spatial audio data based on the audio signals. The one or more processors are further configured to determine a metric indicative of wind noise in the audio signals. The metric is based on a comparison of a first value and a second value. The first value corresponds to an aggregate signal based on the spatial audio data, and the second value corresponds to a differential signal based on the spatial audio data.

Type: Application

Filed: December 21, 2020

Publication date: June 23, 2022

Inventors: S M Akramus SALEHIN, Lae-Hoon KIM, Hannes PESSENTHEINER, Shuhua ZHANG, Sanghyun CHI, Erik VISSER, Shankar THAGADUR SHIVAPPA
SMART HYBRID RENDERING FOR AUGMENTED REALITY/VIRTUAL REALITY AUDIO

Publication number: 20220201419

Abstract: An example device for processing one or more audio streams includes a memory configured to store the one or more audio streams and one or more processors implemented in circuitry coupled to the memory. The one or more processors are configured to determine a listener position. The one or more processors are also configured to determine one or more clusters of the one or more audio streams. The one or more processors are also configured to determine a rendering mode based on the listener position and the one or more clusters. The device also includes a renderer configured to render at least one of the one or more clusters of audio streams based on the rendering mode.

Type: Application

Filed: December 18, 2020

Publication date: June 23, 2022

Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters, Isaac Garcia Munoz
CORRELATION-BASED RENDERING WITH MULTIPLE DISTRIBUTED STREAMS FOR SIX DEGREE OF FREEDOM APPLICATIONS

Publication number: 20220201418

Abstract: An example device includes a memory configured to store audio data and location data associated with a plurality of audio streams and one or more processors coupled to the memory. The one or more processors are configured to obtain a first location of a first audio stream that includes an audio source and obtain a second location of a second audio stream that includes the audio source. The one or more processors are configured to generate direction vectors originating at the first location and the second location, based on a location of the audio source and the first location, and the location of the audio source and the second location, respectively. The one or more processors are also configured to determine parameters that describe a vector field based on the first direction vector and the second direction vector.

Type: Application

Filed: December 18, 2020

Publication date: June 23, 2022

Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
Privacy zoning and authorization for audio rendering

Patent number: 11354085

Abstract: Example devices and methods are disclosed. An example device includes a memory configured to store a plurality of audio streams and an associated level of authorization for each of the plurality of audio streams. The device also includes one or more processors implemented in circuitry and communicatively coupled to the memory. The one or more processors are configured to select, based on the associated levels of authorization, a subset of the plurality of audio streams, the subset of the plurality of audio streams excluding at least one of the plurality of audio streams.

Type: Grant

Filed: July 1, 2020

Date of Patent: June 7, 2022

Assignee: Qualcomm Incorporated

Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, S M Akramus Salehin, Nils Günther Peters

1 2 3 next