Patents by Inventor S M Akramus Salehin

S M Akramus Salehin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11967329
    Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.
    Type: Grant
    Filed: February 19, 2021
    Date of Patent: April 23, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Jason Filos
  • Patent number: 11843932
    Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.
    Type: Grant
    Filed: May 24, 2021
    Date of Patent: December 12, 2023
    Assignee: QUALCOMM Incorporated
    Inventors: Moo Young Kim, Nils Günther Peters, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
  • Patent number: 11812252
    Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from the user via the user interface representing a desired listening position, select, based on the indication, at least one audio stream of the plurality of audio streams, and output, for a display and in response to obtaining the indication representing the desired listening position, a graphical user interface element suggesting an alternative listening position.
    Type: Grant
    Filed: August 2, 2022
    Date of Patent: November 7, 2023
    Assignee: QUALCOMM Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
  • Patent number: 11805380
    Abstract: A device includes one or more processors configured to determine, based on data descriptive of two or more audio environments, a geometry of a mutual audio environment. The one or more processors are also configured to process audio data, based on the geometry of the mutual audio environment, for output at an audio device disposed in a first audio environment of the two or more audio environments.
    Type: Grant
    Filed: August 31, 2021
    Date of Patent: October 31, 2023
    Assignee: QUALCOMM Incorporated
    Inventors: S M Akramus Salehin, Lae-Hoon Kim, Xiaoxin Zhang, Erik Visser
  • Patent number: 11743670
    Abstract: An example device includes a memory configured to store audio data and location data associated with a plurality of audio streams and one or more processors coupled to the memory. The one or more processors are configured to obtain a first location of a first audio stream that includes an audio source and obtain a second location of a second audio stream that includes the audio source. The one or more processors are configured to generate direction vectors originating at the first location and the second location, based on a location of the audio source and the first location, and the location of the audio source and the second location, respectively. The one or more processors are also configured to determine parameters that describe a vector field based on the first direction vector and the second direction vector.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: August 29, 2023
    Assignee: Qualcomm Incorporated
    Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
  • Publication number: 20230260525
    Abstract: A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are configured to apply one adaptive network, based on a constraint that includes preservation of a spatial direction of one or more audio sources in the soundfield at the different time segments, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint.
    Type: Application
    Filed: April 24, 2023
    Publication date: August 17, 2023
    Inventors: Lae-Hoon KIM, Shankar THAGADUR SHIVAPPA, S M Akramus SALEHIN, Shuhua ZHANG, Erik VISSER
  • Patent number: 11721353
    Abstract: A device includes one or more processors configured to obtain audio signals representing sound captured by at least three microphones and determine spatial audio data based on the audio signals. The one or more processors are further configured to determine a metric indicative of wind noise in the audio signals. The metric is based on a comparison of a first value and a second value. The first value corresponds to an aggregate signal based on the spatial audio data, and the second value corresponds to a differential signal based on the spatial audio data.
    Type: Grant
    Filed: December 21, 2020
    Date of Patent: August 8, 2023
    Assignee: Qualcomm Incorporated
    Inventors: S M Akramus Salehin, Lae-Hoon Kim, Hannes Pessentheiner, Shuhua Zhang, Sanghyun Chi, Erik Visser, Shankar Thagadur Shivappa
  • Patent number: 11636866
    Abstract: A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device also includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are also configured to apply one adaptive network, based on a constraint, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint.
    Type: Grant
    Filed: March 23, 2021
    Date of Patent: April 25, 2023
    Assignee: Qualcomm Incorporated
    Inventors: Lae-Hoon Kim, Shankar Thagadur Shivappa, S M Akramus Salehin, Shuhua Zhang, Erik Visser
  • Patent number: 11601776
    Abstract: An example device for processing one or more audio streams includes a memory configured to store the one or more audio streams and one or more processors implemented in circuitry coupled to the memory. The one or more processors are configured to determine a listener position. The one or more processors are also configured to determine one or more clusters of the one or more audio streams. The one or more processors are also configured to determine a rendering mode based on the listener position and the one or more clusters. The device also includes a renderer configured to render at least one of the one or more clusters of audio streams based on the rendering mode.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: March 7, 2023
    Assignee: Qualcomm Incorporated
    Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters, Isaac Garcia Munoz
  • Publication number: 20230060774
    Abstract: A device includes one or more processors configured to determine, based on data descriptive of two or more audio environments, a geometry of a mutual audio environment. The one or more processors are also configured to process audio data, based on the geometry of the mutual audio environment, for output at an audio device disposed in a first audio environment of the two or more audio environments.
    Type: Application
    Filed: August 31, 2021
    Publication date: March 2, 2023
    Inventors: S M Akramus SALEHIN, Lae-Hoon KIM, Xiaoxin ZHANG, Erik VISSER
  • Publication number: 20220386060
    Abstract: Methods, systems, computer-readable media, and apparatuses for manipulating a soundfield are presented. Some configurations include receiving a bitstream that comprises metadata and a soundfield description; parsing the metadata to obtain an effect identifier and at least one effect parameter value; and applying, to the soundfield description, an effect identified by the effect identifier. The applying may include using the at least one effect parameter value to apply the identified effect to the soundfield description.
    Type: Application
    Filed: October 29, 2020
    Publication date: December 1, 2022
    Inventors: Nils Gunther PETERS, Shankar THAGADUR SHIVAPPA, S M Akramus SALEHIN, Jason FILOS, Siddhartha Goutham SWAMINATHAN, Ferdinando OLIVIERI
  • Publication number: 20220377490
    Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from the user via the user interface representing a desired listening position, select, based on the indication, at least one audio stream of the plurality of audio streams, and output, for a display and in response to obtaining the indication representing the desired listening position, a graphical user interface element suggesting an alternative listening position.
    Type: Application
    Filed: August 2, 2022
    Publication date: November 24, 2022
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
  • Patent number: 11432097
    Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from a user via the user interface representing a desired listening position; and select, based on the indication, at least one audio stream of the plurality of audio streams.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: August 30, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
  • Patent number: 11429340
    Abstract: In some examples, a content consumer device configured to play one or more of a plurality of audio streams includes a memory configured to store the plurality of audio streams and audio location information associated with the plurality of audio streams and representative of audio stream coordinates in an acoustical space where an audio stream was captured or synthesized or both. Each of the audio streams is representative of a soundfield. The content consumer device also includes one or more processors coupled to the memory, and configured to determine device location information representative of device coordinates of the content consumer device in the acoustical space. The one or more processors are configured to select, based on the device location information and the audio location information, a subset of the plurality of audio streams, and output, based on the subset of the plurality of audio streams, one or more speaker feeds.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: August 30, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters
  • Patent number: 11425497
    Abstract: In an aspect, a lens is zoomed in to create a zoomed lens. Lens data associated with the lens includes a direction of the lens relative to an object in a field-of-view of the zoomed lens and a magnification of the object resulting from the zoomed lens. An array of microphones capture audio signals including audio produced by the object and interference produced by other objects. The audio signals are processed to identify a directional component associated with the audio produced by the object and three orthogonal components associated with the interference produced by the other objects. Stereo beamforming is used to increase a magnitude of the directional component (relative to the interference) while retaining a binaural nature of the audio signals. The increase in magnitude of the directional component is based on an amount of the magnification provided by the zoomed lens to the object.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: August 23, 2022
    Assignee: QUALCOMM Incorporated
    Inventors: S M Akramus Salehin, Lae-Hoon Kim, Vasudev Nayak, Shankar Thagadur Shivappa, Isaac Garcia Munoz, Sanghyun Chi, Erik Visser
  • Publication number: 20220201395
    Abstract: In an aspect, a lens is zoomed in to create a zoomed lens. Lens data associated with the lens includes a direction of the lens relative to an object in a field-of-view of the zoomed lens and a magnification of the object resulting from the zoomed lens. An array of microphones capture audio signals including audio produced by the object and interference produced by other objects. The audio signals are processed to identify a directional component associated with the audio produced by the object and three orthogonal components associated with the interference produced by the other objects. Stereo beamforming is used to increase a magnitude of the directional component (relative to the interference) while retaining a binaural nature of the audio signals. The increase in magnitude of the directional component is based on an amount of the magnification provided by the zoomed lens to the object.
    Type: Application
    Filed: December 18, 2020
    Publication date: June 23, 2022
    Inventors: S M Akramus SALEHIN, Lae-Hoon KIM, Vasudev NAYAK, Shankar THAGADUR SHIVAPPA, Isaac Garcia MUNOZ, Sanghyun CHI, Erik VISSER
  • Publication number: 20220199100
    Abstract: A device includes one or more processors configured to obtain audio signals representing sound captured by at least three microphones and determine spatial audio data based on the audio signals. The one or more processors are further configured to determine a metric indicative of wind noise in the audio signals. The metric is based on a comparison of a first value and a second value. The first value corresponds to an aggregate signal based on the spatial audio data, and the second value corresponds to a differential signal based on the spatial audio data.
    Type: Application
    Filed: December 21, 2020
    Publication date: June 23, 2022
    Inventors: S M Akramus SALEHIN, Lae-Hoon KIM, Hannes PESSENTHEINER, Shuhua ZHANG, Sanghyun CHI, Erik VISSER, Shankar THAGADUR SHIVAPPA
  • Publication number: 20220201419
    Abstract: An example device for processing one or more audio streams includes a memory configured to store the one or more audio streams and one or more processors implemented in circuitry coupled to the memory. The one or more processors are configured to determine a listener position. The one or more processors are also configured to determine one or more clusters of the one or more audio streams. The one or more processors are also configured to determine a rendering mode based on the listener position and the one or more clusters. The device also includes a renderer configured to render at least one of the one or more clusters of audio streams based on the rendering mode.
    Type: Application
    Filed: December 18, 2020
    Publication date: June 23, 2022
    Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters, Isaac Garcia Munoz
  • Publication number: 20220201418
    Abstract: An example device includes a memory configured to store audio data and location data associated with a plurality of audio streams and one or more processors coupled to the memory. The one or more processors are configured to obtain a first location of a first audio stream that includes an audio source and obtain a second location of a second audio stream that includes the audio source. The one or more processors are configured to generate direction vectors originating at the first location and the second location, based on a location of the audio source and the first location, and the location of the audio source and the second location, respectively. The one or more processors are also configured to determine parameters that describe a vector field based on the first direction vector and the second direction vector.
    Type: Application
    Filed: December 18, 2020
    Publication date: June 23, 2022
    Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
  • Patent number: 11354085
    Abstract: Example devices and methods are disclosed. An example device includes a memory configured to store a plurality of audio streams and an associated level of authorization for each of the plurality of audio streams. The device also includes one or more processors implemented in circuitry and communicatively coupled to the memory. The one or more processors are configured to select, based on the associated levels of authorization, a subset of the plurality of audio streams, the subset of the plurality of audio streams excluding at least one of the plurality of audio streams.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: June 7, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, S M Akramus Salehin, Nils Günther Peters