Patents by Inventor Siddhartha Goutham

Siddhartha Goutham has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210281967
    Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.
    Type: Application
    Filed: May 24, 2021
    Publication date: September 9, 2021
    Inventors: Moo Young KIM, Nils Günther PETERS, S M Akramus SALEHIN, Siddhartha Goutham SWAMINATHAN, Dipanjan SEN
  • Publication number: 20210264927
    Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.
    Type: Application
    Filed: February 19, 2021
    Publication date: August 26, 2021
    Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Jason Filos
  • Patent number: 11089428
    Abstract: In general, various aspects of the techniques are described for selecting audio streams based on motion. A device comprising a processor and a memory may be configured to perform the techniques. The processor may be configured to obtain a current location of the device, and obtain capture locations. Each of the capture locations may identify a location at which a respective one of audio streams is captured. The processor may also be configured to select, based on the current location and the capture locations, a subset of the audio streams, where the subset of the audio streams have less audio streams than the audio streams. The processor may further be configured to reproduce, based on the subset of the audio streams, a soundfield. The memory may be configured to store the subset of the plurality of audio streams.
    Type: Grant
    Filed: December 13, 2019
    Date of Patent: August 10, 2021
    Assignee: QUALCOMM Incorporated
    Inventors: S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
  • Publication number: 20210185470
    Abstract: In general, various aspects of the techniques are described for selecting audio streams based on motion. A device comprising a processor and a memory may be configured to perform the techniques. The processor may be configured to obtain a current location of the device, and obtain capture locations. Each of the capture locations may identify a location at which a respective one of audio streams is captured. The processor may also be configured to select, based on the current location and the capture locations, a subset of the audio streams, where the subset of the audio streams have less audio streams than the audio streams. The processor may further be configured to reproduce, based on the subset of the audio streams, a soundfield. The memory may be configured to store the subset of the plurality of audio streams.
    Type: Application
    Filed: December 13, 2019
    Publication date: June 17, 2021
    Inventors: S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
  • Publication number: 20210157543
    Abstract: Methods, systems, and devices for processing of multiple audio streams based on available bandwidth are described. Described techniques provide for receiving, at a device, one or more audio streams, identifying an available bandwidth for processing the one or more audio streams, locating (based on the available bandwidth) a first set of one or more objects contributing to the one or more audio streams that are located within a threshold radius from the device, and generating an object-based audio stream. The described techniques further provide for extracting a contribution of the first number of objects from the one or more audio streams, generating an HOA audio stream, and outputting an audio feed that includes the HOA audio stream and the object-based audio stream.
    Type: Application
    Filed: November 26, 2019
    Publication date: May 27, 2021
    Inventors: S M Akramus SALEHIN, Siddhartha Goutham SWAMINATHAN
  • Patent number: 11019449
    Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.
    Type: Grant
    Filed: September 11, 2019
    Date of Patent: May 25, 2021
    Assignee: QUALCOMM Incorporated
    Inventors: Moo Young Kim, Nils Günther Peters, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
  • Patent number: 10972852
    Abstract: In general, techniques are described for adapting audio streams for rendering. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a plurality of audio streams that include one or more sub-streams. The one or more processors may determine, based on the plurality of audio streams, a total number of the one or more sub-streams for all of the plurality of audio streams, and adapt, when the total number of the sub-streams is greater than a render threshold, the plurality of audio streams to decrease the number of the one or more sub-streams and obtain an adapted plurality of audio streams. The one or more processors may also apply the renderer to the adapted plurality of audio streams to obtain the one or more speaker feeds, and output the one or more speaker feeds to one or more speakers.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: April 6, 2021
    Assignee: Qualcomm Incorporated
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters
  • Publication number: 20210099825
    Abstract: A device may be configured to process one or more audio streams in accordance with the techniques described herein. The device may comprise: one or more processors and a memory. The one or more processors may be configured to obtain an indication of a boundary separating an interior area from an exterior area, and obtain a listener location indicative of a location of the device relative to the interior area. The one or more processors may be configured to obtain, based on the boundary and the listener location, a current renderer as either an interior renderer configured to render audio data for the interior area or an exterior renderer configured to render the audio data for the exterior area, and apply, to the audio data, the current renderer to obtain one or more speaker feeds. The memory may be configured to store the one or more speaker feeds.
    Type: Application
    Filed: September 30, 2020
    Publication date: April 1, 2021
    Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
  • Patent number: 10924876
    Abstract: In general, various aspects of the techniques are described for interpolating audio streams. A device comprising a memory and a processor may be configured to perform the techniques. The memory may store the one or more audio streams. The processor may obtain one or more microphone locations, each of the one or more microphone locations identifying a location of a respective one or more microphones that captured each of the corresponding one or more audio streams. The processor may also obtain a listener location identifying a location of a listener, and perform interpolation, based on the one or more microphone locations and the listener location, with respect to the audio streams to obtain an interpolated audio stream. The processor may next obtain, based on the interpolated audio stream, one or more speaker feeds, and output the one or more speaker feeds.
    Type: Grant
    Filed: July 16, 2019
    Date of Patent: February 16, 2021
    Assignee: Qualcomm Incorporated
    Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Dipanjan Sen
  • Publication number: 20210004200
    Abstract: Example devices and methods are disclosed. An example device includes a memory configured to store a plurality of audio streams and an associated level of authorization for each of the plurality of audio streams. The device also includes one or more processors implemented in circuitry and communicatively coupled to the memory. The one or more processors are configured to select, based on the associated levels of authorization, a subset of the plurality of audio streams, the subset of the plurality of audio streams excluding at least one of the plurality of audio streams.
    Type: Application
    Filed: July 1, 2020
    Publication date: January 7, 2021
    Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, S M Akramus Salehin, Nils Günther Peters
  • Publication number: 20210006922
    Abstract: Example devices and methods are presented for timer-based access for audio streaming and rendering. For example, a device configured to play one or more of a plurality of audio streams includes a memory configured to store timing information and the plurality of audio streams. The device also includes one or more processors coupled to the memory. The one or more processors are configured to control access to at least one of the plurality of audio streams based on the timing information.
    Type: Application
    Filed: July 1, 2020
    Publication date: January 7, 2021
    Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, Nils Günther Peters
  • Publication number: 20210006976
    Abstract: A method and device for processing one or more audio streams based on privacy restrictions is described. A device may be configured to receive the one or more audio streams from audio elements represented in an acoustic environment that comprises one or more sub-acoustic spaces, each of the one or more audio streams representative of a respective soundfield, determine unrestricted audio streams of the one or more audio streams based on privacy restrictions associated with the one or more audio streams, determine restricted audio streams of the one or more audio streams based on the privacy restrictions associated with the one or more audio streams, generate the corresponding respective soundfields of the unrestricted audio streams, and restrict playback of the corresponding respective soundfields of the restricted audio streams.
    Type: Application
    Filed: July 1, 2020
    Publication date: January 7, 2021
    Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, Nils Günther Peters, S M Akramus Salehin
  • Publication number: 20210006921
    Abstract: Systems and methods for determining parameter adjustments for a capture of audio are disclosed. The systems and methods includes processing circuitry configured to access at least one energy map that corresponds to one or more audio streams. The processing circuitry may then determine, from the at least one energy map, a parameter adjustment with respect to at least one audio element. The parameter adjustment may be configured to adjust the capture of audio by the at least one audio element. In addition, the process circuitry may be configured to output an indication indicating the parameter adjustment with respect to the at least one audio element.
    Type: Application
    Filed: July 1, 2020
    Publication date: January 7, 2021
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters
  • Publication number: 20210006918
    Abstract: In general, techniques are described for adapting audio streams for rendering. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a plurality of audio streams that include one or more sub-streams. The one or more processors may determine, based on the plurality of audio streams, a total number of the one or more sub-streams for all of the plurality of audio streams, and adapt, when the total number of the sub-streams is greater than a render threshold, the plurality of audio streams to decrease the number of the one or more sub-streams and obtain an adapted plurality of audio streams. The one or more processors may also apply the renderer to the adapted plurality of audio streams to obtain the one or more speaker feeds, and output the one or more speaker feeds to one or more speakers.
    Type: Application
    Filed: July 1, 2020
    Publication date: January 7, 2021
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters
  • Publication number: 20210004452
    Abstract: A method and device for processing one or more audio streams based on password-based privacy restrictions is described. A device may be configured to receive unrestricted audio streams of the one or more audio streams based on privacy restrictions associated with a password, wherein the one or more audio streams are from audio elements represented in an acoustic environment that comprises one or more sub-acoustic spaces, each of the one or more audio streams representative of a respective soundfield, and generate the respective soundfields of the unrestricted audio streams.
    Type: Application
    Filed: July 1, 2020
    Publication date: January 7, 2021
    Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
  • Publication number: 20210004201
    Abstract: In some examples, a content consumer device configured to play one or more of a plurality of audio streams includes a memory configured to store the plurality of audio streams and audio location information associated with the plurality of audio streams and representative of audio stream coordinates in an acoustical space where an audio stream was captured or synthesized or both. Each of the audio streams is representative of a soundfield. The content consumer device also includes one or more processors coupled to the memory, and configured to determine device location information representative of device coordinates of the content consumer device in the acoustical space. The one or more processors are configured to select, based on the device location information and the audio location information, a subset of the plurality of audio streams, and output, based on the subset of the plurality of audio streams, one or more speaker feeds.
    Type: Application
    Filed: July 1, 2020
    Publication date: January 7, 2021
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters
  • Publication number: 20210006925
    Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from a user via the user interface representing a desired listening position; and select, based on the indication, at least one audio stream of the plurality of audio streams.
    Type: Application
    Filed: July 1, 2020
    Publication date: January 7, 2021
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
  • Publication number: 20200304935
    Abstract: In general, techniques are described for rendering metadata to control user movement based audio rendering. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store audio data representative of a soundfield. The one or more processors may be coupled to the memory, and configured to obtain rendering metadata indicative of controls for enabling or disabling adaptations, based on an indication of a movement of a user of the device, of a renderer used to render audio data representative of a soundfield, specify, in a bitstream representative of the audio data, the rendering metadata, and output the bitstream.
    Type: Application
    Filed: March 18, 2020
    Publication date: September 24, 2020
    Inventors: Nils Günther Peters, Moo Young Kim, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, Dipanjan Sen
  • Patent number: 10728689
    Abstract: Methods, systems, computer-readable media, and apparatuses for characterizing portions of a soundfield are presented. Some configurations include estimating a total energy of a soundfield associated with a scene space and, for each of at least some of a plurality of regions of the scene space, estimating an energy of a portion of the soundfield that corresponds to the region and creating a corresponding metadata field that indicates a location of the region within the space and a relation between the estimated total energy and the estimated energy that corresponds to the region.
    Type: Grant
    Filed: December 13, 2018
    Date of Patent: July 28, 2020
    Assignee: Qualcomm Incorporated
    Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Dipanjan Sen, Michael Ericson
  • Publication number: 20200196086
    Abstract: Methods, systems, computer-readable media, and apparatuses for characterizing portions of a soundfield are presented. Some configurations include estimating a total energy of a soundfield associated with a scene space and, for each of at least some of a plurality of regions of the scene space, estimating an energy of a portion of the soundfield that corresponds to the region and creating a corresponding metadata field that indicates a location of the region within the space and a relation between the estimated total energy and the estimated energy that corresponds to the region.
    Type: Application
    Filed: December 13, 2018
    Publication date: June 18, 2020
    Inventors: Siddhartha Goutham SWAMINATHAN, S M Akramus SALEHIN, Dipanjan SEN, Michael ERICSON