Patents by Inventor Siddhartha Goutham
Siddhartha Goutham has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20210281967Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.Type: ApplicationFiled: May 24, 2021Publication date: September 9, 2021Inventors: Moo Young KIM, Nils Günther PETERS, S M Akramus SALEHIN, Siddhartha Goutham SWAMINATHAN, Dipanjan SEN
-
Publication number: 20210264927Abstract: An example audio decoding device includes a memory configured to store at least a portion of a coded audio bitstream; and one or more processors configured to: decode, based on the coded audio bitstream, a representation of a soundfield; decode, based on the coded audio bitstream, a syntax element indicating a selection of either a head-related transfer function (HRTF) or a binaural room impulse response (BRIR); and render, using the selected HRTF or BRIR, speaker feeds from the soundfield.Type: ApplicationFiled: February 19, 2021Publication date: August 26, 2021Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Jason Filos
-
Patent number: 11089428Abstract: In general, various aspects of the techniques are described for selecting audio streams based on motion. A device comprising a processor and a memory may be configured to perform the techniques. The processor may be configured to obtain a current location of the device, and obtain capture locations. Each of the capture locations may identify a location at which a respective one of audio streams is captured. The processor may also be configured to select, based on the current location and the capture locations, a subset of the audio streams, where the subset of the audio streams have less audio streams than the audio streams. The processor may further be configured to reproduce, based on the subset of the audio streams, a soundfield. The memory may be configured to store the subset of the plurality of audio streams.Type: GrantFiled: December 13, 2019Date of Patent: August 10, 2021Assignee: QUALCOMM IncorporatedInventors: S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
-
Publication number: 20210185470Abstract: In general, various aspects of the techniques are described for selecting audio streams based on motion. A device comprising a processor and a memory may be configured to perform the techniques. The processor may be configured to obtain a current location of the device, and obtain capture locations. Each of the capture locations may identify a location at which a respective one of audio streams is captured. The processor may also be configured to select, based on the current location and the capture locations, a subset of the audio streams, where the subset of the audio streams have less audio streams than the audio streams. The processor may further be configured to reproduce, based on the subset of the audio streams, a soundfield. The memory may be configured to store the subset of the plurality of audio streams.Type: ApplicationFiled: December 13, 2019Publication date: June 17, 2021Inventors: S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
-
Publication number: 20210157543Abstract: Methods, systems, and devices for processing of multiple audio streams based on available bandwidth are described. Described techniques provide for receiving, at a device, one or more audio streams, identifying an available bandwidth for processing the one or more audio streams, locating (based on the available bandwidth) a first set of one or more objects contributing to the one or more audio streams that are located within a threshold radius from the device, and generating an object-based audio stream. The described techniques further provide for extracting a contribution of the first number of objects from the one or more audio streams, generating an HOA audio stream, and outputting an audio feed that includes the HOA audio stream and the object-based audio stream.Type: ApplicationFiled: November 26, 2019Publication date: May 27, 2021Inventors: S M Akramus SALEHIN, Siddhartha Goutham SWAMINATHAN
-
Patent number: 11019449Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.Type: GrantFiled: September 11, 2019Date of Patent: May 25, 2021Assignee: QUALCOMM IncorporatedInventors: Moo Young Kim, Nils Günther Peters, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
-
Patent number: 10972852Abstract: In general, techniques are described for adapting audio streams for rendering. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a plurality of audio streams that include one or more sub-streams. The one or more processors may determine, based on the plurality of audio streams, a total number of the one or more sub-streams for all of the plurality of audio streams, and adapt, when the total number of the sub-streams is greater than a render threshold, the plurality of audio streams to decrease the number of the one or more sub-streams and obtain an adapted plurality of audio streams. The one or more processors may also apply the renderer to the adapted plurality of audio streams to obtain the one or more speaker feeds, and output the one or more speaker feeds to one or more speakers.Type: GrantFiled: July 1, 2020Date of Patent: April 6, 2021Assignee: Qualcomm IncorporatedInventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters
-
Publication number: 20210099825Abstract: A device may be configured to process one or more audio streams in accordance with the techniques described herein. The device may comprise: one or more processors and a memory. The one or more processors may be configured to obtain an indication of a boundary separating an interior area from an exterior area, and obtain a listener location indicative of a location of the device relative to the interior area. The one or more processors may be configured to obtain, based on the boundary and the listener location, a current renderer as either an interior renderer configured to render audio data for the interior area or an exterior renderer configured to render the audio data for the exterior area, and apply, to the audio data, the current renderer to obtain one or more speaker feeds. The memory may be configured to store the one or more speaker feeds.Type: ApplicationFiled: September 30, 2020Publication date: April 1, 2021Inventors: S M Akramus Salehin, Nils Günther Peters, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
-
Patent number: 10924876Abstract: In general, various aspects of the techniques are described for interpolating audio streams. A device comprising a memory and a processor may be configured to perform the techniques. The memory may store the one or more audio streams. The processor may obtain one or more microphone locations, each of the one or more microphone locations identifying a location of a respective one or more microphones that captured each of the corresponding one or more audio streams. The processor may also obtain a listener location identifying a location of a listener, and perform interpolation, based on the one or more microphone locations and the listener location, with respect to the audio streams to obtain an interpolated audio stream. The processor may next obtain, based on the interpolated audio stream, one or more speaker feeds, and output the one or more speaker feeds.Type: GrantFiled: July 16, 2019Date of Patent: February 16, 2021Assignee: Qualcomm IncorporatedInventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Dipanjan Sen
-
Publication number: 20210004200Abstract: Example devices and methods are disclosed. An example device includes a memory configured to store a plurality of audio streams and an associated level of authorization for each of the plurality of audio streams. The device also includes one or more processors implemented in circuitry and communicatively coupled to the memory. The one or more processors are configured to select, based on the associated levels of authorization, a subset of the plurality of audio streams, the subset of the plurality of audio streams excluding at least one of the plurality of audio streams.Type: ApplicationFiled: July 1, 2020Publication date: January 7, 2021Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, S M Akramus Salehin, Nils Günther Peters
-
Publication number: 20210006922Abstract: Example devices and methods are presented for timer-based access for audio streaming and rendering. For example, a device configured to play one or more of a plurality of audio streams includes a memory configured to store timing information and the plurality of audio streams. The device also includes one or more processors coupled to the memory. The one or more processors are configured to control access to at least one of the plurality of audio streams based on the timing information.Type: ApplicationFiled: July 1, 2020Publication date: January 7, 2021Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, Nils Günther Peters
-
Publication number: 20210006976Abstract: A method and device for processing one or more audio streams based on privacy restrictions is described. A device may be configured to receive the one or more audio streams from audio elements represented in an acoustic environment that comprises one or more sub-acoustic spaces, each of the one or more audio streams representative of a respective soundfield, determine unrestricted audio streams of the one or more audio streams based on privacy restrictions associated with the one or more audio streams, determine restricted audio streams of the one or more audio streams based on the privacy restrictions associated with the one or more audio streams, generate the corresponding respective soundfields of the unrestricted audio streams, and restrict playback of the corresponding respective soundfields of the restricted audio streams.Type: ApplicationFiled: July 1, 2020Publication date: January 7, 2021Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, Nils Günther Peters, S M Akramus Salehin
-
Publication number: 20210006921Abstract: Systems and methods for determining parameter adjustments for a capture of audio are disclosed. The systems and methods includes processing circuitry configured to access at least one energy map that corresponds to one or more audio streams. The processing circuitry may then determine, from the at least one energy map, a parameter adjustment with respect to at least one audio element. The parameter adjustment may be configured to adjust the capture of audio by the at least one audio element. In addition, the process circuitry may be configured to output an indication indicating the parameter adjustment with respect to the at least one audio element.Type: ApplicationFiled: July 1, 2020Publication date: January 7, 2021Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters
-
Publication number: 20210006918Abstract: In general, techniques are described for adapting audio streams for rendering. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a plurality of audio streams that include one or more sub-streams. The one or more processors may determine, based on the plurality of audio streams, a total number of the one or more sub-streams for all of the plurality of audio streams, and adapt, when the total number of the sub-streams is greater than a render threshold, the plurality of audio streams to decrease the number of the one or more sub-streams and obtain an adapted plurality of audio streams. The one or more processors may also apply the renderer to the adapted plurality of audio streams to obtain the one or more speaker feeds, and output the one or more speaker feeds to one or more speakers.Type: ApplicationFiled: July 1, 2020Publication date: January 7, 2021Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters
-
Publication number: 20210004452Abstract: A method and device for processing one or more audio streams based on password-based privacy restrictions is described. A device may be configured to receive unrestricted audio streams of the one or more audio streams based on privacy restrictions associated with a password, wherein the one or more audio streams are from audio elements represented in an acoustic environment that comprises one or more sub-acoustic spaces, each of the one or more audio streams representative of a respective soundfield, and generate the respective soundfields of the unrestricted audio streams.Type: ApplicationFiled: July 1, 2020Publication date: January 7, 2021Inventors: Siddhartha Goutham Swaminathan, Isaac Garcia Munoz
-
Publication number: 20210004201Abstract: In some examples, a content consumer device configured to play one or more of a plurality of audio streams includes a memory configured to store the plurality of audio streams and audio location information associated with the plurality of audio streams and representative of audio stream coordinates in an acoustical space where an audio stream was captured or synthesized or both. Each of the audio streams is representative of a soundfield. The content consumer device also includes one or more processors coupled to the memory, and configured to determine device location information representative of device coordinates of the content consumer device in the acoustical space. The one or more processors are configured to select, based on the device location information and the audio location information, a subset of the plurality of audio streams, and output, based on the subset of the plurality of audio streams, one or more speaker feeds.Type: ApplicationFiled: July 1, 2020Publication date: January 7, 2021Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Nils Günther Peters
-
Publication number: 20210006925Abstract: A device may be configured to play one or more of a plurality of audio streams. The device may include a memory configured to store the plurality of audio streams, each of the audio streams representative of a soundfield. The device also may include one or more processors coupled to the memory, and configured to present a user interface to a user, obtain an indication from a user via the user interface representing a desired listening position; and select, based on the indication, at least one audio stream of the plurality of audio streams.Type: ApplicationFiled: July 1, 2020Publication date: January 7, 2021Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, Nils Günther Peters, S M Akramus Salehin
-
Publication number: 20200304935Abstract: In general, techniques are described for rendering metadata to control user movement based audio rendering. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store audio data representative of a soundfield. The one or more processors may be coupled to the memory, and configured to obtain rendering metadata indicative of controls for enabling or disabling adaptations, based on an indication of a movement of a user of the device, of a renderer used to render audio data representative of a soundfield, specify, in a bitstream representative of the audio data, the rendering metadata, and output the bitstream.Type: ApplicationFiled: March 18, 2020Publication date: September 24, 2020Inventors: Nils Günther Peters, Moo Young Kim, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, Dipanjan Sen
-
Patent number: 10728689Abstract: Methods, systems, computer-readable media, and apparatuses for characterizing portions of a soundfield are presented. Some configurations include estimating a total energy of a soundfield associated with a scene space and, for each of at least some of a plurality of regions of the scene space, estimating an energy of a portion of the soundfield that corresponds to the region and creating a corresponding metadata field that indicates a location of the region within the space and a relation between the estimated total energy and the estimated energy that corresponds to the region.Type: GrantFiled: December 13, 2018Date of Patent: July 28, 2020Assignee: Qualcomm IncorporatedInventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Dipanjan Sen, Michael Ericson
-
Publication number: 20200196086Abstract: Methods, systems, computer-readable media, and apparatuses for characterizing portions of a soundfield are presented. Some configurations include estimating a total energy of a soundfield associated with a scene space and, for each of at least some of a plurality of regions of the scene space, estimating an energy of a portion of the soundfield that corresponds to the region and creating a corresponding metadata field that indicates a location of the region within the space and a relation between the estimated total energy and the estimated energy that corresponds to the region.Type: ApplicationFiled: December 13, 2018Publication date: June 18, 2020Inventors: Siddhartha Goutham SWAMINATHAN, S M Akramus SALEHIN, Dipanjan SEN, Michael ERICSON