Patents by Inventor Dipanjan Sen

Dipanjan Sen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

LAYERED CODING OF AUDIO WITH DISCRETE OBJECTS

Publication number: 20210098004

Abstract: A first layer of data having a first set of Ambisonic audio components can be decoded where the first set of Ambisonic audio components is generated based on ambience and one or more object-based audio signals. A second layer of data is decoded having at least one of the one or more object-based audio signals. One of the object-based audio signals is subtracted from the first set of Ambisonic audio components. The resulting Ambisonic audio components are rendered to generate a first set of audio channels. The one or more object-based audio signals are spatially rendered to generate a second set of audio channels. Other aspects are described and claimed.

Type: Application

Filed: September 26, 2019

Publication date: April 1, 2021

Inventors: Dipanjan Sen, Frank Baumgarte, Juha O. Merimaa
Audio parallax for virtual reality, augmented reality, and mixed reality

Patent number: 10952009

Abstract: An example audio decoding device includes processing circuitry and a memory device coupled to the processing circuitry. The processing circuitry is configured to receive, in a bitstream, encoded representations of audio objects of a three-dimensional (3D) soundfield, to receive metadata associated with the bitstream, to obtain, from the received metadata, one or more transmission factors associated with one or more of the audio objects, and to apply the transmission factors to the one or more audio objects to obtain parallax-adjusted audio objects of the 3D soundfield. The memory device is configured to store at least a portion of the received bitstream, the received metadata, or the parallax-adjusted audio objects of the 3D soundfield.

Type: Grant

Filed: April 30, 2020

Date of Patent: March 16, 2021

Assignee: Qualcomm Incorporated

Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen
Interpolating audio streams

Patent number: 10924876

Abstract: In general, various aspects of the techniques are described for interpolating audio streams. A device comprising a memory and a processor may be configured to perform the techniques. The memory may store the one or more audio streams. The processor may obtain one or more microphone locations, each of the one or more microphone locations identifying a location of a respective one or more microphones that captured each of the corresponding one or more audio streams. The processor may also obtain a listener location identifying a location of a listener, and perform interpolation, based on the one or more microphone locations and the listener location, with respect to the audio streams to obtain an interpolated audio stream. The processor may next obtain, based on the interpolated audio stream, one or more speaker feeds, and output the one or more speaker feeds.

Type: Grant

Filed: July 16, 2019

Date of Patent: February 16, 2021

Assignee: Qualcomm Incorporated

Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Dipanjan Sen
Graphical user interface for a database system

Patent number: 10909130

Abstract: The system includes interactive user interfaces that allow a user to select attributes, entities, and statistical measures to query the combined data sets. The system allows users to visually construct queries of the database. The system may automatically generate multiple queries and/or query the database multiple times in response to user interface selections. The query parameters and results can be stored and shared with other users.

Type: Grant

Filed: July 1, 2016

Date of Patent: February 2, 2021

Assignee: Palantir Technologies Inc.

Inventors: Shannon Scott, Walker Burgin, Hem Wadhar, Grace Wang, Christopher Li, Michael Tuer, Dipanjan Sen, Stephen Klapper
SPATIAL AUDIO FILE FORMAT FOR STORING CAPTURE METADATA

Publication number: 20200409995

Abstract: A device with microphones can generate microphone signals during an audio recording. The device can store, in an electronic audio data file, the microphone signals, and metadata that includes impulse responses of the microphones. Other aspects are described and claimed.

Type: Application

Filed: June 11, 2020

Publication date: December 31, 2020

Inventors: Jonathan D. Sheaffer, Symeon Delikaris Manias, Gaetan R. Lorho, Peter A. Raffensperger, Eric A. Allamanche, Frank Baumgarte, Dipanjan Sen, Joshua D. Atkins, Juha O. Merimaa
HIGHER ORDER AMBISONIC AUDIO DATA

Publication number: 20200335113

Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.

Type: Application

Filed: May 6, 2020

Publication date: October 22, 2020

Inventors: Moo Young Kim, Nils Günther Peters, Shankar Thagadur Shivappa, Dipanjan Sen
RENDERING METADATA TO CONTROL USER MOVEMENT BASED AUDIO RENDERING

Publication number: 20200304935

Abstract: In general, techniques are described for rendering metadata to control user movement based audio rendering. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store audio data representative of a soundfield. The one or more processors may be coupled to the memory, and configured to obtain rendering metadata indicative of controls for enabling or disabling adaptations, based on an indication of a movement of a user of the device, of a renderer used to render audio data representative of a soundfield, specify, in a bitstream representative of the audio data, the rendering metadata, and output the bitstream.

Type: Application

Filed: March 18, 2020

Publication date: September 24, 2020

Inventors: Nils Günther Peters, Moo Young Kim, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, Dipanjan Sen
Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals

Patent number: 10770087

Abstract: In general, techniques are described for performing codebook selection when coding vectors decomposed from higher-order ambisonic coefficients. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store a plurality of codebooks to use when performing vector dequantization with respect to a vector quantized spatial component of a soundfield. The vector quantized spatial component may be obtained through application of a decomposition to a plurality of higher order ambisonic coefficients. The processor may be configured to select one of the plurality of codebooks.

Type: Grant

Filed: May 14, 2015

Date of Patent: September 8, 2020

Assignee: Qualcomm Incorporated

Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen
AUDIO PARALLAX FOR VIRTUAL REALITY, AUGMENTED REALITY, AND MIXED REALITY

Publication number: 20200260210

Abstract: An example audio decoding device includes processing circuitry and a memory device coupled to the processing circuitry. The processing circuitry is configured to receive, in a bitstream, encoded representations of audio objects of a three-dimensional (3D) soundfield, to receive metadata associated with the bitstream, to obtain, from the received metadata, one or more transmission factors associated with one or more of the audio objects, and to apply the transmission factors to the one or more audio objects to obtain parallax-adjusted audio objects of the 3D soundfield. The memory device is configured to store at least a portion of the received bitstream, the received metadata, or the parallax-adjusted audio objects of the 3D soundfield.

Type: Application

Filed: April 30, 2020

Publication date: August 13, 2020

Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen
Soundfield modeling for efficient encoding and/or retrieval

Patent number: 10728689

Abstract: Methods, systems, computer-readable media, and apparatuses for characterizing portions of a soundfield are presented. Some configurations include estimating a total energy of a soundfield associated with a scene space and, for each of at least some of a plurality of regions of the scene space, estimating an energy of a portion of the soundfield that corresponds to the region and creating a corresponding metadata field that indicates a location of the region within the space and a relation between the estimated total energy and the estimated energy that corresponds to the region.

Type: Grant

Filed: December 13, 2018

Date of Patent: July 28, 2020

Assignee: Qualcomm Incorporated

Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Dipanjan Sen, Michael Ericson
SIGNALLING BEAM PATTERN WITH OBJECTS

Publication number: 20200204939

Abstract: A device for processing coded audio is disclosed. The device is configured to store an audio object and audio object metadata associated with the audio object. The audio object metadata includes frequency dependent beam pattern metadata. The device may apply, based on the frequency dependent beam pattern metadata, a renderer to the audio object to obtain one or more speaker feeds and output the one or more speaker feeds.

Type: Application

Filed: December 18, 2019

Publication date: June 25, 2020

Inventors: Moo Young Kim, Nils Günther Peters, S M Akramus Salehin, Dipanjan Sen
Transporting coded audio data

Patent number: 10693936

Abstract: In one example, a device for retrieving audio data includes one or more processors configured to receive availability data representative of a plurality of available adaptation sets, the available adaptation sets including a scene-based audio adaptation set and one or more object-based audio adaptation sets, receive selection data identifying which of the scene-based audio adaptation set and the one or more object-based audio adaptation sets are to be retrieved, and provide instruction data to a streaming client to cause the streaming client to retrieve data for each of the adaptation sets identified by the selection data, and a memory configured to store the retrieved data for the audio adaptation sets.

Type: Grant

Filed: August 24, 2016

Date of Patent: June 23, 2020

Assignee: QUALCOMM Incorporated

Inventors: Thomas Stockhammer, Dipanjan Sen, Nils Günther Peters, Moo Young Kim
SOUNDFIELD MODELING FOR EFFICIENT ENCODING AND/OR RETRIEVAL

Publication number: 20200196086

Abstract: Methods, systems, computer-readable media, and apparatuses for characterizing portions of a soundfield are presented. Some configurations include estimating a total energy of a soundfield associated with a scene space and, for each of at least some of a plurality of regions of the scene space, estimating an energy of a portion of the soundfield that corresponds to the region and creating a corresponding metadata field that indicates a location of the region within the space and a relation between the estimated total energy and the estimated energy that corresponds to the region.

Type: Application

Filed: December 13, 2018

Publication date: June 18, 2020

Inventors: Siddhartha Goutham SWAMINATHAN, S M Akramus SALEHIN, Dipanjan SEN, Michael ERICSON
Audio parallax for virtual reality, augmented reality, and mixed reality

Patent number: 10659906

Abstract: An example audio decoding device includes processing circuitry and a memory device coupled to the processing circuitry. The processing circuitry is configured to receive, in a bitstream, encoded representations of audio objects of a three-dimensional (3D) soundfield, to receive metadata associated with the bitstream, to obtain, from the received metadata, one or more transmission factors associated with one or more of the audio objects, and to apply the transmission factors to the one or more audio objects to obtain parallax-adjusted audio objects of the 3D soundfield. The memory device is configured to store at least a portion of the received bitstream, the received metadata, or the parallax-adjusted audio objects of the 3D soundfield.

Type: Grant

Filed: January 11, 2018

Date of Patent: May 19, 2020

Assignee: Qualcomm Incorporated

Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen
Priority information for higher order ambisonic audio data

Patent number: 10657974

Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.

Type: Grant

Filed: December 20, 2018

Date of Patent: May 19, 2020

Assignee: Qualcomm Incorporated

Inventors: Moo Young Kim, Nils Günther Peters, Shankar Thagadur Shivappa, Dipanjan Sen
RECURSIVELY DEFINED AUDIO METADATA

Publication number: 20200120438

Abstract: In general, techniques are described for recursively defined audio metadata. A device comprising one or more memories and one or more processors may be configured to perform various aspects of the techniques. The one or more memories may store at least a portion of the bitstream. The one or more processors may obtain, from the bitstream, recursively defined audio metadata, and obtain, from the bitstream, a representation of the audio data. The one or more processors may process, based on the recursively defined audio metadata, the representation of the audio data to obtain one or more speaker feeds, and output the one or more speaker feeds to one or more speakers.

Type: Application

Filed: September 26, 2019

Publication date: April 16, 2020

Inventors: Moo Young Kim, Shankar Thagadur Shivappa, Dipanjan Sen, Ferdinando Olivieri
SIX DEGREES OF FREEDOM AND THREE DEGREES OF FREEDOM BACKWARD COMPATIBILITY

Publication number: 20200112814

Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.

Type: Application

Filed: September 11, 2019

Publication date: April 9, 2020

Inventors: Moo Young Kim, Nils Günther Peters, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
AMBISONIC SIGNAL NOISE REDUCTION FOR MICROPHONE ARRAYS

Publication number: 20200107118

Abstract: A device to apply noise reduction to ambisonic signals includes a memory configured to store noise data corresponding to microphones in a microphone array. A processor is configured to perform signal processing operations on signals captured by microphones in the microphone array to generate multiple sets of ambisonic signals including a first set corresponding to a first particular ambisonic order and a second set corresponding to a second particular ambisonic order. The processor is configured to perform a first noise reduction operation that includes applying a first gain factor to each ambisonic signal in the first set and to perform a second noise reduction operation that includes applying a second gain factor to each ambisonic signal in the second set. The first gain factor and the second gain factor are based on the noise data, and the second gain factor is distinct from the first gain factor.

Type: Application

Filed: March 13, 2019

Publication date: April 2, 2020

Inventors: S M Akramus SALEHIN, Dipanjan SEN
REPRESENTING OCCLUSION WHEN RENDERING FOR COMPUTER-MEDIATED REALITY SYSTEMS

Publication number: 20200107147

Abstract: In general, techniques are described for modeling occlusions when rendering audio data. A device comprising a memory and one or more processors may perform the techniques. The memory may store audio data representative of a soundfield. The one or more processors may obtain occlusion metadata representative of an occlusion within the soundfield in terms of propagation of sound through the occlusion, the occlusion separating the soundfield into two or more sound spaces. The one or more processors may obtain a location of the device, and obtain, based on the occlusion metadata and the location, a renderer by which to render the audio data into one or more speaker feeds that account for propagation of the sound in one of the two or more sound spaces in which the device resides. The one or more processors may apply the renderer to the audio data to generate the speaker feeds.

Type: Application

Filed: September 26, 2019

Publication date: April 2, 2020

Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Moo Young Kim, Nils Günther Peters, Dipanjan Sen
RENDERING AUDIO DATA FROM INDEPENDENTLY CONTROLLED AUDIO ZONES

Publication number: 20200053505

Abstract: One or more processors may obtain a first distance between a first audio zone of the two or more audio zones associated with the one or more interest points within the first audio zone, and a first device position of a device, obtain a second distance between a second audio zone of the two or more audio zones associated with the one or more interest points within the second audio zone, and the first device position of the device, and obtain an updated first distance and updated second distance after movement of the device has changed from the first device position to a second device position. The one or more processor(s) may independently control the first audio zone and the second audio zone, such that the audio data within the first audio zone and the second audio zone are adjusted based on the updated first distance and updated second distance.

Type: Application

Filed: August 8, 2018

Publication date: February 13, 2020

Inventors: Nils Gunther PETERS, S M Akramus SALEHIN, Shankar THAGADUR SHIVAPPA, Moo Young KIM, Dipanjan SEN

prev 1 2 3 4 5 6 7 … next