Patents by Inventor Dipanjan Sen

Dipanjan Sen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210098004
    Abstract: A first layer of data having a first set of Ambisonic audio components can be decoded where the first set of Ambisonic audio components is generated based on ambience and one or more object-based audio signals. A second layer of data is decoded having at least one of the one or more object-based audio signals. One of the object-based audio signals is subtracted from the first set of Ambisonic audio components. The resulting Ambisonic audio components are rendered to generate a first set of audio channels. The one or more object-based audio signals are spatially rendered to generate a second set of audio channels. Other aspects are described and claimed.
    Type: Application
    Filed: September 26, 2019
    Publication date: April 1, 2021
    Inventors: Dipanjan Sen, Frank Baumgarte, Juha O. Merimaa
  • Patent number: 10952009
    Abstract: An example audio decoding device includes processing circuitry and a memory device coupled to the processing circuitry. The processing circuitry is configured to receive, in a bitstream, encoded representations of audio objects of a three-dimensional (3D) soundfield, to receive metadata associated with the bitstream, to obtain, from the received metadata, one or more transmission factors associated with one or more of the audio objects, and to apply the transmission factors to the one or more audio objects to obtain parallax-adjusted audio objects of the 3D soundfield. The memory device is configured to store at least a portion of the received bitstream, the received metadata, or the parallax-adjusted audio objects of the 3D soundfield.
    Type: Grant
    Filed: April 30, 2020
    Date of Patent: March 16, 2021
    Assignee: Qualcomm Incorporated
    Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen
  • Patent number: 10924876
    Abstract: In general, various aspects of the techniques are described for interpolating audio streams. A device comprising a memory and a processor may be configured to perform the techniques. The memory may store the one or more audio streams. The processor may obtain one or more microphone locations, each of the one or more microphone locations identifying a location of a respective one or more microphones that captured each of the corresponding one or more audio streams. The processor may also obtain a listener location identifying a location of a listener, and perform interpolation, based on the one or more microphone locations and the listener location, with respect to the audio streams to obtain an interpolated audio stream. The processor may next obtain, based on the interpolated audio stream, one or more speaker feeds, and output the one or more speaker feeds.
    Type: Grant
    Filed: July 16, 2019
    Date of Patent: February 16, 2021
    Assignee: Qualcomm Incorporated
    Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Dipanjan Sen
  • Patent number: 10909130
    Abstract: The system includes interactive user interfaces that allow a user to select attributes, entities, and statistical measures to query the combined data sets. The system allows users to visually construct queries of the database. The system may automatically generate multiple queries and/or query the database multiple times in response to user interface selections. The query parameters and results can be stored and shared with other users.
    Type: Grant
    Filed: July 1, 2016
    Date of Patent: February 2, 2021
    Assignee: Palantir Technologies Inc.
    Inventors: Shannon Scott, Walker Burgin, Hem Wadhar, Grace Wang, Christopher Li, Michael Tuer, Dipanjan Sen, Stephen Klapper
  • Publication number: 20200409995
    Abstract: A device with microphones can generate microphone signals during an audio recording. The device can store, in an electronic audio data file, the microphone signals, and metadata that includes impulse responses of the microphones. Other aspects are described and claimed.
    Type: Application
    Filed: June 11, 2020
    Publication date: December 31, 2020
    Inventors: Jonathan D. Sheaffer, Symeon Delikaris Manias, Gaetan R. Lorho, Peter A. Raffensperger, Eric A. Allamanche, Frank Baumgarte, Dipanjan Sen, Joshua D. Atkins, Juha O. Merimaa
  • Publication number: 20200335113
    Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.
    Type: Application
    Filed: May 6, 2020
    Publication date: October 22, 2020
    Inventors: Moo Young Kim, Nils Günther Peters, Shankar Thagadur Shivappa, Dipanjan Sen
  • Publication number: 20200304935
    Abstract: In general, techniques are described for rendering metadata to control user movement based audio rendering. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store audio data representative of a soundfield. The one or more processors may be coupled to the memory, and configured to obtain rendering metadata indicative of controls for enabling or disabling adaptations, based on an indication of a movement of a user of the device, of a renderer used to render audio data representative of a soundfield, specify, in a bitstream representative of the audio data, the rendering metadata, and output the bitstream.
    Type: Application
    Filed: March 18, 2020
    Publication date: September 24, 2020
    Inventors: Nils Günther Peters, Moo Young Kim, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Isaac Garcia Munoz, Dipanjan Sen
  • Patent number: 10770087
    Abstract: In general, techniques are described for performing codebook selection when coding vectors decomposed from higher-order ambisonic coefficients. A device comprising a memory and a processor may perform the techniques. The memory may be configured to store a plurality of codebooks to use when performing vector dequantization with respect to a vector quantized spatial component of a soundfield. The vector quantized spatial component may be obtained through application of a decomposition to a plurality of higher order ambisonic coefficients. The processor may be configured to select one of the plurality of codebooks.
    Type: Grant
    Filed: May 14, 2015
    Date of Patent: September 8, 2020
    Assignee: Qualcomm Incorporated
    Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen
  • Publication number: 20200260210
    Abstract: An example audio decoding device includes processing circuitry and a memory device coupled to the processing circuitry. The processing circuitry is configured to receive, in a bitstream, encoded representations of audio objects of a three-dimensional (3D) soundfield, to receive metadata associated with the bitstream, to obtain, from the received metadata, one or more transmission factors associated with one or more of the audio objects, and to apply the transmission factors to the one or more audio objects to obtain parallax-adjusted audio objects of the 3D soundfield. The memory device is configured to store at least a portion of the received bitstream, the received metadata, or the parallax-adjusted audio objects of the 3D soundfield.
    Type: Application
    Filed: April 30, 2020
    Publication date: August 13, 2020
    Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen
  • Patent number: 10728689
    Abstract: Methods, systems, computer-readable media, and apparatuses for characterizing portions of a soundfield are presented. Some configurations include estimating a total energy of a soundfield associated with a scene space and, for each of at least some of a plurality of regions of the scene space, estimating an energy of a portion of the soundfield that corresponds to the region and creating a corresponding metadata field that indicates a location of the region within the space and a relation between the estimated total energy and the estimated energy that corresponds to the region.
    Type: Grant
    Filed: December 13, 2018
    Date of Patent: July 28, 2020
    Assignee: Qualcomm Incorporated
    Inventors: Siddhartha Goutham Swaminathan, S M Akramus Salehin, Dipanjan Sen, Michael Ericson
  • Publication number: 20200204939
    Abstract: A device for processing coded audio is disclosed. The device is configured to store an audio object and audio object metadata associated with the audio object. The audio object metadata includes frequency dependent beam pattern metadata. The device may apply, based on the frequency dependent beam pattern metadata, a renderer to the audio object to obtain one or more speaker feeds and output the one or more speaker feeds.
    Type: Application
    Filed: December 18, 2019
    Publication date: June 25, 2020
    Inventors: Moo Young Kim, Nils Günther Peters, S M Akramus Salehin, Dipanjan Sen
  • Patent number: 10693936
    Abstract: In one example, a device for retrieving audio data includes one or more processors configured to receive availability data representative of a plurality of available adaptation sets, the available adaptation sets including a scene-based audio adaptation set and one or more object-based audio adaptation sets, receive selection data identifying which of the scene-based audio adaptation set and the one or more object-based audio adaptation sets are to be retrieved, and provide instruction data to a streaming client to cause the streaming client to retrieve data for each of the adaptation sets identified by the selection data, and a memory configured to store the retrieved data for the audio adaptation sets.
    Type: Grant
    Filed: August 24, 2016
    Date of Patent: June 23, 2020
    Assignee: QUALCOMM Incorporated
    Inventors: Thomas Stockhammer, Dipanjan Sen, Nils Günther Peters, Moo Young Kim
  • Publication number: 20200196086
    Abstract: Methods, systems, computer-readable media, and apparatuses for characterizing portions of a soundfield are presented. Some configurations include estimating a total energy of a soundfield associated with a scene space and, for each of at least some of a plurality of regions of the scene space, estimating an energy of a portion of the soundfield that corresponds to the region and creating a corresponding metadata field that indicates a location of the region within the space and a relation between the estimated total energy and the estimated energy that corresponds to the region.
    Type: Application
    Filed: December 13, 2018
    Publication date: June 18, 2020
    Inventors: Siddhartha Goutham SWAMINATHAN, S M Akramus SALEHIN, Dipanjan SEN, Michael ERICSON
  • Patent number: 10659906
    Abstract: An example audio decoding device includes processing circuitry and a memory device coupled to the processing circuitry. The processing circuitry is configured to receive, in a bitstream, encoded representations of audio objects of a three-dimensional (3D) soundfield, to receive metadata associated with the bitstream, to obtain, from the received metadata, one or more transmission factors associated with one or more of the audio objects, and to apply the transmission factors to the one or more audio objects to obtain parallax-adjusted audio objects of the 3D soundfield. The memory device is configured to store at least a portion of the received bitstream, the received metadata, or the parallax-adjusted audio objects of the 3D soundfield.
    Type: Grant
    Filed: January 11, 2018
    Date of Patent: May 19, 2020
    Assignee: Qualcomm Incorporated
    Inventors: Moo Young Kim, Nils Günther Peters, Dipanjan Sen
  • Patent number: 10657974
    Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.
    Type: Grant
    Filed: December 20, 2018
    Date of Patent: May 19, 2020
    Assignee: Qualcomm Incorporated
    Inventors: Moo Young Kim, Nils Günther Peters, Shankar Thagadur Shivappa, Dipanjan Sen
  • Publication number: 20200120438
    Abstract: In general, techniques are described for recursively defined audio metadata. A device comprising one or more memories and one or more processors may be configured to perform various aspects of the techniques. The one or more memories may store at least a portion of the bitstream. The one or more processors may obtain, from the bitstream, recursively defined audio metadata, and obtain, from the bitstream, a representation of the audio data. The one or more processors may process, based on the recursively defined audio metadata, the representation of the audio data to obtain one or more speaker feeds, and output the one or more speaker feeds to one or more speakers.
    Type: Application
    Filed: September 26, 2019
    Publication date: April 16, 2020
    Inventors: Moo Young Kim, Shankar Thagadur Shivappa, Dipanjan Sen, Ferdinando Olivieri
  • Publication number: 20200112814
    Abstract: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.
    Type: Application
    Filed: September 11, 2019
    Publication date: April 9, 2020
    Inventors: Moo Young Kim, Nils Günther Peters, S M Akramus Salehin, Siddhartha Goutham Swaminathan, Dipanjan Sen
  • Publication number: 20200107118
    Abstract: A device to apply noise reduction to ambisonic signals includes a memory configured to store noise data corresponding to microphones in a microphone array. A processor is configured to perform signal processing operations on signals captured by microphones in the microphone array to generate multiple sets of ambisonic signals including a first set corresponding to a first particular ambisonic order and a second set corresponding to a second particular ambisonic order. The processor is configured to perform a first noise reduction operation that includes applying a first gain factor to each ambisonic signal in the first set and to perform a second noise reduction operation that includes applying a second gain factor to each ambisonic signal in the second set. The first gain factor and the second gain factor are based on the noise data, and the second gain factor is distinct from the first gain factor.
    Type: Application
    Filed: March 13, 2019
    Publication date: April 2, 2020
    Inventors: S M Akramus SALEHIN, Dipanjan SEN
  • Publication number: 20200107147
    Abstract: In general, techniques are described for modeling occlusions when rendering audio data. A device comprising a memory and one or more processors may perform the techniques. The memory may store audio data representative of a soundfield. The one or more processors may obtain occlusion metadata representative of an occlusion within the soundfield in terms of propagation of sound through the occlusion, the occlusion separating the soundfield into two or more sound spaces. The one or more processors may obtain a location of the device, and obtain, based on the occlusion metadata and the location, a renderer by which to render the audio data into one or more speaker feeds that account for propagation of the sound in one of the two or more sound spaces in which the device resides. The one or more processors may apply the renderer to the audio data to generate the speaker feeds.
    Type: Application
    Filed: September 26, 2019
    Publication date: April 2, 2020
    Inventors: Isaac Garcia Munoz, Siddhartha Goutham Swaminathan, S M Akramus Salehin, Moo Young Kim, Nils Günther Peters, Dipanjan Sen
  • Publication number: 20200053505
    Abstract: One or more processors may obtain a first distance between a first audio zone of the two or more audio zones associated with the one or more interest points within the first audio zone, and a first device position of a device, obtain a second distance between a second audio zone of the two or more audio zones associated with the one or more interest points within the second audio zone, and the first device position of the device, and obtain an updated first distance and updated second distance after movement of the device has changed from the first device position to a second device position. The one or more processor(s) may independently control the first audio zone and the second audio zone, such that the audio data within the first audio zone and the second audio zone are adjusted based on the updated first distance and updated second distance.
    Type: Application
    Filed: August 8, 2018
    Publication date: February 13, 2020
    Inventors: Nils Gunther PETERS, S M Akramus SALEHIN, Shankar THAGADUR SHIVAPPA, Moo Young KIM, Dipanjan SEN