Patents by Inventor Deepen Sinha

Deepen Sinha has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12640163
    Abstract: The invention provides a method for identifying similarity between two audio files or tracks. The method comprises receiving a processed audio file and an original audio file, uncompressing the processed audio file, applying global loudness normalization and short-term loudness normalization on the processed audio file and the original audio file, converting the processed audio file and the original audio file into processed spectral image by time-frequency mapping, scaling, using linear interpolation, the processed spectral image, dividing the scaled-up processed spectral image into slices, searching for minimum Sum of Absolute Difference (SAD), using original spectral image as reference, for each slice.
    Type: Grant
    Filed: January 19, 2024
    Date of Patent: May 26, 2026
    Assignee: Audio Technologies and Codecs, Inc.
    Inventors: Deepen Sinha, Mohd Aamir Khan
  • Publication number: 20250104739
    Abstract: Systems and methods are presented for efficient cross-fading of compressed domain information streams on a user/client device. Exemplary systems may provide cross-fade between AAC/Enhanced AAC Plus information streams, between MP3 information streams, or between information streams of unmatched formats. These systems are distinguished in that cross-fade is directly applied to compressed bitstreams so a single decode operation is performed on the resulting bitstream. Thus, a set of frames from each input stream associated with the time interval in which a cross fade is decoded, and combined and recoded with a cross fade or other effect now in the compressed bitstream. Once sent through the client device's decoder, the user hears the transitional effect. The only input data that is decoded and processed is that associated with the portion of each stream used the crossfade, blend or other interstitial, and thus the vast majority of input streams are left compressed.
    Type: Application
    Filed: April 15, 2024
    Publication date: March 27, 2025
    Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
  • Publication number: 20250069592
    Abstract: The invention provides a method and a system for hierarchical audio classification. For getting high accuracy prediction with high resolution and low predictor complexity, the disclosed method uses a hierarchical classification approach with stateful prediction per frame aided by parallel AI transient detector for resetting the states of all stages at class transitions. To improve accuracy perfectly tagged database by innovative techniques of labeling are utilized. Further data augmentation is also done using signal processing techniques like audio mixing, blending of different type of data. The disclosed method applies short term audio normalization on database for normalized training and prediction of AI based Long Short-Term Memory (LSTM) networks. The disclosed method then uses a novel hierarchical classification approach with stateful LSTM prediction per frame aided by a parallel transient detector for resetting the states of all stages of hierarchical LSTM classifiers at class transitions.
    Type: Application
    Filed: January 19, 2024
    Publication date: February 27, 2025
    Applicant: Audio Technologies and Codecs, Inc.
    Inventors: Deepen Sinha, Mohd Aamir Khan
  • Publication number: 20250069591
    Abstract: This invention provides a method and system for hierarchical audio classification, aimed at achieving high accuracy in prediction with enhanced decision time resolution. The disclosed method uses a perfectly tagged database which utilizes innovative techniques of labelling. Further data augmentation is also done using signal processing techniques like audio mixing and blending of different types of data. The disclosed method applies short term audio normalization on database for normalized training and prediction using AI based Long Short-Term Memory (LSTM) networks. The method and system employ the LSTM networks in a hierarchical structure to classify audio into desired 3 or more audio classes which include at least a background noise audio class. Decision time accuracy is improved by running the LSTM predictors over time overlapped slices and by using a separate transition detection neural network.
    Type: Application
    Filed: January 19, 2024
    Publication date: February 27, 2025
    Inventors: Deepen Sinha, Mohd Aamir Khan, Anush Kapoor
  • Publication number: 20250069618
    Abstract: The invention provides a method for identifying similarity between two audio files or tracks. The method comprises receiving a processed audio file and an original audio file, uncompressing the processed audio file, applying global loudness normalization and short-term loudness normalization on the processed audio file and the original audio file, converting the processed audio file and the original audio file into processed spectral image by time-frequency mapping, scaling, using linear interpolation, the processed spectral image, dividing the scaled-up processed spectral image into slices, searching for minimum Sum of Absolute Difference (SAD), using original spectral image as reference, for each slice.
    Type: Application
    Filed: January 19, 2024
    Publication date: February 27, 2025
    Applicant: Audio Technologies and Codecs, Inc.
    Inventors: Deepen Sinha, Mohd Aamir Khan
  • Patent number: 11961538
    Abstract: Systems and methods are presented for efficient cross-fading of compressed domain information streams on a user/client device. Exemplary systems may provide cross-fade between AAC/Enhanced AAC Plus information streams, between MP3 information streams, or between information streams of unmatched formats. These systems are distinguished in that cross-fade is directly applied to compressed bitstreams so a single decode operation is performed on the resulting bitstream. Thus, a set of frames from each input stream associated with the time interval in which a cross fade is decoded, and combined and recoded with a cross fade or other effect now in the compressed bitstream. Once sent through the client device's decoder, the user hears the transitional effect. The only input data that is decoded and processed is that associated with the portion of each stream used the crossfade, blend or other interstitial, and thus the vast majority of input streams are left compressed.
    Type: Grant
    Filed: November 9, 2021
    Date of Patent: April 16, 2024
    Assignee: Sirius XM Radio Inc.
    Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
  • Publication number: 20220328051
    Abstract: Systems and methods are presented for efficient cross-fading of compressed domain information streams on a user/client device. Exemplary systems may provide cross-fade between AAC/Enhanced AAC Plus information streams, between MP3 information streams, or between information streams of unmatched formats. These systems are distinguished in that cross-fade is directly applied to compressed bitstreams so a single decode operation is performed on the resulting bitstream. Thus, a set of frames from each input stream associated with the time interval in which a cross fade is decoded, and combined and recoded with a cross fade or other effect now in the compressed bitstream. Once sent through the client device's decoder, the user hears the transitional effect. The only input data that is decoded and processed is that associated with the portion of each stream used the crossfade, blend or other interstitial, and thus the vast majority of input streams are left compressed.
    Type: Application
    Filed: November 9, 2021
    Publication date: October 13, 2022
    Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
  • Patent number: 11170791
    Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPlus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.
    Type: Grant
    Filed: July 30, 2019
    Date of Patent: November 9, 2021
    Assignee: Sirius XM Radio Inc.
    Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
  • Publication number: 20200202871
    Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPlus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.
    Type: Application
    Filed: July 30, 2019
    Publication date: June 25, 2020
    Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
  • Patent number: 10366694
    Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPlus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.
    Type: Grant
    Filed: October 2, 2017
    Date of Patent: July 30, 2019
    Assignee: Sirius XM Radio Inc.
    Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
  • Patent number: 10096326
    Abstract: Systems and methods for increasing transmission bandwidth efficiency by the analysis and synthesis of the ultimate components of transmitted content are presented. To implement such a system, a dictionary or database of elemental codewords can be generated from a set of audio clips. Using such a database, a given arbitrary song or other audio file can be expressed as a series of such codewords, where each given codeword in the series is a compressed audio packet that can be used as is, or, for example, can be tagged to be modified to better match the corresponding portion of the original audio file. Each codeword in the database has an index number or unique identifier. For a relatively small number of bits used in a unique ID, e.g. 27-30, several hundreds of millions of codewords can be uniquely identified.
    Type: Grant
    Filed: September 15, 2017
    Date of Patent: October 9, 2018
    Assignee: Sirius XM Radio Inc.
    Inventors: Paul Marko, Deepen Sinha, Hariom Aggrawal
  • Publication number: 20180068665
    Abstract: Systems and methods for increasing transmission bandwidth efficiency by the analysis and synthesis of the ultimate components of transmitted content are presented. To implement such a system, a dictionary or database of elemental codewords can be generated from a set of audio clips. Using such a database, a given arbitrary song or other audio file can be expressed as a series of such codewords, where each given codeword in the series is a compressed audio packet that can be used as is, or, for example, can be tagged to be modified to better match the corresponding portion of the original audio file. Each codeword in the database has an index number or unique identifier. For a relatively small number of bits used in a unique ID, e.g. 27-30, several hundreds of millions of codewords can be uniquely identified.
    Type: Application
    Filed: September 15, 2017
    Publication date: March 8, 2018
    Inventors: Paul Marko, Deepen Sinha, Hariom Aggrawal
  • Publication number: 20180025735
    Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPlus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.
    Type: Application
    Filed: October 2, 2017
    Publication date: January 25, 2018
    Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
  • Patent number: 9779736
    Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPIus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.
    Type: Grant
    Filed: April 17, 2013
    Date of Patent: October 3, 2017
    Assignee: Sirius XM Radio Inc.
    Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
  • Patent number: 9767812
    Abstract: Systems and methods for increasing transmission bandwidth efficiency by the analysis and synthesis of the ultimate components of transmitted content are presented. To implement such a system, a dictionary or database of elemental codewords can be generated from a set of audio clips. Using such a database, a given arbitrary song or other audio file can be expressed as a series of such codewords, where each given codeword in the series is a compressed audio packet that can be used as is, or, for example, can be tagged to be modified to better match the corresponding portion of the original audio file. Each codeword in the database has an index number or unique identifier. For a relatively small number of bits used in a unique ID, e.g. 27-30, several hundreds of millions of codewords can be uniquely identified.
    Type: Grant
    Filed: March 26, 2014
    Date of Patent: September 19, 2017
    Assignee: Sirus XM Radio Inc.
    Inventors: Paul Marko, Deepen Sinha, Hariom Aggrawal
  • Publication number: 20150142456
    Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPIus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.
    Type: Application
    Filed: April 17, 2013
    Publication date: May 21, 2015
    Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
  • Publication number: 20140297292
    Abstract: Systems and methods for increasing transmission bandwidth efficiency by the analysis and synthesis of the ultimate components of transmitted content are presented. To implement such a system, a dictionary or database of elemental codewords can be generated from a set of audio clips. Using such a database, a given arbitrary song or other audio file can be expressed as a series of such codewords, where each given codeword in the series is a compressed audio packet that can be used as is, or, for example, can be tagged to be modified to better match the corresponding portion of the original audio file. Each codeword in the database has an index number or unique identifier. For a relatively small number of bits used in a unique ID, e.g. 27-30, several hundreds of millions of codewords can be uniquely identified.
    Type: Application
    Filed: March 26, 2014
    Publication date: October 2, 2014
    Applicant: Sirius XM Radio Inc.
    Inventors: Paul Marko, Deepen Sinha, Hariom Aggrawal
  • Patent number: 7953605
    Abstract: A novel bandwidth extension technique allows information to be encoded and decoded using a fractal self similarity model or an accurate spectral replacement model, or both. Also a multi-band temporal amplitude coding technique, useful as an enhancement to any coding/decoding technique, helps with accurate reconstruction of the temporal envelope and employs a utility filterbank. A perceptual coder using a comodulation masking release model, operating typically with more conventional perceptual coders, makes the perceptual model more accurate and hence increases the efficiency of the overall perceptual coder.
    Type: Grant
    Filed: October 6, 2006
    Date of Patent: May 31, 2011
    Inventors: Deepen Sinha, Anibal J. S. Ferreira, Erumbi Vallabhan Harinarayanan
  • Publication number: 20090125722
    Abstract: A method and system for generating and controlling access to copy-protected digital media files. Digital media content is obtained and encoded in electronic file using a media codec. The encoded media content is encrypted in the electronic file and a multi-format renderer configured to render the encoded, encrypted electronic file is embedded in the electronic file. When the digital file is accessed, the multi-format renderer generates an invocation code identifying an operation-type in response to a requested operation. A transaction ID storing a user-access policy and associated with the electronic file is retrieved and compared to the invocation code. Based on a result of the comparison of the invocation code and the user-access policy, the multi-format renderer selectively allows the invocation code.
    Type: Application
    Filed: September 12, 2008
    Publication date: May 14, 2009
    Applicant: iMedia Streams, LLC
    Inventors: Ahmed A. Gomaa, Deepen Sinha
  • Publication number: 20070238415
    Abstract: A novel bandwidth extension technique allows information to be encoded and decoded using a fractal self similarity model or an accurate spectral replacement model, or both. Also a multi-band temporal amplitude coding technique, useful as an enhancement to any coding/decoding technique, helps with accurate reconstruction of the temporal envelope and employs a utility filterbank. A perceptual coder using a comodulation masking release model, operating typically with more conventional perceptual coders, makes the perceptual model more accurate and hence increases the efficiency of the overall perceptual coder.
    Type: Application
    Filed: October 6, 2006
    Publication date: October 11, 2007
    Inventors: Deepen Sinha, Anibal Ferreira, Erumbi Harinarayanan