Patents by Inventor Deepen Sinha
Deepen Sinha has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12640163Abstract: The invention provides a method for identifying similarity between two audio files or tracks. The method comprises receiving a processed audio file and an original audio file, uncompressing the processed audio file, applying global loudness normalization and short-term loudness normalization on the processed audio file and the original audio file, converting the processed audio file and the original audio file into processed spectral image by time-frequency mapping, scaling, using linear interpolation, the processed spectral image, dividing the scaled-up processed spectral image into slices, searching for minimum Sum of Absolute Difference (SAD), using original spectral image as reference, for each slice.Type: GrantFiled: January 19, 2024Date of Patent: May 26, 2026Assignee: Audio Technologies and Codecs, Inc.Inventors: Deepen Sinha, Mohd Aamir Khan
-
Publication number: 20250104739Abstract: Systems and methods are presented for efficient cross-fading of compressed domain information streams on a user/client device. Exemplary systems may provide cross-fade between AAC/Enhanced AAC Plus information streams, between MP3 information streams, or between information streams of unmatched formats. These systems are distinguished in that cross-fade is directly applied to compressed bitstreams so a single decode operation is performed on the resulting bitstream. Thus, a set of frames from each input stream associated with the time interval in which a cross fade is decoded, and combined and recoded with a cross fade or other effect now in the compressed bitstream. Once sent through the client device's decoder, the user hears the transitional effect. The only input data that is decoded and processed is that associated with the portion of each stream used the crossfade, blend or other interstitial, and thus the vast majority of input streams are left compressed.Type: ApplicationFiled: April 15, 2024Publication date: March 27, 2025Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
-
Publication number: 20250069592Abstract: The invention provides a method and a system for hierarchical audio classification. For getting high accuracy prediction with high resolution and low predictor complexity, the disclosed method uses a hierarchical classification approach with stateful prediction per frame aided by parallel AI transient detector for resetting the states of all stages at class transitions. To improve accuracy perfectly tagged database by innovative techniques of labeling are utilized. Further data augmentation is also done using signal processing techniques like audio mixing, blending of different type of data. The disclosed method applies short term audio normalization on database for normalized training and prediction of AI based Long Short-Term Memory (LSTM) networks. The disclosed method then uses a novel hierarchical classification approach with stateful LSTM prediction per frame aided by a parallel transient detector for resetting the states of all stages of hierarchical LSTM classifiers at class transitions.Type: ApplicationFiled: January 19, 2024Publication date: February 27, 2025Applicant: Audio Technologies and Codecs, Inc.Inventors: Deepen Sinha, Mohd Aamir Khan
-
Publication number: 20250069591Abstract: This invention provides a method and system for hierarchical audio classification, aimed at achieving high accuracy in prediction with enhanced decision time resolution. The disclosed method uses a perfectly tagged database which utilizes innovative techniques of labelling. Further data augmentation is also done using signal processing techniques like audio mixing and blending of different types of data. The disclosed method applies short term audio normalization on database for normalized training and prediction using AI based Long Short-Term Memory (LSTM) networks. The method and system employ the LSTM networks in a hierarchical structure to classify audio into desired 3 or more audio classes which include at least a background noise audio class. Decision time accuracy is improved by running the LSTM predictors over time overlapped slices and by using a separate transition detection neural network.Type: ApplicationFiled: January 19, 2024Publication date: February 27, 2025Inventors: Deepen Sinha, Mohd Aamir Khan, Anush Kapoor
-
Publication number: 20250069618Abstract: The invention provides a method for identifying similarity between two audio files or tracks. The method comprises receiving a processed audio file and an original audio file, uncompressing the processed audio file, applying global loudness normalization and short-term loudness normalization on the processed audio file and the original audio file, converting the processed audio file and the original audio file into processed spectral image by time-frequency mapping, scaling, using linear interpolation, the processed spectral image, dividing the scaled-up processed spectral image into slices, searching for minimum Sum of Absolute Difference (SAD), using original spectral image as reference, for each slice.Type: ApplicationFiled: January 19, 2024Publication date: February 27, 2025Applicant: Audio Technologies and Codecs, Inc.Inventors: Deepen Sinha, Mohd Aamir Khan
-
Patent number: 11961538Abstract: Systems and methods are presented for efficient cross-fading of compressed domain information streams on a user/client device. Exemplary systems may provide cross-fade between AAC/Enhanced AAC Plus information streams, between MP3 information streams, or between information streams of unmatched formats. These systems are distinguished in that cross-fade is directly applied to compressed bitstreams so a single decode operation is performed on the resulting bitstream. Thus, a set of frames from each input stream associated with the time interval in which a cross fade is decoded, and combined and recoded with a cross fade or other effect now in the compressed bitstream. Once sent through the client device's decoder, the user hears the transitional effect. The only input data that is decoded and processed is that associated with the portion of each stream used the crossfade, blend or other interstitial, and thus the vast majority of input streams are left compressed.Type: GrantFiled: November 9, 2021Date of Patent: April 16, 2024Assignee: Sirius XM Radio Inc.Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
-
Publication number: 20220328051Abstract: Systems and methods are presented for efficient cross-fading of compressed domain information streams on a user/client device. Exemplary systems may provide cross-fade between AAC/Enhanced AAC Plus information streams, between MP3 information streams, or between information streams of unmatched formats. These systems are distinguished in that cross-fade is directly applied to compressed bitstreams so a single decode operation is performed on the resulting bitstream. Thus, a set of frames from each input stream associated with the time interval in which a cross fade is decoded, and combined and recoded with a cross fade or other effect now in the compressed bitstream. Once sent through the client device's decoder, the user hears the transitional effect. The only input data that is decoded and processed is that associated with the portion of each stream used the crossfade, blend or other interstitial, and thus the vast majority of input streams are left compressed.Type: ApplicationFiled: November 9, 2021Publication date: October 13, 2022Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
-
Patent number: 11170791Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPlus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.Type: GrantFiled: July 30, 2019Date of Patent: November 9, 2021Assignee: Sirius XM Radio Inc.Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
-
Publication number: 20200202871Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPlus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.Type: ApplicationFiled: July 30, 2019Publication date: June 25, 2020Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
-
Patent number: 10366694Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPlus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.Type: GrantFiled: October 2, 2017Date of Patent: July 30, 2019Assignee: Sirius XM Radio Inc.Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
-
Patent number: 10096326Abstract: Systems and methods for increasing transmission bandwidth efficiency by the analysis and synthesis of the ultimate components of transmitted content are presented. To implement such a system, a dictionary or database of elemental codewords can be generated from a set of audio clips. Using such a database, a given arbitrary song or other audio file can be expressed as a series of such codewords, where each given codeword in the series is a compressed audio packet that can be used as is, or, for example, can be tagged to be modified to better match the corresponding portion of the original audio file. Each codeword in the database has an index number or unique identifier. For a relatively small number of bits used in a unique ID, e.g. 27-30, several hundreds of millions of codewords can be uniquely identified.Type: GrantFiled: September 15, 2017Date of Patent: October 9, 2018Assignee: Sirius XM Radio Inc.Inventors: Paul Marko, Deepen Sinha, Hariom Aggrawal
-
Publication number: 20180068665Abstract: Systems and methods for increasing transmission bandwidth efficiency by the analysis and synthesis of the ultimate components of transmitted content are presented. To implement such a system, a dictionary or database of elemental codewords can be generated from a set of audio clips. Using such a database, a given arbitrary song or other audio file can be expressed as a series of such codewords, where each given codeword in the series is a compressed audio packet that can be used as is, or, for example, can be tagged to be modified to better match the corresponding portion of the original audio file. Each codeword in the database has an index number or unique identifier. For a relatively small number of bits used in a unique ID, e.g. 27-30, several hundreds of millions of codewords can be uniquely identified.Type: ApplicationFiled: September 15, 2017Publication date: March 8, 2018Inventors: Paul Marko, Deepen Sinha, Hariom Aggrawal
-
Publication number: 20180025735Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPlus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.Type: ApplicationFiled: October 2, 2017Publication date: January 25, 2018Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
-
Patent number: 9779736Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPIus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.Type: GrantFiled: April 17, 2013Date of Patent: October 3, 2017Assignee: Sirius XM Radio Inc.Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
-
Patent number: 9767812Abstract: Systems and methods for increasing transmission bandwidth efficiency by the analysis and synthesis of the ultimate components of transmitted content are presented. To implement such a system, a dictionary or database of elemental codewords can be generated from a set of audio clips. Using such a database, a given arbitrary song or other audio file can be expressed as a series of such codewords, where each given codeword in the series is a compressed audio packet that can be used as is, or, for example, can be tagged to be modified to better match the corresponding portion of the original audio file. Each codeword in the database has an index number or unique identifier. For a relatively small number of bits used in a unique ID, e.g. 27-30, several hundreds of millions of codewords can be uniquely identified.Type: GrantFiled: March 26, 2014Date of Patent: September 19, 2017Assignee: Sirus XM Radio Inc.Inventors: Paul Marko, Deepen Sinha, Hariom Aggrawal
-
Publication number: 20150142456Abstract: Systems and methods are presented for efficient cross-fading (or other multiple clip processing) of compressed domain information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Exemplary implementation systems may provide cross-fade between AAC/Enhanced AAC Plus (EAACPIus) information streams or between MP3 information streams or even between information streams of unmatched formats (e.g. AAC to MP3 or MP3 to AAC). Furthermore, these systems are distinguished by the fact that cross-fade is directly applied to the compressed bitstreams so that a single decode operation may be performed on the resulting bitstream. Moreover, using the described methods, similar cross fade in the compressed domain between information streams utilizing other formats of compression, such as, for example, MP2, AC-3, PAC, etc. can also be advantageously implemented.Type: ApplicationFiled: April 17, 2013Publication date: May 21, 2015Inventors: Raymond Lowe, Mark Kalman, Deepen Sinha, Christopher Ward
-
Publication number: 20140297292Abstract: Systems and methods for increasing transmission bandwidth efficiency by the analysis and synthesis of the ultimate components of transmitted content are presented. To implement such a system, a dictionary or database of elemental codewords can be generated from a set of audio clips. Using such a database, a given arbitrary song or other audio file can be expressed as a series of such codewords, where each given codeword in the series is a compressed audio packet that can be used as is, or, for example, can be tagged to be modified to better match the corresponding portion of the original audio file. Each codeword in the database has an index number or unique identifier. For a relatively small number of bits used in a unique ID, e.g. 27-30, several hundreds of millions of codewords can be uniquely identified.Type: ApplicationFiled: March 26, 2014Publication date: October 2, 2014Applicant: Sirius XM Radio Inc.Inventors: Paul Marko, Deepen Sinha, Hariom Aggrawal
-
Patent number: 7953605Abstract: A novel bandwidth extension technique allows information to be encoded and decoded using a fractal self similarity model or an accurate spectral replacement model, or both. Also a multi-band temporal amplitude coding technique, useful as an enhancement to any coding/decoding technique, helps with accurate reconstruction of the temporal envelope and employs a utility filterbank. A perceptual coder using a comodulation masking release model, operating typically with more conventional perceptual coders, makes the perceptual model more accurate and hence increases the efficiency of the overall perceptual coder.Type: GrantFiled: October 6, 2006Date of Patent: May 31, 2011Inventors: Deepen Sinha, Anibal J. S. Ferreira, Erumbi Vallabhan Harinarayanan
-
Publication number: 20090125722Abstract: A method and system for generating and controlling access to copy-protected digital media files. Digital media content is obtained and encoded in electronic file using a media codec. The encoded media content is encrypted in the electronic file and a multi-format renderer configured to render the encoded, encrypted electronic file is embedded in the electronic file. When the digital file is accessed, the multi-format renderer generates an invocation code identifying an operation-type in response to a requested operation. A transaction ID storing a user-access policy and associated with the electronic file is retrieved and compared to the invocation code. Based on a result of the comparison of the invocation code and the user-access policy, the multi-format renderer selectively allows the invocation code.Type: ApplicationFiled: September 12, 2008Publication date: May 14, 2009Applicant: iMedia Streams, LLCInventors: Ahmed A. Gomaa, Deepen Sinha
-
Publication number: 20070238415Abstract: A novel bandwidth extension technique allows information to be encoded and decoded using a fractal self similarity model or an accurate spectral replacement model, or both. Also a multi-band temporal amplitude coding technique, useful as an enhancement to any coding/decoding technique, helps with accurate reconstruction of the temporal envelope and employs a utility filterbank. A perceptual coder using a comodulation masking release model, operating typically with more conventional perceptual coders, makes the perceptual model more accurate and hence increases the efficiency of the overall perceptual coder.Type: ApplicationFiled: October 6, 2006Publication date: October 11, 2007Inventors: Deepen Sinha, Anibal Ferreira, Erumbi Harinarayanan