Patents by Inventor Prem Seetharaman
Prem Seetharaman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11907288Abstract: Example systems and methods are audio identification based on data structure are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to execute a constant Q transform on query time slices of query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, sequentially order the two-dimensional Fourier transforms in a query data structure, and identify the query audio as a cover rendition of reference audio based on a comparison between the query data structure and a reference data structure associated with the reference audio.Type: GrantFiled: July 13, 2020Date of Patent: February 20, 2024Assignee: Gracenote, Inc.Inventors: Zafar Rafii, Prem Seetharaman
-
Publication number: 20230008776Abstract: Example systems and methods for automated cover song identification are disclosed. An example apparatus includes at least one memory, machine-readable instructions, and one or more processors to execute the machine-readable instructions to at least execute a constant Q transform on time slices of first audio data to output constant Q transformed time slices, binarize the constant Q transformed time slices to output binarized and constant Q transformed time slices, execute a two-dimensional Fourier transform on time windows within the binarized and constant Q transformed time slices to output two-dimensional Fourier transforms of the time windows, generate a reference data structure based on a sequential order of the two-dimensional Fourier transforms, store the reference data structure in a database, and identify a query data structure associated with query audio data as a cover rendition of the audio data based on a comparison of the query and reference data structures.Type: ApplicationFiled: September 16, 2022Publication date: January 12, 2023Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
-
Patent number: 11461390Abstract: Example systems and methods for automated cover song identification are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to identify query audio from a content source based on a search query using rights metadata associated with the query audio, execute a constant Q transform on query time slices of the query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, generate a query data structure based on a sequential order of the two-dimensional Fourier transforms, select a subset including reference audio of a reference database based on the rights metadata, and identify the query audio as a cover rendition of the reference audio based on a comparison between the query and reference data structures.Type: GrantFiled: October 7, 2020Date of Patent: October 4, 2022Assignee: Gracenote, Inc.Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
-
Patent number: 11138989Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for sound quality prediction and real-time feedback about sound quality, such as room acoustics quality and background noise. Audio data can be sampled from a live sound source and stored in an audio buffer. The audio data in the buffer is analyzed to calculate a stream of values of one or more sound quality measures, such as speech transmission index and signal-to-noise ratio. Speech transmission index can be calculated using a convolution neural network configured to predict speech transmission index from reverberant speech. The stream of values can be used to provide real-time feedback about sound quality of the audio data. For example, a visual indicator on a graphical user interface can be updated based on consistency of the values over time. The real-time feedback about sound quality can help users optimize their recording setup.Type: GrantFiled: March 7, 2019Date of Patent: October 5, 2021Assignee: Adobe Inc.Inventors: Prem Seetharaman, Gautham J. Mysore, Bryan A. Pardo
-
Publication number: 20210034665Abstract: Example systems and methods for automated cover song identification are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to identify query audio from a content source based on a search query using rights metadata associated with the query audio, execute a constant Q transform on query time slices of the query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, generate a query data structure based on a sequential order of the two-dimensional Fourier transforms, select a subset including reference audio of a reference database based on the rights metadata, and identify the query audio as a cover rendition of the reference audio based on a comparison between the query and reference data structures.Type: ApplicationFiled: October 7, 2020Publication date: February 4, 2021Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
-
Publication number: 20200342024Abstract: Example systems and methods are audio identification based on data structure are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to execute a constant Q transform on query time slices of query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, sequentially order the two-dimensional Fourier transforms in a query data structure, and identify the query audio as a cover rendition of reference audio based on a comparison between the query data structure and a reference data structure associated with the reference audio.Type: ApplicationFiled: July 13, 2020Publication date: October 29, 2020Inventors: Zafar Rafii, Prem Seetharaman
-
Patent number: 10803119Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for automated cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews that occur in cover songs found in content repositories. The systems and methods allow copyright holders to search the content repositories for unlicensed cover song.Type: GrantFiled: September 7, 2017Date of Patent: October 13, 2020Assignee: GRACENOTE, INC.Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
-
Publication number: 20200286504Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for sound quality prediction and real-time feedback about sound quality, such as room acoustics quality and background noise. Audio data can be sampled from a live sound source and stored in an audio buffer. The audio data in the buffer is analyzed to calculate a stream of values of one or more sound quality measures, such as speech transmission index and signal-to-noise ratio. Speech transmission index can be calculated using a convolution neural network configured to predict speech transmission index from reverberant speech. The stream of values can be used to provide real-time feedback about sound quality of the audio data. For example, a visual indicator on a graphical user interface can be updated based on consistency of the values over time. The real-time feedback about sound quality can help users optimize their recording setup.Type: ApplicationFiled: March 7, 2019Publication date: September 10, 2020Inventors: Prem Seetharaman, Gautham J. Mysore, Bryan A. Pardo
-
Patent number: 10713296Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews. In particular, a special data structure provides a time-series representation of audio, and this time-series representation is robust to key changes, timbral changes, and small local tempo deviations. Accordingly, the systems and methods described herein analyze cross-similarity between these time-series representations. In some example embodiments, such systems and methods extract features from an audio fingerprint and calculate a distance measure that is robust and invariant to changes in musical structure.Type: GrantFiled: September 7, 2017Date of Patent: July 14, 2020Assignee: GRACENOTE, INC.Inventors: Zafar Rafii, Prem Seetharaman
-
Publication number: 20180189390Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for automated cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews that occur in cover songs found in content repositories. The systems and methods allow copyright holders to search the content repositories for unlicensed cover song.Type: ApplicationFiled: September 7, 2017Publication date: July 5, 2018Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
-
Publication number: 20180075140Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews. In particular, a special data structure provides a time-series representation of audio, and this time-series representation is robust to key changes, timbral changes, and small local tempo deviations. Accordingly, the systems and methods described herein analyze cross-similarity between these time-series representations. In some example embodiments, such systems and methods extract features from an audio fingerprint and calculate a distance measure that is robust and invariant to changes in musical structure.Type: ApplicationFiled: September 7, 2017Publication date: March 15, 2018Inventors: Zafar Rafii, Prem Seetharaman