Patents by Inventor Prem Seetharaman

Prem Seetharaman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11907288
    Abstract: Example systems and methods are audio identification based on data structure are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to execute a constant Q transform on query time slices of query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, sequentially order the two-dimensional Fourier transforms in a query data structure, and identify the query audio as a cover rendition of reference audio based on a comparison between the query data structure and a reference data structure associated with the reference audio.
    Type: Grant
    Filed: July 13, 2020
    Date of Patent: February 20, 2024
    Assignee: Gracenote, Inc.
    Inventors: Zafar Rafii, Prem Seetharaman
  • Publication number: 20230008776
    Abstract: Example systems and methods for automated cover song identification are disclosed. An example apparatus includes at least one memory, machine-readable instructions, and one or more processors to execute the machine-readable instructions to at least execute a constant Q transform on time slices of first audio data to output constant Q transformed time slices, binarize the constant Q transformed time slices to output binarized and constant Q transformed time slices, execute a two-dimensional Fourier transform on time windows within the binarized and constant Q transformed time slices to output two-dimensional Fourier transforms of the time windows, generate a reference data structure based on a sequential order of the two-dimensional Fourier transforms, store the reference data structure in a database, and identify a query data structure associated with query audio data as a cover rendition of the audio data based on a comparison of the query and reference data structures.
    Type: Application
    Filed: September 16, 2022
    Publication date: January 12, 2023
    Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
  • Patent number: 11461390
    Abstract: Example systems and methods for automated cover song identification are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to identify query audio from a content source based on a search query using rights metadata associated with the query audio, execute a constant Q transform on query time slices of the query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, generate a query data structure based on a sequential order of the two-dimensional Fourier transforms, select a subset including reference audio of a reference database based on the rights metadata, and identify the query audio as a cover rendition of the reference audio based on a comparison between the query and reference data structures.
    Type: Grant
    Filed: October 7, 2020
    Date of Patent: October 4, 2022
    Assignee: Gracenote, Inc.
    Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
  • Patent number: 11138989
    Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for sound quality prediction and real-time feedback about sound quality, such as room acoustics quality and background noise. Audio data can be sampled from a live sound source and stored in an audio buffer. The audio data in the buffer is analyzed to calculate a stream of values of one or more sound quality measures, such as speech transmission index and signal-to-noise ratio. Speech transmission index can be calculated using a convolution neural network configured to predict speech transmission index from reverberant speech. The stream of values can be used to provide real-time feedback about sound quality of the audio data. For example, a visual indicator on a graphical user interface can be updated based on consistency of the values over time. The real-time feedback about sound quality can help users optimize their recording setup.
    Type: Grant
    Filed: March 7, 2019
    Date of Patent: October 5, 2021
    Assignee: Adobe Inc.
    Inventors: Prem Seetharaman, Gautham J. Mysore, Bryan A. Pardo
  • Publication number: 20210034665
    Abstract: Example systems and methods for automated cover song identification are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to identify query audio from a content source based on a search query using rights metadata associated with the query audio, execute a constant Q transform on query time slices of the query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, generate a query data structure based on a sequential order of the two-dimensional Fourier transforms, select a subset including reference audio of a reference database based on the rights metadata, and identify the query audio as a cover rendition of the reference audio based on a comparison between the query and reference data structures.
    Type: Application
    Filed: October 7, 2020
    Publication date: February 4, 2021
    Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
  • Publication number: 20200342024
    Abstract: Example systems and methods are audio identification based on data structure are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to execute a constant Q transform on query time slices of query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, sequentially order the two-dimensional Fourier transforms in a query data structure, and identify the query audio as a cover rendition of reference audio based on a comparison between the query data structure and a reference data structure associated with the reference audio.
    Type: Application
    Filed: July 13, 2020
    Publication date: October 29, 2020
    Inventors: Zafar Rafii, Prem Seetharaman
  • Patent number: 10803119
    Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for automated cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews that occur in cover songs found in content repositories. The systems and methods allow copyright holders to search the content repositories for unlicensed cover song.
    Type: Grant
    Filed: September 7, 2017
    Date of Patent: October 13, 2020
    Assignee: GRACENOTE, INC.
    Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
  • Publication number: 20200286504
    Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for sound quality prediction and real-time feedback about sound quality, such as room acoustics quality and background noise. Audio data can be sampled from a live sound source and stored in an audio buffer. The audio data in the buffer is analyzed to calculate a stream of values of one or more sound quality measures, such as speech transmission index and signal-to-noise ratio. Speech transmission index can be calculated using a convolution neural network configured to predict speech transmission index from reverberant speech. The stream of values can be used to provide real-time feedback about sound quality of the audio data. For example, a visual indicator on a graphical user interface can be updated based on consistency of the values over time. The real-time feedback about sound quality can help users optimize their recording setup.
    Type: Application
    Filed: March 7, 2019
    Publication date: September 10, 2020
    Inventors: Prem Seetharaman, Gautham J. Mysore, Bryan A. Pardo
  • Patent number: 10713296
    Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews. In particular, a special data structure provides a time-series representation of audio, and this time-series representation is robust to key changes, timbral changes, and small local tempo deviations. Accordingly, the systems and methods described herein analyze cross-similarity between these time-series representations. In some example embodiments, such systems and methods extract features from an audio fingerprint and calculate a distance measure that is robust and invariant to changes in musical structure.
    Type: Grant
    Filed: September 7, 2017
    Date of Patent: July 14, 2020
    Assignee: GRACENOTE, INC.
    Inventors: Zafar Rafii, Prem Seetharaman
  • Publication number: 20180189390
    Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for automated cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews that occur in cover songs found in content repositories. The systems and methods allow copyright holders to search the content repositories for unlicensed cover song.
    Type: Application
    Filed: September 7, 2017
    Publication date: July 5, 2018
    Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
  • Publication number: 20180075140
    Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews. In particular, a special data structure provides a time-series representation of audio, and this time-series representation is robust to key changes, timbral changes, and small local tempo deviations. Accordingly, the systems and methods described herein analyze cross-similarity between these time-series representations. In some example embodiments, such systems and methods extract features from an audio fingerprint and calculate a distance measure that is robust and invariant to changes in musical structure.
    Type: Application
    Filed: September 7, 2017
    Publication date: March 15, 2018
    Inventors: Zafar Rafii, Prem Seetharaman