Patents by Inventor Prem Seetharaman

Prem Seetharaman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Audio identification based on data structure

Patent number: 11907288

Abstract: Example systems and methods are audio identification based on data structure are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to execute a constant Q transform on query time slices of query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, sequentially order the two-dimensional Fourier transforms in a query data structure, and identify the query audio as a cover rendition of reference audio based on a comparison between the query data structure and a reference data structure associated with the reference audio.

Type: Grant

Filed: July 13, 2020

Date of Patent: February 20, 2024

Assignee: Gracenote, Inc.

Inventors: Zafar Rafii, Prem Seetharaman
AUTOMATED COVER SONG IDENTIFICATION

Publication number: 20230008776

Abstract: Example systems and methods for automated cover song identification are disclosed. An example apparatus includes at least one memory, machine-readable instructions, and one or more processors to execute the machine-readable instructions to at least execute a constant Q transform on time slices of first audio data to output constant Q transformed time slices, binarize the constant Q transformed time slices to output binarized and constant Q transformed time slices, execute a two-dimensional Fourier transform on time windows within the binarized and constant Q transformed time slices to output two-dimensional Fourier transforms of the time windows, generate a reference data structure based on a sequential order of the two-dimensional Fourier transforms, store the reference data structure in a database, and identify a query data structure associated with query audio data as a cover rendition of the audio data based on a comparison of the query and reference data structures.

Type: Application

Filed: September 16, 2022

Publication date: January 12, 2023

Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
Automated cover song identification

Patent number: 11461390

Abstract: Example systems and methods for automated cover song identification are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to identify query audio from a content source based on a search query using rights metadata associated with the query audio, execute a constant Q transform on query time slices of the query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, generate a query data structure based on a sequential order of the two-dimensional Fourier transforms, select a subset including reference audio of a reference database based on the rights metadata, and identify the query audio as a cover rendition of the reference audio based on a comparison between the query and reference data structures.

Type: Grant

Filed: October 7, 2020

Date of Patent: October 4, 2022

Assignee: Gracenote, Inc.

Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
Sound quality prediction and interface to facilitate high-quality voice recordings

Patent number: 11138989

Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for sound quality prediction and real-time feedback about sound quality, such as room acoustics quality and background noise. Audio data can be sampled from a live sound source and stored in an audio buffer. The audio data in the buffer is analyzed to calculate a stream of values of one or more sound quality measures, such as speech transmission index and signal-to-noise ratio. Speech transmission index can be calculated using a convolution neural network configured to predict speech transmission index from reverberant speech. The stream of values can be used to provide real-time feedback about sound quality of the audio data. For example, a visual indicator on a graphical user interface can be updated based on consistency of the values over time. The real-time feedback about sound quality can help users optimize their recording setup.

Type: Grant

Filed: March 7, 2019

Date of Patent: October 5, 2021

Assignee: Adobe Inc.

Inventors: Prem Seetharaman, Gautham J. Mysore, Bryan A. Pardo
AUTOMATED COVER SONG IDENTIFICATION

Publication number: 20210034665

Abstract: Example systems and methods for automated cover song identification are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to identify query audio from a content source based on a search query using rights metadata associated with the query audio, execute a constant Q transform on query time slices of the query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, generate a query data structure based on a sequential order of the two-dimensional Fourier transforms, select a subset including reference audio of a reference database based on the rights metadata, and identify the query audio as a cover rendition of the reference audio based on a comparison between the query and reference data structures.

Type: Application

Filed: October 7, 2020

Publication date: February 4, 2021

Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
AUDIO IDENTIFICATION BASED ON DATA STRUCTURE

Publication number: 20200342024

Abstract: Example systems and methods are audio identification based on data structure are disclosed. An example apparatus includes memory, and one or more processors to execute instructions to execute a constant Q transform on query time slices of query audio, binarize the constant Q transformed query time slices, execute a two-dimensional Fourier transform on query time windows within the binarized and constant Q transformed query time slices to generate two-dimensional Fourier transforms of the query time windows, sequentially order the two-dimensional Fourier transforms in a query data structure, and identify the query audio as a cover rendition of reference audio based on a comparison between the query data structure and a reference data structure associated with the reference audio.

Type: Application

Filed: July 13, 2020

Publication date: October 29, 2020

Inventors: Zafar Rafii, Prem Seetharaman
Automated cover song identification

Patent number: 10803119

Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for automated cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews that occur in cover songs found in content repositories. The systems and methods allow copyright holders to search the content repositories for unlicensed cover song.

Type: Grant

Filed: September 7, 2017

Date of Patent: October 13, 2020

Assignee: GRACENOTE, INC.

Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
SOUND QUALITY PREDICTION AND INTERFACE TO FACILITATE HIGH-QUALITY VOICE RECORDINGS

Publication number: 20200286504

Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for sound quality prediction and real-time feedback about sound quality, such as room acoustics quality and background noise. Audio data can be sampled from a live sound source and stored in an audio buffer. The audio data in the buffer is analyzed to calculate a stream of values of one or more sound quality measures, such as speech transmission index and signal-to-noise ratio. Speech transmission index can be calculated using a convolution neural network configured to predict speech transmission index from reverberant speech. The stream of values can be used to provide real-time feedback about sound quality of the audio data. For example, a visual indicator on a graphical user interface can be updated based on consistency of the values over time. The real-time feedback about sound quality can help users optimize their recording setup.

Type: Application

Filed: March 7, 2019

Publication date: September 10, 2020

Inventors: Prem Seetharaman, Gautham J. Mysore, Bryan A. Pardo
Audio identification based on data structure

Patent number: 10713296

Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews. In particular, a special data structure provides a time-series representation of audio, and this time-series representation is robust to key changes, timbral changes, and small local tempo deviations. Accordingly, the systems and methods described herein analyze cross-similarity between these time-series representations. In some example embodiments, such systems and methods extract features from an audio fingerprint and calculate a distance measure that is robust and invariant to changes in musical structure.

Type: Grant

Filed: September 7, 2017

Date of Patent: July 14, 2020

Assignee: GRACENOTE, INC.

Inventors: Zafar Rafii, Prem Seetharaman
AUTOMATED COVER SONG IDENTIFICATION

Publication number: 20180189390

Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for automated cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews that occur in cover songs found in content repositories. The systems and methods allow copyright holders to search the content repositories for unlicensed cover song.

Type: Application

Filed: September 7, 2017

Publication date: July 5, 2018

Inventors: Markus K. Cremer, Zafar Rafii, Robert Coover, Prem Seetharaman
AUDIO IDENTIFICATION BASED ON DATA STRUCTURE

Publication number: 20180075140

Abstract: Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews. In particular, a special data structure provides a time-series representation of audio, and this time-series representation is robust to key changes, timbral changes, and small local tempo deviations. Accordingly, the systems and methods described herein analyze cross-similarity between these time-series representations. In some example embodiments, such systems and methods extract features from an audio fingerprint and calculate a distance measure that is robust and invariant to changes in musical structure.

Type: Application

Filed: September 7, 2017

Publication date: March 15, 2018

Inventors: Zafar Rafii, Prem Seetharaman