Patents by Inventor Mihajlo Velimirovic

Mihajlo Velimirovic has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

PERFORMING TASKS USING GENERATIVE NEURAL NETWORKS

Publication number: 20240428056

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing tasks. One of the methods includes obtaining a sequence of input tokens, where each token is selected from a vocabulary of tokens that includes text tokens and audio tokens, and wherein the sequence of input tokens includes tokens that describe a task to be performed and data for performing the task; generating a sequence of embeddings by embedding each token in the sequence of input tokens in an embedding space; and processing the sequence of embeddings using a language model neural network to generate a sequence of output tokens for the task, where each token is selected from the vocabulary.

Type: Application

Filed: June 21, 2024

Publication date: December 26, 2024

Inventors: Paul Kishan Rubenstein, Matthew Sharifi, Alexandru Tudor, Chulayuth Asawaroengchai, Duc Dung Nguyen, Marco Tagliasacchi, Neil Zeghidour, Zalán Borsos, Christian Frank, Dalia Salem Hassan Fahmy Elbadawy, Hannah Raphaelle Muckenhirn, Dirk Ryan Padfield, Damien Vincent, Evgeny Kharitonov, Michelle Dana Tadmor, Mihajlo Velimirovic, Feifan Chen, Victoria Zayats
Self-supervised pitch estimation

Patent number: 11756530

Abstract: Example embodiments relate to techniques for training artificial neural networks or oilier machine-learning encoders to accurately predict the pitch of input audio samples in a semitone or otherwise logarithmically-scaled pitch space. An example method may include generating, from a sample of audio data, two training samples by applying two different pitch shifts to the sample of audio training data. This can be done by converting the sample of audio data into the frequency domain and then shifting the transformed data. These known shifts are then compared to the predicted pitches generated by applying the two training samples to the encoder. The encoder is then updated based on the comparison, such that the relative pitch output by the encoder is improved with respect to accuracy. One or more audio samples, labeled with absolute pitch values, can then be used to calibrate the relative pitch values generated by the trained encoder.

Type: Grant

Filed: September 25, 2020

Date of Patent: September 12, 2023

Assignee: Google LLC

Inventors: Marco Tagliasacchi, Mihajlo Velimirovic, Matthew Sharifi, Dominik Roblek, Christian Frank, Beat Gfeller
SELF-SUPERVISED PITCH ESTIMATION

Publication number: 20220343896

Abstract: Example embodiments relate to techniques for training artificial neural networks or oilier machine-learning encoders to accurately predict the pitch of input audio samples in a semitone or otherwise logarithmically-scaled pitch space. An example method may include generating, from a sample of audio data, two training samples by applying two different pitch shifts to the sample of audio training data. This can be done by converting the sample of audio data into the frequency domain and then shifting the transformed data. These known shifts are then compared to the predicted pitches generated by applying the two training samples to the encoder. The encoder is then updated based on the comparison, such that the relative pitch output by the encoder is improved with respect to accuracy. One or more audio samples, labeled with absolute pitch values, can then be used to calibrate the relative pitch values generated by the trained encoder.

Type: Application

Filed: September 25, 2020

Publication date: October 27, 2022

Inventors: Marco TAGLIASACCHI, Mihajlo VELIMIROVIC, Matthew SHARIFI, Dominik ROBLEK, Christian FRANK, Beat GFELLER
Determining that Audio Includes Music and then Identifying the Music as a Particular Song

Publication number: 20200401367

Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products. A computing device stores reference song characterization data and receives digital audio data. The computing device determines whether the digital audio data represents music and then performs a different process to recognize that the digital audio data represents a particular reference song. The computing device then outputs an indication of the particular reference song.

Type: Application

Filed: September 2, 2020

Publication date: December 24, 2020

Applicant: Google LLC

Inventors: Dominik Roblek, Blaise Hilary Aguera-Arcas, Thomas W. Hume, Marvin Karl Ritter, Brandon Charles Barbello, Kevin I. Kilgour, Mihajlo Velimirovic, Christopher Thornton, Gabriel Oak Taubman, James David Lyon, Jan Heinrich Althaus, Katsiaryna Naliuka, Julian James Odell, Matthew Sharifi, Beat Gfeller
Determining that audio includes music and then identifying the music as a particular song

Patent number: 10809968

Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products. A computing device stores reference song characterization data and receives digital audio data. The computing device determines whether the digital audio data represents music and then performs a different process to recognize that the digital audio data represents a particular reference song. The computing device then outputs an indication of the particular reference song.

Type: Grant

Filed: October 1, 2018

Date of Patent: October 20, 2020

Assignee: Google LLC

Inventors: Dominik Roblek, Blaise Hilary Aguera-Arcas, Thomas W. Hume, Marvin Karl Ritter, Brandon Charles Barbello, Kevin I. Kilgour, Mihajlo Velimirovic, Christopher Thornton, Gabriel Oak Taubman, James David Lyon, Jan Heinrich Althaus, Katsiaryna Naliuka, Julian James Odell, Matthew Sharifi, Beat Gfeller
Identifying Music as a Particular Song

Publication number: 20190102144

Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products for indicating a reference song. A computing device stores reference song characterization data that identifies a plurality of audio characteristics for each reference song in a plurality of reference songs. The computing device receives digital audio data that represents audio recorded by a microphone, converts the digital audio data from time-domain format into frequency-domain format, and uses the digital audio data in the frequency-domain format in a music-characterization process. In response to determining that characterization values for the digital audio data are most relevant to characterization values for a particular reference song, the computing device outputs an indication of the particular reference song.

Type: Application

Filed: October 1, 2018

Publication date: April 4, 2019

Inventors: Dominik Roblek, Blaise Aguera-Arcas, Tom Hume, Marvin Ritter, Brandon Barbello, Kevin Kilgour, Mihajlo Velimirovic, Christopher Walter George Thornton, Gabriel Taubman, James David Lyon, Jan Althaus, Katsiaryna Naliuka, Julian Odell, Matthew Sharifi, Beat Gfeller
Determining that Audio Includes Music and then Identifying the Music as a Particular Song

Publication number: 20190102458

Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products. A computing device stores reference song characterization data and receives digital audio data. The computing device determines whether the digital audio data represents music and then performs a different process to recognize that the digital audio data represents a particular reference song. The computing device then outputs an indication of the particular reference song.

Type: Application

Filed: October 1, 2018

Publication date: April 4, 2019

Inventors: Dominik Roblek, Blaise Aguera-Arcas, Tom Hume, Marvin Ritter, Brandon Barbello, Kevin Kilgour, Mihajlo Velimirovic, Christopher Walter George Thornton, Gabriel Taubman, James David Lyon, Jan Athaus, Katsiaryna Naliuka, Julian Odell, Matthew Sharifi, Beat Gfeller

PERFORMING TASKS USING GENERATIVE NEURAL NETWORKS

Self-supervised pitch estimation

SELF-SUPERVISED PITCH ESTIMATION

Determining that Audio Includes Music and then Identifying the Music as a Particular Song

Determining that audio includes music and then identifying the music as a particular song

Identifying Music as a Particular Song

Determining that Audio Includes Music and then Identifying the Music as a Particular Song