Patents by Inventor Mihajlo Velimirovic

Mihajlo Velimirovic has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240428056
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing tasks. One of the methods includes obtaining a sequence of input tokens, where each token is selected from a vocabulary of tokens that includes text tokens and audio tokens, and wherein the sequence of input tokens includes tokens that describe a task to be performed and data for performing the task; generating a sequence of embeddings by embedding each token in the sequence of input tokens in an embedding space; and processing the sequence of embeddings using a language model neural network to generate a sequence of output tokens for the task, where each token is selected from the vocabulary.
    Type: Application
    Filed: June 21, 2024
    Publication date: December 26, 2024
    Inventors: Paul Kishan Rubenstein, Matthew Sharifi, Alexandru Tudor, Chulayuth Asawaroengchai, Duc Dung Nguyen, Marco Tagliasacchi, Neil Zeghidour, Zalán Borsos, Christian Frank, Dalia Salem Hassan Fahmy Elbadawy, Hannah Raphaelle Muckenhirn, Dirk Ryan Padfield, Damien Vincent, Evgeny Kharitonov, Michelle Dana Tadmor, Mihajlo Velimirovic, Feifan Chen, Victoria Zayats
  • Patent number: 11756530
    Abstract: Example embodiments relate to techniques for training artificial neural networks or oilier machine-learning encoders to accurately predict the pitch of input audio samples in a semitone or otherwise logarithmically-scaled pitch space. An example method may include generating, from a sample of audio data, two training samples by applying two different pitch shifts to the sample of audio training data. This can be done by converting the sample of audio data into the frequency domain and then shifting the transformed data. These known shifts are then compared to the predicted pitches generated by applying the two training samples to the encoder. The encoder is then updated based on the comparison, such that the relative pitch output by the encoder is improved with respect to accuracy. One or more audio samples, labeled with absolute pitch values, can then be used to calibrate the relative pitch values generated by the trained encoder.
    Type: Grant
    Filed: September 25, 2020
    Date of Patent: September 12, 2023
    Assignee: Google LLC
    Inventors: Marco Tagliasacchi, Mihajlo Velimirovic, Matthew Sharifi, Dominik Roblek, Christian Frank, Beat Gfeller
  • Publication number: 20220343896
    Abstract: Example embodiments relate to techniques for training artificial neural networks or oilier machine-learning encoders to accurately predict the pitch of input audio samples in a semitone or otherwise logarithmically-scaled pitch space. An example method may include generating, from a sample of audio data, two training samples by applying two different pitch shifts to the sample of audio training data. This can be done by converting the sample of audio data into the frequency domain and then shifting the transformed data. These known shifts are then compared to the predicted pitches generated by applying the two training samples to the encoder. The encoder is then updated based on the comparison, such that the relative pitch output by the encoder is improved with respect to accuracy. One or more audio samples, labeled with absolute pitch values, can then be used to calibrate the relative pitch values generated by the trained encoder.
    Type: Application
    Filed: September 25, 2020
    Publication date: October 27, 2022
    Inventors: Marco TAGLIASACCHI, Mihajlo VELIMIROVIC, Matthew SHARIFI, Dominik ROBLEK, Christian FRANK, Beat GFELLER
  • Publication number: 20200401367
    Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products. A computing device stores reference song characterization data and receives digital audio data. The computing device determines whether the digital audio data represents music and then performs a different process to recognize that the digital audio data represents a particular reference song. The computing device then outputs an indication of the particular reference song.
    Type: Application
    Filed: September 2, 2020
    Publication date: December 24, 2020
    Applicant: Google LLC
    Inventors: Dominik Roblek, Blaise Hilary Aguera-Arcas, Thomas W. Hume, Marvin Karl Ritter, Brandon Charles Barbello, Kevin I. Kilgour, Mihajlo Velimirovic, Christopher Thornton, Gabriel Oak Taubman, James David Lyon, Jan Heinrich Althaus, Katsiaryna Naliuka, Julian James Odell, Matthew Sharifi, Beat Gfeller
  • Patent number: 10809968
    Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products. A computing device stores reference song characterization data and receives digital audio data. The computing device determines whether the digital audio data represents music and then performs a different process to recognize that the digital audio data represents a particular reference song. The computing device then outputs an indication of the particular reference song.
    Type: Grant
    Filed: October 1, 2018
    Date of Patent: October 20, 2020
    Assignee: Google LLC
    Inventors: Dominik Roblek, Blaise Hilary Aguera-Arcas, Thomas W. Hume, Marvin Karl Ritter, Brandon Charles Barbello, Kevin I. Kilgour, Mihajlo Velimirovic, Christopher Thornton, Gabriel Oak Taubman, James David Lyon, Jan Heinrich Althaus, Katsiaryna Naliuka, Julian James Odell, Matthew Sharifi, Beat Gfeller
  • Publication number: 20190102144
    Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products for indicating a reference song. A computing device stores reference song characterization data that identifies a plurality of audio characteristics for each reference song in a plurality of reference songs. The computing device receives digital audio data that represents audio recorded by a microphone, converts the digital audio data from time-domain format into frequency-domain format, and uses the digital audio data in the frequency-domain format in a music-characterization process. In response to determining that characterization values for the digital audio data are most relevant to characterization values for a particular reference song, the computing device outputs an indication of the particular reference song.
    Type: Application
    Filed: October 1, 2018
    Publication date: April 4, 2019
    Inventors: Dominik Roblek, Blaise Aguera-Arcas, Tom Hume, Marvin Ritter, Brandon Barbello, Kevin Kilgour, Mihajlo Velimirovic, Christopher Walter George Thornton, Gabriel Taubman, James David Lyon, Jan Althaus, Katsiaryna Naliuka, Julian Odell, Matthew Sharifi, Beat Gfeller
  • Publication number: 20190102458
    Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products. A computing device stores reference song characterization data and receives digital audio data. The computing device determines whether the digital audio data represents music and then performs a different process to recognize that the digital audio data represents a particular reference song. The computing device then outputs an indication of the particular reference song.
    Type: Application
    Filed: October 1, 2018
    Publication date: April 4, 2019
    Inventors: Dominik Roblek, Blaise Aguera-Arcas, Tom Hume, Marvin Ritter, Brandon Barbello, Kevin Kilgour, Mihajlo Velimirovic, Christopher Walter George Thornton, Gabriel Taubman, James David Lyon, Jan Athaus, Katsiaryna Naliuka, Julian Odell, Matthew Sharifi, Beat Gfeller