Patents Assigned to Krisp Technologies, Inc.
  • Patent number: 12223979
    Abstract: Techniques are described for generating parallel data for real-time speech form conversion. In an embodiment, based at least in part on input speech data of an original form, a speech machine learning (ML) model generates parallel speech data. The parallel speech data includes the input speech data of the original form and temporally aligned output speech data of a target form different than the original form. Each frame of the input speech data temporally corresponds to the corresponding output speech frame of the target speech form and contains a same portion of the particular content. The techniques further include training a teacher machine learning model that is offline and is substantially larger than a student machine learning model for converting speech form. Transferring “knowledge” from the trained Teacher model for training the Production Student Model that performs the speech form conversion on an end-user computing device.
    Type: Grant
    Filed: July 17, 2024
    Date of Patent: February 11, 2025
    Assignee: KRISP TECHNOLOGIES, INC.
    Inventors: Stepan Sargsyan, Artur Kobelyan, Levon Galoyan, Kajik Hakobyan, Rima Shahbazyan, Daniel Baghdasaryan, Ruben Hasratyan, Nairi Hakobyan, Hayk Aleksanyan, Tigran Tonoyan, Aris Hovsepyan
  • Publication number: 20250029629
    Abstract: Techniques are described for generating parallel data for real-time speech form conversion. In an embodiment, based at least in part on input speech data of an original form, a speech machine learning (ML) model generates parallel speech data. The parallel speech data includes the input speech data of the original form and temporally aligned output speech data of a target form different than the original form. Each frame of the input speech data temporally corresponds to the corresponding output speech frame of the target speech form and contains a same portion of the particular content. The techniques further include training a teacher machine learning model that is offline and is substantially larger than a student machine learning model for converting speech form. Transferring “knowledge” from the trained Teacher model for training the Production Student Model that performs the speech form conversion on an end-user computing device.
    Type: Application
    Filed: July 17, 2024
    Publication date: January 23, 2025
    Applicant: KRISP TECHNOLOGIES, INC.
    Inventors: STEPAN SARGSYAN, ARTUR KOBELYAN, LEVON GALOYAN, KAJIK HAKOBYAN, RIMA SHAHBAZYAN, DANIEL BAGHDASARYAN, RUBEN HASRATYAN, NAIRI HAKOBYAN, HAYK ALEKSANYAN, TIGRAN TONOYAN, ARIS HOVSEPYAN
  • Publication number: 20250029628
    Abstract: Techniques are described for generating parallel data for real-time speech form conversion. In an embodiment, based at least in part on input speech data of an original form, a speech machine learning (ML) model generates parallel speech data. The parallel speech data includes the input speech data of the original form and temporally aligned output speech data of a target form different than the original form. Each frame of the input speech data temporally corresponds to the corresponding output speech frame of the target speech form and contains a same portion of the particular content. The techniques further include training a teacher machine learning model that is offline and is substantially larger than a student machine learning model for converting speech form. Transferring “knowledge” from the trained Teacher model for training the Production Student Model that performs the speech form conversion on an end-user computing device.
    Type: Application
    Filed: July 17, 2024
    Publication date: January 23, 2025
    Applicant: KRISP TECHNOLOGIES, INC.
    Inventors: STEPAN SARGSYAN, ARTUR KOBELYAN, LEVON GALOYAN, KAJIK HAKOBYAN, RIMA SHAHBAZYAN, DANIEL BAGHDASARYAN, RUBEN HASRATYAN, NAIRI HAKOBYAN, HAYK ALEKSANYAN, TIGRAN TONOYAN, ARIS HOVSEPYAN
  • Patent number: 12205609
    Abstract: Techniques are described for generating parallel data for real-time speech form conversion. In an embodiment, based at least in part on input speech data of an original form, a speech machine learning (ML) model generates parallel speech data. The parallel speech data includes the input speech data of the original form and temporally aligned output speech data of a target form different than the original form. Each frame of the input speech data temporally corresponds to the corresponding output speech frame of the target speech form and contains a same portion of the particular content. The techniques further include training a teacher machine learning model that is offline and is substantially larger than a student machine learning model for converting speech form. Transferring “knowledge” from the trained Teacher model for training the Production Student Model that performs the speech form conversion on an end-user computing device.
    Type: Grant
    Filed: July 17, 2024
    Date of Patent: January 21, 2025
    Assignee: KRISP TECHNOLOGIES, INC.
    Inventors: Stepan Sargsyan, Artur Kobelyan, Levon Galoyan, Kajik Hakobyan, Rima Shahbazyan, Daniel Baghdasaryan, Ruben Hasratyan, Nairi Hakobyan, Hayk Aleksanyan, Tigran Tonoyan, Aris Hovsepyan
  • Patent number: 11100941
    Abstract: Example speech enhancement and noise suppression systems and methods are described. In one implementation, a method receives an audio file comprising a combination of voice data and noise data, and divides the audio file into multiple frames. The method performs a discrete Fourier transform on each frame of a first subset of the multiple frames to provide a plurality of frequency-domain outputs, which are input to a neural network. A ratio mask is obtained as an output from the neural network and clean voice coefficients are computed using the ratio mask. The method outputs an audio file having enhanced speech based on the computed clean voice coefficients.
    Type: Grant
    Filed: August 21, 2019
    Date of Patent: August 24, 2021
    Assignee: Krisp Technologies, Inc.
    Inventors: Stepan Sargsyan, Artavazd Minasyan