Patents by Inventor Takaaki SAEKI

Takaaki SAEKI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240153484
    Abstract: A method includes receiving training data that includes a plurality of sets of text-to-speech (TTS) spoken utterances each associated with a respective language and including TTS utterances of synthetic speech spoken that includes a corresponding reference speech representation paired with a corresponding input text sequence. For each TTS utterance in each set of the TTS spoken training utterances of the received training data, the method includes generating a corresponding TTS encoded textual representation for the corresponding input text sequence, generating a corresponding speech encoding for the corresponding TTS utterance of synthetic speech, generating a shared encoder output, generating a predicted speech representation for the corresponding TTS utterance of synthetic speech, and determining a reconstruction loss. The method also includes training a TTS model based on the reconstruction losses determined for the TTS utterances in each set of the TTS spoken training utterances.
    Type: Application
    Filed: October 25, 2023
    Publication date: May 9, 2024
    Applicant: Google LLC
    Inventors: Andrew M. Rosenberg, Takaaki Saeki, Zhehuai Chen, Byungha Chun, Bhuvana Ramabhadran
  • Publication number: 20230360631
    Abstract: A voice conversion device and so forth, capable of realizing both high voice quality and real-time nature using spectral differentials, are provided. The voice conversion device 10 includes an acquisition unit 11 that acquires signals of a voice of a subject, a filter calculation unit 12 that performs transform of features representing a voice timbre of the voice by a trained transformer model, and subjects the features following transform to liftering by a trained lifter, thereby calculating a spectrum of a filter, a shortened filter calculation unit 13 that performs inverse Fourier transform of the spectrum of the filter, and applies a predetermined window function, thereby calculating a shortened filter, and a generating unit 14 that applies a spectrum, obtained by Fourier transform of the shortened filter, to the spectrum of the signals, and performs inverse Fourier transform, thereby generating a synthesized voice.
    Type: Application
    Filed: August 18, 2020
    Publication date: November 9, 2023
    Inventors: Shinnosuke Takamichi, Yuki Saito, Takaaki Saeki, Hiroshi Saruwatari
  • Publication number: 20230086642
    Abstract: The present invention provides a voice conversion apparatus and the like using a differential spectral method which is capable of implementing both high voice quality and real-time performance even in wideband. A voice conversion apparatus 10 includes: an acquiring unit 11 configured to acquire a signal of a voice of a subject; a dividing unit 12 configured to divide the signal into sub-band signals corresponding to a plurality of frequency bands; a converting unit configured to convert one or a plurality of sub-band signals corresponding to one or a plurality of lower frequency bands, out of the sub-band signals corresponding to the plurality of frequency bands; and a synthesizing unit 16 configured to generate a synthesized voice by synthesizing the one or plurality of sub-band signals after conversion and the remaining sub-band signals that are not converted.
    Type: Application
    Filed: February 5, 2021
    Publication date: March 23, 2023
    Inventors: Shinnosuke Takamichi, Yuki Saito, Takaaki Saeki, Hiroshi Saruwatari
  • Patent number: 9576569
    Abstract: A playback control apparatus includes a playback controller configured to control playback of first content and second content. The first content is to output first sound which is generated based on text information using speech synthesis processing. The second content is to output second sound which is generated not using the speech synthesis processing. The playback controller causes an attribute of content to be played back to be displayed on the screen, the attribute indicating whether or not the content is to output sound which is generated based on text information using speech synthesis processing.
    Type: Grant
    Filed: May 15, 2015
    Date of Patent: February 21, 2017
    Assignee: SONY CORPORATION
    Inventors: Takaaki Saeki, Yukiyoshi Hirose
  • Patent number: 9159313
    Abstract: A playback control apparatus includes a playback controller configured to control playback of first content and second content. The first content is to output first sound which is generated based on text information using speech synthesis processing. The second content is to output second sound which is generated not using the speech synthesis processing. The playback controller causes an attribute of content to be played back to be displayed on the screen, the attribute indicating whether or not the content is to output sound which is generated based on text information using speech synthesis processing.
    Type: Grant
    Filed: November 28, 2012
    Date of Patent: October 13, 2015
    Assignee: SONY CORPORATION
    Inventors: Takaaki Saeki, Yukiyoshi Hirose
  • Publication number: 20150248272
    Abstract: A playback control apparatus includes a playback controller configured to control playback of first content and second content. The first content is to output first sound which is generated based on text information using speech synthesis processing. The second content is to output second sound which is generated not using the speech synthesis processing. The playback controller causes an attribute of content to be played back to be displayed on the screen, the attribute indicating whether or not the content is to output sound which is generated based on text information using speech synthesis processing.
    Type: Application
    Filed: May 15, 2015
    Publication date: September 3, 2015
    Inventors: Takaaki Saeki, Yukiyoshi HIROSE
  • Publication number: 20130262118
    Abstract: A playback control apparatus includes a playback controller configured to control playback of first content and second content. The first content is to output first sound which is generated based on text information using speech synthesis processing. The second content is to output second sound which is generated not using the speech synthesis processing. The playback controller causes an attribute of content to be played back to be displayed on the screen, the attribute indicating whether or not the content is to output sound which is generated based on text information using speech synthesis processing.
    Type: Application
    Filed: November 28, 2012
    Publication date: October 3, 2013
    Applicant: Sony Corporation
    Inventors: Takaaki SAEKI, Yukiyoshi HIROSE