Patents by Inventor Takaaki SAEKI

Takaaki SAEKI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MASSIVE MULTILINGUAL SPEECH-TEXT JOINT SEMI-SUPERVISED LEARNING FOR TEXT-TO-SPEECH

Publication number: 20240153484

Abstract: A method includes receiving training data that includes a plurality of sets of text-to-speech (TTS) spoken utterances each associated with a respective language and including TTS utterances of synthetic speech spoken that includes a corresponding reference speech representation paired with a corresponding input text sequence. For each TTS utterance in each set of the TTS spoken training utterances of the received training data, the method includes generating a corresponding TTS encoded textual representation for the corresponding input text sequence, generating a corresponding speech encoding for the corresponding TTS utterance of synthetic speech, generating a shared encoder output, generating a predicted speech representation for the corresponding TTS utterance of synthetic speech, and determining a reconstruction loss. The method also includes training a TTS model based on the reconstruction losses determined for the TTS utterances in each set of the TTS spoken training utterances.

Type: Application

Filed: October 25, 2023

Publication date: May 9, 2024

Applicant: Google LLC

Inventors: Andrew M. Rosenberg, Takaaki Saeki, Zhehuai Chen, Byungha Chun, Bhuvana Ramabhadran
VOICE CONVERSION DEVICE, VOICE CONVERSION METHOD, AND VOICE CONVERSION PROGRAM

Publication number: 20230360631

Abstract: A voice conversion device and so forth, capable of realizing both high voice quality and real-time nature using spectral differentials, are provided. The voice conversion device 10 includes an acquisition unit 11 that acquires signals of a voice of a subject, a filter calculation unit 12 that performs transform of features representing a voice timbre of the voice by a trained transformer model, and subjects the features following transform to liftering by a trained lifter, thereby calculating a spectrum of a filter, a shortened filter calculation unit 13 that performs inverse Fourier transform of the spectrum of the filter, and applies a predetermined window function, thereby calculating a shortened filter, and a generating unit 14 that applies a spectrum, obtained by Fourier transform of the shortened filter, to the spectrum of the signals, and performs inverse Fourier transform, thereby generating a synthesized voice.

Type: Application

Filed: August 18, 2020

Publication date: November 9, 2023

Inventors: Shinnosuke Takamichi, Yuki Saito, Takaaki Saeki, Hiroshi Saruwatari
VOICE CONVERSION DEVICE, VOICE CONVERSION METHOD, AND VOICE CONVERSION PROGRAM

Publication number: 20230086642

Abstract: The present invention provides a voice conversion apparatus and the like using a differential spectral method which is capable of implementing both high voice quality and real-time performance even in wideband. A voice conversion apparatus 10 includes: an acquiring unit 11 configured to acquire a signal of a voice of a subject; a dividing unit 12 configured to divide the signal into sub-band signals corresponding to a plurality of frequency bands; a converting unit configured to convert one or a plurality of sub-band signals corresponding to one or a plurality of lower frequency bands, out of the sub-band signals corresponding to the plurality of frequency bands; and a synthesizing unit 16 configured to generate a synthesized voice by synthesizing the one or plurality of sub-band signals after conversion and the remaining sub-band signals that are not converted.

Type: Application

Filed: February 5, 2021

Publication date: March 23, 2023

Inventors: Shinnosuke Takamichi, Yuki Saito, Takaaki Saeki, Hiroshi Saruwatari
Playback control apparatus, playback control method, and medium for playing a program including segments generated using speech synthesis

Patent number: 9576569

Abstract: A playback control apparatus includes a playback controller configured to control playback of first content and second content. The first content is to output first sound which is generated based on text information using speech synthesis processing. The second content is to output second sound which is generated not using the speech synthesis processing. The playback controller causes an attribute of content to be played back to be displayed on the screen, the attribute indicating whether or not the content is to output sound which is generated based on text information using speech synthesis processing.

Type: Grant

Filed: May 15, 2015

Date of Patent: February 21, 2017

Assignee: SONY CORPORATION

Inventors: Takaaki Saeki, Yukiyoshi Hirose
Playback control apparatus, playback control method, and medium for playing a program including segments generated using speech synthesis and segments not generated using speech synthesis

Patent number: 9159313

Abstract: A playback control apparatus includes a playback controller configured to control playback of first content and second content. The first content is to output first sound which is generated based on text information using speech synthesis processing. The second content is to output second sound which is generated not using the speech synthesis processing. The playback controller causes an attribute of content to be played back to be displayed on the screen, the attribute indicating whether or not the content is to output sound which is generated based on text information using speech synthesis processing.

Type: Grant

Filed: November 28, 2012

Date of Patent: October 13, 2015

Assignee: SONY CORPORATION

Inventors: Takaaki Saeki, Yukiyoshi Hirose
PLAYBACK CONTROL APPARATUS, PLAYBACK CONTROL METHOD, AND PROGRAM

Publication number: 20150248272

Abstract: A playback control apparatus includes a playback controller configured to control playback of first content and second content. The first content is to output first sound which is generated based on text information using speech synthesis processing. The second content is to output second sound which is generated not using the speech synthesis processing. The playback controller causes an attribute of content to be played back to be displayed on the screen, the attribute indicating whether or not the content is to output sound which is generated based on text information using speech synthesis processing.

Type: Application

Filed: May 15, 2015

Publication date: September 3, 2015

Inventors: Takaaki Saeki, Yukiyoshi HIROSE
PLAYBACK CONTROL APPARATUS, PLAYBACK CONTROL METHOD, AND PROGRAM

Publication number: 20130262118

Abstract: A playback control apparatus includes a playback controller configured to control playback of first content and second content. The first content is to output first sound which is generated based on text information using speech synthesis processing. The second content is to output second sound which is generated not using the speech synthesis processing. The playback controller causes an attribute of content to be played back to be displayed on the screen, the attribute indicating whether or not the content is to output sound which is generated based on text information using speech synthesis processing.

Type: Application

Filed: November 28, 2012

Publication date: October 3, 2013

Applicant: Sony Corporation

Inventors: Takaaki SAEKI, Yukiyoshi HIROSE

MASSIVE MULTILINGUAL SPEECH-TEXT JOINT SEMI-SUPERVISED LEARNING FOR TEXT-TO-SPEECH

VOICE CONVERSION DEVICE, VOICE CONVERSION METHOD, AND VOICE CONVERSION PROGRAM

VOICE CONVERSION DEVICE, VOICE CONVERSION METHOD, AND VOICE CONVERSION PROGRAM

Playback control apparatus, playback control method, and medium for playing a program including segments generated using speech synthesis

Playback control apparatus, playback control method, and medium for playing a program including segments generated using speech synthesis and segments not generated using speech synthesis

PLAYBACK CONTROL APPARATUS, PLAYBACK CONTROL METHOD, AND PROGRAM

PLAYBACK CONTROL APPARATUS, PLAYBACK CONTROL METHOD, AND PROGRAM