Patents by Inventor Hiroki KANAGAWA

Hiroki KANAGAWA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

LABELING METHOD, LABELING DEVICE, AND LABELING PROGRAM

Publication number: 20240054992

Abstract: A labeling processing device (100) generates first label information by labeling time information in a forward direction with respect to a plurality of phoneme boundaries set in speech information for learning. The labeling processing device (100) generates second label information by labeling time information in a direction opposite to the forward direction with respect to a plurality of phoneme boundaries set in speech information for learning and inverting the order of the labeled time information. The labeling processing device (100) detects whether phoneme boundaries are appropriate on the basis of a difference between time information on a plurality of phoneme boundaries included in the first label information and time information on a plurality of phoneme boundaries included in the second label information.

Type: Application

Filed: November 25, 2020

Publication date: February 15, 2024

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventor: Hiroki KANAGAWA
GENERATING METHOD, GENERATING DEVICE, AND GENERATING PROGRAM

Publication number: 20240038213

Abstract: A generation device (100) extracts a plurality of integrated speech samples by repeatedly executing processing of integrating a plurality of consecutive speech samples included in speech waveform information into one speech sample, and generates a compressed speech sample by compressing the plurality of integrated speech samples extracted. The generation device (100) generates a plurality of new integrated speech samples subsequent to the plurality of integrated speech samples by inputting the compressed speech sample and an acoustic feature value calculated from the speech waveform information to a speech waveform generation model, and repeatedly executes processing of inputting a compressed speech sample obtained by compressing the plurality of new integrated speech samples and the acoustic feature value to the speech waveform generation model, to generate a plurality of new integrated speech samples a plurality of times.

Type: Application

Filed: November 25, 2020

Publication date: February 1, 2024

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventor: Hiroki KANAGAWA
Acoustic model learning device, voice synthesis device, and program

Patent number: 11545135

Abstract: An acoustic model learning device is provided for obtaining an acoustic model used to synthesize voice signals with intonation. The device includes a first learning unit that learns the acoustic model to estimate synthetic acoustic feature values using voice and speaker determination models based on acoustic feature values of speakers, language feature values corresponding to the acoustic feature values and speaker data items, a second learning unit that learns the voice determination model to determine whether the synthetic acoustic feature value is a predetermined acoustic feature value or not based on the acoustic feature values and the synthetic acoustic feature values, and a third learning unit that learns the speaker determination model to determine whether the speaker of the synthetic acoustic feature value is a predetermined speaker or not based on the acoustic feature values and the synthetic acoustic feature values.

Type: Grant

Filed: September 25, 2019

Date of Patent: January 3, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Hiroki Kanagawa, Yusuke Ijima
DETECTION APPARATUS, METHOD AND PROGRAM FOR THE SAME

Publication number: 20220406289

Abstract: A detection device includes a labeling acoustic feature calculation unit configured to calculate a labeling acoustic feature from voice data, a time information acquisition unit configured to acquire a label with time information corresponding to the voice data from a label with no time information corresponding to the voice data and the labeling acoustic feature through a use of a labeling acoustic model configured to receive, as inputs, a label with no time information and a labeling acoustic feature and output a label with time information, an acoustic feature prediction unit configured to predict an acoustic feature corresponding to the label with time information and acquire a predicted value through a use of an acoustic model configured to receive, as an input, a label with time information and output an acoustic feature, an acoustic feature calculation unit configured to calculate an acoustic feature from the voice data, a difference calculation unit configured to determine an acoustic difference betwe

Type: Application

Filed: November 25, 2019

Publication date: December 22, 2022

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Hiroki KANAGAWA, Yusuke IJIMA
ACOUSTIC MODEL LEARNING DEVICE, VOICE SYNTHESIS DEVICE, AND PROGRAM

Publication number: 20220051655

Abstract: An acoustic model learning device is provided for obtaining an acoustic model used to synthesize voice signals with intonation. The device includes a first learning unit that learns the acoustic model to estimate synthetic acoustic feature values using voice and speaker determination models based on acoustic feature values of speakers, language feature values corresponding to the acoustic feature values and speaker data items, a second learning unit that learns the voice determination model to determine whether the synthetic acoustic feature value is a predetermined acoustic feature value or not based on the acoustic feature values and the synthetic acoustic feature values, and a third learning unit that learns the speaker determination model to determine whether the speaker of the synthetic acoustic feature value is a predetermined speaker or not based on the acoustic feature values and the synthetic acoustic feature values.

Type: Application

Filed: September 25, 2019

Publication date: February 17, 2022

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Hiroki KANAGAWA, Yusuke IJIMA

LABELING METHOD, LABELING DEVICE, AND LABELING PROGRAM

GENERATING METHOD, GENERATING DEVICE, AND GENERATING PROGRAM

Acoustic model learning device, voice synthesis device, and program

DETECTION APPARATUS, METHOD AND PROGRAM FOR THE SAME

ACOUSTIC MODEL LEARNING DEVICE, VOICE SYNTHESIS DEVICE, AND PROGRAM