Patents by Inventor Hosana KAMIYAMA

Hosana KAMIYAMA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

LEARNING METHOD, LEARNING SYSTEM AND LEARNING PROGRAM

Publication number: 20240078999

Abstract: A learning method includes the following processes. A shuffling process acquires learning data arranged in a time series and rearranges the learning data in an order different from the order of the time series. A learning process trains an acoustic model using the learning data rearranged through the shuffling process.

Type: Application

Filed: January 15, 2021

Publication date: March 7, 2024

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Hosana KAMIYAMA, Yoshikazu YAMAGUCHI
SATISFACTION ESTIMATION MODEL ADAPTING APPARATUS, SATISFACTION ESTIMATING APPARATUS, METHODS THEREFOR, AND PROGRAM

Publication number: 20230410834

Abstract: A pre-adaptation model storage unit (14) stores a satisfaction estimation model obtained by connecting a speech satisfaction estimation model part that estimates a speech satisfaction for each speech using a feature amount of each speech as an input and a conversation satisfaction estimation model part that estimates a conversation satisfaction using at least the speech satisfaction for each speech as an input. An adaptation data storage unit (15) stores adaptation data including a conversation voice in which a conversation including a plurality of speeches is recorded and a correct value of a conversation satisfaction for the conversation. A model adaptation unit (18) fixes, by using a feature amount of each speech extracted from the conversation voice and a correct value of the conversation satisfaction, a parameter of the conversation satisfaction estimation model part to update a parameter of the speech satisfaction estimation model part.

Type: Application

Filed: November 4, 2020

Publication date: December 21, 2023

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsushi ANDO, Hosana KAMIYAMA, Takeshi MORI, Satoshi KOBASHIKAWA
Paralinguistic information estimation apparatus, paralinguistic information estimation method, and program

Patent number: 11798578

Abstract: To increase the accuracy of paralinguistic information estimation. A paralinguistic information estimation model storage unit 20 stores a paralinguistic information estimation model outputting, with a plurality of independent features as inputs, paralinguistic information estimation results. A feature extraction unit 11 extracts the features from an input utterance. A paralinguistic information estimation unit 20 estimates paralinguistic information of the input utterance from the features extracted from the input utterance, by using the paralinguistic information estimation model.

Type: Grant

Filed: October 8, 2019

Date of Patent: October 24, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsushi Ando, Hosana Kamiyama, Satoshi Kobashikawa
Attribute identification method, and program

Patent number: 11756554

Abstract: An attribute identification technology that can reject an attribute identification result if the reliability thereof is low is provided. An attribute identification device includes: a posteriori probability calculation unit 110 that calculates, from input speech, a posteriori probability sequence {q(c, i)} which is a sequence of the posteriori probabilities q(c, i) that a frame i of the input speech is a class c; a reliability calculation unit 120 that calculates, from the posteriori probability sequence {q(c, i)}, reliability r(c) indicating the extent to which the class c is a correct attribute identification result; and an attribute identification result generating unit 130 that generates an attribute identification result L of the input speech from the posteriori probability sequence {q(c, i)} and the reliability r(c).

Type: Grant

Filed: August 23, 2021

Date of Patent: September 12, 2023

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Hosana Kamiyama, Satoshi Kobashikawa, Atsushi Ando
MODEL LEARNING APPARATUS, METHOD AND PROGRAM FOR THE SAME

Publication number: 20230206118

Abstract: Provided is a model learning technology to learn a model in consideration of a difference in label assignment accuracy between experts and non-experts.

Type: Application

Filed: March 19, 2020

Publication date: June 29, 2023

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Hosana KAMIYAMA, Yuki KITAGISHI, Atsushi ANDO, Ryo MASUMURA, Takeshi MORI, Satoshi KOBASHIKAWA
EMOTION RECOGNITION APPARATUS, EMOTION RECOGNITION MODEL LEARNING APPARATUS, METHODS AND PROGRAMS FOR THE SAME

Publication number: 20230095088

Abstract: The present invention provides emotion recognition technology that achieves high emotion recognition accuracy for all speakers.

Type: Application

Filed: February 28, 2020

Publication date: March 30, 2023

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsushi ANDO, Yuki KITAGISHI, Hosana KAMIYAMA, Takeshi MORI
RECOGNITION APPARATUS, LEARNING APPARATUS, METHODS AND PROGRAMS FOR THE SAME

Publication number: 20230069908

Abstract: A recognition apparatus includes a classification unit that estimates a non-linguistic and para-linguistic information label to be imparted by an n-th listener from an acoustic feature amount of speech data to be recognized using an n-th classification model, and an integration unit that integrates estimation results of the non-linguistic and para-linguistic information labels for N listeners and obtains non-linguistic and para-linguistic information estimation results as a recognition apparatus for the speech data to be recognized, and the n-th classification model is a classification model trained using training speech data and a non-linguistic and para-linguistic information label imparted to the training speech data by the n-th listener as training data.

Type: Application

Filed: February 21, 2020

Publication date: March 9, 2023

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsushi ANDO, Yuki KITAGISHI, Hosana KAMIYAMA, Takeshi MORI
Pronunciation error detection apparatus, pronunciation error detection method and program

Patent number: 11568761

Abstract: The present invention provides a pronunciation error detection apparatus capable of following a text without the need for a correct sentence even when erroneous recognition such as a reading error occurs.

Type: Grant

Filed: September 13, 2018

Date of Patent: January 31, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Satoshi Kobashikawa, Ryo Masumura, Hosana Kamiyama, Yusuke Ijima, Yushi Aono
LEARNING APPARATUS, ESTIMATION APPARATUS, METHODS AND PROGRAMS FOR THE SAME

Publication number: 20230013385

Abstract: A learning apparatus includes: a speaker vector learning unit configured to learn a speaker vector extraction parameter ? based on one or more items of learning speech voice data in a speaker vector voice database; a non-speaker-individuality sound model learning unit configured to create a probability distribution model using a frequency component of one or more items of non-speaker-individuality sound data in a non-speaker-individuality sound database and calculate an internal parameter of the probability distribution model; and an age level estimation model learning unit configured to extract a speaker vector from voice data in an age level estimation model-learning voice database using the speaker vector extraction parameter ?, calculate a non-speaker-individuality sound likelihood vector from voice data in the age level estimation model-learning voice database using the internal parameters ? and ?, and learn, with input of the speaker vector and the non-speaker-individuality sound likelihood vector, a pa

Type: Application

Filed: December 9, 2019

Publication date: January 19, 2023

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Yuki KITAGISHI, Takeshi MORI, Hosana KAMIYAMA, Atsushi ANDO, Satoshi KOBASHIKAWA
Satisfaction estimation model learning apparatus, satisfaction estimating apparatus, satisfaction estimation model learning method, satisfaction estimation method, and program

Patent number: 11557311

Abstract: Estimation accuracies of a conversation satisfaction and a speech satisfaction are improved. A learning data storage unit (10) stores learning data including a conversation voice containing a conversation including a plurality of speeches, a correct answer value of a conversation satisfaction for the conversation, and a correct answer value of a speech satisfaction for each speech included in the conversation.

Type: Grant

Filed: July 20, 2018

Date of Patent: January 17, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsushi Ando, Hosana Kamiyama, Satoshi Kobashikawa
Label generation device, model learning device, emotion recognition apparatus, methods therefor, program, and recording medium

Patent number: 11551708

Abstract: With correct emotion classes selected as correct values of an emotion of an utterer of a first utterance from among a plurality of emotion classes C1, . . . , CK by listeners who have listened to the first utterance, as an input, the numbers of times ni that emotion classes Ci have been selected as the correct emotion classes are obtained, and rates of the numbers of times nk to a sum total of the numbers of times n1, . . . , nK or smoothed values of the rates are obtained as correct emotion soft labels tk(s) corresponding to the first utterance.

Type: Grant

Filed: November 12, 2018

Date of Patent: January 10, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsushi Ando, Hosana Kamiyama, Satoshi Kobashikawa
Model learning device, estimating device, methods therefor, and program

Patent number: 11521641

Abstract: State-of-satisfaction change pattern models each including a set of transition weights in state sequences of the states of satisfaction are obtained for predetermined change patterns of the states of satisfaction, and a state-of-satisfaction estimation model for obtaining the posteriori probability of the utterance feature amount given the state of satisfaction of an utterer is obtained by using the utterance-for-learning feature amount and a correct value of the state of satisfaction of an utterer who gave an utterance for learning corresponding to the utterance-for-learning feature amount. By using the input utterance feature amount and the state-of-satisfaction change pattern models and the state-of-satisfaction estimation model, an estimated value of the state of satisfaction of an utterer who gave an utterance corresponding to the input utterance feature amount is obtained.

Type: Grant

Filed: February 2, 2018

Date of Patent: December 6, 2022

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsushi Ando, Hosana Kamiyama, Satoshi Kobashikawa
Urgency level estimation apparatus, urgency level estimation method, and program

Patent number: 11495245

Abstract: An urgency level estimation technique of estimating an urgency level of a speaker for free uttered speech, which does not require a specific word, is provided. An urgency level estimation apparatus includes a feature amount extracting part configured to extract a feature amount of an utterance from uttered speech, and an urgency level estimating part configured to estimate an urgency level of a speaker of the uttered speech from the feature amount based on a relationship between a feature amount extracted from uttered speech and an urgency level of a speaker of the uttered speech, the relationship being determined in advance, and the feature amount includes at least one of a feature indicating speaking speed of the uttered speech, a feature indicating voice pitch of the uttered speech and a feature indicating a power level of the uttered speech.

Type: Grant

Filed: November 15, 2018

Date of Patent: November 8, 2022

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Hosana Kamiyama, Satoshi Kobashikawa, Atsushi Ando
ESTIMATION DEVICE, ESTIMATION METHOD, AND ESTIMATION PROGRAM

Publication number: 20220335928

Abstract: An estimation apparatus clusters a group of voice signals including a voice signal having a speaker attribute to be estimated into a plurality of clusters. Subsequently, the estimation apparatus identifies, from the plurality of clusters, a duster to which the voice signal to be estimated belongs. Next, the estimation apparatus uses a speaker attribute estimation model to estimate speaker attributes of respective voice signals in the identified cluster. After that, the estimation apparatus estimates an attribute of the entire cluster, by using an estimation result of the speaker attributes of the voice signals in the identified cluster, and outputs an estimation result of the speaker attribute of the entire cluster, as an estimation result of the speaker attribute of the voice signal to be estimated.

Type: Application

Filed: August 19, 2019

Publication date: October 20, 2022

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Naohiro TAWARA, Hosana KAMIYAMA, Satoshi KOBASHIKAWA, Atsunori OGAWA
IMPRESSION ESTIMATION APPARATUS, LEARNING APPARATUS, METHODS AND PROGRAMS FOR THE SAME

Publication number: 20220277761

Abstract: An impression estimation technique without the need of voice recognition is provided. An impression estimation device includes an estimation unit configured to estimate an impression of a voice signal s by defining p1<p2 and using a first feature amount obtained based on a first analysis time length p1 for the voice signal s and a second feature amount obtained based on a second analysis time length p2 for the voice signal s. A learning device includes a learning unit configured to learn an estimation model which estimates the impression of the voice signal by defining p1<p2 and using a first feature amount for learning obtained based on the first analysis time length p1 for a voice signal for learning sL, a second feature amount for learning obtained based on the second analysis time length p2 for the voice signal for learning sL, and an impression label imparted to the voice signal for learning sL.

Type: Application

Filed: July 29, 2019

Publication date: September 1, 2022

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Hosana KAMIYAMA, Atsushi ANDO, Satoshi KOBASHIKAWA
MODEL LEARNING APPARATUS, LABEL ESTIMATION APPARATUS, METHOD AND PROGRAM THEREOF

Publication number: 20220180188

Abstract: A model is learned that is capable of accurate label estimation even if learning data is used for which the number of evaluators per piece of data is small.

Type: Application

Filed: February 25, 2020

Publication date: June 9, 2022

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Hosana KAMIYAMA, Atsushi ANDO, Satoshi KOBASHIKAWA
PARALINGUISTIC INFORMATION ESTIMATION MODEL LEARNING APPARATUS, PARALINGUISTIC INFORMATION ESTIMATION APPARATUS, AND PROGRAM

Publication number: 20220122584

Abstract: Paralinguistic information is estimated with high accuracy even when an utterance for which it is difficult to identify paralinguistic information is used for model learning. An acoustic feature extraction unit 11 extracts an acoustic feature from an utterance. An anti-teacher decision unit 12 decides, based on a paralinguistic information label indicating a determination result of paralinguistic information given by a plurality of listeners for each utterance, an anti-teacher label indicating an anti-teacher serving as incorrect paralinguistic information for the utterance. An anti-teacher estimation model learning unit 13 learns, based on an acoustic feature extracted from the utterance and the anti-teacher label, an anti-teacher estimation model for outputting a posterior probability of anti-teacher for an input acoustic feature.

Type: Application

Filed: January 27, 2020

Publication date: April 21, 2022

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsushi ANDO, Hosana KAMIYAMA, Satoshi KOBASHIKAWA
MODEL LEARNING APPARATUS, LABEL ESTIMATION APPARATUS, METHOD AND PROGRAM THEREOF

Publication number: 20220108217

Abstract: A model capable of estimating a label with high accuracy is learned even when training data involving a small number of raters per data item is used. Learning processing is performed in which a plurality of data items and label expectation values that are indicators representing degrees of correctness of individual labels on the data items are used in pairs as training data, and a model that estimates a label on an input data item is obtained.

Type: Application

Filed: January 29, 2020

Publication date: April 7, 2022

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Hosana KAMIYAMA, Satoshi KOBASHIKAWA, Atsushi ANDO, Ryo MASUMURA
PARALINGUISTIC INFORMATION ESTIMATION APPARATUS, PARALINGUISTIC INFORMATION ESTIMATION METHOD, AND PROGRAM

Publication number: 20210398552

Abstract: To increase the accuracy of paralinguistic information estimation. A paralinguistic information estimation model storage unit 20 stores a paralinguistic information estimation model outputting, with a plurality of independent features as inputs, paralinguistic information estimation results. A feature extraction unit 11 extracts the features from an input utterance. A paralinguistic information estimation unit 20 estimates paralinguistic information of the input utterance from the features extracted from the input utterance, by using the paralinguistic information estimation model.

Type: Application

Filed: October 8, 2019

Publication date: December 23, 2021

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsushi ANDO, Hosana KAMIYAMA, Satoshi KOBASHIKAWA
ATTRIBUTE IDENTIFICATION METHOD, AND PROGRAM

Publication number: 20210383812

Abstract: An attribute identification technology that can reject an attribute identification result if the reliability thereof is low is provided. An attribute identification device includes: a posteriori probability calculation unit 110 that calculates, from input speech, a posteriori probability sequence {q(c, i)} which is a sequence of the posteriori probabilities q(c, i) that a frame i of the input speech is a class c; a reliability calculation unit 120 that calculates, from the posteriori probability sequence {q(c, i)}, reliability r(c) indicating the extent to which the class c is a correct attribute identification result; and an attribute identification result generating unit 130 that generates an attribute identification result L of the input speech from the posteriori probability sequence {q(c, i)} and the reliability r(c).

Type: Application

Filed: August 23, 2021

Publication date: December 9, 2021

Applicant: Nippon Telegraph and Telephone Corporation

Inventors: Hosana KAMIYAMA, Satoshi Kobashikawa, Atsushi Ando

1 2 next