Patents by Inventor Hosana KAMIYAMA

Hosana KAMIYAMA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240078999
    Abstract: A learning method includes the following processes. A shuffling process acquires learning data arranged in a time series and rearranges the learning data in an order different from the order of the time series. A learning process trains an acoustic model using the learning data rearranged through the shuffling process.
    Type: Application
    Filed: January 15, 2021
    Publication date: March 7, 2024
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Hosana KAMIYAMA, Yoshikazu YAMAGUCHI
  • Publication number: 20230410834
    Abstract: A pre-adaptation model storage unit (14) stores a satisfaction estimation model obtained by connecting a speech satisfaction estimation model part that estimates a speech satisfaction for each speech using a feature amount of each speech as an input and a conversation satisfaction estimation model part that estimates a conversation satisfaction using at least the speech satisfaction for each speech as an input. An adaptation data storage unit (15) stores adaptation data including a conversation voice in which a conversation including a plurality of speeches is recorded and a correct value of a conversation satisfaction for the conversation. A model adaptation unit (18) fixes, by using a feature amount of each speech extracted from the conversation voice and a correct value of the conversation satisfaction, a parameter of the conversation satisfaction estimation model part to update a parameter of the speech satisfaction estimation model part.
    Type: Application
    Filed: November 4, 2020
    Publication date: December 21, 2023
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Atsushi ANDO, Hosana KAMIYAMA, Takeshi MORI, Satoshi KOBASHIKAWA
  • Patent number: 11798578
    Abstract: To increase the accuracy of paralinguistic information estimation. A paralinguistic information estimation model storage unit 20 stores a paralinguistic information estimation model outputting, with a plurality of independent features as inputs, paralinguistic information estimation results. A feature extraction unit 11 extracts the features from an input utterance. A paralinguistic information estimation unit 20 estimates paralinguistic information of the input utterance from the features extracted from the input utterance, by using the paralinguistic information estimation model.
    Type: Grant
    Filed: October 8, 2019
    Date of Patent: October 24, 2023
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Atsushi Ando, Hosana Kamiyama, Satoshi Kobashikawa
  • Patent number: 11756554
    Abstract: An attribute identification technology that can reject an attribute identification result if the reliability thereof is low is provided. An attribute identification device includes: a posteriori probability calculation unit 110 that calculates, from input speech, a posteriori probability sequence {q(c, i)} which is a sequence of the posteriori probabilities q(c, i) that a frame i of the input speech is a class c; a reliability calculation unit 120 that calculates, from the posteriori probability sequence {q(c, i)}, reliability r(c) indicating the extent to which the class c is a correct attribute identification result; and an attribute identification result generating unit 130 that generates an attribute identification result L of the input speech from the posteriori probability sequence {q(c, i)} and the reliability r(c).
    Type: Grant
    Filed: August 23, 2021
    Date of Patent: September 12, 2023
    Assignee: Nippon Telegraph and Telephone Corporation
    Inventors: Hosana Kamiyama, Satoshi Kobashikawa, Atsushi Ando
  • Publication number: 20230206118
    Abstract: Provided is a model learning technology to learn a model in consideration of a difference in label assignment accuracy between experts and non-experts.
    Type: Application
    Filed: March 19, 2020
    Publication date: June 29, 2023
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Hosana KAMIYAMA, Yuki KITAGISHI, Atsushi ANDO, Ryo MASUMURA, Takeshi MORI, Satoshi KOBASHIKAWA
  • Publication number: 20230095088
    Abstract: The present invention provides emotion recognition technology that achieves high emotion recognition accuracy for all speakers.
    Type: Application
    Filed: February 28, 2020
    Publication date: March 30, 2023
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Atsushi ANDO, Yuki KITAGISHI, Hosana KAMIYAMA, Takeshi MORI
  • Publication number: 20230069908
    Abstract: A recognition apparatus includes a classification unit that estimates a non-linguistic and para-linguistic information label to be imparted by an n-th listener from an acoustic feature amount of speech data to be recognized using an n-th classification model, and an integration unit that integrates estimation results of the non-linguistic and para-linguistic information labels for N listeners and obtains non-linguistic and para-linguistic information estimation results as a recognition apparatus for the speech data to be recognized, and the n-th classification model is a classification model trained using training speech data and a non-linguistic and para-linguistic information label imparted to the training speech data by the n-th listener as training data.
    Type: Application
    Filed: February 21, 2020
    Publication date: March 9, 2023
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Atsushi ANDO, Yuki KITAGISHI, Hosana KAMIYAMA, Takeshi MORI
  • Patent number: 11568761
    Abstract: The present invention provides a pronunciation error detection apparatus capable of following a text without the need for a correct sentence even when erroneous recognition such as a reading error occurs.
    Type: Grant
    Filed: September 13, 2018
    Date of Patent: January 31, 2023
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Satoshi Kobashikawa, Ryo Masumura, Hosana Kamiyama, Yusuke Ijima, Yushi Aono
  • Publication number: 20230013385
    Abstract: A learning apparatus includes: a speaker vector learning unit configured to learn a speaker vector extraction parameter ? based on one or more items of learning speech voice data in a speaker vector voice database; a non-speaker-individuality sound model learning unit configured to create a probability distribution model using a frequency component of one or more items of non-speaker-individuality sound data in a non-speaker-individuality sound database and calculate an internal parameter of the probability distribution model; and an age level estimation model learning unit configured to extract a speaker vector from voice data in an age level estimation model-learning voice database using the speaker vector extraction parameter ?, calculate a non-speaker-individuality sound likelihood vector from voice data in the age level estimation model-learning voice database using the internal parameters ? and ?, and learn, with input of the speaker vector and the non-speaker-individuality sound likelihood vector, a pa
    Type: Application
    Filed: December 9, 2019
    Publication date: January 19, 2023
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Yuki KITAGISHI, Takeshi MORI, Hosana KAMIYAMA, Atsushi ANDO, Satoshi KOBASHIKAWA
  • Patent number: 11557311
    Abstract: Estimation accuracies of a conversation satisfaction and a speech satisfaction are improved. A learning data storage unit (10) stores learning data including a conversation voice containing a conversation including a plurality of speeches, a correct answer value of a conversation satisfaction for the conversation, and a correct answer value of a speech satisfaction for each speech included in the conversation.
    Type: Grant
    Filed: July 20, 2018
    Date of Patent: January 17, 2023
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Atsushi Ando, Hosana Kamiyama, Satoshi Kobashikawa
  • Patent number: 11551708
    Abstract: With correct emotion classes selected as correct values of an emotion of an utterer of a first utterance from among a plurality of emotion classes C1, . . . , CK by listeners who have listened to the first utterance, as an input, the numbers of times ni that emotion classes Ci have been selected as the correct emotion classes are obtained, and rates of the numbers of times nk to a sum total of the numbers of times n1, . . . , nK or smoothed values of the rates are obtained as correct emotion soft labels tk(s) corresponding to the first utterance.
    Type: Grant
    Filed: November 12, 2018
    Date of Patent: January 10, 2023
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Atsushi Ando, Hosana Kamiyama, Satoshi Kobashikawa
  • Patent number: 11521641
    Abstract: State-of-satisfaction change pattern models each including a set of transition weights in state sequences of the states of satisfaction are obtained for predetermined change patterns of the states of satisfaction, and a state-of-satisfaction estimation model for obtaining the posteriori probability of the utterance feature amount given the state of satisfaction of an utterer is obtained by using the utterance-for-learning feature amount and a correct value of the state of satisfaction of an utterer who gave an utterance for learning corresponding to the utterance-for-learning feature amount. By using the input utterance feature amount and the state-of-satisfaction change pattern models and the state-of-satisfaction estimation model, an estimated value of the state of satisfaction of an utterer who gave an utterance corresponding to the input utterance feature amount is obtained.
    Type: Grant
    Filed: February 2, 2018
    Date of Patent: December 6, 2022
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Atsushi Ando, Hosana Kamiyama, Satoshi Kobashikawa
  • Patent number: 11495245
    Abstract: An urgency level estimation technique of estimating an urgency level of a speaker for free uttered speech, which does not require a specific word, is provided. An urgency level estimation apparatus includes a feature amount extracting part configured to extract a feature amount of an utterance from uttered speech, and an urgency level estimating part configured to estimate an urgency level of a speaker of the uttered speech from the feature amount based on a relationship between a feature amount extracted from uttered speech and an urgency level of a speaker of the uttered speech, the relationship being determined in advance, and the feature amount includes at least one of a feature indicating speaking speed of the uttered speech, a feature indicating voice pitch of the uttered speech and a feature indicating a power level of the uttered speech.
    Type: Grant
    Filed: November 15, 2018
    Date of Patent: November 8, 2022
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Hosana Kamiyama, Satoshi Kobashikawa, Atsushi Ando
  • Publication number: 20220335928
    Abstract: An estimation apparatus clusters a group of voice signals including a voice signal having a speaker attribute to be estimated into a plurality of clusters. Subsequently, the estimation apparatus identifies, from the plurality of clusters, a duster to which the voice signal to be estimated belongs. Next, the estimation apparatus uses a speaker attribute estimation model to estimate speaker attributes of respective voice signals in the identified cluster. After that, the estimation apparatus estimates an attribute of the entire cluster, by using an estimation result of the speaker attributes of the voice signals in the identified cluster, and outputs an estimation result of the speaker attribute of the entire cluster, as an estimation result of the speaker attribute of the voice signal to be estimated.
    Type: Application
    Filed: August 19, 2019
    Publication date: October 20, 2022
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Naohiro TAWARA, Hosana KAMIYAMA, Satoshi KOBASHIKAWA, Atsunori OGAWA
  • Publication number: 20220277761
    Abstract: An impression estimation technique without the need of voice recognition is provided. An impression estimation device includes an estimation unit configured to estimate an impression of a voice signal s by defining p1<p2 and using a first feature amount obtained based on a first analysis time length p1 for the voice signal s and a second feature amount obtained based on a second analysis time length p2 for the voice signal s. A learning device includes a learning unit configured to learn an estimation model which estimates the impression of the voice signal by defining p1<p2 and using a first feature amount for learning obtained based on the first analysis time length p1 for a voice signal for learning sL, a second feature amount for learning obtained based on the second analysis time length p2 for the voice signal for learning sL, and an impression label imparted to the voice signal for learning sL.
    Type: Application
    Filed: July 29, 2019
    Publication date: September 1, 2022
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Hosana KAMIYAMA, Atsushi ANDO, Satoshi KOBASHIKAWA
  • Publication number: 20220180188
    Abstract: A model is learned that is capable of accurate label estimation even if learning data is used for which the number of evaluators per piece of data is small.
    Type: Application
    Filed: February 25, 2020
    Publication date: June 9, 2022
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Hosana KAMIYAMA, Atsushi ANDO, Satoshi KOBASHIKAWA
  • Publication number: 20220122584
    Abstract: Paralinguistic information is estimated with high accuracy even when an utterance for which it is difficult to identify paralinguistic information is used for model learning. An acoustic feature extraction unit 11 extracts an acoustic feature from an utterance. An anti-teacher decision unit 12 decides, based on a paralinguistic information label indicating a determination result of paralinguistic information given by a plurality of listeners for each utterance, an anti-teacher label indicating an anti-teacher serving as incorrect paralinguistic information for the utterance. An anti-teacher estimation model learning unit 13 learns, based on an acoustic feature extracted from the utterance and the anti-teacher label, an anti-teacher estimation model for outputting a posterior probability of anti-teacher for an input acoustic feature.
    Type: Application
    Filed: January 27, 2020
    Publication date: April 21, 2022
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Atsushi ANDO, Hosana KAMIYAMA, Satoshi KOBASHIKAWA
  • Publication number: 20220108217
    Abstract: A model capable of estimating a label with high accuracy is learned even when training data involving a small number of raters per data item is used. Learning processing is performed in which a plurality of data items and label expectation values that are indicators representing degrees of correctness of individual labels on the data items are used in pairs as training data, and a model that estimates a label on an input data item is obtained.
    Type: Application
    Filed: January 29, 2020
    Publication date: April 7, 2022
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Hosana KAMIYAMA, Satoshi KOBASHIKAWA, Atsushi ANDO, Ryo MASUMURA
  • Publication number: 20210398552
    Abstract: To increase the accuracy of paralinguistic information estimation. A paralinguistic information estimation model storage unit 20 stores a paralinguistic information estimation model outputting, with a plurality of independent features as inputs, paralinguistic information estimation results. A feature extraction unit 11 extracts the features from an input utterance. A paralinguistic information estimation unit 20 estimates paralinguistic information of the input utterance from the features extracted from the input utterance, by using the paralinguistic information estimation model.
    Type: Application
    Filed: October 8, 2019
    Publication date: December 23, 2021
    Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Atsushi ANDO, Hosana KAMIYAMA, Satoshi KOBASHIKAWA
  • Publication number: 20210383812
    Abstract: An attribute identification technology that can reject an attribute identification result if the reliability thereof is low is provided. An attribute identification device includes: a posteriori probability calculation unit 110 that calculates, from input speech, a posteriori probability sequence {q(c, i)} which is a sequence of the posteriori probabilities q(c, i) that a frame i of the input speech is a class c; a reliability calculation unit 120 that calculates, from the posteriori probability sequence {q(c, i)}, reliability r(c) indicating the extent to which the class c is a correct attribute identification result; and an attribute identification result generating unit 130 that generates an attribute identification result L of the input speech from the posteriori probability sequence {q(c, i)} and the reliability r(c).
    Type: Application
    Filed: August 23, 2021
    Publication date: December 9, 2021
    Applicant: Nippon Telegraph and Telephone Corporation
    Inventors: Hosana KAMIYAMA, Satoshi Kobashikawa, Atsushi Ando