Patents by Inventor Kousuke ITAKURA

Kousuke ITAKURA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240282313
    Abstract: A speaker recognition device acquires a registered voice, converts the acquired registered voice to a plurality of property converted voices having respective acoustic properties different from each other, extracts a speaker feature indicative of a characteristic of a speaker from the registered voice, extracts a speaker feature from each of the property converted voices, compares all pairs of speaker features of a part or all of the speaker feature extracted from the registered voice and the speaker features extracted from the property converted voices, and calculates a threshold used for recognition of a speaker of an input voice on the basis of a result of the comparison.
    Type: Application
    Filed: May 2, 2024
    Publication date: August 22, 2024
    Applicant: Panasonic Intellectual Property Corporation of America
    Inventor: Kousuke ITAKURA
  • Publication number: 20240273883
    Abstract: An information processing device performs: acquiring a face similarity indicating a similarity between a face of a first person and a face of a second person; acquiring a voice similarity indicating a similarity between a voice of the first person and a voice of the second person; calculating an integrated similarity by integrating the face similarity and the voice similarity, and determining the integrated similarity as a final similarity when the face similarity falls within an integrated range including a threshold which is used to determine whether the first person and the second person are identical to each other, and calculating the face similarity as a final similarity when the face similarity is out of the integrated range; and outputting the final similarity.
    Type: Application
    Filed: April 24, 2024
    Publication date: August 15, 2024
    Applicant: Panasonic Intellectual Property Corporation of America
    Inventors: Shintaro OKADA, Masanari MIYAMOTO, Kousuke ITAKURA
  • Publication number: 20240112682
    Abstract: An utterer identification device executes: performing voice recognition from input utterance data; selecting, from among a plurality of registered utterance contents set in advance, a registered utterance content closest to a recognized utterance content indicated by a result of the voice recognition as a selected utterance content; selecting, from among a plurality of databases respectively associated with the registered utterance contents, a database associated with the selected utterance content; calculating a similarity between a feature quantity of the input utterance data and a feature quantity stored in the selected database; and identifying a certain utterer on the basis of the similarity, and outputting a result of the identification.
    Type: Application
    Filed: December 7, 2023
    Publication date: April 4, 2024
    Applicant: Panasonic Intellectual Property Corporation of America
    Inventors: Takahiro KAMAI, Misaki DOI, Katsunori DAIMO, Kousuke ITAKURA
  • Publication number: 20240105174
    Abstract: A voice recognition device includes an estimation unit that compares a plurality of pieces of registration voice data stored in a database with input voice data uttered by a speaker who gets on a mobile body to estimate a registration command corresponding to the input command, a presentation unit that presents an estimation result, a second acquisition unit that acquires an error instruction indicating that the estimation result is an error, a determination unit that, in a case where the error instruction is acquired, determines a correct command corresponding to the input command based on an operation by the speaker, and a database management unit that stores the correct command and the input voice data in the database in association with each other
    Type: Application
    Filed: December 4, 2023
    Publication date: March 28, 2024
    Applicant: Panasonic Intellectual Property Corporation of America
    Inventors: Takahiro KAMAI, Katsunori DAIMO, Misaki DOI, Kousuke ITAKURA
  • Publication number: 20240087570
    Abstract: A voice recognition device includes: a calculation unit that calculates a first feature amount that is a feature amount of input voice data acquired by a first acquisition unit; an estimation unit that estimates a driving situation of a mobile object on the basis of operation information acquired by a second acquisition unit; an extraction unit that extracts, from a feature amount database, a second feature amount corresponding to the driving situation; a recognition unit that recognizes an input command on the basis of similarity between the first feature amount and the second feature amount; and an output unit that outputs a recognition result.
    Type: Application
    Filed: November 22, 2023
    Publication date: March 14, 2024
    Applicant: Panasonic Intellectual Property Corporation of America
    Inventors: Takahiro KAMAI, Kousuke ITAKURA, Misaki DOI, Katsunori DAIMO
  • Patent number: 11580989
    Abstract: A training method of training a speaker identification model which receives voice data as an input and outputs speaker identification information for identifying a speaker of an utterance included in the voice data is provided. The training method includes: performing voice quality conversion of first voice data of a first speaker to generate second voice data of a second speaker; and performing training of the speaker identification model using, as training data, the first voice data and the second voice data.
    Type: Grant
    Filed: August 18, 2020
    Date of Patent: February 14, 2023
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Misaki Doi, Takahiro Kamai, Kousuke Itakura
  • Publication number: 20230016655
    Abstract: A speaker identification device acquires identification target voice data; acquires registered voice data; selects a first speaker identification model machine-learned using male voice data to identify a male speaker in a case where one of a sex of a speaker of the identification target voice data and a sex of a speaker of the registered voice data is male, and selects a second speaker identification model machine-learned using female voice data to identify a female speaker in a case where one of a sex of the speaker of the identification target voice data and a sex of the speaker of the registered voice data is female; and inputs a feature amount of the identification target voice data and a feature amount of the registered voice data to one of the selected first speaker identification model and second speaker identification model to identify the speaker of the identification target voice data.
    Type: Application
    Filed: September 21, 2022
    Publication date: January 19, 2023
    Applicant: Panasonic Intellectual Property Corporation of America
    Inventor: Kousuke ITAKURA
  • Patent number: 11501209
    Abstract: In a behavior identification method, surrounding sound is acquired, a feature value that is specified by a spectrum pattern included in spectrum information generated from sound made by a person performing a predetermined behavior is extracted from the sound acquired, the predetermined behavior is identified by the feature value, and information indicating the predetermined behavior identified is output.
    Type: Grant
    Filed: November 12, 2019
    Date of Patent: November 15, 2022
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Kousuke Itakura, Ko Mizuno
  • Patent number: 11315550
    Abstract: A speaker recognition device according to the present disclosure includes: an acoustic feature calculator that calculates, from utterance data indicating a voice of an obtained utterance, acoustic feature of the voice of the utterance; a statistic calculator that calculates an utterance data statistic from the calculated acoustic feature; a speaker feature extractor that extracts speaker feature of a speaker of the utterance data from the calculated utterance data statistic using a deep neural network (DNN); a similarity calculator that calculates a similarity between the extracted speaker feature and pre-stored speaker feature of at least one registered speaker; and a speaker recognizer that recognizes the speaker of the utterance data based on the calculated similarity.
    Type: Grant
    Filed: November 13, 2019
    Date of Patent: April 26, 2022
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Kousuke Itakura, Ko Mizuno, Misaki Doi
  • Patent number: 11222641
    Abstract: A speaker recognition device includes: a feature calculator that calculates two or more acoustic features of a voice of an utterance obtained; a similarity calculator that calculates two or more similarities, each being a similarity between one of one or more speaker-specific features of a target speaker for recognition and one of the two or more acoustic features; a combination unit that combines the two or more similarities to obtain a combined value; and a determiner that determines whether a speaker of the utterance is the target speaker based on the combined value. Here, (i) at least two of the two or more acoustic features have different properties, (ii) at least two of the two or more similarities have different properties, or (iii) at least two of the two or more acoustic features have different properties and at least two of the two or more similarities have different properties.
    Type: Grant
    Filed: September 19, 2019
    Date of Patent: January 11, 2022
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventor: Kousuke Itakura
  • Publication number: 20210056955
    Abstract: A training method of training a speaker identification model which receives voice data as an input and outputs speaker identification information for identifying a speaker of an utterance included in the voice data is provided. The training method includes: performing voice quality conversion of first voice data of a first speaker to generate second voice data of a second speaker; and performing training of the speaker identification model using, as training data, the first voice data and the second voice data.
    Type: Application
    Filed: August 18, 2020
    Publication date: February 25, 2021
    Inventors: Misaki DOI, Takahiro KAMAI, Kousuke ITAKURA
  • Publication number: 20200160846
    Abstract: A speaker recognition device according to the present disclosure includes: an acoustic feature calculator that calculates, from utterance data indicating a voice of an obtained utterance, acoustic feature of the voice of the utterance; a statistic calculator that calculates an utterance data statistic from the calculated acoustic feature; a speaker feature extractor that extracts speaker feature of a speaker of the utterance data from the calculated utterance data statistic using a deep neural network (DNN); a similarity calculator that calculates a similarity between the extracted speaker feature and pre-stored speaker feature of at least one registered speaker; and a speaker recognizer that recognizes the speaker of the utterance data based on the calculated similarity.
    Type: Application
    Filed: November 13, 2019
    Publication date: May 21, 2020
    Inventors: Kousuke ITAKURA, Ko MIZUNO, Misaki DOI
  • Publication number: 20200160218
    Abstract: In a behavior identification method, surrounding sound is acquired, a feature value that is specified by a spectrum pattern included in spectrum information generated from sound made by a person performing a predetermined behavior is extracted from the sound acquired, the predetermined behavior is identified by the feature value, and information indicating the predetermined behavior identified is output.
    Type: Application
    Filed: November 12, 2019
    Publication date: May 21, 2020
    Inventors: Kousuke ITAKURA, Ko MIZUNO
  • Publication number: 20200111496
    Abstract: A speaker recognition device includes: a feature calculator that calculates two or more acoustic features of a voice of an utterance obtained; a similarity calculator that calculates two or more similarities, each being a similarity between one of one or more speaker-specific features of a target speaker for recognition and one of the two or more acoustic features; a combination unit that combines the two or more similarities to obtain a combined value; and a determiner that determines whether a speaker of the utterance is the target speaker based on the combined value. Here, (i) at least two of the two or more acoustic features have different properties, (ii) at least two of the two or more similarities have different properties, or (iii) at least two of the two or more acoustic features have different properties and at least two of the two or more similarities have different properties.
    Type: Application
    Filed: September 19, 2019
    Publication date: April 9, 2020
    Inventor: Kousuke ITAKURA