Patents by Inventor Kousuke ITAKURA

Kousuke ITAKURA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Voice recognition device, voice recognition method, and non-transitory computer readable recording medium

Patent number: 12626699

Abstract: A voice recognition device includes an estimation unit that compares a plurality of pieces of registration voice data stored in a database with input voice data uttered by a speaker who gets on a mobile body to estimate a registration command corresponding to the input command, a presentation unit that presents an estimation result, a second acquisition unit that acquires an error instruction indicating that the estimation result is an error, a determination unit that, in a case where the error instruction is acquired, determines a correct command corresponding to the input command based on an operation by the speaker, and a database management unit that stores the correct command and the input voice data in the database in association with each other.

Type: Grant

Filed: December 4, 2023

Date of Patent: May 12, 2026

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Takahiro Kamai, Katsunori Daimo, Misaki Doi, Kousuke Itakura
Information processing method, information processing device, and non-transitory computer readable recording medium storing information processing program

Patent number: 12592238

Abstract: A speaker recognition device acquires a registered voice, converts the acquired registered voice to a plurality of property converted voices having respective acoustic properties different from each other, extracts a speaker feature indicative of a characteristic of a speaker from the registered voice, extracts a speaker feature from each of the property converted voices, compares all pairs of speaker features of a part or all of the speaker feature extracted from the registered voice and the speaker features extracted from the property converted voices, and calculates a threshold used for recognition of a speaker of an input voice on the basis of a result of the comparison.

Type: Grant

Filed: May 2, 2024

Date of Patent: March 31, 2026

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventor: Kousuke Itakura
Speaker identification method, speaker identification device, non-transitory computer readable recording medium storing speaker identification program, sex identification model generation method, and speaker identification model generation method

Patent number: 12354606

Abstract: A speaker identification device acquires identification target voice data; acquires registered voice data; selects a first speaker identification model machine-learned using male voice data to identify a male speaker in a case where one of a sex of a speaker of the identification target voice data and a sex of a speaker of the registered voice data is male, and selects a second speaker identification model machine-learned using female voice data to identify a female speaker in a case where one of a sex of the speaker of the identification target voice data and a sex of the speaker of the registered voice data is female; and inputs a feature amount of the identification target voice data and a feature amount of the registered voice data to one of the selected first speaker identification model and second speaker identification model to identify the speaker of the identification target voice data.

Type: Grant

Filed: September 21, 2022

Date of Patent: July 8, 2025

Assignee: Panasonic Intellectual Property Corporation of America

Inventor: Kousuke Itakura
INFORMATION PROCESSING METHOD, INFORMATION PROCESSING DEVICE, AND NON-TRANSITORY COMPUTER READABLE RECORDING MEDIUM STORING INFORMATION PROCESSING PROGRAM

Publication number: 20240282313

Abstract: A speaker recognition device acquires a registered voice, converts the acquired registered voice to a plurality of property converted voices having respective acoustic properties different from each other, extracts a speaker feature indicative of a characteristic of a speaker from the registered voice, extracts a speaker feature from each of the property converted voices, compares all pairs of speaker features of a part or all of the speaker feature extracted from the registered voice and the speaker features extracted from the property converted voices, and calculates a threshold used for recognition of a speaker of an input voice on the basis of a result of the comparison.

Type: Application

Filed: May 2, 2024

Publication date: August 22, 2024

Applicant: Panasonic Intellectual Property Corporation of America

Inventor: Kousuke ITAKURA
INFORMATION PROCESSING METHOD, INFORMATION PROCESSING DEVICE, AND NON-TRANSITORY COMPUTER READABLE RECORDING MEDIUM

Publication number: 20240273883

Abstract: An information processing device performs: acquiring a face similarity indicating a similarity between a face of a first person and a face of a second person; acquiring a voice similarity indicating a similarity between a voice of the first person and a voice of the second person; calculating an integrated similarity by integrating the face similarity and the voice similarity, and determining the integrated similarity as a final similarity when the face similarity falls within an integrated range including a threshold which is used to determine whether the first person and the second person are identical to each other, and calculating the face similarity as a final similarity when the face similarity is out of the integrated range; and outputting the final similarity.

Type: Application

Filed: April 24, 2024

Publication date: August 15, 2024

Applicant: Panasonic Intellectual Property Corporation of America

Inventors: Shintaro OKADA, Masanari MIYAMOTO, Kousuke ITAKURA
SPEAKER IDENTIFICATION METHOD, SPEAKER IDENTIFICATION DEVICE, AND NON-TRANSITORY COMPUTER READABLE RECORDING MEDIUM

Publication number: 20240112682

Abstract: An utterer identification device executes: performing voice recognition from input utterance data; selecting, from among a plurality of registered utterance contents set in advance, a registered utterance content closest to a recognized utterance content indicated by a result of the voice recognition as a selected utterance content; selecting, from among a plurality of databases respectively associated with the registered utterance contents, a database associated with the selected utterance content; calculating a similarity between a feature quantity of the input utterance data and a feature quantity stored in the selected database; and identifying a certain utterer on the basis of the similarity, and outputting a result of the identification.

Type: Application

Filed: December 7, 2023

Publication date: April 4, 2024

Applicant: Panasonic Intellectual Property Corporation of America

Inventors: Takahiro KAMAI, Misaki DOI, Katsunori DAIMO, Kousuke ITAKURA
VOICE RECOGNITION DEVICE, VOICE RECOGNITION METHOD, AND NON-TRANSITORY COMPUTER READABLE RECORDING MEDIUM

Publication number: 20240105174

Abstract: A voice recognition device includes an estimation unit that compares a plurality of pieces of registration voice data stored in a database with input voice data uttered by a speaker who gets on a mobile body to estimate a registration command corresponding to the input command, a presentation unit that presents an estimation result, a second acquisition unit that acquires an error instruction indicating that the estimation result is an error, a determination unit that, in a case where the error instruction is acquired, determines a correct command corresponding to the input command based on an operation by the speaker, and a database management unit that stores the correct command and the input voice data in the database in association with each other

Type: Application

Filed: December 4, 2023

Publication date: March 28, 2024

Applicant: Panasonic Intellectual Property Corporation of America

Inventors: Takahiro KAMAI, Katsunori DAIMO, Misaki DOI, Kousuke ITAKURA
VOICE RECOGNITION DEVICE, VOICE RECOGNITION METHOD, AND NON-TRANSITORY COMPUTER READABLE RECORDING MEDIUM

Publication number: 20240087570

Abstract: A voice recognition device includes: a calculation unit that calculates a first feature amount that is a feature amount of input voice data acquired by a first acquisition unit; an estimation unit that estimates a driving situation of a mobile object on the basis of operation information acquired by a second acquisition unit; an extraction unit that extracts, from a feature amount database, a second feature amount corresponding to the driving situation; a recognition unit that recognizes an input command on the basis of similarity between the first feature amount and the second feature amount; and an output unit that outputs a recognition result.

Type: Application

Filed: November 22, 2023

Publication date: March 14, 2024

Applicant: Panasonic Intellectual Property Corporation of America

Inventors: Takahiro KAMAI, Kousuke ITAKURA, Misaki DOI, Katsunori DAIMO
Training method of a speaker identification model based on a first language and a second language

Patent number: 11580989

Abstract: A training method of training a speaker identification model which receives voice data as an input and outputs speaker identification information for identifying a speaker of an utterance included in the voice data is provided. The training method includes: performing voice quality conversion of first voice data of a first speaker to generate second voice data of a second speaker; and performing training of the speaker identification model using, as training data, the first voice data and the second voice data.

Type: Grant

Filed: August 18, 2020

Date of Patent: February 14, 2023

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Misaki Doi, Takahiro Kamai, Kousuke Itakura
SPEAKER IDENTIFICATION METHOD, SPEAKER IDENTIFICATION DEVICE, NON-TRANSITORY COMPUTER READABLE RECORDING MEDIUM STORING SPEAKER IDENTIFICATION PROGRAM, SEX IDENTIFICATION MODEL GENERATION METHOD, AND SPEAKER IDENTIFICATION MODEL GENERATION METHOD

Publication number: 20230016655

Abstract: A speaker identification device acquires identification target voice data; acquires registered voice data; selects a first speaker identification model machine-learned using male voice data to identify a male speaker in a case where one of a sex of a speaker of the identification target voice data and a sex of a speaker of the registered voice data is male, and selects a second speaker identification model machine-learned using female voice data to identify a female speaker in a case where one of a sex of the speaker of the identification target voice data and a sex of the speaker of the registered voice data is female; and inputs a feature amount of the identification target voice data and a feature amount of the registered voice data to one of the selected first speaker identification model and second speaker identification model to identify the speaker of the identification target voice data.

Type: Application

Filed: September 21, 2022

Publication date: January 19, 2023

Applicant: Panasonic Intellectual Property Corporation of America

Inventor: Kousuke ITAKURA
Behavior identification method, behavior identification device, non-transitory computer-readable recording medium recording therein behavior identification program, machine learning method, machine learning device, and non-transitory computer-readable recording medium recording therein machine learning program

Patent number: 11501209

Abstract: In a behavior identification method, surrounding sound is acquired, a feature value that is specified by a spectrum pattern included in spectrum information generated from sound made by a person performing a predetermined behavior is extracted from the sound acquired, the predetermined behavior is identified by the feature value, and information indicating the predetermined behavior identified is output.

Type: Grant

Filed: November 12, 2019

Date of Patent: November 15, 2022

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Kousuke Itakura, Ko Mizuno
Speaker recognition device, speaker recognition method, and recording medium

Patent number: 11315550

Abstract: A speaker recognition device according to the present disclosure includes: an acoustic feature calculator that calculates, from utterance data indicating a voice of an obtained utterance, acoustic feature of the voice of the utterance; a statistic calculator that calculates an utterance data statistic from the calculated acoustic feature; a speaker feature extractor that extracts speaker feature of a speaker of the utterance data from the calculated utterance data statistic using a deep neural network (DNN); a similarity calculator that calculates a similarity between the extracted speaker feature and pre-stored speaker feature of at least one registered speaker; and a speaker recognizer that recognizes the speaker of the utterance data based on the calculated similarity.

Type: Grant

Filed: November 13, 2019

Date of Patent: April 26, 2022

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Kousuke Itakura, Ko Mizuno, Misaki Doi
Speaker recognition device, speaker recognition method, and recording medium

Patent number: 11222641

Abstract: A speaker recognition device includes: a feature calculator that calculates two or more acoustic features of a voice of an utterance obtained; a similarity calculator that calculates two or more similarities, each being a similarity between one of one or more speaker-specific features of a target speaker for recognition and one of the two or more acoustic features; a combination unit that combines the two or more similarities to obtain a combined value; and a determiner that determines whether a speaker of the utterance is the target speaker based on the combined value. Here, (i) at least two of the two or more acoustic features have different properties, (ii) at least two of the two or more similarities have different properties, or (iii) at least two of the two or more acoustic features have different properties and at least two of the two or more similarities have different properties.

Type: Grant

Filed: September 19, 2019

Date of Patent: January 11, 2022

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventor: Kousuke Itakura
TRAINING METHOD, SPEAKER IDENTIFICATION METHOD, AND RECORDING MEDIUM

Publication number: 20210056955

Abstract: A training method of training a speaker identification model which receives voice data as an input and outputs speaker identification information for identifying a speaker of an utterance included in the voice data is provided. The training method includes: performing voice quality conversion of first voice data of a first speaker to generate second voice data of a second speaker; and performing training of the speaker identification model using, as training data, the first voice data and the second voice data.

Type: Application

Filed: August 18, 2020

Publication date: February 25, 2021

Inventors: Misaki DOI, Takahiro KAMAI, Kousuke ITAKURA
BEHAVIOR IDENTIFICATION METHOD, BEHAVIOR IDENTIFICATION DEVICE, NON-TRANSITORY COMPUTER-READABLE RECORDING MEDIUM RECORDING THEREIN BEHAVIOR IDENTIFICATION PROGRAM, MACHINE LEARNING METHOD, MACHINE LEARNING DEVICE, AND NON-TRANSITORY COMPUTER-READABLE RECORDING MEDIUM RECORDING THEREIN MACHINE LEARNING PROGRAM

Publication number: 20200160218

Abstract: In a behavior identification method, surrounding sound is acquired, a feature value that is specified by a spectrum pattern included in spectrum information generated from sound made by a person performing a predetermined behavior is extracted from the sound acquired, the predetermined behavior is identified by the feature value, and information indicating the predetermined behavior identified is output.

Type: Application

Filed: November 12, 2019

Publication date: May 21, 2020

Inventors: Kousuke ITAKURA, Ko MIZUNO
SPEAKER RECOGNITION DEVICE, SPEAKER RECOGNITION METHOD, AND RECORDING MEDIUM

Publication number: 20200160846

Abstract: A speaker recognition device according to the present disclosure includes: an acoustic feature calculator that calculates, from utterance data indicating a voice of an obtained utterance, acoustic feature of the voice of the utterance; a statistic calculator that calculates an utterance data statistic from the calculated acoustic feature; a speaker feature extractor that extracts speaker feature of a speaker of the utterance data from the calculated utterance data statistic using a deep neural network (DNN); a similarity calculator that calculates a similarity between the extracted speaker feature and pre-stored speaker feature of at least one registered speaker; and a speaker recognizer that recognizes the speaker of the utterance data based on the calculated similarity.

Type: Application

Filed: November 13, 2019

Publication date: May 21, 2020

Inventors: Kousuke ITAKURA, Ko MIZUNO, Misaki DOI
SPEAKER RECOGNITION DEVICE, SPEAKER RECOGNITION METHOD, AND RECORDING MEDIUM

Publication number: 20200111496

Abstract: A speaker recognition device includes: a feature calculator that calculates two or more acoustic features of a voice of an utterance obtained; a similarity calculator that calculates two or more similarities, each being a similarity between one of one or more speaker-specific features of a target speaker for recognition and one of the two or more acoustic features; a combination unit that combines the two or more similarities to obtain a combined value; and a determiner that determines whether a speaker of the utterance is the target speaker based on the combined value. Here, (i) at least two of the two or more acoustic features have different properties, (ii) at least two of the two or more similarities have different properties, or (iii) at least two of the two or more acoustic features have different properties and at least two of the two or more similarities have different properties.

Type: Application

Filed: September 19, 2019

Publication date: April 9, 2020

Inventor: Kousuke ITAKURA