Patents by Inventor Shuji KOMEIJI

Shuji KOMEIJI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Processing system, processing method, and non-transitory storage medium

Patent number: 11818300

Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.

Type: Grant

Filed: October 3, 2022

Date of Patent: November 14, 2023

Assignee: NEC CORPORATION

Inventors: Hiroki Matsuura, Shuji Komeiji, Takayuki Shirokaze, Ryoji Yoshida
SPEECH RECOGNITION APPARATUS, CONTROL METHOD, AND NON-TRANSITORY STORAGE MEDIUM

Publication number: 20230109867

Abstract: A speech recognition apparatus (2000) acquires source data (10) representing an audio signal including an utterance. The speech recognition apparatus (2000) converts the source data (10) into a text string (30). The speech recognition apparatus (2000) generates a concatenated text (40) representing a content of an utterance by concatenating a text (32) included in the text string (30). Herein, texts (32) adjacent to each other in the text string (30) are such that parts of associated audio signals overlap each other on a time axis. At a time of concatenating texts (32) adjacent to each other, the speech recognition apparatus (2000) eliminates a trailing portion of a preceding text (32) and a leading portion of a succeeding text (32).

Type: Application

Filed: March 9, 2020

Publication date: April 13, 2023

Applicant: NEC Corporation

Inventors: Shuji KOMEIJI, Hitoshi YAMAMOTO
UTTERANCE END DETECTION APPARATUS, CONTROL METHOD, AND NON-TRANSITORY STORAGE MEDIUM

Publication number: 20230082325

Abstract: An utterance end detection apparatus (2000) acquires source data 10 representing an audio signal including one or more utterances. The utterance end detection apparatus (2000) converts the source data (10) into text data (30). The utterance end detection apparatus (2000) detects a conversion unit that analyzes text data (30), acquires source data, and converts the source data into text data, and an end of each utterance included in an audio signal represented by the source data (10).

Type: Application

Filed: February 26, 2020

Publication date: March 16, 2023

Applicant: NEC Corporation

Inventors: Shuji KOMEIJI, Hitoshi YAMAMOTO
SPEECH RECOGNITION APPARATUS, CONTROL METHOD, AND NON-TRANSITORY STORAGE MEDIUM

Publication number: 20230076709

Abstract: A speech recognition apparatus (2000) acquires a plurality of pieces of audio data (20) for a source audio signal including an utterance. The speech recognition apparatus (2000) generates a candidate text group (30) for each of the plurality of pieces of audio data (20). The candidate text group (30) includes a plurality of candidate texts (32). The candidate text (32) is a candidate of a text representing a content of an utterance corresponding to the audio data (20), and represents a sentence. The speech recognition apparatus (2000) selects, based on a comparison result between the plurality of candidate text groups (30), for each of the pieces of audio data (20), a candidate text (32) representing a content of an utterance represented by the piece of audio data (20) from the candidate text group (30) generated for the piece of audio data (20).

Type: Application

Filed: March 16, 2020

Publication date: March 9, 2023

Applicant: NEC Corporation

Inventors: Shuji KOMEIJI, Hitoshi YAMAMOTO
SPEECH RECOGNITION APPARATUS, ACOUSTIC MODEL LEARNING APPARATUS, SPEECH RECOGNITION METHOD, AND COMPUTER-READABLE RECORDING MEDIUM

Publication number: 20230064137

Abstract: A speech recognition apparatus 20, includes; a data acquisition unit 21 that acquires speech data and sensor data to be recognized; a speech recognition unit 22 that converts the acquired speech data into text data by applying the acquired speech data and the acquired sensor data to an acoustic model which is constructed by machine learning using an embedded vector generated from sensor data related to training data in addition to speech data to be the training data and teacher data to be the training data.

Type: Application

Filed: February 17, 2020

Publication date: March 2, 2023

Applicant: NEC Corporation

Inventors: Shuji KOMEIJI, Yasuo IIMURA, Hitoshi YAMAMOTO
SPEECH RECOGNITION APPARATUS, CONTROL METHOD, AND NON-TRANSITORY STORAGE MEDIUM

Publication number: 20230046763

Abstract: A speech recognition apparatus (2000) includes a first model (10) and a second model (20). The first model (10) is learned by training data with an audio frame as input data, and with, as correct answer data, compressed character string data acquired by encoding character string data represented by the audio frame. The second model (20) is a learned decoder (44) acquired by learning an autoencoder (40) being constituted of an encoder (42) converting input character string data into compressed character string data, and the decoder (44) converting, into character string data, the compressed character string data output from the encoder. The speech recognition apparatus (2000) inputs an audio frame to the first model (10), inputs, to the second model (20), compressed character string data output from the first model (10), and thereby generates character string data corresponding to the audio frame.

Type: Application

Filed: February 19, 2020

Publication date: February 16, 2023

Applicant: NEC Corporation

Inventors: Shuji Komeiji, Ryoji Yoshida, Hitoshi Yamamoto
PROCESSING SYSTEM, PROCESSING METHOD, AND NON-TRANSITORY STORAGE MEDIUM

Publication number: 20230027992

Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.

Type: Application

Filed: October 3, 2022

Publication date: January 26, 2023

Applicant: NEC Corporation

Inventors: Hiroki MATSUURA, Shuji KOMEIJI, Takayuki SHIROKAZE, Ryoji YOSHIDA
Processing system, processing method, and non-transitory storage medium

Patent number: 11503161

Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.

Type: Grant

Filed: September 13, 2019

Date of Patent: November 15, 2022

Assignee: NEC CORPORATION

Inventors: Hiroki Matsuura, Shuji Komeiji, Takayuki Shirokaze, Ryoji Yoshida
COMPARISON APPARATUS, COMPARISON SYSTEM, COMPARISON METHOD, AND NON-TRANSITORY COMPUTER-READABLE MEDIUM STORING COMPARISON PROGRAM

Publication number: 20220358787

Abstract: According to an example embodiment, a display apparatus includes adjustment means for comparing a plurality of first feature points specified in a face region, which is extracted from a shot image obtained by shooting an inspection target person, of the inspection target person and a plurality of first feature points specified in a face region, which is extracted from image data on a person registered in a database, of the person and adjusting a positional relationship between the shot image of the inspection target person and a registered image of the person to be generated based on the image data, and display control means for displaying the shot image of the inspection target person and a mark representing a visually recognizable second feature point to be specified from the registered image of the person in an overlapping manner on a display device.

Type: Application

Filed: April 21, 2020

Publication date: November 10, 2022

Applicant: NEC Corporation

Inventors: Yasushi HAMADA, Shuji KOMEIJI
SPEECH RECOGNITION DEVICE, SPEECH RECOGNITION METHOD, AND PROGRAM

Publication number: 20220335951

Abstract: A speech recognition apparatus (100) includes: a speech reproduction unit (102) that reproduces, for each predetermined section, target speech for speech recognition being divided for each predetermined section; a speech recognition unit (104) that recognizes, for each target speech, spoken speech acquired by repeating the target speech by a user; a text information generation unit (106) that generates text information about the spoken speech, based on a recognition result of the speech recognition unit (104); and a storage processing unit (108) that stores, as learning data, identification information by the user, the spoken speech, and the recognition result corresponding to the spoken speech in association with one another, in which the speech recognition unit (104) performs recognition by using a recognition engine that learns the learning data by the user.

Type: Application

Filed: September 8, 2020

Publication date: October 20, 2022

Applicant: NEC Corporation

Inventor: Shuji KOMEIJI
LANGUAGE INFERENCE APPARATUS, LANGUAGE INFERENCE METHOD, AND PROGRAM

Publication number: 20220319512

Abstract: A language inference apparatus (100) includes an acquisition unit (102) that acquires nationality information, a selection unit (104) that selects a language inference engine by using the acquired nationality information, and a determination unit (106) that determines a language used by a speaker, by analyzing voice information of the speaker using the selected language inference engine (110).

Type: Application

Filed: September 7, 2020

Publication date: October 6, 2022

Applicant: NEC Corporation

Inventor: Shuji KOMEIJI
PROCESSING SYSTEM, PROCESSING METHOD, AND NON-TRANSITORY STORAGE MEDIUM

Publication number: 20220014628

Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.

Type: Application

Filed: September 13, 2019

Publication date: January 13, 2022

Applicant: NEC Corporation

Inventors: Hiroki MATSUURA, Shuji KOMEIJI, Takayuki SHIROKAZE, Ryoji YOSHIDA
Speech processing apparatus, speech processing method, and recording medium

Patent number: 10347273

Abstract: A speech processing apparatus includes: an expectation value calculation unit configured to calculate, using an input signal spectrum and a speech model that models a feature quantity of speech, a spectrum expectation value which is an expectation value of a spectrum of an acoustic component included in the input signal spectrum; and an acoustic power estimation unit configured to estimate an acoustic power of the acoustic component of the input signal spectrum based on the input signal spectrum and the spectrum expectation value.

Type: Grant

Filed: December 8, 2015

Date of Patent: July 9, 2019

Assignee: NEC CORPORATION

Inventors: Shuji Komeiji, Masanori Tsujikawa, Ryosuke Isotani
INFORMATION PROCESSING DEVICE, CONDUCT EVALUATION METHOD, AND PROGRAM STORAGE MEDIUM

Publication number: 20170364854

Abstract: The purpose of the present invention is to provide a technology which is capable of appropriately evaluating a person's conduct with respect to another person. Provided is an information processing device, comprising a recognition unit 11, a detection unit 12, and an evaluation unit 13. The recognition unit 11 recognizes an evaluation subject's conduct. The detection unit 12 detects a trigger which is a state of a person other than the evaluation subject which triggers the evaluation subject's conduct. Using the detected trigger and the result of recognition by the recognition unit 13 relating to the evaluation subject's conduct, the evaluation unit 13 evaluates the evaluation subject's conduct.

Type: Application

Filed: December 2, 2015

Publication date: December 21, 2017

Inventors: Terumi UMEMATSU, Ryosuke ISOTANI, Yoshifumi OMISHI, Masanori TSUJIKAWA, Makoto TERAO, Tasuku KITADE, Shuji KOMEIJI
SPEECH PROCESSING APPARATUS, SPEECH PROCESSING METHOD, AND RECORDING MEDIUM

Publication number: 20170337935

Abstract: A speech processing apparatus includes: an expectation value calculation unit configured to calculate, using an input signal spectrum and a speech model that models a feature quantity of speech, a spectrum expectation value which is an expectation value of a spectrum of an acoustic component included in the input signal spectrum; and an acoustic power estimation unit configured to estimate an acoustic power of the acoustic component of the input signal spectrum based on the input signal spectrum and the spectrum expectation value.

Type: Application

Filed: December 8, 2015

Publication date: November 23, 2017

Applicant: NEC Corporation

Inventors: Shuji KOMEIJI, Masanori TSUJIKAWA, Ryosuke ISOTANI
Noise reduction system, speech detection system, speech recognition system, noise reduction method, and noise reduction program

Patent number: 9449616

Abstract: Provided are a noise reduction system that highly precisely estimates noise contained in an input signal and highly precisely reduces the noise contained in the input signal using the estimated noise, a speech detection system, a speech recognition system, a noise reduction method, and a noise reduction program.

Type: Grant

Filed: December 25, 2013

Date of Patent: September 20, 2016

Assignee: NEC CORPORATION

Inventors: Masanori Tsujikawa, Ken Hanazawa, Shuji Komeiji
Speech recognition device, speech recognition method, and computer readable medium

Patent number: 9245524

Abstract: The present invention can increase the types of noises that can be dealt with enough to enable speech recognition with a speech recognition rate of high accuracy.

Type: Grant

Filed: November 10, 2011

Date of Patent: January 26, 2016

Assignee: NEC CORPORATION

Inventors: Shuji Komeiji, Takayuki Arakawa, Takafumi Koshinaka
NOISE REDUCTION SYSTEM, SPEECH DETECTION SYSTEM, SPEECH RECOGNITION SYSTEM, NOISE REDUCTION METHOD, AND NOISE REDUCTION PROGRAM

Publication number: 20150356983

Abstract: Provided are a noise reduction system that highly precisely estimates noise contained in an input signal and highly precisely reduces the noise contained in the input signal using the estimated noise, a speech detection system, a speech recognition system, a noise reduction method, and a noise reduction program.

Type: Application

Filed: December 25, 2013

Publication date: December 10, 2015

Inventors: Masanori TSUJIKAWA, Ken HANAZAWA, Shuji KOMEIJI