Patents by Inventor Shuji KOMEIJI
Shuji KOMEIJI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11818300Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.Type: GrantFiled: October 3, 2022Date of Patent: November 14, 2023Assignee: NEC CORPORATIONInventors: Hiroki Matsuura, Shuji Komeiji, Takayuki Shirokaze, Ryoji Yoshida
-
Publication number: 20230109867Abstract: A speech recognition apparatus (2000) acquires source data (10) representing an audio signal including an utterance. The speech recognition apparatus (2000) converts the source data (10) into a text string (30). The speech recognition apparatus (2000) generates a concatenated text (40) representing a content of an utterance by concatenating a text (32) included in the text string (30). Herein, texts (32) adjacent to each other in the text string (30) are such that parts of associated audio signals overlap each other on a time axis. At a time of concatenating texts (32) adjacent to each other, the speech recognition apparatus (2000) eliminates a trailing portion of a preceding text (32) and a leading portion of a succeeding text (32).Type: ApplicationFiled: March 9, 2020Publication date: April 13, 2023Applicant: NEC CorporationInventors: Shuji KOMEIJI, Hitoshi YAMAMOTO
-
Publication number: 20230082325Abstract: An utterance end detection apparatus (2000) acquires source data 10 representing an audio signal including one or more utterances. The utterance end detection apparatus (2000) converts the source data (10) into text data (30). The utterance end detection apparatus (2000) detects a conversion unit that analyzes text data (30), acquires source data, and converts the source data into text data, and an end of each utterance included in an audio signal represented by the source data (10).Type: ApplicationFiled: February 26, 2020Publication date: March 16, 2023Applicant: NEC CorporationInventors: Shuji KOMEIJI, Hitoshi YAMAMOTO
-
Publication number: 20230076709Abstract: A speech recognition apparatus (2000) acquires a plurality of pieces of audio data (20) for a source audio signal including an utterance. The speech recognition apparatus (2000) generates a candidate text group (30) for each of the plurality of pieces of audio data (20). The candidate text group (30) includes a plurality of candidate texts (32). The candidate text (32) is a candidate of a text representing a content of an utterance corresponding to the audio data (20), and represents a sentence. The speech recognition apparatus (2000) selects, based on a comparison result between the plurality of candidate text groups (30), for each of the pieces of audio data (20), a candidate text (32) representing a content of an utterance represented by the piece of audio data (20) from the candidate text group (30) generated for the piece of audio data (20).Type: ApplicationFiled: March 16, 2020Publication date: March 9, 2023Applicant: NEC CorporationInventors: Shuji KOMEIJI, Hitoshi YAMAMOTO
-
Publication number: 20230064137Abstract: A speech recognition apparatus 20, includes; a data acquisition unit 21 that acquires speech data and sensor data to be recognized; a speech recognition unit 22 that converts the acquired speech data into text data by applying the acquired speech data and the acquired sensor data to an acoustic model which is constructed by machine learning using an embedded vector generated from sensor data related to training data in addition to speech data to be the training data and teacher data to be the training data.Type: ApplicationFiled: February 17, 2020Publication date: March 2, 2023Applicant: NEC CorporationInventors: Shuji KOMEIJI, Yasuo IIMURA, Hitoshi YAMAMOTO
-
Publication number: 20230046763Abstract: A speech recognition apparatus (2000) includes a first model (10) and a second model (20). The first model (10) is learned by training data with an audio frame as input data, and with, as correct answer data, compressed character string data acquired by encoding character string data represented by the audio frame. The second model (20) is a learned decoder (44) acquired by learning an autoencoder (40) being constituted of an encoder (42) converting input character string data into compressed character string data, and the decoder (44) converting, into character string data, the compressed character string data output from the encoder. The speech recognition apparatus (2000) inputs an audio frame to the first model (10), inputs, to the second model (20), compressed character string data output from the first model (10), and thereby generates character string data corresponding to the audio frame.Type: ApplicationFiled: February 19, 2020Publication date: February 16, 2023Applicant: NEC CorporationInventors: Shuji Komeiji, Ryoji Yoshida, Hitoshi Yamamoto
-
Publication number: 20230027992Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.Type: ApplicationFiled: October 3, 2022Publication date: January 26, 2023Applicant: NEC CorporationInventors: Hiroki MATSUURA, Shuji KOMEIJI, Takayuki SHIROKAZE, Ryoji YOSHIDA
-
Patent number: 11503161Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.Type: GrantFiled: September 13, 2019Date of Patent: November 15, 2022Assignee: NEC CORPORATIONInventors: Hiroki Matsuura, Shuji Komeiji, Takayuki Shirokaze, Ryoji Yoshida
-
Publication number: 20220358787Abstract: According to an example embodiment, a display apparatus includes adjustment means for comparing a plurality of first feature points specified in a face region, which is extracted from a shot image obtained by shooting an inspection target person, of the inspection target person and a plurality of first feature points specified in a face region, which is extracted from image data on a person registered in a database, of the person and adjusting a positional relationship between the shot image of the inspection target person and a registered image of the person to be generated based on the image data, and display control means for displaying the shot image of the inspection target person and a mark representing a visually recognizable second feature point to be specified from the registered image of the person in an overlapping manner on a display device.Type: ApplicationFiled: April 21, 2020Publication date: November 10, 2022Applicant: NEC CorporationInventors: Yasushi HAMADA, Shuji KOMEIJI
-
Publication number: 20220335951Abstract: A speech recognition apparatus (100) includes: a speech reproduction unit (102) that reproduces, for each predetermined section, target speech for speech recognition being divided for each predetermined section; a speech recognition unit (104) that recognizes, for each target speech, spoken speech acquired by repeating the target speech by a user; a text information generation unit (106) that generates text information about the spoken speech, based on a recognition result of the speech recognition unit (104); and a storage processing unit (108) that stores, as learning data, identification information by the user, the spoken speech, and the recognition result corresponding to the spoken speech in association with one another, in which the speech recognition unit (104) performs recognition by using a recognition engine that learns the learning data by the user.Type: ApplicationFiled: September 8, 2020Publication date: October 20, 2022Applicant: NEC CorporationInventor: Shuji KOMEIJI
-
Publication number: 20220319512Abstract: A language inference apparatus (100) includes an acquisition unit (102) that acquires nationality information, a selection unit (104) that selects a language inference engine by using the acquired nationality information, and a determination unit (106) that determines a language used by a speaker, by analyzing voice information of the speaker using the selected language inference engine (110).Type: ApplicationFiled: September 7, 2020Publication date: October 6, 2022Applicant: NEC CorporationInventor: Shuji KOMEIJI
-
Publication number: 20220014628Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.Type: ApplicationFiled: September 13, 2019Publication date: January 13, 2022Applicant: NEC CorporationInventors: Hiroki MATSUURA, Shuji KOMEIJI, Takayuki SHIROKAZE, Ryoji YOSHIDA
-
Patent number: 10347273Abstract: A speech processing apparatus includes: an expectation value calculation unit configured to calculate, using an input signal spectrum and a speech model that models a feature quantity of speech, a spectrum expectation value which is an expectation value of a spectrum of an acoustic component included in the input signal spectrum; and an acoustic power estimation unit configured to estimate an acoustic power of the acoustic component of the input signal spectrum based on the input signal spectrum and the spectrum expectation value.Type: GrantFiled: December 8, 2015Date of Patent: July 9, 2019Assignee: NEC CORPORATIONInventors: Shuji Komeiji, Masanori Tsujikawa, Ryosuke Isotani
-
Publication number: 20170364854Abstract: The purpose of the present invention is to provide a technology which is capable of appropriately evaluating a person's conduct with respect to another person. Provided is an information processing device, comprising a recognition unit 11, a detection unit 12, and an evaluation unit 13. The recognition unit 11 recognizes an evaluation subject's conduct. The detection unit 12 detects a trigger which is a state of a person other than the evaluation subject which triggers the evaluation subject's conduct. Using the detected trigger and the result of recognition by the recognition unit 13 relating to the evaluation subject's conduct, the evaluation unit 13 evaluates the evaluation subject's conduct.Type: ApplicationFiled: December 2, 2015Publication date: December 21, 2017Inventors: Terumi UMEMATSU, Ryosuke ISOTANI, Yoshifumi OMISHI, Masanori TSUJIKAWA, Makoto TERAO, Tasuku KITADE, Shuji KOMEIJI
-
Publication number: 20170337935Abstract: A speech processing apparatus includes: an expectation value calculation unit configured to calculate, using an input signal spectrum and a speech model that models a feature quantity of speech, a spectrum expectation value which is an expectation value of a spectrum of an acoustic component included in the input signal spectrum; and an acoustic power estimation unit configured to estimate an acoustic power of the acoustic component of the input signal spectrum based on the input signal spectrum and the spectrum expectation value.Type: ApplicationFiled: December 8, 2015Publication date: November 23, 2017Applicant: NEC CorporationInventors: Shuji KOMEIJI, Masanori TSUJIKAWA, Ryosuke ISOTANI
-
Patent number: 9449616Abstract: Provided are a noise reduction system that highly precisely estimates noise contained in an input signal and highly precisely reduces the noise contained in the input signal using the estimated noise, a speech detection system, a speech recognition system, a noise reduction method, and a noise reduction program.Type: GrantFiled: December 25, 2013Date of Patent: September 20, 2016Assignee: NEC CORPORATIONInventors: Masanori Tsujikawa, Ken Hanazawa, Shuji Komeiji
-
Patent number: 9245524Abstract: The present invention can increase the types of noises that can be dealt with enough to enable speech recognition with a speech recognition rate of high accuracy.Type: GrantFiled: November 10, 2011Date of Patent: January 26, 2016Assignee: NEC CORPORATIONInventors: Shuji Komeiji, Takayuki Arakawa, Takafumi Koshinaka
-
Publication number: 20150356983Abstract: Provided are a noise reduction system that highly precisely estimates noise contained in an input signal and highly precisely reduces the noise contained in the input signal using the estimated noise, a speech detection system, a speech recognition system, a noise reduction method, and a noise reduction program.Type: ApplicationFiled: December 25, 2013Publication date: December 10, 2015Inventors: Masanori TSUJIKAWA, Ken HANAZAWA, Shuji KOMEIJI