Patents by Inventor Shuji KOMEIJI

Shuji KOMEIJI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11818300
    Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.
    Type: Grant
    Filed: October 3, 2022
    Date of Patent: November 14, 2023
    Assignee: NEC CORPORATION
    Inventors: Hiroki Matsuura, Shuji Komeiji, Takayuki Shirokaze, Ryoji Yoshida
  • Publication number: 20230109867
    Abstract: A speech recognition apparatus (2000) acquires source data (10) representing an audio signal including an utterance. The speech recognition apparatus (2000) converts the source data (10) into a text string (30). The speech recognition apparatus (2000) generates a concatenated text (40) representing a content of an utterance by concatenating a text (32) included in the text string (30). Herein, texts (32) adjacent to each other in the text string (30) are such that parts of associated audio signals overlap each other on a time axis. At a time of concatenating texts (32) adjacent to each other, the speech recognition apparatus (2000) eliminates a trailing portion of a preceding text (32) and a leading portion of a succeeding text (32).
    Type: Application
    Filed: March 9, 2020
    Publication date: April 13, 2023
    Applicant: NEC Corporation
    Inventors: Shuji KOMEIJI, Hitoshi YAMAMOTO
  • Publication number: 20230082325
    Abstract: An utterance end detection apparatus (2000) acquires source data 10 representing an audio signal including one or more utterances. The utterance end detection apparatus (2000) converts the source data (10) into text data (30). The utterance end detection apparatus (2000) detects a conversion unit that analyzes text data (30), acquires source data, and converts the source data into text data, and an end of each utterance included in an audio signal represented by the source data (10).
    Type: Application
    Filed: February 26, 2020
    Publication date: March 16, 2023
    Applicant: NEC Corporation
    Inventors: Shuji KOMEIJI, Hitoshi YAMAMOTO
  • Publication number: 20230076709
    Abstract: A speech recognition apparatus (2000) acquires a plurality of pieces of audio data (20) for a source audio signal including an utterance. The speech recognition apparatus (2000) generates a candidate text group (30) for each of the plurality of pieces of audio data (20). The candidate text group (30) includes a plurality of candidate texts (32). The candidate text (32) is a candidate of a text representing a content of an utterance corresponding to the audio data (20), and represents a sentence. The speech recognition apparatus (2000) selects, based on a comparison result between the plurality of candidate text groups (30), for each of the pieces of audio data (20), a candidate text (32) representing a content of an utterance represented by the piece of audio data (20) from the candidate text group (30) generated for the piece of audio data (20).
    Type: Application
    Filed: March 16, 2020
    Publication date: March 9, 2023
    Applicant: NEC Corporation
    Inventors: Shuji KOMEIJI, Hitoshi YAMAMOTO
  • Publication number: 20230064137
    Abstract: A speech recognition apparatus 20, includes; a data acquisition unit 21 that acquires speech data and sensor data to be recognized; a speech recognition unit 22 that converts the acquired speech data into text data by applying the acquired speech data and the acquired sensor data to an acoustic model which is constructed by machine learning using an embedded vector generated from sensor data related to training data in addition to speech data to be the training data and teacher data to be the training data.
    Type: Application
    Filed: February 17, 2020
    Publication date: March 2, 2023
    Applicant: NEC Corporation
    Inventors: Shuji KOMEIJI, Yasuo IIMURA, Hitoshi YAMAMOTO
  • Publication number: 20230046763
    Abstract: A speech recognition apparatus (2000) includes a first model (10) and a second model (20). The first model (10) is learned by training data with an audio frame as input data, and with, as correct answer data, compressed character string data acquired by encoding character string data represented by the audio frame. The second model (20) is a learned decoder (44) acquired by learning an autoencoder (40) being constituted of an encoder (42) converting input character string data into compressed character string data, and the decoder (44) converting, into character string data, the compressed character string data output from the encoder. The speech recognition apparatus (2000) inputs an audio frame to the first model (10), inputs, to the second model (20), compressed character string data output from the first model (10), and thereby generates character string data corresponding to the audio frame.
    Type: Application
    Filed: February 19, 2020
    Publication date: February 16, 2023
    Applicant: NEC Corporation
    Inventors: Shuji Komeiji, Ryoji Yoshida, Hitoshi Yamamoto
  • Publication number: 20230027992
    Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.
    Type: Application
    Filed: October 3, 2022
    Publication date: January 26, 2023
    Applicant: NEC Corporation
    Inventors: Hiroki MATSUURA, Shuji KOMEIJI, Takayuki SHIROKAZE, Ryoji YOSHIDA
  • Patent number: 11503161
    Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.
    Type: Grant
    Filed: September 13, 2019
    Date of Patent: November 15, 2022
    Assignee: NEC CORPORATION
    Inventors: Hiroki Matsuura, Shuji Komeiji, Takayuki Shirokaze, Ryoji Yoshida
  • Publication number: 20220358787
    Abstract: According to an example embodiment, a display apparatus includes adjustment means for comparing a plurality of first feature points specified in a face region, which is extracted from a shot image obtained by shooting an inspection target person, of the inspection target person and a plurality of first feature points specified in a face region, which is extracted from image data on a person registered in a database, of the person and adjusting a positional relationship between the shot image of the inspection target person and a registered image of the person to be generated based on the image data, and display control means for displaying the shot image of the inspection target person and a mark representing a visually recognizable second feature point to be specified from the registered image of the person in an overlapping manner on a display device.
    Type: Application
    Filed: April 21, 2020
    Publication date: November 10, 2022
    Applicant: NEC Corporation
    Inventors: Yasushi HAMADA, Shuji KOMEIJI
  • Publication number: 20220335951
    Abstract: A speech recognition apparatus (100) includes: a speech reproduction unit (102) that reproduces, for each predetermined section, target speech for speech recognition being divided for each predetermined section; a speech recognition unit (104) that recognizes, for each target speech, spoken speech acquired by repeating the target speech by a user; a text information generation unit (106) that generates text information about the spoken speech, based on a recognition result of the speech recognition unit (104); and a storage processing unit (108) that stores, as learning data, identification information by the user, the spoken speech, and the recognition result corresponding to the spoken speech in association with one another, in which the speech recognition unit (104) performs recognition by using a recognition engine that learns the learning data by the user.
    Type: Application
    Filed: September 8, 2020
    Publication date: October 20, 2022
    Applicant: NEC Corporation
    Inventor: Shuji KOMEIJI
  • Publication number: 20220319512
    Abstract: A language inference apparatus (100) includes an acquisition unit (102) that acquires nationality information, a selection unit (104) that selects a language inference engine by using the acquired nationality information, and a determination unit (106) that determines a language used by a speaker, by analyzing voice information of the speaker using the selected language inference engine (110).
    Type: Application
    Filed: September 7, 2020
    Publication date: October 6, 2022
    Applicant: NEC Corporation
    Inventor: Shuji KOMEIJI
  • Publication number: 20220014628
    Abstract: The example embodiments provides a processing system (10) including: an acquisition unit (11) that acquires target speech data in which a target speech is recorded or a target feature value that indicates a feature of the target speech; an inference unit (12) that infers a language of the target speech, based on an inference model for inferring a language of a speech from speech data or a speech feature value and the target speech data or the target feature value; a result output unit (13) that outputs an inference result by the inference unit (12); a determination unit (14) that determines whether the inference result is correct; and a learning data output unit (15) that outputs the inference result determined to be correct by the determination unit (14) and the target speech data or the target feature value, as learning data for generating the inference model.
    Type: Application
    Filed: September 13, 2019
    Publication date: January 13, 2022
    Applicant: NEC Corporation
    Inventors: Hiroki MATSUURA, Shuji KOMEIJI, Takayuki SHIROKAZE, Ryoji YOSHIDA
  • Patent number: 10347273
    Abstract: A speech processing apparatus includes: an expectation value calculation unit configured to calculate, using an input signal spectrum and a speech model that models a feature quantity of speech, a spectrum expectation value which is an expectation value of a spectrum of an acoustic component included in the input signal spectrum; and an acoustic power estimation unit configured to estimate an acoustic power of the acoustic component of the input signal spectrum based on the input signal spectrum and the spectrum expectation value.
    Type: Grant
    Filed: December 8, 2015
    Date of Patent: July 9, 2019
    Assignee: NEC CORPORATION
    Inventors: Shuji Komeiji, Masanori Tsujikawa, Ryosuke Isotani
  • Publication number: 20170364854
    Abstract: The purpose of the present invention is to provide a technology which is capable of appropriately evaluating a person's conduct with respect to another person. Provided is an information processing device, comprising a recognition unit 11, a detection unit 12, and an evaluation unit 13. The recognition unit 11 recognizes an evaluation subject's conduct. The detection unit 12 detects a trigger which is a state of a person other than the evaluation subject which triggers the evaluation subject's conduct. Using the detected trigger and the result of recognition by the recognition unit 13 relating to the evaluation subject's conduct, the evaluation unit 13 evaluates the evaluation subject's conduct.
    Type: Application
    Filed: December 2, 2015
    Publication date: December 21, 2017
    Inventors: Terumi UMEMATSU, Ryosuke ISOTANI, Yoshifumi OMISHI, Masanori TSUJIKAWA, Makoto TERAO, Tasuku KITADE, Shuji KOMEIJI
  • Publication number: 20170337935
    Abstract: A speech processing apparatus includes: an expectation value calculation unit configured to calculate, using an input signal spectrum and a speech model that models a feature quantity of speech, a spectrum expectation value which is an expectation value of a spectrum of an acoustic component included in the input signal spectrum; and an acoustic power estimation unit configured to estimate an acoustic power of the acoustic component of the input signal spectrum based on the input signal spectrum and the spectrum expectation value.
    Type: Application
    Filed: December 8, 2015
    Publication date: November 23, 2017
    Applicant: NEC Corporation
    Inventors: Shuji KOMEIJI, Masanori TSUJIKAWA, Ryosuke ISOTANI
  • Patent number: 9449616
    Abstract: Provided are a noise reduction system that highly precisely estimates noise contained in an input signal and highly precisely reduces the noise contained in the input signal using the estimated noise, a speech detection system, a speech recognition system, a noise reduction method, and a noise reduction program.
    Type: Grant
    Filed: December 25, 2013
    Date of Patent: September 20, 2016
    Assignee: NEC CORPORATION
    Inventors: Masanori Tsujikawa, Ken Hanazawa, Shuji Komeiji
  • Patent number: 9245524
    Abstract: The present invention can increase the types of noises that can be dealt with enough to enable speech recognition with a speech recognition rate of high accuracy.
    Type: Grant
    Filed: November 10, 2011
    Date of Patent: January 26, 2016
    Assignee: NEC CORPORATION
    Inventors: Shuji Komeiji, Takayuki Arakawa, Takafumi Koshinaka
  • Publication number: 20150356983
    Abstract: Provided are a noise reduction system that highly precisely estimates noise contained in an input signal and highly precisely reduces the noise contained in the input signal using the estimated noise, a speech detection system, a speech recognition system, a noise reduction method, and a noise reduction program.
    Type: Application
    Filed: December 25, 2013
    Publication date: December 10, 2015
    Inventors: Masanori TSUJIKAWA, Ken HANAZAWA, Shuji KOMEIJI