Patents by Inventor Takehiko Kagoshima

Takehiko Kagoshima has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speech recognition apparatus, method and non-transitory computer-readable storage medium

Patent number: 11978441

Abstract: According to one embodiment, a speech recognition apparatus includes processing circuitry. The processing circuitry generates, based on sensor information, environmental information relating to an environment in which the sensor information has been acquired, generates, based on the environmental information and generic speech data, an adapted acoustic model obtained by adapting a base acoustic model to the environment, acquires speech uttered in the environment as input speech data, and subjects the input speech data to a speech recognition process using the adapted acoustic model.

Type: Grant

Filed: February 26, 2021

Date of Patent: May 7, 2024

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventors: Daichi Hayakawa, Takehiko Kagoshima, Kenji Iwata
Signal processing apparatus and non-transitory computer readable medium

Patent number: 11908487

Abstract: A signal processing apparatus according an embodiment includes an acquisition unit and an application unit. The acquisition unit acquires M detection signals output from M detector devices having N-fold symmetry (M is an integer equal to or greater than 2, and N is an integer equal to or greater than 2). Each of the M detector devices detects original signals generated from K signal sources (K is an integer equal to or greater than 2) having the N-fold symmetry. The application unit applies a trained neural network to M input vectors corresponding to the M detection signals and outputs K output vectors. The same parameter is set to, of multiple weights included in a weight matrix of the trained neural network, weights that are commutative based on the N-fold symmetry.

Type: Grant

Filed: February 26, 2021

Date of Patent: February 20, 2024

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventors: Takehiko Kagoshima, Daichi Hayakawa
THRESHOLD GENERATION METHOD, THRESHOLD GENERATION DEVICE, AND COMPUTER PROGRAM PRODUCT

Publication number: 20240029713

Abstract: According to one embodiment, a threshold generation method includes generating a threshold to be set in a keyword detection device. The keyword detection device detects, based on a result of comparison of the threshold with a keyword score representing a degree of similarity between voice included in an audio signal and a preset keyword, whether the audio signal includes the keyword. The threshold generation method includes: calculating keyword scores representing degrees of similarity between the keyword and a plurality of reference audio signals; calculating parameters representing a distribution of a score set including the keyword scores calculated based on the reference audio signals; and generating the threshold based on the parameters representing the distribution of the score set.

Type: Application

Filed: February 13, 2023

Publication date: January 25, 2024

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventor: Takehiko KAGOSHIMA
SPEECH RECOGNITION APPARATUS AND METHOD

Publication number: 20220383860

Abstract: According to one embodiment, a speech recognition apparatus includes processing circuitry. The processing circuitry generates a plurality of augmented speech data, based on input speech data, generates a plurality of acoustic scores, based on the plurality of augmented speech data and an acoustic model, generates a plurality of adjusted acoustic scores by resampling the acoustic scores, generates an integrated acoustic score by integrating the adjusted acoustic scores, generates an integrated lattice, based on the integrated acoustic score, a pronunciation dictionary, and a language model, and searches a speech recognition result with a highest likelihood from the integrated lattice.

Type: Application

Filed: February 28, 2022

Publication date: December 1, 2022

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Daichi HAYAKAWA, Takehiko KAGOSHIMA
Moving body operation support system

Patent number: 11453411

Abstract: According to one embodiment, a moving body operation support system includes an acquirer, a microphone, a transceiver, and a processor. The acquirer acquires moving body information relating to a state of a moving body. The microphone is provided in the moving body. The transceiver performs transmitting to and receiving from an operator communication device. The processor implements one of a first operation or a second operation based on the moving body information and instruction information. The instruction information relates to an instruction based on a sound acquired by the microphone. The instruction is of a user riding in the moving body. In the first operation, the processor causes the moving body to perform an operation corresponding to the instruction information. In the second operation, the processor enables communication between the user and the operator by the transmitting and receiving between the transceiver and the operator communication device.

Type: Grant

Filed: February 28, 2018

Date of Patent: September 27, 2022

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventors: Takehiko Kagoshima, Noriko Yamanaka, Tatsuma Ishihara
Signal processing apparatus and signal processing method

Patent number: 11395061

Abstract: According to one embodiment, a signal processing apparatus includes the following units. The transform unit transforms a first signal into a time-frequency domain to obtain a second signal, the first signal obtained by detecting sound at each of different positions. The first calculation unit calculates a first spatial correlation matrix based on the second signal. The second calculation unit calculates a second spatial correlation matrix based on a third signal obtained by delaying the second signal by a predetermined time. The spatial filter unit generates a spatial filter based on the first spatial correlation matrix and the second spatial correlation matrix, and filters the second signal by using the spatial filter.

Type: Grant

Filed: February 20, 2020

Date of Patent: July 19, 2022

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventor: Takehiko Kagoshima
DICTIONARY EDITING APPARATUS AND DICTIONARY EDITING METHOD

Publication number: 20220138405

Abstract: According to one embodiment, a dictionary editing apparatus includes processing circuitry. The processing circuitry is configured to extract words from text data, append character pronunciations to the extracted words, and specify, when a modification is made to word information including the extracted words and the appended character pronunciations, a modification candidate that is a word or character pronunciation to be modified in relation to the modification.

Type: Application

Filed: August 26, 2021

Publication date: May 5, 2022

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Kenji IWATA, Takehiko KAGOSHIMA
DIFFERENCE EXTRACTION DEVICE, METHOD AND PROGRAM

Publication number: 20220138420

Abstract: According to one embodiment, a difference extraction device includes processing circuitry. The processing circuitry acquires a text in which an input notation string is described. The processing circuitry converts the input notation string into a pronunciation string. The processing circuitry executes a pronunciation string conversion process in which the pronunciation string is converted into an output notation string. The processing circuitry extracts a difference by comparing the input notation string and the output notation string with each other.

Type: Application

Filed: August 31, 2021

Publication date: May 5, 2022

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Daiki TANAKA, Takehiko KAGOSHIMA, Kenji IWATA, Hiroshi FUJIMURA
DICTIONARY EDITING APPARATUS, DICTIONARY EDITING METHOD, AND RECORDING MEDIUM RECORDING THEREON DICTIONARY EDITING PROGRAM

Publication number: 20220138416

Abstract: According to one embodiment, a dictionary editing apparatus includes a processor with hardware. The processor extracts a word from text data. The processor appends a character pronunciation to the extracted word. The processor calculates at least one of a first reliability denoting a reliability of the extracted word and a second reliability denoting a reliability of the appended character pronunciation. The processor specifies a word to be a modification candidate in accordance with the first reliability. The processor specifies a character pronunciation to be a modification candidate in accordance with the second reliability.

Type: Application

Filed: August 26, 2021

Publication date: May 5, 2022

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Kenji IWATA, Takehiko KAGOSHIMA
Acoustic signal processing with neural network using amplitude, phase, and frequency

Patent number: 11282505

Abstract: According to one embodiment, a signal generation device includes one or more processors. The processors convert an acoustic signal and output amplitude and phase at a plurality of frequencies. The processors, for each of a plurality of nodes of a hidden layer included in a neural network that treats the amplitude and the phase as input, obtain frequency based on a plurality of weights used in arithmetic operation of the node. The processors generate an acoustic signal based on the plurality of obtained frequencies and based on amplitude and phase corresponding to each of the plurality of nodes.

Type: Grant

Filed: March 8, 2019

Date of Patent: March 22, 2022

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventors: Daichi Hayakawa, Takehiko Kagoshima, Hiroshi Fujimura
SIGNAL PROCESSING APPARATUS AND NON-TRANSITORY COMPUTER READABLE MEDIUM

Publication number: 20220084539

Abstract: A signal processing apparatus according an embodiment includes an acquisition unit and an application unit. The acquisition unit acquires M detection signals output from M detector devices having N-fold symmetry (M is an integer equal to or greater than 2, and N is an integer equal to or greater than 2). Each of the M detector devices detects original signals generated from K signal sources (K is an integer equal to or greater than 2) having the N-fold symmetry. The application unit applies a trained neural network to M input vectors corresponding to the M detection signals and outputs K output vectors. The same parameter is set to, of multiple weights included in a weight matrix of the trained neural network, weights that are commutative based on the N-fold symmetry.

Type: Application

Filed: February 26, 2021

Publication date: March 17, 2022

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Takehiko KAGOSHIMA, Daichi HAYAKAWA
SPEECH RECOGNITION APPARATUS, METHOD AND NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM

Publication number: 20220076667

Abstract: According to one embodiment, a speech recognition apparatus includes processing circuitry. The processing circuitry generates, based on sensor information, environmental information relating to an environment in which the sensor information has been acquired, generates, based on the environmental information and generic speech data, an adapted acoustic model obtained by adapting a base acoustic model to the environment, acquires speech uttered in the environment as input speech data, and subjects the input speech data to a speech recognition process using the adapted acoustic model.

Type: Application

Filed: February 26, 2021

Publication date: March 10, 2022

Inventors: Daichi Hayakawa, Takehiko Kagoshima, Kenji Iwata
Information processing apparatus, information processing method, and computer program product

Patent number: 11062705

Abstract: According to one embodiment, an information processing apparatus includes one or more processors configured to detect a trigger from a voice signal, the trigger indicating start of voice recognition; and to perform voice recognition of a recognition sound section subsequent to a trigger sound section including the detected trigger, referring to a trigger and voice recognition dictionary corresponding to the trigger.

Type: Grant

Filed: February 27, 2019

Date of Patent: July 13, 2021

Assignee: Kabushiki Kaisha Toshiba

Inventors: Nayuko Watanabe, Takehiko Kagoshima, Hiroshi Fujimura
Sound processing apparatus, speech recognition apparatus, sound processing method, speech recognition method, storage medium

Patent number: 10950227

Abstract: According to one embodiment, a sound processing apparatus extracts a feature of first speech uttered outside an objective area from first speech obtained at positions different from each other in a space of the objective area and a place outside the objective area. The apparatus creates, by learning, a determination model configured to determine whether an utterance position of second speech in the space is outside the objective area based at least in part on the feature uttered outside the objective area. The apparatus eliminates a portion of the second speech uttered outside the objective area from the second speech obtained by a second microphone based at least in part on the feature and the model. The apparatus detects and outputs remaining speech from the second speech.

Type: Grant

Filed: February 26, 2018

Date of Patent: March 16, 2021

Assignee: Kabushiki Kaisha Toshiba

Inventor: Takehiko Kagoshima
Signal processing apparatus, signal processing method, and computer program product

Patent number: 10951982

Abstract: A signal processing apparatus includes one or more processors. The processors acquire a plurality of observed signals acquired from a plurality of microphone groups each including at least one microphone selected from a plurality of microphones. The microphone groups include respective microphone combinations each including at least one microphone, the combinations are different from each other, and at least one of the microphone groups includes a plurality of microphones. The processors estimate a mask indicating occupancy for each of time frequency points of a sound signal of a space corresponding to the observed signal in a plurality of spaces, for each of the observed signals. The processors integrate masks estimated for the observed signals to generate an integrated mask indicating occupancy for each of time frequency points of a sound signal in a space determined based on the spaces.

Type: Grant

Filed: August 19, 2019

Date of Patent: March 16, 2021

Assignee: Kabushiki Kaisha Toshiba

Inventors: Daichi Hayakawa, Takehiko Kagoshima, Hiroshi Fujimura
SIGNAL PROCESSING APPARATUS AND SIGNAL PROCESSING METHOD

Publication number: 20210067867

Abstract: According to one embodiment, a signal processing apparatus includes the following units. The transform unit transforms a first signal into a time-frequency domain to obtain a second signal, the first signal obtained by detecting sound at each of different positions. The first calculation unit calculates a first spatial correlation matrix based on the second signal. The second calculation unit calculates a second spatial correlation matrix based on a third signal obtained by delaying the second signal by a predetermined time. The spatial filter unit generates a spatial filter based on the first spatial correlation matrix and the second spatial correlation matrix, and filters the second signal by using the spatial filter.

Type: Application

Filed: February 20, 2020

Publication date: March 4, 2021

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventor: Takehiko KAGOSHIMA
SIGNAL PROCESSING APPARATUS, SIGNAL PROCESSING METHOD, AND COMPUTER PROGRAM PRODUCT

Publication number: 20200296507

Abstract: A signal processing apparatus includes one or more processors. The processors acquire a plurality of observed signals acquired from a plurality of microphone groups each including at least one microphone selected from a plurality of microphones. The microphone groups include respective microphone combinations each including at least one microphone, the combinations are different from each other, and at least one of the microphone groups includes a plurality of microphones. The processors estimate a mask indicating occupancy for each of time frequency points of a sound signal of a space corresponding to the observed signal in a plurality of spaces, for each of the observed signals. The processors integrate masks estimated for the observed signals to generate an integrated mask indicating occupancy for each of time frequency points of a sound signal in a space determined based on the spaces.

Type: Application

Filed: August 19, 2019

Publication date: September 17, 2020

Inventors: Daichi Hayakawa, Takehiko Kagoshima, Hiroshi Fujimura
Speech recognition device, speech recognition method and storage medium using recognition results to adjust volume level threshold

Patent number: 10579327

Abstract: In a speech recognition device according to one embodiment, a microphone detects sound and generates an audio signal corresponding to the sound, an adjustment processor adjusts a threshold to be a value less than a first volume level of first input audio signal generated by the microphone, and registers the adjusted threshold, a recognition processor reads the registered threshold, compares the registered threshold with a second input audio signal, discards the second input audio signal when a second volume level of the second input audio signal is less than the registered threshold, and performs a recognition process as the audio signal of a user to be recognized when the second volume level of the second input audio signal is greater than or equal to the registered threshold.

Type: Grant

Filed: September 14, 2017

Date of Patent: March 3, 2020

Assignee: Kabushiki Kaisha Toshiba

Inventor: Takehiko Kagoshima
SIGNAL GENERATION DEVICE, SIGNAL GENERATION SYSTEM, SIGNAL GENERATION METHOD, AND COMPUTER PROGRAM PRODUCT

Publication number: 20200066260

Abstract: According to one embodiment, a signal generation device includes one or more processors. The processors convert an acoustic signal and output amplitude and phase at a plurality of frequencies. The processors, for each of a plurality of nodes of a hidden layer included in a neural network that treats the amplitude and the phase as input, obtain frequency based on a plurality of weights used in arithmetic operation of the node. The processors generate an acoustic signal based on the plurality of obtained frequencies and based on amplitude and phase corresponding to each of the plurality of nodes.

Type: Application

Filed: March 8, 2019

Publication date: February 27, 2020

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Daichi HAYAKAWA, Takehiko KAGOSHIMA, Hiroshi FUJIMURA
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND COMPUTER PROGRAM PRODUCT

Publication number: 20200027453

Abstract: According to one embodiment, an information processing apparatus includes one or more processors configured to detect a trigger from a voice signal, the trigger indicating start of voice recognition; and to perform voice recognition of a recognition sound section subsequent to a trigger sound section including the detected trigger, referring to a trigger and voice recognition dictionary corresponding to the trigger.

Type: Application

Filed: February 27, 2019

Publication date: January 23, 2020

Inventors: Nayuko WATANABE, Takehiko KAGOSHIMA, Hiroshi FUJIMURA

1 2 3 4 5 … next