Patents by Inventor Takehiko Kagoshima

Takehiko Kagoshima has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND COMPUTER PROGRAM PRODUCT

Publication number: 20200027453

Abstract: According to one embodiment, an information processing apparatus includes one or more processors configured to detect a trigger from a voice signal, the trigger indicating start of voice recognition; and to perform voice recognition of a recognition sound section subsequent to a trigger sound section including the detected trigger, referring to a trigger and voice recognition dictionary corresponding to the trigger.

Type: Application

Filed: February 27, 2019

Publication date: January 23, 2020

Inventors: Nayuko WATANABE, Takehiko KAGOSHIMA, Hiroshi FUJIMURA
Voice processing device, voice processing method, and computer program product

Patent number: 10504523

Abstract: According to an embodiment, a voice processing device includes a receiver, a separator, and an output controller. The receiver is configured to receive n input signals input into n voice input devices respectively corresponding to n sound sources, where n is an integer of 2 or more. The separator is configured to separate the input signals by the sound sources to produce n separation signals. The output controller is configured to, according to the number of sound sources having uttered voice sounds, switch between an output signal produced based on the input signal and an output signal produced based on the separation signal, and output the output signal.

Type: Grant

Filed: February 7, 2018

Date of Patent: December 10, 2019

Assignee: Kabushiki Kaisha Toshiba

Inventors: Ning Ding, Takehiko Kagoshima
SOUND PROCESSING APPARATUS, SPEECH RECOGNITION APPARATUS, SOUND PROCESSING METHOD, SPEECH RECOGNITION METHOD, STORAGE MEDIUM

Publication number: 20190080689

Abstract: According to one embodiment, a sound processing apparatus extracts a feature of first speech uttered outside an objective area from first speech obtained at positions different from each other in a space of the objective area and a place outside the objective area. The apparatus creates, by learning, a determination model configured to determine whether an utterance position of second speech in the space is outside the objective area based at least in part on the feature uttered outside the objective area. The apparatus eliminates a portion of the second speech uttered outside the objective area from the second speech obtained by a second microphone based at least in part on the feature and the model. The apparatus detects and outputs remaining speech from the second speech.

Type: Application

Filed: February 26, 2018

Publication date: March 14, 2019

Inventor: Takehiko Kagoshima
MOVING BODY OPERATION SUPPORT SYSTEM

Publication number: 20190061781

Abstract: According to one embodiment, a moving body operation support system includes an acquirer, a microphone, a transceiver, and a processor. The acquirer acquires moving body information relating to a state of a moving body. The microphone is provided in the moving body. The transceiver performs transmitting to and receiving from an operator communication device. The processor implements one of a first operation or a second operation based on the moving body information and instruction information. The instruction information relates to an instruction based on a sound acquired by the microphone. The instruction is of a user riding in the moving body. In the first operation, the processor causes the moving body to perform an operation corresponding to the instruction information. In the second operation, the processor enables communication between the user and the operator by the transmitting and receiving between the transceiver and the operator communication device.

Type: Application

Filed: February 28, 2018

Publication date: February 28, 2019

Inventors: Takehiko KAGOSHIMA, Noriko YAMANAKA, Tatsuma ISHIHARA
Acoustic processing apparatus, acoustic processing method, and computer program product

Patent number: 10152986

Abstract: An acoustic processing apparatus includes a storage, an estimation unit, and a removal unit. The storage stores therein a reference signal indicating a signal obtained by completing removal of reverberation from a first observation signal included in a first processing section. The estimation unit estimates, on the basis of a model representing an observation signal as a signal obtained by adding a signal obtained by applying a reverberation removal filter to an acoustic signal that is input with a delay and the acoustic signal, a filter coefficient of the reverberation removal filter by using a second observation signal and the reference signal. The removal unit determines an output signal indicating a signal obtained by removing reverberation from the second observation signal by using the second observation signal, the reference signal, and the reverberation removal filter having the estimated filter coefficient.

Type: Grant

Filed: July 10, 2017

Date of Patent: December 11, 2018

Assignee: Kabushiki Kaisha Toshiba

Inventors: Takehiko Kagoshima, Toru Taniguchi
VOICE PROCESSING DEVICE, VOICE PROCESSING METHOD, AND COMPUTER PROGRAM PRODUCT

Publication number: 20180350370

Abstract: According to an embodiment, a voice processing device includes a receiver, a separator, and an output controller. The receiver is configured to receive n input signals input into n voice input devices respectively corresponding to n sound sources, where n is an integer of 2 or more. The separator is configured to separate the input signals by the sound sources to produce n separation signals. The output controller is configured to, according to the number of sound sources having uttered voice sounds, switch between an output signal produced based on the input signal and an output signal produced based on the separation signal, and output the output signal.

Type: Application

Filed: February 7, 2018

Publication date: December 6, 2018

Inventors: Ning DING, Takehiko KAGOSHIMA
Speech synthesizer, audio watermarking information detection apparatus, speech synthesizing method, audio watermarking information detection method, and computer program product

Patent number: 10109286

Abstract: According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.

Type: Grant

Filed: September 14, 2017

Date of Patent: October 23, 2018

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventors: Kentaro Tachibana, Takehiko Kagoshima, Masatsune Tamura, Masahiro Morita
SPEECH RECOGNITION DEVICE, SPEECH RECOGNITION METHOD AND STORAGE MEDIUM

Publication number: 20180275951

Abstract: In a speech recognition device according to one embodiment, a microphone detects sound and generates an audio signal corresponding to the sound, an adjustment processor adjusts a threshold to be a value less than a first volume level of first input audio signal generated by the microphone, and registers the adjusted threshold, a recognition processor reads the registered threshold, compares the registered threshold with a second input audio signal, discards the second input audio signal when a second volume level of the second input audio signal is less than the registered threshold, and performs a recognition process as the audio signal of a user to be recognized when the second volume level of the second input audio signal is greater than or equal to the registered threshold.

Type: Application

Filed: September 14, 2017

Publication date: September 27, 2018

Inventor: Takehiko Kagoshima
ACOUSTIC PROCESSING APPARATUS, ACOUSTIC PROCESSING METHOD, AND COMPUTER PROGRAM PRODUCT

Publication number: 20180233161

Abstract: An acoustic processing apparatus includes a storage, an estimation unit, and a removal unit. The storage stores therein a reference signal indicating a signal obtained by completing removal of reverberation from a first observation signal included in a first processing section. The estimation unit estimates, on the basis of a model representing an observation signal as a signal obtained by adding a signal obtained by applying a reverberation removal filter to an acoustic signal that is input with a delay and the acoustic signal, a filter coefficient of the reverberation removal filter by using a second observation signal and the reference signal. The removal unit determines an output signal indicating a signal obtained by removing reverberation from the second observation signal by using the second observation signal, the reference signal, and the reverberation removal filter having the estimated filter coefficient.

Type: Application

Filed: July 10, 2017

Publication date: August 16, 2018

Applicant: Kabushiki Kaisha Toshiba

Inventors: Takehiko Kagoshima, Toru Taniguchi
Speech synthesis apparatus, method, and computer-readable medium that generates synthesized speech having prosodic feature

Patent number: 9905219

Abstract: According to one embodiment, a speech synthesis apparatus is provided with generation, normalization, interpolation and synthesis units. The generation unit generates a first parameter using a prosodic control dictionary of a target speaker and one or more second parameters using a prosodic control dictionary of one or more standard speakers based on language information for an input text. The normalization unit normalizes the one or more second parameters based a normalization parameter. The interpolation unit interpolates the first parameter and the one or more normalized second parameters based on weight information to generate a third parameter and the synthesis unit generates synthesized speech using the third parameter.

Type: Grant

Filed: August 16, 2013

Date of Patent: February 27, 2018

Assignee: Kabushiki Kaisha Toshiba

Inventors: Kentaro Tachibana, Takehiko Kagoshima, Masahiro Morita
Speech synthesizer, audio watermarking information detection apparatus, speech synthesizing method, audio watermarking information detection method, and computer program product

Patent number: 9870779

Abstract: According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.

Type: Grant

Filed: July 16, 2015

Date of Patent: January 16, 2018

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventors: Kentaro Tachibana, Takehiko Kagoshima, Masatsune Tamura, Masahiro Morita
SPEECH SYNTHESIZER, AUDIO WATERMARKING INFORMATION DETECTION APPARATUS, SPEECH SYNTHESIZING METHOD, AUDIO WATERMARKING INFORMATION DETECTION METHOD, AND COMPUTER PROGRAM PRODUCT

Publication number: 20180005637

Abstract: According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.

Type: Application

Filed: September 14, 2017

Publication date: January 4, 2018

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Kentaro TACHIBANA, Takehiko KAGOSHIMA, Masatsune TAMURA, Masahiro MORITA
Speech synthesis dictionary creating device and method

Patent number: 9792894

Abstract: According to an embodiment, a speech synthesis dictionary creating device includes a first speech input unit, a second speech input unit, a determining unit, and a creating unit. The first speech input unit receives input of first speech data. The second speech input unit receives input of second speech data which is considered to be appropriate speech data. The determining unit determines whether or not a speaker of the first speech data is the same as a speaker of the second speech data. When the determining unit determines that the speaker of the first speech data is the same as the speaker of the second speech data, the creating unit creates a speech synthesis dictionary using the first speech data and using a text corresponding to the first speech data.

Type: Grant

Filed: December 16, 2015

Date of Patent: October 17, 2017

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventors: Kentaro Tachibana, Masahiro Morita, Takehiko Kagoshima
Speech synthesis apparatus and method utilizing acquisition of at least two speech unit waveforms acquired from a continuous memory region by one access

Patent number: 9666179

Abstract: A waveform memory that stores a plurality of speech unit waveforms corresponding to respective speech units, wherein an address order of the speech unit waveforms is determined by a sort order of speech units included in a speech unit sequence corresponding to a phoneme sequence of training data, and the speech units included in the speech unit sequence are selected so as to synthesize a speech of the phone sequence.

Type: Grant

Filed: February 26, 2014

Date of Patent: May 30, 2017

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventor: Takehiko Kagoshima
Prosody editing apparatus and method

Patent number: 9601106

Abstract: According to one embodiment, a prosody editing apparatus includes a storage, a first selection unit, a search unit, a normalization unit, a mapping unit, a display, a second selection unit, a restoring unit and a replacing unit. The search unit searches the storage for one or more second prosodic patterns corresponding to attribute information that matches attribute information of the selected phrase. The mapping maps each of the normalized second prosodic patterns on a low-dimensional space. The restoring unit restores a restored prosodic pattern according to the selected coordinates. The replacing unit replaces prosody of synthetic speech generated based on the selected phrase by the restored prosodic pattern.

Type: Grant

Filed: August 15, 2013

Date of Patent: March 21, 2017

Assignee: Kabushiki Kaisha Toshiba

Inventors: Kouichirou Mori, Takehiko Kagoshima, Masahiro Morita
SPEECH SYNTHESIS DICTIONARY CREATING DEVICE AND METHOD

Publication number: 20160104475

Abstract: According to an embodiment, a speech synthesis dictionary creating device includes a first speech input unit, a second speech input unit, a determining unit, and a creating unit. The first speech input unit receives input of first speech data. The second speech input unit receives input of second speech data which is considered to be appropriate speech data. The determining unit determines whether or not a speaker of the first speech data is the same as a speaker of the second speech data. When the determining unit determines that the speaker of the first speech data is the same as the speaker of the second speech data, the creating unit creates a speech synthesis dictionary using the first speech data and using a text corresponding to the first speech data.

Type: Application

Filed: December 16, 2015

Publication date: April 14, 2016

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Kentaro TACHIBANA, Masahiro MORITA, Takehiko KAGOSHIMA
Apparatus and method for estimating utterance style of each sentence in documents, and non-transitory computer readable medium thereof

Patent number: 9280967

Abstract: According to one embodiment, an apparatus for supporting reading of a document includes a model storage unit, a document acquisition unit, a feature information extraction, and an utterance style estimation unit. The model storage unit is configured to store a model which has trained a correspondence relationship between first feature information and an utterance style. The first feature information is extracted from a plurality of sentences in a training document. The document acquisition unit is configured to acquire a document to be read. The feature information extraction unit is configured to extract second feature information from each sentence in the document to be read. The utterance style estimation unit is configured to compare the second feature information of a plurality of sentences in the document to be read with the model, and to estimate an utterance style of the each sentence of the document to be read.

Type: Grant

Filed: September 14, 2011

Date of Patent: March 8, 2016

Assignee: Kabushiki Kaisha Toshiba

Inventors: Kosei Fume, Masaru Suzuki, Masahiro Morita, Kentaro Tachibana, Kouichirou Mori, Yuji Shimizu, Takehiko Kagoshima, Masatsune Tamura, Tomohiro Yamasaki
SPEECH SYNTHESIZER, AUDIO WATERMARKING INFORMATION DETECTION APPARATUS, SPEECH SYNTHESIZING METHOD, AUDIO WATERMARKING INFORMATION DETECTION METHOD, AND COMPUTER PROGRAM PRODUCT

Publication number: 20150325232

Abstract: According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.

Type: Application

Filed: July 16, 2015

Publication date: November 12, 2015

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Kentaro TACHIBANA, Takehiko KAGOSHIMA, Masatsune TAMURA, Masahiro MORITA
Apparatus and method for creating dictionary for speech synthesis utilizing a display to aid in assessing synthesis quality

Patent number: 9129596

Abstract: Apparatus for creating a dictionary for speech synthesis includes a sentence storage unit configured to store N sentences, a sentence display unit configured to selectively display a first sentence which is one of the N sentences, a recording unit configured to record each user speech, a necessity determination unit configured to make a determination of whether to create the dictionary, a dictionary creation unit configured to create the dictionary by utilizing the user speech, and a speech synthesis unit configured to convert a second sentence to a synthesized speech with the dictionary. The display unit is configured to stop displaying the currently displayed sentence according to an evaluation of a quality of its synthesis. The determination unit makes the determination under a condition that the recording unit records the user speech of M first sentences (M is less than N) and the determination is based on at least one of an instruction from the user, M and an amount of the recorded user speech.

Type: Grant

Filed: June 28, 2012

Date of Patent: September 8, 2015

Assignee: Kabushiki Kaisha Toshiba

Inventors: Kentaro Tachibana, Masahiro Morita, Takehiko Kagoshima
Speech synthesizer, speech synthesis method and computer program product

Patent number: 9058807

Abstract: According to one embodiment, a first storage unit stores n band noise signals obtained by applying n band-pass filters to a noise signal. A second storage unit stores n band pulse signals. A parameter input unit inputs a fundamental frequency, n band noise intensities, and a spectrum parameter. A extraction unit extracts for each pitch mark the n band noise signals while shifting. An amplitude control unit changes amplitudes of the extracted band noise signals and band pulse signals in accordance with the band noise intensities. A generation unit generates a mixed sound source signal by adding the n band noise signals and the n band pulse signals. A generation unit generates the mixed sound source signal generated based on the pitch mark. A vocal tract filter unit generates a speech waveform by applying a vocal tract filter using the spectrum parameter to the generated mixed sound source signal.

Type: Grant

Filed: March 18, 2011

Date of Patent: June 16, 2015

Assignee: Kabushiki Kaisha Toshiba

Inventors: Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima

prev 1 2 3 4 5 6 next