Patents by Inventor Takehiko Kagoshima

Takehiko Kagoshima has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20200027453
    Abstract: According to one embodiment, an information processing apparatus includes one or more processors configured to detect a trigger from a voice signal, the trigger indicating start of voice recognition; and to perform voice recognition of a recognition sound section subsequent to a trigger sound section including the detected trigger, referring to a trigger and voice recognition dictionary corresponding to the trigger.
    Type: Application
    Filed: February 27, 2019
    Publication date: January 23, 2020
    Inventors: Nayuko WATANABE, Takehiko KAGOSHIMA, Hiroshi FUJIMURA
  • Patent number: 10504523
    Abstract: According to an embodiment, a voice processing device includes a receiver, a separator, and an output controller. The receiver is configured to receive n input signals input into n voice input devices respectively corresponding to n sound sources, where n is an integer of 2 or more. The separator is configured to separate the input signals by the sound sources to produce n separation signals. The output controller is configured to, according to the number of sound sources having uttered voice sounds, switch between an output signal produced based on the input signal and an output signal produced based on the separation signal, and output the output signal.
    Type: Grant
    Filed: February 7, 2018
    Date of Patent: December 10, 2019
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Ning Ding, Takehiko Kagoshima
  • Publication number: 20190080689
    Abstract: According to one embodiment, a sound processing apparatus extracts a feature of first speech uttered outside an objective area from first speech obtained at positions different from each other in a space of the objective area and a place outside the objective area. The apparatus creates, by learning, a determination model configured to determine whether an utterance position of second speech in the space is outside the objective area based at least in part on the feature uttered outside the objective area. The apparatus eliminates a portion of the second speech uttered outside the objective area from the second speech obtained by a second microphone based at least in part on the feature and the model. The apparatus detects and outputs remaining speech from the second speech.
    Type: Application
    Filed: February 26, 2018
    Publication date: March 14, 2019
    Inventor: Takehiko Kagoshima
  • Publication number: 20190061781
    Abstract: According to one embodiment, a moving body operation support system includes an acquirer, a microphone, a transceiver, and a processor. The acquirer acquires moving body information relating to a state of a moving body. The microphone is provided in the moving body. The transceiver performs transmitting to and receiving from an operator communication device. The processor implements one of a first operation or a second operation based on the moving body information and instruction information. The instruction information relates to an instruction based on a sound acquired by the microphone. The instruction is of a user riding in the moving body. In the first operation, the processor causes the moving body to perform an operation corresponding to the instruction information. In the second operation, the processor enables communication between the user and the operator by the transmitting and receiving between the transceiver and the operator communication device.
    Type: Application
    Filed: February 28, 2018
    Publication date: February 28, 2019
    Inventors: Takehiko KAGOSHIMA, Noriko YAMANAKA, Tatsuma ISHIHARA
  • Patent number: 10152986
    Abstract: An acoustic processing apparatus includes a storage, an estimation unit, and a removal unit. The storage stores therein a reference signal indicating a signal obtained by completing removal of reverberation from a first observation signal included in a first processing section. The estimation unit estimates, on the basis of a model representing an observation signal as a signal obtained by adding a signal obtained by applying a reverberation removal filter to an acoustic signal that is input with a delay and the acoustic signal, a filter coefficient of the reverberation removal filter by using a second observation signal and the reference signal. The removal unit determines an output signal indicating a signal obtained by removing reverberation from the second observation signal by using the second observation signal, the reference signal, and the reverberation removal filter having the estimated filter coefficient.
    Type: Grant
    Filed: July 10, 2017
    Date of Patent: December 11, 2018
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Takehiko Kagoshima, Toru Taniguchi
  • Publication number: 20180350370
    Abstract: According to an embodiment, a voice processing device includes a receiver, a separator, and an output controller. The receiver is configured to receive n input signals input into n voice input devices respectively corresponding to n sound sources, where n is an integer of 2 or more. The separator is configured to separate the input signals by the sound sources to produce n separation signals. The output controller is configured to, according to the number of sound sources having uttered voice sounds, switch between an output signal produced based on the input signal and an output signal produced based on the separation signal, and output the output signal.
    Type: Application
    Filed: February 7, 2018
    Publication date: December 6, 2018
    Inventors: Ning DING, Takehiko KAGOSHIMA
  • Patent number: 10109286
    Abstract: According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.
    Type: Grant
    Filed: September 14, 2017
    Date of Patent: October 23, 2018
    Assignee: KABUSHIKI KAISHA TOSHIBA
    Inventors: Kentaro Tachibana, Takehiko Kagoshima, Masatsune Tamura, Masahiro Morita
  • Publication number: 20180275951
    Abstract: In a speech recognition device according to one embodiment, a microphone detects sound and generates an audio signal corresponding to the sound, an adjustment processor adjusts a threshold to be a value less than a first volume level of first input audio signal generated by the microphone, and registers the adjusted threshold, a recognition processor reads the registered threshold, compares the registered threshold with a second input audio signal, discards the second input audio signal when a second volume level of the second input audio signal is less than the registered threshold, and performs a recognition process as the audio signal of a user to be recognized when the second volume level of the second input audio signal is greater than or equal to the registered threshold.
    Type: Application
    Filed: September 14, 2017
    Publication date: September 27, 2018
    Inventor: Takehiko Kagoshima
  • Publication number: 20180233161
    Abstract: An acoustic processing apparatus includes a storage, an estimation unit, and a removal unit. The storage stores therein a reference signal indicating a signal obtained by completing removal of reverberation from a first observation signal included in a first processing section. The estimation unit estimates, on the basis of a model representing an observation signal as a signal obtained by adding a signal obtained by applying a reverberation removal filter to an acoustic signal that is input with a delay and the acoustic signal, a filter coefficient of the reverberation removal filter by using a second observation signal and the reference signal. The removal unit determines an output signal indicating a signal obtained by removing reverberation from the second observation signal by using the second observation signal, the reference signal, and the reverberation removal filter having the estimated filter coefficient.
    Type: Application
    Filed: July 10, 2017
    Publication date: August 16, 2018
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Takehiko Kagoshima, Toru Taniguchi
  • Patent number: 9905219
    Abstract: According to one embodiment, a speech synthesis apparatus is provided with generation, normalization, interpolation and synthesis units. The generation unit generates a first parameter using a prosodic control dictionary of a target speaker and one or more second parameters using a prosodic control dictionary of one or more standard speakers based on language information for an input text. The normalization unit normalizes the one or more second parameters based a normalization parameter. The interpolation unit interpolates the first parameter and the one or more normalized second parameters based on weight information to generate a third parameter and the synthesis unit generates synthesized speech using the third parameter.
    Type: Grant
    Filed: August 16, 2013
    Date of Patent: February 27, 2018
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Kentaro Tachibana, Takehiko Kagoshima, Masahiro Morita
  • Patent number: 9870779
    Abstract: According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.
    Type: Grant
    Filed: July 16, 2015
    Date of Patent: January 16, 2018
    Assignee: KABUSHIKI KAISHA TOSHIBA
    Inventors: Kentaro Tachibana, Takehiko Kagoshima, Masatsune Tamura, Masahiro Morita
  • Publication number: 20180005637
    Abstract: According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.
    Type: Application
    Filed: September 14, 2017
    Publication date: January 4, 2018
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventors: Kentaro TACHIBANA, Takehiko KAGOSHIMA, Masatsune TAMURA, Masahiro MORITA
  • Patent number: 9792894
    Abstract: According to an embodiment, a speech synthesis dictionary creating device includes a first speech input unit, a second speech input unit, a determining unit, and a creating unit. The first speech input unit receives input of first speech data. The second speech input unit receives input of second speech data which is considered to be appropriate speech data. The determining unit determines whether or not a speaker of the first speech data is the same as a speaker of the second speech data. When the determining unit determines that the speaker of the first speech data is the same as the speaker of the second speech data, the creating unit creates a speech synthesis dictionary using the first speech data and using a text corresponding to the first speech data.
    Type: Grant
    Filed: December 16, 2015
    Date of Patent: October 17, 2017
    Assignee: KABUSHIKI KAISHA TOSHIBA
    Inventors: Kentaro Tachibana, Masahiro Morita, Takehiko Kagoshima
  • Patent number: 9666179
    Abstract: A waveform memory that stores a plurality of speech unit waveforms corresponding to respective speech units, wherein an address order of the speech unit waveforms is determined by a sort order of speech units included in a speech unit sequence corresponding to a phoneme sequence of training data, and the speech units included in the speech unit sequence are selected so as to synthesize a speech of the phone sequence.
    Type: Grant
    Filed: February 26, 2014
    Date of Patent: May 30, 2017
    Assignee: KABUSHIKI KAISHA TOSHIBA
    Inventor: Takehiko Kagoshima
  • Patent number: 9601106
    Abstract: According to one embodiment, a prosody editing apparatus includes a storage, a first selection unit, a search unit, a normalization unit, a mapping unit, a display, a second selection unit, a restoring unit and a replacing unit. The search unit searches the storage for one or more second prosodic patterns corresponding to attribute information that matches attribute information of the selected phrase. The mapping maps each of the normalized second prosodic patterns on a low-dimensional space. The restoring unit restores a restored prosodic pattern according to the selected coordinates. The replacing unit replaces prosody of synthetic speech generated based on the selected phrase by the restored prosodic pattern.
    Type: Grant
    Filed: August 15, 2013
    Date of Patent: March 21, 2017
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Kouichirou Mori, Takehiko Kagoshima, Masahiro Morita
  • Publication number: 20160104475
    Abstract: According to an embodiment, a speech synthesis dictionary creating device includes a first speech input unit, a second speech input unit, a determining unit, and a creating unit. The first speech input unit receives input of first speech data. The second speech input unit receives input of second speech data which is considered to be appropriate speech data. The determining unit determines whether or not a speaker of the first speech data is the same as a speaker of the second speech data. When the determining unit determines that the speaker of the first speech data is the same as the speaker of the second speech data, the creating unit creates a speech synthesis dictionary using the first speech data and using a text corresponding to the first speech data.
    Type: Application
    Filed: December 16, 2015
    Publication date: April 14, 2016
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventors: Kentaro TACHIBANA, Masahiro MORITA, Takehiko KAGOSHIMA
  • Patent number: 9280967
    Abstract: According to one embodiment, an apparatus for supporting reading of a document includes a model storage unit, a document acquisition unit, a feature information extraction, and an utterance style estimation unit. The model storage unit is configured to store a model which has trained a correspondence relationship between first feature information and an utterance style. The first feature information is extracted from a plurality of sentences in a training document. The document acquisition unit is configured to acquire a document to be read. The feature information extraction unit is configured to extract second feature information from each sentence in the document to be read. The utterance style estimation unit is configured to compare the second feature information of a plurality of sentences in the document to be read with the model, and to estimate an utterance style of the each sentence of the document to be read.
    Type: Grant
    Filed: September 14, 2011
    Date of Patent: March 8, 2016
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Kosei Fume, Masaru Suzuki, Masahiro Morita, Kentaro Tachibana, Kouichirou Mori, Yuji Shimizu, Takehiko Kagoshima, Masatsune Tamura, Tomohiro Yamasaki
  • Publication number: 20150325232
    Abstract: According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.
    Type: Application
    Filed: July 16, 2015
    Publication date: November 12, 2015
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventors: Kentaro TACHIBANA, Takehiko KAGOSHIMA, Masatsune TAMURA, Masahiro MORITA
  • Patent number: 9129596
    Abstract: Apparatus for creating a dictionary for speech synthesis includes a sentence storage unit configured to store N sentences, a sentence display unit configured to selectively display a first sentence which is one of the N sentences, a recording unit configured to record each user speech, a necessity determination unit configured to make a determination of whether to create the dictionary, a dictionary creation unit configured to create the dictionary by utilizing the user speech, and a speech synthesis unit configured to convert a second sentence to a synthesized speech with the dictionary. The display unit is configured to stop displaying the currently displayed sentence according to an evaluation of a quality of its synthesis. The determination unit makes the determination under a condition that the recording unit records the user speech of M first sentences (M is less than N) and the determination is based on at least one of an instruction from the user, M and an amount of the recorded user speech.
    Type: Grant
    Filed: June 28, 2012
    Date of Patent: September 8, 2015
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Kentaro Tachibana, Masahiro Morita, Takehiko Kagoshima
  • Patent number: 9058807
    Abstract: According to one embodiment, a first storage unit stores n band noise signals obtained by applying n band-pass filters to a noise signal. A second storage unit stores n band pulse signals. A parameter input unit inputs a fundamental frequency, n band noise intensities, and a spectrum parameter. A extraction unit extracts for each pitch mark the n band noise signals while shifting. An amplitude control unit changes amplitudes of the extracted band noise signals and band pulse signals in accordance with the band noise intensities. A generation unit generates a mixed sound source signal by adding the n band noise signals and the n band pulse signals. A generation unit generates the mixed sound source signal generated based on the pitch mark. A vocal tract filter unit generates a speech waveform by applying a vocal tract filter using the spectrum parameter to the generated mixed sound source signal.
    Type: Grant
    Filed: March 18, 2011
    Date of Patent: June 16, 2015
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima