Patents by Inventor Tomohiro Nakatani

Tomohiro Nakatani has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MASK ESTIMATION DEVICE, MASK ESTIMATION METHOD, AND MASK ESTIMATION PROGRAM

Publication number: 20210216687

Abstract: A mask estimation apparatus includes processing circuitry configured to estimate, for a target segment to be processed among a plurality of segments of a continuous time, a first mask which is an occupancy ratio of a target signal to an observation signal of the target segment, based on a first feature obtained from a plurality of the observation signals of the target segment recorded at a plurality of locations, and estimate a parameter for modeling a second feature and a second mask which is an occupancy ratio of the target signal to the observation signal based on an estimation result of the first mask in the target segment and the second feature obtained from the plurality of the observation signals of the target segment.

Type: Application

Filed: August 23, 2019

Publication date: July 15, 2021

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Tomohiro NAKATANI, Marc DELCROIX, Keisuke KINOSHITA, Nobutaka ITO, Shoko ARAKI
ACOUSTIC MODEL TRAINING METHOD, SPEECH RECOGNITION METHOD, ACOUSTIC MODEL TRAINING APPARATUS, SPEECH RECOGNITION APPARATUS, ACOUSTIC MODEL TRAINING PROGRAM, AND SPEECH RECOGNITION PROGRAM

Publication number: 20210193161

Abstract: To begin with, an acoustic model training apparatus extracts speech features representing speech characteristics, and calculates an acoustic-condition feature representing a feature of an acoustic condition of the speech data using an acoustic-condition calculation model that is represented as a neural network, based on an acoustic-condition calculation model parameter characterizing the acoustic-condition calculation model. The acoustic model training apparatus then generates an adjusted parameter that is an acoustic model parameter adjusted based on the acoustic-condition feature, the acoustic model parameter characterizing an acoustic model represented as a neural network to which an output layer of the acoustic-condition calculation model is coupled. The acoustic model training apparatus then updates the acoustic model parameter based on the adjusted parameter and the speech features, and updates the acoustic-condition calculation model parameters based on the adjusted parameter and the speech features.

Type: Application

Filed: January 26, 2017

Publication date: June 24, 2021

Applicant: NIPPON TELEGRAPH AND TELEPHPNE CORPORATION

Inventors: Marc DELCROIX, Keisuke KINOSHITA, Astunori OGAWA, Takuya YOSHIOKA, Tomohiro NAKATANI
LEARNING DEVICE, LEARNING METHOD AND LEARNING PROGRAM

Publication number: 20210056954

Abstract: A learning device (10) includes a feature extracting unit (11) that extracts features of speech from speech data for training, a probability calculating unit (12) that, on the basis of the features of speech, performs prefix searching using a speech recognition model of which a neural network is representative, and calculates a posterior probability of a recognition character string to obtain a plurality of hypothetical character strings, an error calculating unit (13) that calculates an error by word error rates of the plurality of hypothetical character strings and a correct character string for training, and obtains a parameter for the entire model that minimizes an expected value of summation of loss in the word error rates, and an updating unit (14) that updates a parameter of the model in accordance with the parameter obtained by the error calculating unit (13).

Type: Application

Filed: February 1, 2019

Publication date: February 25, 2021

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Shigeki KARITA, Atsunori OGAWA, Marc DELCROIX, Tomohiro NAKATANI
APPARATUS, METHOD, AND PROGRAM FOR UTILIZING LANGUAGE MODEL

Publication number: 20210049324

Abstract: Disclosed is a model adaptation technology of a language model with higher adaptability. An aspect of the present disclosure relates to an apparatus includes a first neural network unit that transforms an input symbol and outputs an intermediate state; and a second neural network unit that transforms input auxiliary information and the intermediate state and predicts a symbol following the input symbol, wherein the second neural network unit includes a plurality of hidden layers receiving, as input, the intermediate state and auxiliary information, and pieces of the auxiliary information input to each hidden layer are different from each other.

Type: Application

Filed: February 18, 2019

Publication date: February 18, 2021

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Marc DELCROIX, Atsunori OGAWA, Tomohiro NAKATANI, Michael HENTSCHEL
DETERMINATION DEVICE, DETERMINATION METHOD, AND DETERMINATION PROGRAM

Publication number: 20210035564

Abstract: A determination device includes a memory, and processing circuitry coupled to the memory and configured to accept input of a plurality of sequences provided as candidates for a solution to one given input, and determine, for two sequences of the plurality of sequences, a sequence that has a higher accuracy than the other sequence of the two sequences, using a model expressed as a neural network.

Type: Application

Filed: February 1, 2019

Publication date: February 4, 2021

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsunori OGAWA, Marc DELCROIX, Shigeki KARITA, Tomohiro NAKATANI
SIGNAL ANALYSIS DEVICE, SIGNAL ANALYSIS METHOD, AND SIGNAL ANALYSIS PROGRAM

Publication number: 20210012790

Abstract: A signal analysis device (1) includes an estimation unit (10) that, when a parameter for modeling spatial characteristics of signals from N signal sources (where N is an integer equal to or larger than 2) is a spatial parameter, estimates a signal source position prior probability which is a mixture weight for modeling a prior distribution of the spatial parameter with respect to each signal source using a mixture distribution that is a linear combination of prior distributions of the spatial parameter with respect to K signal source position candidates (where K is an integer equal to or larger than 2), and which is a probability that a signal arrives from each signal source position candidate per signal source.

Type: Application

Filed: April 5, 2019

Publication date: January 14, 2021

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Nobutaka ITO, Tomohiro NAKATANI, Shoko ARAKI
SIGNAL ANALYSIS DEVICE, SIGNAL ANALYSIS METHOD, AND RECORDING MEDIUM

Publication number: 20200411031

Abstract: A signal analysis device includes a memory and processing circuitry coupled to the memory and configured to obtain, for a spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling spatial characteristics of J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, a simultaneous decorrelation matrix P as a matrix in which all PHRjP are diagonal matrices, or/and Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to the J source signals for observation signal vectors based on observation signals acquired at I (I is an integral number equal to or larger than 2) different positions.

Type: Application

Filed: February 1, 2019

Publication date: December 31, 2020

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Nobutaka ITO, Tomohiro NAKATANI, Shoko ARAKI
SIGNAL ANALYSIS DEVICE, SIGNAL ANALYSIS METHOD, AND SIGNAL ANALYSIS PROGRAM

Publication number: 20200411027

Abstract: A signal analysis device includes an estimation unit that models a sound source position occurrence probability matrix Q using a product of a sound source position probability matrix B and a sound source existence probability matrix A, and estimates at least one of the sound source position probability matrix B and the sound source existence probability matrix A based on the modeling, the sound source position occurrence probability matrix Q being composed of probabilities of arrival of a signal from each sound source position candidate per frame, which is a time section, with respect to a plurality of sound source position candidates. The sound source position probability matrix B being composed of probabilities of arrival of a signal from each sound source position candidate per sound source with respect to a plurality of sound sources.

Type: Application

Filed: April 4, 2019

Publication date: December 31, 2020

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Nobutaka ITO, Tomohiro NAKATANI, Shoko ARAKI
Mask estimation apparatus, mask estimation method, and mask estimation program

Patent number: 10878832

Abstract: A feature extraction unit in a mask estimation apparatus extracts, from a plurality of observation signals obtained by observing a plurality of acoustic signals at different positions, feature vectors obtained by collecting time-frequency components of the observation signals for each time-frequency point. A mask update unit uses the feature vectors, a mixture weight of each component distribution, and a shape parameter that is a model parameter capable of controlling a shape of each component distribution, where a probability distribution of the feature vectors is modeled by a mixture distribution consisting of a plurality of component distributions, to estimate masks indicating a proportion in which each component distribution contributes to each time-frequency point. A mixture weight update unit updates the mixture weight based on the updated masks. A parameter update unit updates the shape parameter by using the feature vectors and the masks.

Type: Grant

Filed: December 20, 2016

Date of Patent: December 29, 2020

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Nobutaka Ito, Shoko Araki, Tomohiro Nakatani
MASK ESTIMATION APPARATUS, MODEL LEARNING APPARATUS, SOUND SOURCE SEPARATION APPARATUS, MASK ESTIMATION METHOD, MODEL LEARNING METHOD, SOUND SOURCE SEPARATION METHOD, AND PROGRAM

Publication number: 20200395037

Abstract: A mask estimation apparatus for estimating mask information for specifying a mask used to extract a signal of a specific sound source from an input audio signal includes a converter which converts the input audio signal into embedded vectors of a predetermined dimension using a trained neural network model and a mask calculator which calculates the mask information by fitting the embedded vectors to a mixed Gaussian model.

Type: Application

Filed: February 19, 2019

Publication date: December 17, 2020

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Takuya HIGUCHI, Tomohiro NAKATANI, Keisuke KINOSHITA
LEARNING DEVICE, LEARNING METHOD, AND LEARNING PROGRAM

Publication number: 20200365143

Abstract: A learning device includes a memory, and processing circuitry coupled to the memory and configured to receive an input of a plurality of series for learning having known accuracy, and learn a model represented by a neural network, the model being capable of determining accuracy levels of two series when given feature amounts of the two series among the plurality of series.

Type: Application

Filed: February 1, 2019

Publication date: November 19, 2020

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsunori OGAWA, Marc DELCROIX, Shigeki KARITA, Tomohiro NAKATANI
MASK CALCULATION DEVICE, CLUSTER WEIGHT LEARNING DEVICE, MASK CALCULATION NEURAL NETWORK LEARNING DEVICE, MASK CALCULATION METHOD, CLUSTER WEIGHT LEARNING METHOD, AND MASK CALCULATION NEURAL NETWORK LEARNING METHOD

Publication number: 20200143819

Abstract: A cluster weight calculator calculates weights corresponding to respective clusters in a mask calculation NN with at least one of the layers divided into the clusters, based on the signals of speech of a target speaker using a cluster weight calculation NN. A mask calculator calculates a mask for extracting features of speech of the target speaker from features in observed speech signals of one or more speakers based on the features in the observation signals of the speech of the one or more speakers using the mask calculator NN weighted by the weights calculated by the cluster weight calculator.

Type: Application

Filed: July 18, 2018

Publication date: May 7, 2020

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Marc DELCROIX, Keisuke KINOSHITA, Atsunori OGAWA, Takuya HIGUCHI, Tomohiro NAKATANI
Spatial correlation matrix estimation device, spatial correlation matrix estimation method, and spatial correlation matrix estimation program

Patent number: 10643633

Abstract: An observation feature value vector is calculated based on observation signals recorded at different positions in a situation in which target sound sources and background noise are present in a mixed manner; masks associated with the target sound sources and a mask associated with the background noise are estimated; a spatial correlation matrix of the target sound sources that includes the background noise is calculated based on the masks associated with the observation signals and the target sound sources; a spatial correlation matrix of the background noise is calculated based on the masks associated with the observation signals and the background noise; and a spatial correlation matrix of the target sound sources is estimated based on the matrix obtained by weighting each of the spatial correlation matrices by predetermined coefficients.

Type: Grant

Filed: December 1, 2016

Date of Patent: May 5, 2020

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Tomohiro Nakatani, Nobutaka Ito, Takuya Higuchi, Shoko Araki, Takuya Yoshioka
MASK ESTIMATION APPARATUS, MASK ESTIMATION METHOD, AND MASK ESTIMATION PROGRAM

Publication number: 20190267019

Abstract: A feature extraction unit in a mask estimation apparatus extracts, from a plurality of observation signals obtained by observing a plurality of acoustic signals at different positions, feature vectors obtained by collecting time-frequency components of the observation signals for each time-frequency point. A mask update unit uses the feature vectors, a mixture weight of each component distribution, and a shape parameter that is a model parameter capable of controlling a shape of each component distribution, where a probability distribution of the feature vectors is modeled by a mixture distribution consisting of a plurality of component distributions, to estimate masks indicating a proportion in which each component distribution contributes to each time-frequency point. A mixture weight update unit updates the mixture weight based on the updated masks. A parameter update unit updates the shape parameter by using the feature vectors and the masks.

Type: Application

Filed: December 20, 2016

Publication date: August 29, 2019

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Nobutaka ITO, Shoko ARAKI, Tomohiro NAKATANI
SPATIAL CORRELATION MATRIX ESTIMATION DEVICE, SPATIAL CORRELATION MATRIX ESTIMATION METHOD, AND SPATIAL CORRELATION MATRIX ESTIMATION PROGRAM

Publication number: 20180366135

Abstract: An observation feature value vector is calculated based on observation signals recorded at different positions in a situation in which target sound sources and background noise are present in a mixed manner; masks associated with the target sound sources and a mask associated with the background noise are estimated; a spatial correlation matrix of the target sound sources that includes the background noise is calculated based on the masks associated with the observation signals and the target sound sources; a spatial correlation matrix of the background noise is calculated based on the masks associated with the observation signals and the background noise; and a spatial correlation matrix of the target sound sources is estimated based on the matrix obtained by weighting each of the spatial correlation matrices by predetermined coefficients.

Type: Application

Filed: December 1, 2016

Publication date: December 20, 2018

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Tomohiro NAKATANI, Nobutaka ITO, Takuya HIGUCHI, Shoko ARAKI, Takuya YOSHIOKA
Noise estimation apparatus, noise estimation method, noise estimation program, and recording medium

Patent number: 9754608

Abstract: A noise estimation apparatus which estimates a non-stationary noise component on the basis of the likelihood maximization criterion is provided. The noise estimation apparatus obtains the variance of a noise signal that causes a large value to be obtained by weighted addition of the sums each of which is obtained by adding the product of the log likelihood of a model of an observed signal expressed by a Gaussian distribution in a speech segment and a speech posterior probability in each frame, and the product of the log likelihood of a model of an observed signal expressed by a Gaussian distribution in a non-speech segment and a non-speech posterior probability in each frame, by using complex spectra of a plurality of observed signals up to the current frame.

Type: Grant

Filed: January 30, 2013

Date of Patent: September 5, 2017

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Mehrez Souden, Keisuke Kinoshita, Tomohiro Nakatani, Marc Delcroix, Takuya Yoshioka
Audio signal section estimating apparatus, audio signal section estimating method, and recording medium

Patent number: 9208780

Abstract: The processing efficiency and estimation accuracy of a voice activity detection apparatus are improved. An acoustic signal analyzer receives a digital acoustic signal containing a speech signal and a noise signal, generates a non-speech GMM and a speech GMM adapted to a noise environment, by using a silence GMM and a clean-speech GMM in each frame of the digital acoustic signal, and calculates the output probabilities of dominant Gaussian distributions of the GMMs. A speech state probability to non-speech state probability ratio calculator calculates a speech state probability to non-speech state probability ratio based on a state transition model of a speech state and a non-speech state, by using the output probabilities; and a voice activity detection unit judges, from the speech state probability to non-speech state probability ratio, whether the acoustic signal in the frame is in the speech state or in the non-speech state and outputs only the acoustic signal in the speech state.

Type: Grant

Filed: July 15, 2010

Date of Patent: December 8, 2015

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Masakiyo Fujimoto, Tomohiro Nakatani
NOISE ESTIMATION APPARATUS, NOISE ESTIMATION METHOD, NOISE ESTIMATION PROGRAM, AND RECORDING MEDIUM

Publication number: 20150032445

Abstract: A noise estimation apparatus which estimates a non-stationary noise component on the basis of the likelihood maximization criterion is provided. The noise estimation apparatus obtains the variance of a noise signal that causes a large value to be obtained by weighted addition of the sums each of which is obtained by adding the product of the log likelihood of a model of an observed signal expressed by a Gaussian distribution in a speech segment and a speech posterior probability in each frame, and the product of the log likelihood of a model of an observed signal expressed by a Gaussian distribution in a non-speech segment and a non-speech posterior probability in each frame, by using complex spectra of a plurality of observed signals up to the current frame.

Type: Application

Filed: January 30, 2013

Publication date: January 29, 2015

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Mehrez Souden, Keisuke Kinoshita, Tomohiro Nakatani, Marc Delcroix, Takuya Yoshioka
Signal enhancement device, method thereof, program, and recording medium

Patent number: 8848933

Abstract: The initial values of parameter estimates are set, including reverberation parameter estimates, which includes a regression coefficient used in a linear convolutional operation for calculating an estimated value of reverberation included in an observed signal, source parameter estimates, which includes estimated values of a linear prediction coefficient and a prediction residual power that identify the power spectrum of a source signal, and noise parameter estimates, which include noise power spectrum estimates. Then, the maximum likelihood estimation is used to alternately repeat processing for updating at least one of the reverberation parameter estimates and the noise parameter estimates and processing for updating the source parameter estimates until a predetermined termination condition is satisfied.

Type: Grant

Filed: March 5, 2009

Date of Patent: September 30, 2014

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Takuya Yoshioka, Tomohiro Nakatani, Masato Miyoshi
SUBSTRATE WITH THROUGH-ELECTRODE AND METHOD FOR PRODUCING SAME

Publication number: 20130168141

Abstract: A method for producing a substrate with through-electrode includes the steps of: forming recesses or through-holes in either one of a silicon substrate and a glass substrate; forming protrusions in the other substrate; laying the silicon substrate and glass substrate on each other so that the protrusions are inserted in the respective recesses or through-holes; and bonding the silicon substrate and the glass substrate to each other.

Type: Application

Filed: January 24, 2012

Publication date: July 4, 2013

Applicant: PANASONIC CORPORATION

Inventors: Junichi Hozumi, Takumi Taura, Shin Okumura, Tomohiro Nakatani, Ryo Tomoida

prev 1 2 3 4 next