Patents by Inventor Qiongqiong WANG

Qiongqiong WANG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240038244
    Abstract: A speaker subset selection means 81 selects speakers corresponding to an attribute from subset information of an entire speaker to determine a subset of a speech model from which test utterance is identified. A speaker identification means 82 identifies a speaker of the test utterance from a subset of the determined speech model based on features extracted from the test utterance.
    Type: Application
    Filed: December 25, 2020
    Publication date: February 1, 2024
    Applicant: NEC Corporation
    Inventors: Qiongqiong Wang, Takafumi KOSHINAKA
  • Publication number: 20230368809
    Abstract: A speech enhancement means 81 determines an enhancement mask generated based on a mask for speech enhancement, when a test utterance is input as speech data. A first hyper-parameter optimization means 82 determines, when the test utterance is input, a first hyper-parameter which is a hyper-parameter representing the degree to which the signal representing the test utterance is kept using the mask, and the first hyper-parameter which is set to take into account a downstream task that is processed using an enhanced test utterance. A mask generation means 83 generates an adaptive mask from the determined enhancement mask and the first hyper-parameter that enhances the test utterance for the downstream task. The mask generation means 83 generates the adaptive mask in which the first hyper-parameter is a power of the mask.
    Type: Application
    Filed: October 15, 2020
    Publication date: November 16, 2023
    Applicant: NEC Corporation
    Inventors: Qiongqiong WANG, Takafumi Koshinaka
  • Patent number: 11817103
    Abstract: Provided is a pattern recognition apparatus to provide classification robustness to any kind of domain variability. The pattern recognition apparatus 500 based on Neural Network (NN) includes: NN training unit 501 that trains an NN model to generate NN parameters, based on at least one first feature vector and at least one domain vector indicating one of subsets in a specific domain, wherein, the first feature vector is extracted from each of the subsets, the domain vector indicates an identifier corresponding to the each of the subsets; and NN verification unit 502 that verifies a pair of second feature vectors in the specific domain to output whether the pair indicates same individual or not, based on a target domain vector and the NN parameters.
    Type: Grant
    Filed: September 15, 2017
    Date of Patent: November 14, 2023
    Assignee: NEC CORPORATION
    Inventors: Qiongqiong Wang, Takafumi Koshinaka
  • Patent number: 11798564
    Abstract: A spoofing detection apparatus 100 includes a multi-channel spectrogram creation unit 10 and an evaluation unit 40. The multi-channel spectrogram creation unit 10 extracts different type of spectrograms from speech data and integrates the different type of spectrograms to create a multi-channel spectrogram. The evaluation unit 40 evaluates the created multi-channel spectrogram by applying the created multi-channel spectrogram to a classifier constructed using labeled multi-channel spectrograms as training data and classifies it to either genuine or spoof.
    Type: Grant
    Filed: June 28, 2019
    Date of Patent: October 24, 2023
    Assignee: NEC CORPORATION
    Inventors: Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka
  • Patent number: 11600273
    Abstract: The speech processing apparatus 100 includes an air microphone speech recognition unit 101 which recognizes speech from an air microphone 200 acquiring speech through air, a wearable microphone speech recognition unit 102 which recognizes speech from a wearable microphone 300, a sensing unit 103 which measures environmental conditions, a weight decision unit 104 which calculates the weights for recognition results of the air microphone speech recognition unit 101 and the wearable microphone speech recognition unit 102 on the basis of the environmental conditions, and a combination unit 105 which combines the recognition results outputted from the air microphone speech recognition unit 101 and the wearable microphone speech recognition unit 102, using the weights.
    Type: Grant
    Filed: February 14, 2018
    Date of Patent: March 7, 2023
    Assignee: NEC CORPORATION
    Inventors: Qiongqiong Wang, Takafumi Koshinaka
  • Patent number: 11580967
    Abstract: A speech feature extraction apparatus 100 includes a voice activity detection unit 103 that drops non-voice frames from frames corresponding to an input speech utterance, and calculates a posterior of being voiced for each frame, a voice activity detection process unit 106 calculates a function value as weights in pooling frames to produce an utterance-level feature, from a given a voice activity detection posterior, and an utterance-level feature extraction unit 112 that extracts an utterance-level feature, from the frame on a basis of multiple frame-level features, using the function values.
    Type: Grant
    Filed: June 29, 2018
    Date of Patent: February 14, 2023
    Assignee: NEC CORPORATION
    Inventors: Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Takafumi Koshinaka
  • Publication number: 20220358934
    Abstract: A spoofing detection apparatus 100 includes a multi-channel spectrogram creation unit 10 and an evaluation unit 40. The multi-channel spectrogram creation unit 10 extracts different type of spectrograms from speech data and integrates the different type of spectrograms to create a multi-channel spectrogram. The evaluation unit 40 evaluates the created multi-channel spectrogram by applying the created multi-channel spectrogram to a classifier constructed using labeled multi-channel spectrograms as training data and classifies it to either genuine or spoof.
    Type: Application
    Filed: June 28, 2019
    Publication date: November 10, 2022
    Applicant: NEC Corporation
    Inventors: Qiongqiong WANG, Kong Aik LEE, Takafumi KOSHINAKA
  • Publication number: 20220335950
    Abstract: A spoofing detection apparatus 100 includes a multi-channel spectrogram creation unit 10 and an evaluation unit 40. The multi-channel spectrogram creation unit 10 extracts different type of spectrograms from speech data and integrates the different type of spectrograms to create a multi-channel spectrogram. The evaluation unit 40 evaluates the created multi-channel spectrogram by applying the created multi-channel spectrogram to a classifier constructed using labeled multi-channel spectrograms as training data and classifies it to either genuine or spoof.
    Type: Application
    Filed: October 18, 2019
    Publication date: October 20, 2022
    Applicant: NEC Corporation
    Inventors: Qiongqiong WANG, Takafumi KOSHINAKA, Kong Aik LEE
  • Patent number: 11403545
    Abstract: A pattern recognition apparatus for discriminative training includes: a similarity calculator that calculates similarities among training data; a statistics calculator that calculates statistics from the similarities in accordance with current labels for the training data; and a discriminative probabilistic linear discriminant analysis (PLDA) trainer that receives the training data, the statistics of the training data, the current labels and PLDA parameters, and updates the PLDA parameters and the labels of the training data.
    Type: Grant
    Filed: March 9, 2017
    Date of Patent: August 2, 2022
    Assignee: NEC CORPORATION
    Inventors: Qiongqiong Wang, Takafumi Koshinaka
  • Publication number: 20220130397
    Abstract: A speaker recognition system includes a non-transitory computer readable medium configured to store instructions. The speaker recognition system further includes a processor connected to the non-transitory computer readable medium. The processor is configured to execute the instructions for extracting acoustic features from each frame of a plurality of frames in input speech data. The processor is configured to execute the instructions for calculating a saliency value for each frame of the plurality of frames using a first neural network (NN) based on the extracted acoustic features, wherein the first NN is a trained NN using speaker posteriors. The processor is configured to execute the instructions for extracting a speaker feature using the saliency value for each frame of the plurality of frames.
    Type: Application
    Filed: February 5, 2020
    Publication date: April 28, 2022
    Inventors: Qiongqiong WANG, Koji OKABE, Takafumi KOSHINAKA
  • Publication number: 20210390158
    Abstract: A covariance matrix computation unit 81 computes a pseudo-in-domain covariance matrix from one or both of a within class covariance matrix and a between class covariance matrix of an out-of-domain Probabilistic Linear Discriminant Analysis (PLDA) model. A simultaneous diagonalization unit 82 computes a generalized eigenvalue and an eigenvector for a pseudo-in-domain covariance matrix and the class covariance matrix of the out-of-domain PLDA model on the basis of simultaneous diagonalization. An adaptation unit 83 computes one or both of a within class covariance matrix and a between class covariance matrix of an in-domain PLDA model using the generalized eigenvalues and eigenvectors. The covariance matrix computation unit 81 computes the pseudo-in-domain covariance matrix based on the out-of-domain PLDA model and a covariance matrix of in-domain data.
    Type: Application
    Filed: March 28, 2019
    Publication date: December 16, 2021
    Applicant: NEC Corporation
    Inventors: Kong Aik LEE, Qiongqiong WANG, Takafumi KOSHINAKA
  • Publication number: 20210256970
    Abstract: A speech feature extraction apparatus 100 includes a voice activity detection unit 103 that drops non-voice frames from frames corresponding to an input speech utterance, and calculates a posterior of being voiced for each frame, a voice activity detection process unit 106 calculates a function value as weights in pooling frames to produce an utterance-level feature, from a given a voice activity detection posterior, and an utterance-level feature extraction unit 112 that extracts an utterance-level feature, from the frame on a basis of multiple frame-level features, using the function values.
    Type: Application
    Filed: June 29, 2018
    Publication date: August 19, 2021
    Applicant: NEC Corporation
    Inventors: Qiongqiong WANG, Koji OKABE, Kong Aik LEE, Takafumi KOSHINAKA
  • Publication number: 20210027778
    Abstract: The speech processing apparatus 100 includes an air microphone speech recognition unit 101 which recognizes speech from an air microphone 200 acquiring speech through air, a wearable microphone speech recognition unit 102 which recognizes speech from a wearable microphone 300, a sensing unit 103 which measures environmental conditions, a weight decision unit 104 which calculates the weights for recognition results of the air microphone speech recognition unit 101 and the wearable microphone speech recognition unit 102 on the basis of the environmental conditions, and a combination unit 105 which combines the recognition results outputted from the air microphone speech recognition unit 101 and the wearable microphone speech recognition unit 102, using the weights.
    Type: Application
    Filed: February 14, 2018
    Publication date: January 28, 2021
    Applicant: NEC Corporation
    Inventors: Qiongqiong WANG, Takafumi KOSHINAKA
  • Patent number: 10803875
    Abstract: A speaker recognition system includes a non-transitory computer readable medium configured to store instructions. The speaker recognition system further includes a processor connected to the non-transitory computer readable medium. The processor is configured to execute the instructions for extracting acoustic features from each frame of a plurality of frames in input speech data. The processor is configured to execute the instructions for calculating a saliency value for each frame of the plurality of frames using a first neural network (NN) based on the extracted acoustic features, wherein the first NN is a trained NN using speaker posteriors. The processor is configured to execute the instructions for extracting a speaker feature using the saliency value for each frame of the plurality of frames.
    Type: Grant
    Filed: February 8, 2019
    Date of Patent: October 13, 2020
    Assignee: NEC CORPORATION
    Inventors: Qiongqiong Wang, Koji Okabe, Takafumi Koshinaka
  • Publication number: 20200258527
    Abstract: A speaker recognition system includes a non-transitory computer readable medium configured to store instructions. The speaker recognition system further includes a processor connected to the non-transitory computer readable medium. The processor is configured to execute the instructions for extracting acoustic features from each frame of a plurality of frames in input speech data. The processor is configured to execute the instructions for calculating a saliency value for each frame of the plurality of frames using a first neural network (NN) based on the extracted acoustic features, wherein the first NN is a trained NN using speaker posteriors. The processor is configured to execute the instructions for extracting a speaker feature using the saliency value for each frame of the plurality of frames.
    Type: Application
    Filed: February 8, 2019
    Publication date: August 13, 2020
    Inventors: Qiongqiong WANG, Koji OKABE, Takafumi KOSHINAKA
  • Publication number: 20200211567
    Abstract: Provided is a pattern recognition apparatus to provide classification robustness to any kind of domain variability. The pattern recognition apparatus 500 based on Neural Network (NN) includes: NN training unit 501 that trains an NN model to generate NN parameters, based on at least one first feature vector and at least one domain vector indicating one of subsets in a specific domain, wherein, the first feature vector is extracted from each of the subsets, the domain vector indicates an identifier corresponding to the each of the subsets; and NN verification unit 502 that verifies a pair of second feature vectors in the specific domain to output whether the pair indicates same individual or not, based on a target domain vector and the NN parameters.
    Type: Application
    Filed: September 15, 2017
    Publication date: July 2, 2020
    Applicant: NEC Corporation
    Inventors: Qiongqiong WANG, Takafumi KOSHINAKA
  • Patent number: 10614343
    Abstract: The A pattern recognition apparatus using domain adaptation 10 comprises an estimation unit 11. The estimation unit 11 estimates PLDA (Probabilistic Linear Discriminant Analysis) parameters and transformation parameters from features of a first domain data and a second domain data so as to maximize/minimize an objective function with respect to the features.
    Type: Grant
    Filed: September 16, 2015
    Date of Patent: April 7, 2020
    Assignee: NEC CORPORATION
    Inventors: Qiongqiong Wang, Takafumi Koshinaka
  • Publication number: 20190347565
    Abstract: A pattern recognition apparatus for discriminative training includes: a similarity calculator that calculates similarities among training data; a statistics calculator that calculates statistics from the similarities in accordance with current labels for the training data; and a discriminative probabilistic linear discriminant analysis (PLDA) trainer that receives the training data, the statistics of the training data, the current labels and PLDA parameters, and updates the PLDA parameters and the labels of the training data.
    Type: Application
    Filed: March 9, 2017
    Publication date: November 14, 2019
    Applicant: NEC Corporation
    Inventors: Qiongqiong WANG, Takafumi KOSHINAKA
  • Publication number: 20180253628
    Abstract: The A pattern recognition apparatus using domain adaptation 10 comprises an estimation unit 11. The estimation unit 11 estimates PLDA (Probabilistic Linear Discriminant Analysis) parameters and transformation parameters from features of a first domain data and a second domain data so as to maximize/minimize an objective function with respect to the features.
    Type: Application
    Filed: September 16, 2015
    Publication date: September 6, 2018
    Applicant: NEC Corporation
    Inventors: Qiongqiong WANG, Takafumi KOSHINAKA