Score Normalization (epo) Patents (Class 704/E17.01)
  • Patent number: 11929058
    Abstract: Novel methods and systems for adapting a voice cloning synthesizer for a new speaker using real speech data are disclosed. Utterances from one or more target speakers are parameterized and are used to initialize an embedding vector for use with a voice synthesizer, by means of clustering the utterance data and determining the centroid of the data, using a speaker identification neural network, and/or by finding the closest stored embedded vector to the utterance data.
    Type: Grant
    Filed: August 18, 2020
    Date of Patent: March 12, 2024
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Cong Zhou, Xiaoyu Liu, Michael Getty Horgan, Vivek Kumar
  • Publication number: 20080235007
    Abstract: A method and system for speaker recognition and identification includes transforming features of a speaker utterance in a first condition state to match a second condition state and provide a transformed utterance. A discriminative criterion is used to generate a transform that maps an utterance to obtain a computed result. The discriminative criterion is maximized over a plurality of speakers to obtain a best transform for recognizing speech and/or identifying a speaker under the second condition state. Speech recognition and speaker identity may be determined by employing the best transform for decoding speech to reduce channel mismatch.
    Type: Application
    Filed: June 3, 2008
    Publication date: September 25, 2008
    Inventors: Jiri Navratil, Jagon Pelecanos, Ganesh N. Ramaswamy