Patents Examined by Talivaldis Ivars Smith
  • Patent number: 8224648
    Abstract: A hybrid approach is described for combining frequency warping and Gaussian Mixture Modeling (GMM) to achieve better speaker identity and speech quality. To train the voice conversion GMM model, line spectral frequency and other features are extracted from a set of source sounds to generate a source feature vector and from a set of target sounds to generate a target feature vector. The GMM model is estimated based on the aligned source feature vector and the target feature vector. A mixture specific warping function is generated each set of mixture mean pairs of the GMM model, and a warping function is generated based on a weighting of each of the mixture specific warping functions. The warping function can be used to convert sounds received from a source speaker to approximate speech of a target speaker.
    Type: Grant
    Filed: December 28, 2007
    Date of Patent: July 17, 2012
    Assignee: Nokia Corporation
    Inventors: Jilei Tian, Victor Popa, Jani Kristian Nurminen