Patents by Inventor Mustafa Arslan

Mustafa Arslan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20070213987
    Abstract: The conversion of speech can be used to transform an utterance by a source speaker to match the speech characteristic of a target speaker, for applications such as dubbing a motion picture. During a training phase, utterances corresponding to the same sentences by both the target speaker and source speaker are force aligned according to the phonemes within the sentences. A transformation or mapping is trained so that each frame of the source utterances is mapped to a corresponding frame of the target utterance. After the completion of the training phase, a source utterance is divided into frames, which are transformed into target frames. After all target frames are created from the sequence of frames from the source utterance, a target utterance is created having the speech of the source speaker, but with the vocal characteristics of the target speaker.
    Type: Application
    Filed: March 8, 2006
    Publication date: September 13, 2007
    Applicant: Voxonic, Inc.
    Inventors: Oytun Turk, Levent Mustafa Arslan, Fred Deutsch
  • Patent number: 6615174
    Abstract: A voice conversion system employs a codebook mapping approach to transforming a source voice to sound like a target voice. Each speech frame is represented by a weighted average of codebook entries. The weights represent a perceptual distance of the speech frame and may be refined by a gradient descent analysis. The vocal tract characteristics, represented by a line spectral frequency vector, the excitation characteristics, represented by a linear predictive coding residual, the duration, and the amplitude of the speech frame are transformed in the same weighted-average framework.
    Type: Grant
    Filed: February 22, 2000
    Date of Patent: September 2, 2003
    Assignee: Microsoft Corporation
    Inventors: Levent Mustafa Arslan, David Thieme Talkin
  • Patent number: 6449595
    Abstract: A system and method for synthesizing a facial image, compares a speech frame from an incoming speech signal with acoustic features stored within visually similar entries in an audio-visual codebook to produce a set of weights. The audio-visual codebook also stores visual features corresponding to the acoustic features. A composite visual feature is generated as a weighted sum of the corresponding visual features, from which the facial image is synthesized. The audio-visual codebook may include multiple samples of the acoustic and visual features for each entry, which corresponds to a sequence of one or more phonemes.
    Type: Grant
    Filed: March 11, 1999
    Date of Patent: September 10, 2002
    Assignee: Microsoft Corporation
    Inventors: Levent Mustafa Arslan, David Thieme Talkin