Patents by Inventor Mustafa Arslan

Mustafa Arslan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Codebook-less speech conversion method and system

Publication number: 20070213987

Abstract: The conversion of speech can be used to transform an utterance by a source speaker to match the speech characteristic of a target speaker, for applications such as dubbing a motion picture. During a training phase, utterances corresponding to the same sentences by both the target speaker and source speaker are force aligned according to the phonemes within the sentences. A transformation or mapping is trained so that each frame of the source utterances is mapped to a corresponding frame of the target utterance. After the completion of the training phase, a source utterance is divided into frames, which are transformed into target frames. After all target frames are created from the sequence of frames from the source utterance, a target utterance is created having the speech of the source speaker, but with the vocal characteristics of the target speaker.

Type: Application

Filed: March 8, 2006

Publication date: September 13, 2007

Applicant: Voxonic, Inc.

Inventors: Oytun Turk, Levent Mustafa Arslan, Fred Deutsch
Voice conversion system and methodology

Patent number: 6615174

Abstract: A voice conversion system employs a codebook mapping approach to transforming a source voice to sound like a target voice. Each speech frame is represented by a weighted average of codebook entries. The weights represent a perceptual distance of the speech frame and may be refined by a gradient descent analysis. The vocal tract characteristics, represented by a line spectral frequency vector, the excitation characteristics, represented by a linear predictive coding residual, the duration, and the amplitude of the speech frame are transformed in the same weighted-average framework.

Type: Grant

Filed: February 22, 2000

Date of Patent: September 2, 2003

Assignee: Microsoft Corporation

Inventors: Levent Mustafa Arslan, David Thieme Talkin
Face synthesis system and methodology

Patent number: 6449595

Abstract: A system and method for synthesizing a facial image, compares a speech frame from an incoming speech signal with acoustic features stored within visually similar entries in an audio-visual codebook to produce a set of weights. The audio-visual codebook also stores visual features corresponding to the acoustic features. A composite visual feature is generated as a weighted sum of the corresponding visual features, from which the facial image is synthesized. The audio-visual codebook may include multiple samples of the acoustic and visual features for each entry, which corresponds to a sequence of one or more phonemes.

Type: Grant

Filed: March 11, 1999

Date of Patent: September 10, 2002

Assignee: Microsoft Corporation

Inventors: Levent Mustafa Arslan, David Thieme Talkin

prev 1 2 3 4

Codebook-less speech conversion method and system

Voice conversion system and methodology

Face synthesis system and methodology