Patents by Inventor Ilya Vladimirovich EDRENKIN

Ilya Vladimirovich EDRENKIN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and system for text-to-speech synthesis

Patent number: 9916825

Abstract: There are disclosed methods and systems for text-to-speech synthesis for outputting a synthetic speech having a selected speech attribute. First, an acoustic space model is trained based on a set of training data of speech attributes, using a deep neural network to determine interdependency factors between the speech attributes in the training data, the dnn generating a single, continuous acoustic space model based on the interdependency factors, the acoustic space model thereby taking into account a plurality of interdependent speech attributes and allowing for modelling of a continuous spectrum of the interdependent speech attributes. Next, a text is received; a selection of one or more speech attribute is received, each speech attribute having a selected attribute weight; the text is converted into synthetic speech using the acoustic space model, the synthetic speech having the selected speech attribute; and the synthetic speech is outputted as audio having the selected speech attribute.

Type: Grant

Filed: September 13, 2016

Date of Patent: March 13, 2018

Assignee: YANDEX EUROPE AG

Inventor: Ilya Vladimirovich Edrenkin
METHOD AND SYSTEM FOR TEXT-TO-SPEECH SYNTHESIS

Publication number: 20170092258

Abstract: There are disclosed methods and systems for text-to-speech synthesis for outputting a synthetic speech having a selected speech attribute. First, an acoustic space model is trained based on a set of training data of speech attributes, using a deep neural network to determine interdependency factors between the speech attributes in the training data, the dnn generating a single, continuous acoustic space model based on the interdependency factors, the acoustic space model thereby taking into account a plurality of interdependent speech attributes and allowing for modelling of a continuous spectrum of the interdependent speech attributes. Next, a text is received; a selection of one or more speech attribute is received, each speech attribute having a selected attribute weight; the text is converted into synthetic speech using the acoustic space model, the synthetic speech having the selected speech attribute; and the synthetic speech is outputted as audio having the selected speech attribute.

Type: Application

Filed: September 13, 2016

Publication date: March 30, 2017

Inventor: Ilya Vladimirovich EDRENKIN

Method and system for text-to-speech synthesis

METHOD AND SYSTEM FOR TEXT-TO-SPEECH SYNTHESIS