Patents by Inventor Leyuan Sheng

Leyuan Sheng has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Computer-implemented method for speech synthesis, computer device, and non-transitory computer readable storage medium

Patent number: 11763796

Abstract: A computer-implemented method for speech synthesis, a computer device, and a non-transitory computer readable storage medium are provided. The method includes: obtaining a speech text to be synthesized; obtaining a Mel spectrum corresponding to the speech text to be synthesized according to the speech text to be synthesized; inputting the Mel spectrum into a complex neural network, and obtaining a complex spectrum corresponding to the speech text to be synthesized, wherein the complex spectrum comprises real component information and imaginary component information; and obtaining a synthetic speech corresponding to the speech text to be synthesized, according to the complex spectrum. The method can efficiently and simply complete speech synthesis.

Type: Grant

Filed: December 10, 2020

Date of Patent: September 19, 2023

Assignee: UBTECH ROBOTICS CORP LTD

Inventors: Dongyan Huang, Leyuan Sheng, Youjun Xiong
Speech synthesis method and apparatus and computer readable storage medium using the same

Patent number: 11417316

Abstract: The present disclosure provides a speech synthesis method as well as an apparatus and a computer readable storage medium using the same. The method includes: obtaining a to-be-synthesized text, and extracting to-be-processed Mel spectrum features of the to-be-synthesized text through a preset speech feature extraction algorithm; inputting the to-be-processed Mel spectrum features into a preset ResUnet network model to obtain first intermediate features; performing an average pooling and a first down sampling on the to-be-processed Mel spectrum features to obtain second intermediate features; taking the second intermediate features and the first intermediate features output by the ResUnet network model as an input to perform a deconvolution and a first up sampling so as to obtain target Mel spectrum features corresponding to the to-be-processed Mel spectrum features; and converting the target Mel spectrum features into a target speech corresponding to the to-be-synthesized text.

Type: Grant

Filed: December 8, 2020

Date of Patent: August 16, 2022

Assignee: UBTECH ROBOTICS CORP LTD

Inventors: Dongyan Huang, Leyuan Sheng, Youjun Xiong
COMPUTER-IMPLEMENTED METHOD FOR SPEECH SYNTHESIS, COMPUTER DEVICE, AND NON-TRANSITORY COMPUTER READABLE STORAGE MEDIUM

Publication number: 20220189454

Abstract: A computer-implemented method for speech synthesis, a computer device, and a non-transitory computer readable storage medium are provided. The method includes: obtaining a speech text to be synthesized; obtaining a Mel spectrum corresponding to the speech text to be synthesized according to the speech text to be synthesized; inputting the Mel spectrum into a complex neural network, and obtaining a complex spectrum corresponding to the speech text to be synthesized, wherein the complex spectrum comprises real component information and imaginary component information; and obtaining a synthetic speech corresponding to the speech text to be synthesized, according to the complex spectrum. The method can efficiently and simply complete speech synthesis.

Type: Application

Filed: December 10, 2020

Publication date: June 16, 2022

Inventors: Dongyan Huang, Leyuan Sheng, Youjun Xiong
SPEECH SYNTHESIS METHOD AND APPARATUS AND COMPUTER READABLE STORAGE MEDIUM USING THE SAME

Publication number: 20210193113

Abstract: The present disclosure provides a speech synthesis method as well as an apparatus and a computer readable storage medium using the same. The method includes: obtaining a to-be-synthesized text, and extracting to-be-processed Mel spectrum features of the to-be-synthesized text through a preset speech feature extraction algorithm; inputting the to-be-processed Mel spectrum features into a preset ResUnet network model to obtain first intermediate features; performing an average pooling and a first down sampling on the to-be-processed Mel spectrum features to obtain second intermediate features; taking the second intermediate features and the first intermediate features output by the ResUnet network model as an input to perform a deconvolution and a first up sampling so as to obtain target Mel spectrum features corresponding to the to-be-processed Mel spectrum features; and converting the target Mel spectrum features into a target speech corresponding to the to-be-synthesized text.

Type: Application

Filed: December 8, 2020

Publication date: June 24, 2021

Inventors: Dongyan Huang, Leyuan Sheng, Youjun Xiong

Computer-implemented method for speech synthesis, computer device, and non-transitory computer readable storage medium

Speech synthesis method and apparatus and computer readable storage medium using the same

COMPUTER-IMPLEMENTED METHOD FOR SPEECH SYNTHESIS, COMPUTER DEVICE, AND NON-TRANSITORY COMPUTER READABLE STORAGE MEDIUM

SPEECH SYNTHESIS METHOD AND APPARATUS AND COMPUTER READABLE STORAGE MEDIUM USING THE SAME