Patents by Inventor Zhizheng WU

Zhizheng WU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speech synthesis method, device and computer readable storage medium

Patent number: 11881205

Abstract: The present disclosure relates to a speech synthesis method and device, and a computer-readable storage medium, and relates to the field of computer technology. The method of the present disclosure includes: dividing a text into a plurality of segments according to a language category to which each of the segments belongs; converting each of the segments into a phoneme corresponding to the segment to generate a phoneme sequence of the text according to the language category to which each of the segments belongs; inputting the phoneme sequence into a speech synthesis model trained in advance and converting the phoneme sequence into a vocoder characteristic parameter; and inputting the vocoder characteristic parameter into a vocoder to generate a speech.

Type: Grant

Filed: March 30, 2020

Date of Patent: January 23, 2024

Assignees: BEIJING JINGDONG SHANGKE INFORMATION TECHNOLOGY CO, LTD., BEIJING JINGDONG CENTURY TRADING CO., LTD.

Inventors: Zhizheng Wu, Zhengchen Zhang, Wei Song, Yonghui Rao, Zhihang Xie, Guanghui Xu, Shuyong Liu, Bosen Ma, Shuangwen Qiu, Junmin Lin
TEXT INFORMATION PROCESSING METHOD AND APPARATUS

Publication number: 20220406290

Abstract: Embodiments of the present application provide a text information processing method and apparatus, the method includes: acquiring a phoneme vector corresponding to an individual phoneme and a semantic vector corresponding to the individual phoneme in text information; acquiring first semantic information output at a last moment, wherein the first semantic information is semantic information corresponding to part of the text information in the text information, and the part of the text information is text information that has been converted into voice information; determining a context vector corresponding to a current moment according to the first semantic information, the phoneme vector corresponding to the individual phoneme and the semantic vector corresponding to the individual phoneme; and determining voice information at the current moment according to the context vector and the first semantic information.

Type: Application

Filed: January 15, 2021

Publication date: December 22, 2022

Inventors: Liumeng XUE, Wei SONG, Zhizheng WU
SPEECH SYNTHESIS METHOD AND APPARATUS, AND STORAGE MEDIUM

Publication number: 20220270587

Abstract: Disclosed are a speech synthesis method and apparatus, and a storage medium.

Type: Application

Filed: March 18, 2020

Publication date: August 25, 2022

Inventors: Zhizheng WU, Wei SONG
SPEECH SYNTHESIS METHOD, DEVICE AND COMPUTER READABLE STORAGE MEDIUM

Publication number: 20220165249

Abstract: The present disclosure relates to a speech synthesis method and device, and a computer-readable storage medium, and relates to the field of computer technology. The method of the present disclosure includes: dividing a text into a plurality of segments according to a language category to which each of the segments belongs; converting each of the segments into a phoneme corresponding to the segment to generate a phoneme sequence of the text according to the language category to which each of the segments belongs; inputting the phoneme sequence into a speech synthesis model trained in advance and converting the phoneme sequence into a vocoder characteristic parameter; and inputting the vocoder characteristic parameter into a vocoder to generate a speech.

Type: Application

Filed: March 30, 2020

Publication date: May 26, 2022

Inventors: Zhizheng WU, Zhengchen ZHANG, Wei SONG, Yonghui RAO, Zhihang XIE, Guanghui XU, Shuyong LIU, Bosen MA, Shuangwen QIU, Junmin LIN

Speech synthesis method, device and computer readable storage medium

TEXT INFORMATION PROCESSING METHOD AND APPARATUS

SPEECH SYNTHESIS METHOD AND APPARATUS, AND STORAGE MEDIUM

SPEECH SYNTHESIS METHOD, DEVICE AND COMPUTER READABLE STORAGE MEDIUM