Patents by Inventor Zhizheng WU

Zhizheng WU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11881205
    Abstract: The present disclosure relates to a speech synthesis method and device, and a computer-readable storage medium, and relates to the field of computer technology. The method of the present disclosure includes: dividing a text into a plurality of segments according to a language category to which each of the segments belongs; converting each of the segments into a phoneme corresponding to the segment to generate a phoneme sequence of the text according to the language category to which each of the segments belongs; inputting the phoneme sequence into a speech synthesis model trained in advance and converting the phoneme sequence into a vocoder characteristic parameter; and inputting the vocoder characteristic parameter into a vocoder to generate a speech.
    Type: Grant
    Filed: March 30, 2020
    Date of Patent: January 23, 2024
    Assignees: BEIJING JINGDONG SHANGKE INFORMATION TECHNOLOGY CO, LTD., BEIJING JINGDONG CENTURY TRADING CO., LTD.
    Inventors: Zhizheng Wu, Zhengchen Zhang, Wei Song, Yonghui Rao, Zhihang Xie, Guanghui Xu, Shuyong Liu, Bosen Ma, Shuangwen Qiu, Junmin Lin
  • Publication number: 20220406290
    Abstract: Embodiments of the present application provide a text information processing method and apparatus, the method includes: acquiring a phoneme vector corresponding to an individual phoneme and a semantic vector corresponding to the individual phoneme in text information; acquiring first semantic information output at a last moment, wherein the first semantic information is semantic information corresponding to part of the text information in the text information, and the part of the text information is text information that has been converted into voice information; determining a context vector corresponding to a current moment according to the first semantic information, the phoneme vector corresponding to the individual phoneme and the semantic vector corresponding to the individual phoneme; and determining voice information at the current moment according to the context vector and the first semantic information.
    Type: Application
    Filed: January 15, 2021
    Publication date: December 22, 2022
    Inventors: Liumeng XUE, Wei SONG, Zhizheng WU
  • Publication number: 20220270587
    Abstract: Disclosed are a speech synthesis method and apparatus, and a storage medium.
    Type: Application
    Filed: March 18, 2020
    Publication date: August 25, 2022
    Inventors: Zhizheng WU, Wei SONG
  • Publication number: 20220165249
    Abstract: The present disclosure relates to a speech synthesis method and device, and a computer-readable storage medium, and relates to the field of computer technology. The method of the present disclosure includes: dividing a text into a plurality of segments according to a language category to which each of the segments belongs; converting each of the segments into a phoneme corresponding to the segment to generate a phoneme sequence of the text according to the language category to which each of the segments belongs; inputting the phoneme sequence into a speech synthesis model trained in advance and converting the phoneme sequence into a vocoder characteristic parameter; and inputting the vocoder characteristic parameter into a vocoder to generate a speech.
    Type: Application
    Filed: March 30, 2020
    Publication date: May 26, 2022
    Inventors: Zhizheng WU, Zhengchen ZHANG, Wei SONG, Yonghui RAO, Zhihang XIE, Guanghui XU, Shuyong LIU, Bosen MA, Shuangwen QIU, Junmin LIN