Patents by Inventor Wei Bin Zhu

Wei Bin Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8595011
    Abstract: The present invention provides a method and apparatus for text to speech conversion, and a method and apparatus for adjusting a corpus. The method for text to speech comprises: text analysis step for parsing the text to obtain descriptive prosody annotations of the text based on a TTS model generated from a first corpus; prosody parameter prediction step for predicting the prosody parameter of the text according to the result of text analysis step; speech synthesis step for synthesizing speech of said text based on said the prosody parameter of the text; wherein descriptive prosody annotations of the text include prosody structure for the text, the prosody structure of the text is adjusted according to a target speech speed for the synthesized speech. The present invention adjusts the prosody structure of the text according to the target speech speed. The synthesized speech will have improved quality.
    Type: Grant
    Filed: July 3, 2008
    Date of Patent: November 26, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Qin Shi, Wei Zhang, Wei Bin Zhu, Hai Xin Chai
  • Patent number: 7617105
    Abstract: A method for text to speech conversion and for adjusting a corpus. The method includes a text analysis step for parsing the text to obtain descriptive prosody annotations of the text based on a TTS model generated from a first corpus, a prosody parameter prediction step for predicting the prosody parameter of the text according to the result of the text analysis step, a speech synthesis step for synthesizing speech of the text based on the prosody parameter of the text, adjusting according to a target speech speed for the synthesized speech when necessary.
    Type: Grant
    Filed: May 27, 2005
    Date of Patent: November 10, 2009
    Assignee: Nuance Communications, Inc.
    Inventors: Qin Shi, Wei Zhang, Wei Bin Zhu, Hai Xin Chai
  • Publication number: 20080270139
    Abstract: The present invention provides a method and apparatus for text to speech conversion, and a method and apparatus for adjusting a corpus. The method for text to speech comprises: text analysis step for parsing the text to obtain descriptive prosody annotations of the text based on a TTS model generated from a first corpus; prosody parameter prediction step for predicting the prosody parameter of the text according to the result of text analysis step; speech synthesis step for synthesizing speech of said text based on said the prosody parameter of the text; wherein descriptive prosody annotations of the text include prosody structure for the text, the prosody structure of the text is adjusted according to a target speech speed for the synthesized speech. The present invention adjusts the prosody structure of the text according to the target speech speed. The synthesized speech will have improved quality.
    Type: Application
    Filed: July 3, 2008
    Publication date: October 30, 2008
    Inventors: Qin Shi, Wei Zhang, Wei Bin Zhu, Hai Xin Chai