Patents by Inventor Chumin Li

Chumin Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SPEECH SYNTHESIS

Publication number: 20250356841

Abstract: Embodiments of the disclosure relate to speech synthesis. A method provided herein includes: constructing, based on target text and prompt speech content, an input sequence corresponding to a sequence template, wherein the sequence template includes a placeholder, and a sequence segment, in the input sequence, corresponding to the placeholder is: preset content independent of the prompt speech content, or a speech feature representation generated based on the prompt speech content; and processing the input sequence with a target model to generate target speech content corresponding to the target text, wherein the target model is trained with a set of training sequences constructed based on the sequence template, the set of training sequences corresponds to a set of training speech content, and the set of training sequences is constructed by replacing the placeholder with the preset content or a training speech feature representation corresponding to respective training speech content.

Type: Application

Filed: May 13, 2025

Publication date: November 20, 2025

Inventors: Jiawei Chen, Yuanzhe Chen, Dongya Jia, Zhengxi Liu, Jian Cong, Chumin Li, Xin Wang, Lin Liu, Zhuo Chen, Yuping Wang, Yuxuan Wang
JOINT TRAINING

Publication number: 20250356836

Abstract: Embodiments in the disclosure relate to joint training. A method provided herein includes: obtaining a first sequence and a second sequence, wherein the first sequence is generated based on text content and the second sequence is generated based on speech content matching the text content, wherein the first sequence includes a plurality of text tokens and the second sequence includes a plurality of speech tokens; constructing a mixed sequence based on an alignment relationship between the plurality of text tokens and the plurality of speech tokens, the mixed sequence including at least one of the plurality of text tokens and at least one of the plurality of speech tokens; and training a target model with the mixed sequence.

Type: Application

Filed: May 13, 2025

Publication date: November 20, 2025

Inventors: Yuanzhe Chen, Jiawei Chen, Dongya Jia, Chumin Li, Jian Cong, Zhengxi Liu, Zhuo Chen, Yuping Wang, Yuxuan Wang

SPEECH SYNTHESIS

JOINT TRAINING