Patents by Inventor Xinkang XU

Xinkang XU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

END-TO-END NATURAL AND CONTROLLABLE EMOTIONAL SPEECH SYNTHESIS METHODS

Publication number: 20240005905

Abstract: The present disclosure provides acoustic model training methods and systems, and speech synthesis methods and systems. An acoustic model training method may include obtaining a plurality of training samples. Each of the plurality of training samples may include a sample text input, a sample emotion label corresponding to the sample text input, and a sample reference mel spectrum corresponding to the sample text input. The acoustic model training method may include inputting the plurality of training samples into a target model. The target model may include the acoustic model and an auxiliary module. The acoustic model training method may further include iteratively adjusting at least one model parameter of the acoustic model based on a loss target.

Type: Application

Filed: June 27, 2023

Publication date: January 4, 2024

Applicant: HANGZHOU TONGHUASHUN DATA PROCESSING CO., LTD.

Inventors: Ming CHEN, Xinkang XU, Xinhui HU, Xudong ZHAO
SYSTEMS AND METHODS FOR SYNTHESIZING SPEECH

Publication number: 20230419948

Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.

Type: Application

Filed: September 11, 2023

Publication date: December 28, 2023

Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU
Systems and methods for synthesizing speech

Patent number: 11798527

Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.

Type: Grant

Filed: August 18, 2021

Date of Patent: October 24, 2023

Assignee: ZHEJIANG TONGHU ASHUN INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Peng Zhang, Xinhui Hu, Xinkang Xu, Jian Lu
SYSTEMS AND METHODS FOR SPEECH RECOGNITION

Publication number: 20230115271

Abstract: A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.

Type: Application

Filed: April 23, 2022

Publication date: April 13, 2023

Applicant: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.

Inventors: Jinlong WANG, Xinkang XU, Xinhui HU, Ming CHEN
SYSTEMS AND METHODS FOR SYNTHESIZING SPEECH

Publication number: 20220059072

Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.

Type: Application

Filed: August 18, 2021

Publication date: February 24, 2022

Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU

END-TO-END NATURAL AND CONTROLLABLE EMOTIONAL SPEECH SYNTHESIS METHODS

SYSTEMS AND METHODS FOR SYNTHESIZING SPEECH

Systems and methods for synthesizing speech

SYSTEMS AND METHODS FOR SPEECH RECOGNITION

SYSTEMS AND METHODS FOR SYNTHESIZING SPEECH