Patents by Inventor Xinhui Hu

Xinhui Hu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SYSTEMS AND METHODS FOR MULTIPLE SPEAKER SPEECH RECOGNITION

Publication number: 20250149025

Abstract: A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.

Type: Application

Filed: January 7, 2025

Publication date: May 8, 2025

Applicant: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.

Inventors: Jinlong WANG, Xinkang XU, Xinhui HU
SYSTEMS AND METHODS FOR SYNTHESIZING SPEECH

Publication number: 20250078806

Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.

Type: Application

Filed: November 18, 2024

Publication date: March 6, 2025

Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU
Systems and methods for multiple speaker speech recognition

Patent number: 12223945

Abstract: A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.

Type: Grant

Filed: April 23, 2022

Date of Patent: February 11, 2025

Assignee: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.

Inventors: Jinlong Wang, Xinkang Xu, Xinhui Hu
Systems and methods for synthesizing speech

Patent number: 12148415

Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.

Type: Grant

Filed: September 11, 2023

Date of Patent: November 19, 2024

Assignee: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Peng Zhang, Xinhui Hu, Xinkang Xu, Jian Lu
END-TO-END NATURAL AND CONTROLLABLE EMOTIONAL SPEECH SYNTHESIS METHODS

Publication number: 20240005905

Abstract: The present disclosure provides acoustic model training methods and systems, and speech synthesis methods and systems. An acoustic model training method may include obtaining a plurality of training samples. Each of the plurality of training samples may include a sample text input, a sample emotion label corresponding to the sample text input, and a sample reference mel spectrum corresponding to the sample text input. The acoustic model training method may include inputting the plurality of training samples into a target model. The target model may include the acoustic model and an auxiliary module. The acoustic model training method may further include iteratively adjusting at least one model parameter of the acoustic model based on a loss target.

Type: Application

Filed: June 27, 2023

Publication date: January 4, 2024

Applicant: HANGZHOU TONGHUASHUN DATA PROCESSING CO., LTD.

Inventors: Ming CHEN, Xinkang XU, Xinhui HU, Xudong ZHAO
SYSTEMS AND METHODS FOR SYNTHESIZING SPEECH

Publication number: 20230419948

Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.

Type: Application

Filed: September 11, 2023

Publication date: December 28, 2023

Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU
Systems and methods for synthesizing speech

Patent number: 11798527

Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.

Type: Grant

Filed: August 18, 2021

Date of Patent: October 24, 2023

Assignee: ZHEJIANG TONGHU ASHUN INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Peng Zhang, Xinhui Hu, Xinkang Xu, Jian Lu
SYSTEMS AND METHODS FOR SPEECH RECOGNITION

Publication number: 20230115271

Abstract: A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.

Type: Application

Filed: April 23, 2022

Publication date: April 13, 2023

Applicant: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.

Inventors: Jinlong WANG, Xinkang XU, Xinhui HU, Ming CHEN
SYSTEMS AND METHODS FOR SYNTHESIZING SPEECH

Publication number: 20220059072

Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.

Type: Application

Filed: August 18, 2021

Publication date: February 24, 2022

Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU
Underwater multi-hop communications network

Patent number: 9992038

Abstract: A multi-hop ad hoc communications network may allow optical communications between underwater nodes. Each node may be fitted with environmental sensors. Each node may collect data from the sensors and transmit the data to other nodes in the network according to a time division multiple access (TDMA) scheme. The data may propagate through a series of child and parent nodes to reach a master node. The master node may have a wired connection for power and data transfer.

Type: Grant

Filed: June 13, 2014

Date of Patent: June 5, 2018

Assignee: Arizona Board of Regents on Behalf of Arizona State University

Inventors: Cody Youngbull, David Ganger, Andres Mora, Andrea Richa, Jin Zhang, Chenyang Zhou, Xinhui Hu
UNDERWATER MULTI-HOP COMMUNICATIONS NETWORK

Publication number: 20160134433

Abstract: A multi-hop ad hoc communications network may allow optical communications between underwater nodes. Each node may be fitted with environmental sensors. Each node may collect data from the sensors and transmit the data to other nodes in the network according to a time division multiple access (TDMA) scheme. The data may propagate through a series of child and parent nodes to reach a master node. The master node may have a wired connection for power and data transfer.

Type: Application

Filed: June 13, 2014

Publication date: May 12, 2016

Applicant: ARIZONA BOARD OF REGENTS ON BEHALF OF ARIZONA STATE UNIVERSITY

Inventors: Cody Youngbull, David Ganger, Andres Mora, Andrea Richa, Jin Zhang, Chenyang Zhou, Xinhui Hu