Patents by Inventor Xinhui Hu
Xinhui Hu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250149025Abstract: A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.Type: ApplicationFiled: January 7, 2025Publication date: May 8, 2025Applicant: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.Inventors: Jinlong WANG, Xinkang XU, Xinhui HU
-
Publication number: 20250078806Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.Type: ApplicationFiled: November 18, 2024Publication date: March 6, 2025Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU
-
Patent number: 12223945Abstract: A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.Type: GrantFiled: April 23, 2022Date of Patent: February 11, 2025Assignee: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.Inventors: Jinlong Wang, Xinkang Xu, Xinhui Hu
-
Patent number: 12148415Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.Type: GrantFiled: September 11, 2023Date of Patent: November 19, 2024Assignee: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.Inventors: Peng Zhang, Xinhui Hu, Xinkang Xu, Jian Lu
-
Publication number: 20240005905Abstract: The present disclosure provides acoustic model training methods and systems, and speech synthesis methods and systems. An acoustic model training method may include obtaining a plurality of training samples. Each of the plurality of training samples may include a sample text input, a sample emotion label corresponding to the sample text input, and a sample reference mel spectrum corresponding to the sample text input. The acoustic model training method may include inputting the plurality of training samples into a target model. The target model may include the acoustic model and an auxiliary module. The acoustic model training method may further include iteratively adjusting at least one model parameter of the acoustic model based on a loss target.Type: ApplicationFiled: June 27, 2023Publication date: January 4, 2024Applicant: HANGZHOU TONGHUASHUN DATA PROCESSING CO., LTD.Inventors: Ming CHEN, Xinkang XU, Xinhui HU, Xudong ZHAO
-
Publication number: 20230419948Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.Type: ApplicationFiled: September 11, 2023Publication date: December 28, 2023Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU
-
Patent number: 11798527Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.Type: GrantFiled: August 18, 2021Date of Patent: October 24, 2023Assignee: ZHEJIANG TONGHU ASHUN INTELLIGENT TECHNOLOGY CO., LTD.Inventors: Peng Zhang, Xinhui Hu, Xinkang Xu, Jian Lu
-
Publication number: 20230115271Abstract: A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.Type: ApplicationFiled: April 23, 2022Publication date: April 13, 2023Applicant: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.Inventors: Jinlong WANG, Xinkang XU, Xinhui HU, Ming CHEN
-
Publication number: 20220059072Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.Type: ApplicationFiled: August 18, 2021Publication date: February 24, 2022Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU
-
Patent number: 9992038Abstract: A multi-hop ad hoc communications network may allow optical communications between underwater nodes. Each node may be fitted with environmental sensors. Each node may collect data from the sensors and transmit the data to other nodes in the network according to a time division multiple access (TDMA) scheme. The data may propagate through a series of child and parent nodes to reach a master node. The master node may have a wired connection for power and data transfer.Type: GrantFiled: June 13, 2014Date of Patent: June 5, 2018Assignee: Arizona Board of Regents on Behalf of Arizona State UniversityInventors: Cody Youngbull, David Ganger, Andres Mora, Andrea Richa, Jin Zhang, Chenyang Zhou, Xinhui Hu
-
Publication number: 20160134433Abstract: A multi-hop ad hoc communications network may allow optical communications between underwater nodes. Each node may be fitted with environmental sensors. Each node may collect data from the sensors and transmit the data to other nodes in the network according to a time division multiple access (TDMA) scheme. The data may propagate through a series of child and parent nodes to reach a master node. The master node may have a wired connection for power and data transfer.Type: ApplicationFiled: June 13, 2014Publication date: May 12, 2016Applicant: ARIZONA BOARD OF REGENTS ON BEHALF OF ARIZONA STATE UNIVERSITYInventors: Cody Youngbull, David Ganger, Andres Mora, Andrea Richa, Jin Zhang, Chenyang Zhou, Xinhui Hu