Patents by Inventor Xinhui Hu

Xinhui Hu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250149025
    Abstract: A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.
    Type: Application
    Filed: January 7, 2025
    Publication date: May 8, 2025
    Applicant: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.
    Inventors: Jinlong WANG, Xinkang XU, Xinhui HU
  • Publication number: 20250078806
    Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.
    Type: Application
    Filed: November 18, 2024
    Publication date: March 6, 2025
    Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.
    Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU
  • Patent number: 12223945
    Abstract: A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.
    Type: Grant
    Filed: April 23, 2022
    Date of Patent: February 11, 2025
    Assignee: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.
    Inventors: Jinlong Wang, Xinkang Xu, Xinhui Hu
  • Patent number: 12148415
    Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.
    Type: Grant
    Filed: September 11, 2023
    Date of Patent: November 19, 2024
    Assignee: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.
    Inventors: Peng Zhang, Xinhui Hu, Xinkang Xu, Jian Lu
  • Publication number: 20240005905
    Abstract: The present disclosure provides acoustic model training methods and systems, and speech synthesis methods and systems. An acoustic model training method may include obtaining a plurality of training samples. Each of the plurality of training samples may include a sample text input, a sample emotion label corresponding to the sample text input, and a sample reference mel spectrum corresponding to the sample text input. The acoustic model training method may include inputting the plurality of training samples into a target model. The target model may include the acoustic model and an auxiliary module. The acoustic model training method may further include iteratively adjusting at least one model parameter of the acoustic model based on a loss target.
    Type: Application
    Filed: June 27, 2023
    Publication date: January 4, 2024
    Applicant: HANGZHOU TONGHUASHUN DATA PROCESSING CO., LTD.
    Inventors: Ming CHEN, Xinkang XU, Xinhui HU, Xudong ZHAO
  • Publication number: 20230419948
    Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.
    Type: Application
    Filed: September 11, 2023
    Publication date: December 28, 2023
    Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.
    Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU
  • Patent number: 11798527
    Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.
    Type: Grant
    Filed: August 18, 2021
    Date of Patent: October 24, 2023
    Assignee: ZHEJIANG TONGHU ASHUN INTELLIGENT TECHNOLOGY CO., LTD.
    Inventors: Peng Zhang, Xinhui Hu, Xinkang Xu, Jian Lu
  • Publication number: 20230115271
    Abstract: A speech recognition method is provided. The method may include: obtaining speech data and a speech recognition result of the speech data, the speech data including speech of a plurality of speakers, and the speech recognition result including a plurality of words; determining speaking time of each of the plurality of speakers by processing the speech data; determining, based on the speaking times of the plurality of speakers and the speech recognition result, a corresponding relationship between the plurality of words and the plurality of speakers; determining, based on the corresponding relationship, at least one conversion word from the plurality of words, each of the at least one conversion word corresponding to at least two of the plurality of speakers; and re-determining the corresponding relationship between the plurality of words and the plurality of speakers based on the at least one conversion word.
    Type: Application
    Filed: April 23, 2022
    Publication date: April 13, 2023
    Applicant: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.
    Inventors: Jinlong WANG, Xinkang XU, Xinhui HU, Ming CHEN
  • Publication number: 20220059072
    Abstract: The present disclosure discloses a method for synthesizing a speech. The method includes generating the speech based on a text with a speech synthesis model, wherein the speech synthesis model includes an embedding layer, a speech synthesis layer, and a position layer; and training the speech synthesis model when an evaluation index meets a preset condition, wherein the evaluation index includes one or more quality indexes determined based on at least a part of the text and at least a part of the speech.
    Type: Application
    Filed: August 18, 2021
    Publication date: February 24, 2022
    Applicant: ZHEJIANG TONGHUASHUN INTELLIGENT TECHNOLOGY CO., LTD.
    Inventors: Peng ZHANG, Xinhui HU, Xinkang XU, Jian LU
  • Patent number: 9992038
    Abstract: A multi-hop ad hoc communications network may allow optical communications between underwater nodes. Each node may be fitted with environmental sensors. Each node may collect data from the sensors and transmit the data to other nodes in the network according to a time division multiple access (TDMA) scheme. The data may propagate through a series of child and parent nodes to reach a master node. The master node may have a wired connection for power and data transfer.
    Type: Grant
    Filed: June 13, 2014
    Date of Patent: June 5, 2018
    Assignee: Arizona Board of Regents on Behalf of Arizona State University
    Inventors: Cody Youngbull, David Ganger, Andres Mora, Andrea Richa, Jin Zhang, Chenyang Zhou, Xinhui Hu
  • Publication number: 20160134433
    Abstract: A multi-hop ad hoc communications network may allow optical communications between underwater nodes. Each node may be fitted with environmental sensors. Each node may collect data from the sensors and transmit the data to other nodes in the network according to a time division multiple access (TDMA) scheme. The data may propagate through a series of child and parent nodes to reach a master node. The master node may have a wired connection for power and data transfer.
    Type: Application
    Filed: June 13, 2014
    Publication date: May 12, 2016
    Applicant: ARIZONA BOARD OF REGENTS ON BEHALF OF ARIZONA STATE UNIVERSITY
    Inventors: Cody Youngbull, David Ganger, Andres Mora, Andrea Richa, Jin Zhang, Chenyang Zhou, Xinhui Hu