Patents by Inventor Zhifu GAO

Zhifu GAO has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250252955
    Abstract: Embodiments of this specification provide a speech recognition method and apparatus. The speech recognition method includes: obtaining speech data to be recognized; extracting a speech feature in the speech data to obtain a first speech feature; performing accent feature recognition on the first speech feature to obtain a second speech feature carrying an accent feature; and recognizing first speech text content corresponding to the speech data based on the second speech feature. The accuracy and efficiency of speech recognition can be improved.
    Type: Application
    Filed: April 10, 2023
    Publication date: August 7, 2025
    Inventors: Yuqin LIN, Shiliang ZHANG, Zhifu GAO
  • Publication number: 20250201239
    Abstract: Embodiments of the present application provide a speech recognition method, a speech recognition model, an electronic device and a storage medium. The speech recognition method includes: obtaining an acoustic representation of to-be-recognized speech; determining a character probability corresponding to each frame vector in the acoustic representation, where the character probability is used to indicate a probability of recognizing corresponding character speech based on a current frame vector; predicting, according to the character probability corresponding to each frame vector, the number of characters included in the to-be-recognized speech and a frame boundary of each character to obtain a prediction result; extracting a vector representation of each piece of character speech from the acoustic representation according to the prediction result; obtaining a recognition result of the to-be-recognized speech according to the vector representation of each piece of character speech.
    Type: Application
    Filed: November 8, 2022
    Publication date: June 19, 2025
    Inventors: Zhifu GAO, Shiliang ZHANG
  • Publication number: 20230064756
    Abstract: A method, an apparatus, and an electronic device for streaming end-to-end speech recognition are described. The method includes: extracting and encoding speech acoustic features of a received voice stream in units of frames; performing block processing, and predicting a number of activation points included in a same block that need to be encoded and outputted; determining position(s) of activation point(s) that need(s) to be decoded and outputted according to a prediction result, to a decoder to perform decoding at the position(s) of the activation point(s) and output a recognition result. Through the embodiments of the present disclosure, the robustness of a streaming end-to-end speech recognition system to noise can be improved, thereby improving the performance and the accuracy of the system.
    Type: Application
    Filed: October 28, 2022
    Publication date: March 2, 2023
    Inventors: Shiliang ZHANG, Zhifu GAO
  • Publication number: 20230009633
    Abstract: A speech processing method, a speech encoder, a speech decoder, and a speech recognition system are provided. The method includes: obtaining a speech signal to be processed; using a first neural network and a second neural network to process the speech signal to obtain first feature information and second feature information corresponding to the speech signal respectively, wherein a computational efficiency of the first neural network is higher than a computational efficiency of the second neural network, and an accuracy of the second feature information outputted by the second neural network is higher than an accuracy of the first feature information outputted by the first neural network; and determining target feature information used to represent semantics in the speech signal based on the first feature information and the second feature information.
    Type: Application
    Filed: September 23, 2022
    Publication date: January 12, 2023
    Inventors: Shiliang ZHANG, Zhifu GAO, Ming Lei