Patents by Inventor Zhanjie Gao

Zhanjie Gao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230206943
    Abstract: An audio recognizing method, including: performing acoustic feature prediction on the audio to be recognized to obtain first audio prediction result and an acoustic feature reference quantity for predicting an audio recognition result; obtaining second audio prediction result based on the acoustic feature reference quantity; and determining the audio recognition result of the audio to be recognized based on the first audio prediction result and the second audio prediction result, the audio recognition result including unvoiced sound or voiced sound. When determining that the audio is unvoiced sound or voiced sound, the first audio prediction result obtained by performing acoustic feature prediction on the audio to be recognized is used, and the second audio prediction result is obtained in combination with other acoustic feature reference quantities, thereby making the determination result of unvoiced sound or voiced sound of the audio more accurate, to improve the audio quality in speech processing.
    Type: Application
    Filed: August 19, 2022
    Publication date: June 29, 2023
    Inventors: Wenjie Li, Zhanjie Gao, Lei Jia
  • Publication number: 20220390230
    Abstract: A method for generating a speech package, an electronic device and a storage medium The method includes: determining a number of texts to be displayed and a speech recording condition based on a type of a recording mode selection control in response to the recording mode selection control being triggered; acquiring speech data with an amount matched with the number based on the speech recording condition; sending the speech data to a server; and acquiring a speech package generated by the server using the speech data.
    Type: Application
    Filed: August 8, 2022
    Publication date: December 8, 2022
    Inventors: Bo PENG, Chao LI, Cong GAO, Zhanjie GAO, Yunfeng LI
  • Patent number: 11200382
    Abstract: This application discloses a prosodic pause prediction method, a prosodic pause prediction device and an electronic device. The specific implementation scheme includes: obtaining a first matrix by mapping a to-be-tested text sequence through a trained embedding layer, where the to-be-tested text sequence includes a to-be-tested input text and an identity of a to-be-tested speaker; inputting the first matrix into a trained attention model, and determining a semantic representation matrix by the trained attention model; and, performing prosodic pause prediction based on the semantic representation matrix and outputting a prosodic pause prediction result of each word in the to-be-tested input text.
    Type: Grant
    Filed: May 8, 2020
    Date of Patent: December 14, 2021
    Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
    Inventors: Zhipeng Nie, Yanyao Bian, Zhanjie Gao, Changbin Chen
  • Publication number: 20210133396
    Abstract: This application discloses a prosodic pause prediction method, a prosodic pause prediction device and an electronic device. The specific implementation scheme includes: obtaining a first matrix by mapping a to-be-tested text sequence through a trained embedding layer, where the to-be-tested text sequence includes a to-be-tested input text and an identity of a to-be-tested speaker; inputting the first matrix into a trained attention model, and determining a semantic representation matrix by the trained attention model; and, performing prosodic pause prediction based on the semantic representation matrix and outputting a prosodic pause prediction result of each word in the to-be-tested input text.
    Type: Application
    Filed: May 8, 2020
    Publication date: May 6, 2021
    Applicant: Baidu Online Network Technology (Beijing) Co., Ltd.
    Inventors: Zhipeng Nie, Yanyao Bian, Zhanjie Gao, Changbin Chen