Patents by Inventor Qiguang ZANG

Qiguang ZANG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

AUDIO RECOGNITION METHOD, METHOD OF TRAINING AUDIO RECOGNITION MODEL, AND ELECTRONIC DEVICE

Publication number: 20230410794

Abstract: An audio recognition method, a method of training an audio recognition model, and an electronic device are provided, which relate to fields of artificial intelligence, speech recognition, deep learning and natural language processing technologies. The audio recognition method includes: truncating an audio feature of target audio data to obtain at least one first audio sequence feature corresponding to a predetermined duration; obtaining, according to a peak information of the audio feature, a peak sub-information corresponding to the first audio sequence feature; performing at least one decoding operation on the first audio sequence feature to obtain a recognition result for the first audio sequence feature, a number of times the decoding operation is performed being identical to a number of peaks corresponding to the first audio sequence feature; obtaining target text data for the target audio data according to the recognition result for the at least one first audio sequence feature.

Type: Application

Filed: August 25, 2023

Publication date: December 21, 2023

Inventors: Xiaoyin FU, Mingshun YANG, Qiguang ZANG, Zhijie CHEN, Yangkai XU, Guibin WANG, Lei JIA
Method and apparatus for speech recognition, and storage medium

Patent number: 11756529

Abstract: Proposed are a method and apparatus for speech recognition, and a storage medium. The specific solution includes: obtaining audio data to be recognized; decoding the audio data to obtain a first syllable of a to-be-converted word, in which the first syllable is a combination of at least one phoneme corresponding to the to-be-converted word; obtaining a sentence to which the to-be-converted word belongs and a converted word in the sentence, and obtaining a second syllable of the converted word; encoding the first syllable and the second syllable to generate first encoding information of the first syllable; and decoding the first encoding information to obtain a text corresponding to the to-be-converted word.

Type: Grant

Filed: December 16, 2020

Date of Patent: September 12, 2023

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Liao Zhang, Xiaoyin Fu, Zhengxiang Jiang, Mingxin Liang, Junyao Shao, Qi Zhang, Zhijie Chen, Qiguang Zang
METHOD FOR TRAINING SPEECH RECOGNITION MODEL, DEVICE AND STORAGE MEDIUM

Publication number: 20220310064

Abstract: A method for training a speech recognition model, a device and a storage medium, which relate to the field of computer technologies, and particularly to the fields of speech recognition technologies, deep learning technologies, or the like, are disclosed. The method for training a speech recognition model includes: obtaining a fusion probability of each of at least one candidate text corresponding to a speech based on an acoustic decoding model and a language model; selecting a preset number of one or more candidate texts based on the fusion probability of each of the at least one candidate text, and determining a predicted text based on the preset number of one or more candidate texts; and obtaining a loss function based on the predicted text and a standard text corresponding to the speech, and training the speech recognition model based on the loss function.

Type: Application

Filed: January 10, 2022

Publication date: September 29, 2022

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Junyao SHAO, Xiaoyin FU, Qiguang ZANG, Zhijie CHEN, Mingxin LIANG, Huanxin ZHENG, Sheng QIAN
METHOD OF RECOGNIZING SPEECH OFFLINE, ELECTRONIC DEVICE, AND STORAGE MEDIUM

Publication number: 20220108684

Abstract: The present disclosure provides a method of recognizing speech offline, electronic device, and a storage medium, relating to a field of artificial intelligence such as speech recognition, natural language processing, and deep learning. The method may include: decoding speech data to be recognized into a syllable recognition result; transforming the syllable recognition result into a corresponding text as a speech recognition result of the speech data.

Type: Application

Filed: December 16, 2021

Publication date: April 7, 2022

Inventors: Xiaoyin FU, Mingxin LIANG, Zhijie CHEN, Qiguang ZANG, Zhengxiang JIANG, Liao ZHANG, Qi ZHANG, Lei JIA
METHOD AND APPARATUS FOR MINING FEATURE INFORMATION, AND ELECTRONIC DEVICE

Publication number: 20220036879

Abstract: A method for mining feature information, an apparatus for mining feature information and an electronic device are disclosed. The method includes: determining a usage scenario of a target device; obtaining raw audio data including real scenario data, speech synthesis data, recorded audio data and other media data; generating target audio data of the usage scenario by simulating the usage scenario based on the raw audio data; and obtaining feature information of the usage scenario by performing feature extraction on the target audio data.

Type: Application

Filed: October 13, 2021

Publication date: February 3, 2022

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Jiaxiang GE, Zhen WU, Maoren ZHOU, Qiguang ZANG, Ming WEN, Xiaoyin FU
METHOD FOR SEMANTIC RECOGNITION, ELECTRONIC DEVICE, AND STORAGE MEDIUM

Publication number: 20220028376

Abstract: The disclosure discloses a method for semantic recognition, an electronic device, and a storage medium. The detailed solution includes: obtaining a speech recognition result of a speech to be processed, in which the speech recognition result includes a newly added recognition result fragment and a historical recognition result fragment; obtaining a semantic vector of each historical object in the historical recognition result fragment, and obtaining a semantic vector of each newly added object by inputting the semantic vector of each historical object and each newly added object in the newly added recognition result fragment into a streaming semantic coding layer; and obtaining a semantic recognition result of the speech by inputting the semantic vector of each historical object and the semantic vector of each newly added object into a streaming semantic vector fusion layer and a semantic understanding multi-task layer sequentially arranged.

Type: Application

Filed: October 13, 2021

Publication date: January 27, 2022

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Yufang WU, Qin QU, Qibo WANG, Chengjian MAN, Qiguang ZANG, Xiaoyin FU
METHOD AND APPARATUS FOR SPEECH RECOGNITION, AND STORAGE MEDIUM

Publication number: 20210375264

Abstract: Proposed are a method and apparatus for speech recognition, and a storage medium. The specific solution includes: obtaining audio data to be recognized; decoding the audio data to obtain a first syllable of a to-be-converted word, in which the first syllable is a combination of at least one phoneme corresponding to the to-be-converted word; obtaining a sentence to which the to-be-converted word belongs and a converted word in the sentence, and obtaining a second syllable of the converted word; encoding the first syllable and the second syllable to generate first encoding information of the first syllable; and decoding the first encoding information to obtain a text corresponding to the to-be-converted word.

Type: Application

Filed: December 16, 2020

Publication date: December 2, 2021

Inventors: Liao ZHANG, Xiaoyin FU, Zhengxiang JIANG, Mingxin LIANG, Junyao SHAO, Qi ZHANG, Zhijie CHEN, Qiguang ZANG

AUDIO RECOGNITION METHOD, METHOD OF TRAINING AUDIO RECOGNITION MODEL, AND ELECTRONIC DEVICE

Method and apparatus for speech recognition, and storage medium

METHOD FOR TRAINING SPEECH RECOGNITION MODEL, DEVICE AND STORAGE MEDIUM

METHOD OF RECOGNIZING SPEECH OFFLINE, ELECTRONIC DEVICE, AND STORAGE MEDIUM

METHOD AND APPARATUS FOR MINING FEATURE INFORMATION, AND ELECTRONIC DEVICE

METHOD FOR SEMANTIC RECOGNITION, ELECTRONIC DEVICE, AND STORAGE MEDIUM

METHOD AND APPARATUS FOR SPEECH RECOGNITION, AND STORAGE MEDIUM