Patents Assigned to iFLYTEK Co., Ltd.
-
Patent number: 11694041Abstract: A discourse-level text translation method and device, the method comprising: acquiring a text to be translated, the text to be translated being a unit text in a discourse-level text to be translated (S101); acquiring an associated text of the text to be translated, the associated text including at least one of a preceding source text, a following source text, and a preceding target text (S102); and translating, according to the associated text, the text to be translated (S103).Type: GrantFiled: April 10, 2019Date of Patent: July 4, 2023Assignee: IFLYTEK CO., LTD.Inventors: Zhiqiang Ma, Junhua Liu, Si Wei, Guoping Hu
-
Publication number: 20230186912Abstract: A speech recognition method and related products are provided. The method includes acquiring a to-be-recognized speech and a configured hot word library; determining, based on the to-be-recognized speech and the hot word library, an audio-related feature used at a current decoding time instant; determining, based on the audio-related feature, a hot word-related feature used at the current decoding time instant from the hot word library; and determining, based on the audio-related feature and the hot word-related feature, a recognition result of the to-be-recognized speech at the current decoding time instant.Type: ApplicationFiled: December 2, 2020Publication date: June 15, 2023Applicant: IFLYTEK CO., LTD.Inventors: Shifu XIONG, Cong LIU, Si WEI, Qingfeng LIU, Jianqing GAO, Jia PAN
-
Patent number: 11651578Abstract: A method and a system for end-to-end modeling are provided. The method includes: determining a topological structure of a target-based end-to-end model, where the topological structure includes an input layer, an encoding layer, an code enhancement layer, a filtering layer, a decoding layer and an output layer; the code enhancement layer adds information of a target unit to a feature sequence outputted by the encoding layer, the filtering layer filters a feature sequence added with the information of the target unit; collecting multiple pieces of training data; and training parameters of the target-based end-to-end model by using the multiple pieces of the training data.Type: GrantFiled: January 11, 2017Date of Patent: May 16, 2023Assignee: IFLYTEK CO., LTD.Inventors: Jia Pan, Shiliang Zhang, Shifu Xiong, Si Wei, Guoping Hu
-
Publication number: 20230035947Abstract: A speech recognition method and related products are provided. The method includes acquiring text contents and text-associated time information transmitted by a plurality of terminals in a preset scenario and determining a shared text for the preset scenario based on the text contents and the text-associated time information, obtaining a customized language model for the preset scenario based on the shared text, and performing speech recognition for the preset scenario with the customized language model. The method provides improved speech recognition for the preset scenario due to the correlation between the customized language model and the preset scenario.Type: ApplicationFiled: December 14, 2020Publication date: February 2, 2023Applicant: IFLYTEK CO., LTD.Inventors: Genshun WAN, Jianqing GAO, Zhiguo WANG
-
Publication number: 20220383853Abstract: A speech recognition error correction method and device, and a readable storage medium are provided. The method includes: acquiring to-be-recognized speech data and a first recognition result of the speech data, re-recognizing the speech data with reference to context information in the first recognition result to obtain a second recognition result, and determining a final recognition result based on the second recognition result. In the method, the speech data is re-recognized with reference to context information in the first recognition result, which fully considers context information in the recognition result and the application scenario of the speech data. If any error occurs in the first recognition result, the first recognition result is corrected based on the second recognition. Therefore, the accuracy of speech recognition can be improved.Type: ApplicationFiled: November 17, 2020Publication date: December 1, 2022Applicant: IFLYTEK CO., LTD.Inventors: Li XU, Jia PAN, Zhiguo WANG, Guoping HU
-
Publication number: 20220375459Abstract: A method for constructing a decoding network, a speech recognition method, a device, an apparatus, and a storage medium are provided. The method for constructing a decoding network includes: acquiring a general language model, a domain language model, and a general decoding network generated based on the general language model; generating a domain decoding network based on the domain language model and the general language model; and integrating the domain decoding network with the general decoding network to obtain a target decoding network. The speech recognition method includes: decoding to-be-recognized speech data by using a target decoding network to obtain a decoding path for the to-be-recognized speech data; and determining a speech recognition result for the to-be-recognized speech data based on the decoding path for the to-be-recognized speech data.Type: ApplicationFiled: December 12, 2019Publication date: November 24, 2022Applicant: IFLYTEK CO., LTD.Inventors: Jianqing GAO, Zhiguo WANG, Guoping HU
-
Patent number: 11508366Abstract: A method, an apparatus and a device for converting a whispered speech, and a readable storage medium are provided. The method is implemented based on the whispered speech converting model. The whispered speech converting model is trained in advance by using recognition results and whispered speech training acoustic features of whispered speech training data as samples and using normal speech acoustic features of normal speech data parallel to the whispered speech training data as sample labels. A whispered speech acoustic feature and a preliminary recognition result of whispered speech data are acquired, then the whispered speech acoustic feature and the preliminary recognition result are inputted into a preset whispered speech converting model to acquire a normal speech acoustic feature outputted by the model. In this way, the whispered speech can be converted to a normal speech.Type: GrantFiled: June 15, 2018Date of Patent: November 22, 2022Assignee: IFLYTEK CO., LTD.Inventors: Jia Pan, Cong Liu, Haikun Wang, Zhiguo Wang, Guoping Hu
-
Patent number: 11308974Abstract: A target voice detection method and a target voice detection apparatus are provided. The method includes: receiving sound signals collected by a microphone array; performing a beamforming process on the sound signals to obtain beams in different directions; extracting a detection feature of each frame based on the sound signals and the beams in different directions; inputting an extracted detection feature of a current frame into a pre-constructed target voice detection model to obtain a model output result; and obtaining a target voice detection result of the current frame based on the model output result.Type: GrantFiled: July 16, 2018Date of Patent: April 19, 2022Assignee: IFLYTEK CO., LTD.Inventors: Feng Ma, Haikun Wang, Zhiguo Wang, Guoping Hu
-
Patent number: 11081123Abstract: A microphone array-based target voice acquisition method and device, said method comprising: receiving voice signals acquired on the basis of a microphone array (101); determining a pre-selected target voice signal and a direction thereof (102); performing strong directional gain and weak directional gain on the pre-selected target voice signal, so as to obtain a strong gain signal and a weak gain signal (103); performing an endpoint detection on the basis of the strong gain signal, so as to obtain an endpoint detection result (104); and performing endpoint processing on the weak gain signal according to the endpoint detection result, so as to obtain a final target voice signal (105). The present invention can obtain an accurate and reliable target voice signal, thereby avoiding an adverse effect of the target voice quality on subsequent target voice processing.Type: GrantFiled: July 16, 2018Date of Patent: August 3, 2021Assignee: IFLYTEK CO., LTD.Inventors: Dongyang Xu, Haikun Wang, Zhiguo Wang, Guoping Hu
-
Patent number: 11064296Abstract: Provided are a voice denoising method and apparatus, a server and a storage medium. The voice denoising method comprises: acquiring voice signals synchronously collected by an acoustic microphone and a non-acoustic microphone (S100); carrying out voice activity detection according to the voice signal collected by the non-acoustic microphone to obtain a voice activity detection result (S110); and according to the voice activity detection result, denoising the voice signal collected by the acoustic microphone to obtain a denoised voice signal (S120). The effect of denoising can be enhanced, and the quality of voice signals can be improved.Type: GrantFiled: June 15, 2018Date of Patent: July 13, 2021Assignee: IFLYTEK CO., LTD.Inventors: Haikun Wang, Feng Ma, Zhiguo Wang
-
Publication number: 20210150154Abstract: A discourse-level text translation method and device, the method comprising: acquiring a text to be translated, the text to be translated being a unit text in a discourse-level text to be translated (S101); acquiring an associated text of the text to be translated, the associated text including at least one of a preceding source text, a following source text, and a preceding target text (S102); and translating, according to the associated text, the text to be translated (S103).Type: ApplicationFiled: April 10, 2019Publication date: May 20, 2021Applicant: IFLYTEK CO., LTD.Inventors: Zhiqiang MA, Junhua LIU, Si WEI, Guoping HU
-
Patent number: 10964337Abstract: A method, a device and a storage medium for evaluating speech quality include: receiving speech data to be evaluated; extracting evaluation features of the speech data to be evaluated; performing quality evaluation to the speech data to be evaluated according to the evaluation features of the speech data to be evaluated and a predetermined speech quality evaluation model, in which the speech quality evaluation model is an indication of a relationship between evaluation features of single-ended speech data and quality information of the single-ended speech data.Type: GrantFiled: February 20, 2019Date of Patent: March 30, 2021Assignee: Iflytek Co., Ltd.Inventors: Bing Yin, Si Wei, Guoping Hu, Su Cheng
-
Patent number: 10949701Abstract: A method for recognizing a character includes: obtaining a character; converting the character into a radical based character recognition result, where the radical based character recognition result comprises symbols indicating radicals of the character and a structure of the radicals of the character; and recognizing the character based on the radical based character recognition result.Type: GrantFiled: November 2, 2018Date of Patent: March 16, 2021Assignee: IFLYTEK CO., LTD.Inventors: Jun Du, Jianshu Zhang, Lirong Dai, Jinshui Hu, Jiajia Wu, Cong Liu, Guoping Hu, Qingfeng Liu
-
Publication number: 20210051404Abstract: An echo cancellation method based on delay estimation is provided. In the method, a microphone signal and a reference signal are received and preprocessed. In the preprocessed microphone signal and the preprocessed reference signal, frequency point signals with non-linearity in a current echo cancellation scenario are determined. A current delay estimation value is calculated based on frequency point signals without non-linearity in the microphone signal and the reference signal. The reference signal is shifted based on the current delay estimation value. An adaptive filter is updated based on the preprocessed microphone signal and the shifted reference signal, to perform echo cancellation.Type: ApplicationFiled: July 16, 2018Publication date: February 18, 2021Applicant: IFLYTEK CO., LTD.Inventors: Mingzi LI, Feng MA, Haikun WANG, Zhiguo WANG, Guoping HU
-
Publication number: 20200389728Abstract: Provided are a voice denoising method and apparatus, a server and a storage medium. The voice denoising method comprises: acquiring voice signals synchronously collected by an acoustic microphone and a non-acoustic microphone (S100); carrying out voice activity detection according to the voice signal collected by the non-acoustic microphone to obtain a voice activity detection result (S110); and according to the voice activity detection result, denoising the voice signal collected by the acoustic microphone to obtain a denoised voice signal (S120). The effect of denoising can be enhanced, and the quality of voice signals can be improved.Type: ApplicationFiled: June 15, 2018Publication date: December 10, 2020Applicant: IFLYTEK CO., LTD.Inventors: Haikun WANG, Feng MA, Zhiguo WANG
-
Publication number: 20200342890Abstract: A target voice detection method and a target voice detection apparatus are provided. The method includes: receiving sound signals collected by a microphone array; performing a beamforming process on the sound signals to obtain beams in different directions; extracting a detection feature of each frame based on the sound signals and the beams in different directions; inputting an extracted detection feature of a current frame into a pre-constructed target voice detection model to obtain a model output result; and obtaining a target voice detection result of the current frame based on the model output result.Type: ApplicationFiled: July 16, 2018Publication date: October 29, 2020Applicant: IFLYTEK CO., LTD.Inventors: Feng MA, Haikun WANG, Zhiguo WANG, Guoping HU
-
Publication number: 20200342887Abstract: A microphone array-based target voice acquisition method and device, said method comprising: receiving voice signals acquired on the basis of a microphone array (101); determining a pre-selected target voice signal and a direction thereof (102); performing strong directional gain and weak directional gain on the pre-selected target voice signal, so as to obtain a strong gain signal and a weak gain signal (103); performing an endpoint detection on the basis of the strong gain signal, so as to obtain an endpoint detection result (104); and performing endpoint processing on the weak gain signal according to the endpoint detection result, so as to obtain a final target voice signal (105). The present invention can obtain an accurate and reliable target voice signal, thereby avoiding an adverse effect of the target voice quality on subsequent target voice processing.Type: ApplicationFiled: July 16, 2018Publication date: October 29, 2020Applicant: IFLYTEK CO., LTD.Inventors: Dongyang XU, Haikun WANG, Zhiguo WANG, Guoping HU
-
Publication number: 20200211550Abstract: A method, an apparatus and a device for converting a whispered speech, and a readable storage medium are provided. The method is implemented based on the whispered speech converting model. The whispered speech converting model is trained in advance by using recognition results and whispered speech training acoustic features of whispered speech training data as samples and using normal speech acoustic features of normal speech data parallel to the whispered speech training data as sample labels. A whispered speech acoustic feature and a preliminary recognition result of whispered speech data are acquired, then the whispered speech acoustic feature and the preliminary recognition result are inputted into a preset whispered speech converting model to acquire a normal speech acoustic feature outputted by the model. In this way, the whispered speech can be converted to a normal speech.Type: ApplicationFiled: June 15, 2018Publication date: July 2, 2020Applicant: IFLYTEK CO., LTD.Inventors: Jia PAN, Cong LIU, Haikun WANG, Zhiguo WANG, Guoping HU
-
Publication number: 20200143191Abstract: A method for recognizing a character includes: obtaining a character; converting the character into a radical based character recognition result, where the radical based character recognition result comprises symbols indicating radicals of the character and a structure of the radicals of the character; and recognizing the character based on the radical based character recognition result.Type: ApplicationFiled: November 2, 2018Publication date: May 7, 2020Applicant: IFLYTEK CO., LTD.Inventors: Jun DU, Jianshu ZHANG, Lirong DAI, Jinshui HU, Jiajia WU, Cong LIU, Guoping HU, Qingfeng LIU
-
Publication number: 20190279036Abstract: A method and a system for end-to-end modeling are provided. The method includes: determining a topological structure of a target-based end-to-end model, where the topological structure includes an input layer, an encoding layer, an code enhancement layer, a filtering layer, a decoding layer and an output layer; the code enhancement layer adds information of a target unit to a feature sequence outputted by the encoding layer, the filtering layer filters a feature sequence added with the information of the target unit collecting multiple pieces of training data; and training parameters of the target-based end-to-end model by using the multiple pieces of the training data.Type: ApplicationFiled: January 11, 2017Publication date: September 12, 2019Applicant: IFLYTEK CO., LTD.Inventors: Jia PAN, Shiliang ZHANG, Shifu XIONG, Si WEI, Guoping HU