Patents by Inventor Jianxiong Ma

Jianxiong Ma has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SPEECH RECOGNITION METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM

Publication number: 20230032385

Abstract: A speech recognition method includes acquiring speech data, inputting a speech feature matrix of the speech data to a speech recognition model, the speech feature matrix being used for representing time-domain and frequency-domain features of the speech data, performing attention encoding on the speech feature matrix through the speech recognition model to obtain an encoded matrix, the encoded matrix including multiple encoded vectors, and decoding the multiple encoded vectors in the encoded matrix according to positions of the multiple encoded vectors in the encoded matrix to output a character string corresponding to the speech data, a decoding sequence of the multiple encoded vectors being related to the positions of the multiple encoded vectors in the encoded matrix.

Type: Application

Filed: October 7, 2022

Publication date: February 2, 2023

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventors: Xilin ZHANG, Bo LIU, Haipeng WANG, Jianxiong MA, Ping ZHENG
Method and computer system for performing audio search on a social networking platform

Patent number: 10453477

Abstract: Methods and computer systems for audio search on a social networking platform are disclosed. While running a social networking application, a computer system receives a first audio input from a user of the computer system and then generates a first audio confusion network from the first audio input. After comparing the first audio confusion network with one or more second audio confusion networks, each corresponding to a second audio input associated with one of a plurality of participants of a chat session of the social networking application, the computer system identifies at least one second audio input corresponding to the at least one second audio confusion network that matches the first audio confusion network and displays a portion of the chat session including a visual icon representing the identified second audio input on a display of the computer system.

Type: Grant

Filed: October 9, 2017

Date of Patent: October 22, 2019

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Lu Li, Jianxiong Ma, Li Lu
METHOD AND COMPUTER SYSTEM FOR PERFORMING AUDIO SEARCH ON A SOCIAL NETWORKING PLATFORM

Publication number: 20180033450

Abstract: Methods and computer systems for audio search on a social networking platform are disclosed. While running a social networking application, a computer system receives a first audio input from a user of the computer system and then generates a first audio confusion network from the first audio input. After comparing the first audio confusion network with one or more second audio confusion networks, each corresponding to a second audio input associated with one of a plurality of participants of a chat session of the social networking application, the computer system identifies at least one second audio input corresponding to the at least one second audio confusion network that matches the first audio confusion network and displays a portion of the chat session including a visual icon representing the identified second audio input on a display of the computer system.

Type: Application

Filed: October 9, 2017

Publication date: February 1, 2018

Inventors: Lu LI, Jianxiong MA, Li LU
Method and computer system for performing audio search on a social networking platform

Patent number: 9818432

Abstract: Methods and computer systems for audio search on a social networking platform are disclosed. The method includes: while running a social networking application, receiving a first audio input from a user of the computer system, the first audio input including one or more search keywords; generating a first audio confusion network from the first audio input; determining whether the first audio confusion network matches at least one of one or more second audio confusion networks, wherein a respective second audio confusion network was generated from a corresponding second audio input associated with a chat session of which the user is a participant; and identifying a second audio input corresponding to the at least one second audio confusion network that matches the first audio confusion network, wherein the identified second audio input includes the one or more search keywords that are included in the first audio input.

Type: Grant

Filed: June 7, 2016

Date of Patent: November 14, 2017

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Lu Li, Jianxiong Ma, Li Lu
Systems and methods for speech recognition

Patent number: 9558741

Abstract: Systems and methods are provided for speech recognition. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as a speech recognition result.

Type: Grant

Filed: May 30, 2014

Date of Patent: January 31, 2017

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventors: Lou Li, Li Lu, Xiang Zhang, Feng Rao, Shuai Yue, Bo Chen, Jianxiong Ma, Haibo Liu
METHOD AND COMPUTER SYSTEM FOR PERFORMING AUDIO SEARCH ON A SOCIAL NETWORKING PLATFORM

Publication number: 20160293184

Abstract: Methods and computer systems for audio search on a social networking platform are disclosed. The method includes: while running a social networking application, receiving a first audio input from a user of the computer system, the first audio input including one or more search keywords; generating a first audio confusion network from the first audio input; determining whether the first audio confusion network matches at least one of one or more second audio confusion networks, wherein a respective second audio confusion network was generated from a corresponding second audio input associated with a Chat session of which the user is a participant; and identifying a second audio input corresponding to the at least one second audio confusion network that matches the first audio confusion network, wherein the identified second audio input includes the one or more search keywords that are included in the first audio input.

Type: Application

Filed: June 7, 2016

Publication date: October 6, 2016

Inventors: Lu LI, Jianxiong MA, Li LU
Method and apparatus for performing speech keyword retrieval

Patent number: 9355637

Abstract: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

Type: Grant

Filed: February 11, 2015

Date of Patent: May 31, 2016

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventors: Jianxiong Ma, Lu Li, Li Lu, Xiang Zhang, Shuai Yue, Feng Rao, Eryu Wang, Linghui Kong
Language recognition based on vocabulary lists

Patent number: 9336197

Abstract: A method is implemented at a computer to determine that certain information content is composed or compiled in a specific language selected among two or more similar languages. The computer integrates a first vocabulary list of a first language and a second vocabulary list of a second language into a comprehensive vocabulary list. The integrating includes analyzing the first vocabulary list in view of the second vocabulary list to identify a first vocabulary sub-list that is used in the first language, but not in the second language. The computer then identifies, in the information content, a plurality of expressions that are included in the comprehensive vocabulary list, and a subset of expressions that are included in the first vocabulary sub-list. Upon a determination that a total frequency of occurrence of the subset of expressions meets predetermined occurrence criteria, the computer determines that the information content is composed in the first language.

Type: Grant

Filed: December 16, 2013

Date of Patent: May 10, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Lu Li, Qiang Cheng, Jianxiong Ma, Feng Rao, Duling Lu, Li Lu, Xiang Zhang, Bo Chen
Method and apparatus for performing speech keyword retrieval

Patent number: 9257118

Abstract: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

Type: Grant

Filed: February 11, 2015

Date of Patent: February 9, 2016

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventors: Jianxiong Ma, Lu Li, Li Lu, Xiang Zhang, Shuai Yue, Feng Rao, Eryu Wang, Linghui Kong
Keyword detection for speech recognition

Patent number: 9230541

Abstract: This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.

Type: Grant

Filed: December 11, 2014

Date of Patent: January 5, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Lu Ll, Li Lu, Jianxiong Ma, Linghui Kong, Feng Rao, Shuai Yue, Xiang Zhang, Haibo Liu, Eryu Wang, Bo Chen
Method and Apparatus For Performing Speech Keyword Retrieval

Publication number: 20150154955

Abstract: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

Type: Application

Filed: February 11, 2015

Publication date: June 4, 2015

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Jianxiong MA, Lu LI, Li LU, Xiang ZHANG, Shuai YUE, Feng RAO, Eryu WANG, Linghui KONG
Keyword Detection For Speech Recognition

Publication number: 20150095032

Abstract: This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.

Type: Application

Filed: December 11, 2014

Publication date: April 2, 2015

Inventors: Lu LI, Li Lu, Jianxiong Ma, Linghui Kong, Feng Rao, Shuai Yue, Xiang Zhang, Haibo Liu, Eryu Wang, Bo Chen
Systems and Methods for Voice Identification

Publication number: 20140350934

Abstract: Systems and methods are provided for voice identification. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as an identification result.

Type: Application

Filed: May 30, 2014

Publication date: November 27, 2014

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventors: Lou Li, Li Lu, Xiang Zhang, Feng Rao, Shuai Yue, Bo Chen, Jianxiong Ma, Haibo Liu
LANGUAGE RECOGNITION BASED ON VOCABULARY LISTS

Publication number: 20140207440

Abstract: A method is implemented at a computer to determine that certain information content is composed or compiled in a specific language selected among two or more similar languages. The computer integrates a first vocabulary list of a first language and a second vocabulary list of a second language into a comprehensive vocabulary list. The integrating includes analyzing the first vocabulary list in view of the second vocabulary list to identify a first vocabulary sub-list that is used in the first language, but not in the second language. The computer then identifies, in the information content, a plurality of expressions that are included in the comprehensive vocabulary list, and a subset of expressions that are included in the first vocabulary sub-list. Upon a determination that a total frequency of occurrence of the subset of expressions meets predetermined occurrence criteria, the computer determines that the information content is composed in the first language.

Type: Application

Filed: December 16, 2013

Publication date: July 24, 2014

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventors: Lu Li, Qiang Cheng, Jianxiong Ma, Feng Rao, Duling Lu, Li Lu, Xiang Zhang, Bo Chen