Patents by Inventor Lou LI

Lou LI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Model-based automatic correction of typographical errors

Patent number: 10643029

Abstract: A method is performed at a computer for automatically correcting typographical errors. The computer selects a target word in a target sentence and identifies a target word therein as having a typographical error and first and second sequences of words separated by the target word as context. After identifying, among a database of grammatically correct sentences, a set of sentences having the first and second sequences of words, each sentence including a replacement word, the computer selects a set of candidate grammatically correct sentences whose corresponding replacement words have similarities to the target word above a pre-set threshold, Finally, the computer chooses, among the set of candidate grammatically correct sentences, a fittest grammatically correct sentence according to a linguistic model and replaces the target word in the target sentence with the replacement word within the fittest grammatically correct sentence.

Type: Grant

Filed: September 17, 2018

Date of Patent: May 5, 2020

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Lou Li, Qiang Cheng, Feng Rao, Li Lu, Xiang Zhang, Shuai Yue, Bo Chen, Duling Lu
MODEL-BASED AUTOMATIC CORRECTION OF TYPOGRAPHICAL ERRORS

Publication number: 20190102373

Abstract: A method is performed at a computer for automatically correcting typographical errors. The computer selects a target word in a target sentence and identifies a target word therein as having a typographical error and first and second sequences of words separated by the target word as context. After identifying, among a database of grammatically correct sentences, a set of sentences having the first and second sequences of words, each sentence including a replacement word, the computer selects a set of candidate grammatically correct sentences whose corresponding replacement words have similarities to the target word above a pre-set threshold, Finally, the computer chooses, among the set of candidate grammatically correct sentences, a fittest grammatically correct sentence according to a linguistic model and replaces the target word in the target sentence with the replacement word within the fittest grammatically correct sentence.

Type: Application

Filed: September 17, 2018

Publication date: April 4, 2019

Inventors: Lou Li, Qiang Cheng, Feng Rao, Li Lu, Xiang Zhang, Shuai Yue, Bo Chen, Duling Lu
Method and device for voiceprint recognition

Patent number: 9940935

Abstract: A method is performed at a device having one or more processors and memory. The device establishes a first-level Deep Neural Network (DNN) model based on unlabeled speech data, the unlabeled speech data containing no speaker labels and the first-level DNN model specifying a plurality of basic voiceprint features for the unlabeled speech data. The device establishes a second-level DNN model by tuning the first-level DNN model based on labeled speech data, the labeled speech data containing speech samples with respective speaker labels, wherein the second-level DNN model specifies a plurality of high-level voiceprint features. Using the second-level DNN model, registers a first high-level voiceprint feature sequence for a user based on a registration speech sample received from the user. The device performs speaker verification for the user based on the first high-level voiceprint feature sequence registered for the user.

Type: Grant

Filed: August 18, 2016

Date of Patent: April 10, 2018

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Eryu Wang, Li Lu, Xiang Zhang, Haibo Liu, Lou Li, Feng Rao, Duling Lu, Shuai Yue, Bo Chen
Method and system for building a topic specific language model for use in automatic speech recognition

Patent number: 9697821

Abstract: An automatic speech recognition method includes at a computer having one or more processors and memory for storing one or more programs to be executed by the processors, obtaining a plurality of speech corpus categories through classifying and calculating raw speech corpus; obtaining a plurality of classified language models that respectively correspond to the plurality of speech corpus categories through a language model training applied on each speech corpus category; obtaining an interpolation language model through implementing a weighted interpolation on each classified language model and merging the interpolated plurality of classified language models; constructing a decoding resource in accordance with an acoustic model and the interpolation language model; and decoding input speech using the decoding resource, and outputting a character string with a highest probability as a recognition result of the input speech.

Type: Grant

Filed: December 16, 2013

Date of Patent: July 4, 2017

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Feng Rao, Li Lu, Bo Chen, Shuai Yue, Xiang Zhang, Eryu Wang, Dadong Xie, Lou Li, Duling Lu
Systems and methods for speech recognition

Patent number: 9558741

Abstract: Systems and methods are provided for speech recognition. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as a speech recognition result.

Type: Grant

Filed: May 30, 2014

Date of Patent: January 31, 2017

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventors: Lou Li, Li Lu, Xiang Zhang, Feng Rao, Shuai Yue, Bo Chen, Jianxiong Ma, Haibo Liu
METHOD AND DEVICE FOR VOICEPRINT RECOGNITION

Publication number: 20160358610

Abstract: A method is performed at a device having one or more processors and memory. The device establishes a first-level Deep Neural Network (DNN) model based on unlabeled speech data, the unlabeled speech data containing no speaker labels and the first-level DNN model specifying a plurality of basic voiceprint features for the unlabeled speech data. The device establishes a second-level DNN model by tuning the first-level DNN model based on labeled speech data, the labeled speech data containing speech samples with respective speaker labels, wherein the second-level DNN model specifies a plurality of high-level voiceprint features. Using the second-level DNN model, registers a first high-level voiceprint feature sequence for a user based on a registration speech sample received from the user. The device performs speaker verification for the user based on the first high-level voiceprint feature sequence registered for the user.

Type: Application

Filed: August 18, 2016

Publication date: December 8, 2016

Inventors: Eryu WANG, Li LU, Xiang ZHANG, Haibo LIU, Lou LI, Feng RAO, Duling LU, Shuai YUE, Bo CHEN
Method and device for parallel processing in model training

Patent number: 9508347

Abstract: A method and a device for training a DNN model includes: at a device including one or more processors and memory: establishing an initial DNN model; dividing a training data corpus into a plurality of disjoint data subsets; for each of the plurality of disjoint data subsets, providing the data subset to a respective training processing unit of a plurality of training processing units operating in parallel, wherein the respective training processing unit applies a Stochastic Gradient Descent (SGD) process to update the initial DNN model to generate a respective DNN sub-model based on the data subset; and merging the respective DNN sub-models generated by the plurality of training processing units to obtain an intermediate DNN model, wherein the intermediate DNN model is established as either the initial DNN model for a next training iteration or a final DNN model in accordance with a preset convergence condition.

Type: Grant

Filed: December 16, 2013

Date of Patent: November 29, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Eryu Wang, Li Lu, Xiang Zhang, Haibo Liu, Feng Rao, Lou Li, Shuai Yue, Bo Chen
Method and device for voiceprint recognition

Patent number: 9502038

Abstract: A method and device for voiceprint recognition, include: establishing a first-level Deep Neural Network (DNN) model based on unlabeled speech data, the unlabeled speech data containing no speaker labels and the first-level DNN model specifying a plurality of basic voiceprint features for the unlabeled speech data; obtaining a plurality of high-level voiceprint features by tuning the first-level DNN model based on labeled speech data, the labeled speech data containing speech samples with respective speaker labels, and the tuning producing a second-level DNN model specifying the plurality of high-level voiceprint features; based on the second-level DNN model, registering a respective high-level voiceprint feature sequence for a user based on a registration speech sample received from the user; and performing speaker verification for the user based on the respective high-level voiceprint feature sequence registered for the user.

Type: Grant

Filed: December 12, 2013

Date of Patent: November 22, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Eryu Wang, Li Lu, Xiang Zhang, Haibo Liu, Lou Li, Feng Rao, Duling Lu, Shuai Yue, Bo Chen
Method and system for adding punctuation to voice files

Patent number: 9442910

Abstract: A method and system for adding punctuation to a voice file is disclosed. The method includes: utilizing silence or pause duration detection to divide a voice file into a plurality of speech segments for processing, the voice file includes a plurality of features units; identifying all features units that appear in the voice file according to every term or expression and semantics features of the every term or expression that form each of the plurality of speech segments; using a linguistic model to determine a sum of weight of various punctuation modes in the voice file according to all the feature units, the linguistic model is built upon semantics features of various parsed out terms or expressions from a body text of a spoken sentence according to a language library; and adding punctuations to the voice file based on the determined sum of weight of the various punctuation modes.

Type: Grant

Filed: March 19, 2014

Date of Patent: September 13, 2016

Assignee: Tencent Technology (Shenzhen) Co., Ltd.

Inventors: Haibo Liu, Eryu Wang, Xiang Zhang, Li Lu, Shuai Yue, Bo Chen, Lou Li, Jian Liu
User authentication method and apparatus based on audio and video data

Patent number: 9177131

Abstract: A computer-implemented method is performed at a server having one or more processors and memory storing programs executed by the one or more processors for authenticating a user from video and audio data. The method includes: receiving a login request from a mobile device, the login request including video data and audio data; extracting a group of facial features from the video data; extracting a group of audio features from the audio data and recognizing a sequence of words in the audio data; identifying a first user account whose respective facial features match the group of facial features and a second user account whose respective audio features match the group of audio features. If the first user account is the same as the second user account, retrieve the sequence of words associated with the user account and compare the sequences of words for authentication purpose.

Type: Grant

Filed: April 25, 2014

Date of Patent: November 3, 2015

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Xiang Zhang, Li Lu, Eryu Wang, Shuai Yue, Feng Rao, Haibo Liu, Lou Li, Duling Lu, Bo Chen
METHOD AND DEVICE FOR PARALLEL PROCESSING IN MODEL TRAINING

Publication number: 20150019214

Abstract: A method and a device for training a DNN model includes: at a device including one or more processors and memory: establishing an initial DNN model; dividing a training data corpus into a plurality of disjoint data subsets; for each of the plurality of disjoint data subsets, providing the data subset to a respective training processing unit of a plurality of training processing units operating in parallel, wherein the respective training processing unit applies a Stochastic Gradient Descent (SGD) process to update the initial DNN model to generate a respective DNN sub-model based on the data subset; and merging the respective DNN sub-models generated by the plurality of training processing units to obtain an intermediate DNN model, wherein the intermediate DNN model is established as either the initial DNN model for a next training iteration or a final DNN model in accordance with a preset convergence condition.

Type: Application

Filed: December 16, 2013

Publication date: January 15, 2015

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventors: Eryu WANG, Li LU, Xiang ZHANG, Haibo LIU, Feng RAO, Lou LI, Shuai YUE, Bo CHEN
METHOD AND SYSTEM FOR ADDING PUNCTUATION TO VOICE FILES

Publication number: 20140350918

Abstract: A method and system for adding punctuation to a voice file is disclosed. The method includes: utilizing silence or pause duration detection to divide a voice file into a plurality of speech segments for processing, the voice file includes a plurality of features units; identifying all features units that appear in the voice file according to every term or expression and semantics features of the every term or expression that form each of the plurality of speech segments; using a linguistic model to determine a sum of weight of various punctuation modes in the voice file according to all the feature units, the linguistic model is built upon semantics features of various parsed out terms or expressions from a body text of a spoken sentence according to a language library; and adding punctuations to the voice file based on the determined sum of weight of the various punctuation modes.

Type: Application

Filed: March 19, 2014

Publication date: November 27, 2014

Applicant: TENCENT TECHNOLOGY (SHENZHEN) CO., LTD.

Inventors: Haibo LIU, Eryu WANG, Xiang ZHANG, Li LU, Shuai YUE, Bo CHEN, Lou LI, Jian LIU
Systems and Methods for Voice Identification

Publication number: 20140350934

Abstract: Systems and methods are provided for voice identification. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as an identification result.

Type: Application

Filed: May 30, 2014

Publication date: November 27, 2014

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventors: Lou Li, Li Lu, Xiang Zhang, Feng Rao, Shuai Yue, Bo Chen, Jianxiong Ma, Haibo Liu
USER AUTHENTICATION METHOD AND APPARATUS BASED ON AUDIO AND VIDEO DATA

Publication number: 20140237576

Abstract: A computer-implemented method is performed at a server having one or more processors and memory storing programs executed by the one or more processors for authenticating a user from video and audio data. The method includes: receiving a login request from a mobile device, the login request including video data and audio data; extracting a group of facial features from the video data; extracting a group of audio features from the audio data and recognizing a sequence of words in the audio data; identifying a first user account whose respective facial features match the group of facial features and a second user account whose respective audio features match the group of audio features. If the first user account is the same as the second user account, retrieve the sequence of words associated with the user account and compare the sequences of words for authentication purpose.

Type: Application

Filed: April 25, 2014

Publication date: August 21, 2014

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventors: Xiang ZHANG, Li LU, Eryu WANG, Shuai YUE, Feng RAO, Haibo LlU, Lou LI, Duling LU, Bo CHEN
METHOD AND DEVICE FOR ERROR CORRECTION MODEL TRAINING AND TEXT ERROR CORRECTION

Publication number: 20140214401

Abstract: A computer-implemented method is performed at a device having one or more processors and memory storing programs executed by the one or more processors. The method comprises: selecting a target word in a target sentence; from the target sentence, acquiring a first sequence of words that precede the target word and a second sequence of words that succeed the target word; from a sentence database, searching and acquiring a group of words, each of which separates the first sequence of words from the second sequence of words in a sentence; creating a candidate sentence for each of the candidate words by replacing the target word in the target sentence with each of the candidate words; determining the fittest sentence among the candidate sentences according to a linguistic model; and suggesting the candidate word within the fittest sentence as a correction.

Type: Application

Filed: December 13, 2013

Publication date: July 31, 2014

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventors: Lou LI, Qiang CHENG, Feng RAO, Li LU, Xiang ZHANG, Shuai YUE, Bo CHEN
METHOD AND DEVICE FOR VOICEPRINT RECOGNITION

Publication number: 20140214417

Abstract: A method and device for voiceprint recognition, include: establishing a first-level Deep Neural Network (DNN) model based on unlabeled speech data, the unlabeled speech data containing no speaker labels and the first-level DNN model specifying a plurality of basic voiceprint features for the unlabeled speech data; obtaining a plurality of high-level voiceprint features by tuning the first-level DNN model based on labeled speech data, the labeled speech data containing speech samples with respective speaker labels, and the tuning producing a second-level DNN model specifying the plurality of high-level voiceprint features; based on the second-level DNN model, registering a respective high-level voiceprint feature sequence for a user based on a registration speech sample received from the user; and performing speaker verification for the user based on the respective high-level voiceprint feature sequence registered for the user.

Type: Application

Filed: December 12, 2013

Publication date: July 31, 2014

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventors: Eryu WANG, Li LU, Xiang ZHANG, Haibo LIU, Lou LI, Feng RAO, Duling LU, Shuai YUE, Bo CHEN
METHOD AND SYSTEM FOR AUTOMATIC SPEECH RECOGNITION

Publication number: 20140214419

Abstract: An automatic speech recognition method includes at a computer having one or more processors and memory for storing one or more programs to be executed by the processors, obtaining a plurality of speech corpus categories through classifying and calculating raw speech corpus; obtaining a plurality of classified language models that respectively correspond to the plurality of speech corpus categories through a language model training applied on each speech corpus category; obtaining an interpolation language model through implementing a weighted interpolation on each classified language model and merging the interpolated plurality of classified language models; constructing a decoding resource in accordance with an acoustic model and the interpolation language model; and decoding input speech using the decoding resource, and outputting a character string with a highest probability as a recognition result of the input speech.

Type: Application

Filed: December 16, 2013

Publication date: July 31, 2014

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventors: Feng Rao, Li Lu, Bo Chen, Shuai Yue, Xiang Zhang, Eryu Wang, Dadong Xie, Lou Li, Duling Lu