Patents by Inventor Feng Rao

Feng Rao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for speech recognition

Patent number: 9558741

Abstract: Systems and methods are provided for speech recognition. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as a speech recognition result.

Type: Grant

Filed: May 30, 2014

Date of Patent: January 31, 2017

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventors: Lou Li, Li Lu, Xiang Zhang, Feng Rao, Shuai Yue, Bo Chen, Jianxiong Ma, Haibo Liu
METHOD AND DEVICE FOR VOICEPRINT RECOGNITION

Publication number: 20160358610

Abstract: A method is performed at a device having one or more processors and memory. The device establishes a first-level Deep Neural Network (DNN) model based on unlabeled speech data, the unlabeled speech data containing no speaker labels and the first-level DNN model specifying a plurality of basic voiceprint features for the unlabeled speech data. The device establishes a second-level DNN model by tuning the first-level DNN model based on labeled speech data, the labeled speech data containing speech samples with respective speaker labels, wherein the second-level DNN model specifies a plurality of high-level voiceprint features. Using the second-level DNN model, registers a first high-level voiceprint feature sequence for a user based on a registration speech sample received from the user. The device performs speaker verification for the user based on the first high-level voiceprint feature sequence registered for the user.

Type: Application

Filed: August 18, 2016

Publication date: December 8, 2016

Inventors: Eryu WANG, Li LU, Xiang ZHANG, Haibo LIU, Lou LI, Feng RAO, Duling LU, Shuai YUE, Bo CHEN
Method and device for parallel processing in model training

Patent number: 9508347

Abstract: A method and a device for training a DNN model includes: at a device including one or more processors and memory: establishing an initial DNN model; dividing a training data corpus into a plurality of disjoint data subsets; for each of the plurality of disjoint data subsets, providing the data subset to a respective training processing unit of a plurality of training processing units operating in parallel, wherein the respective training processing unit applies a Stochastic Gradient Descent (SGD) process to update the initial DNN model to generate a respective DNN sub-model based on the data subset; and merging the respective DNN sub-models generated by the plurality of training processing units to obtain an intermediate DNN model, wherein the intermediate DNN model is established as either the initial DNN model for a next training iteration or a final DNN model in accordance with a preset convergence condition.

Type: Grant

Filed: December 16, 2013

Date of Patent: November 29, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Eryu Wang, Li Lu, Xiang Zhang, Haibo Liu, Feng Rao, Lou Li, Shuai Yue, Bo Chen
Method and device for voiceprint recognition

Patent number: 9502038

Abstract: A method and device for voiceprint recognition, include: establishing a first-level Deep Neural Network (DNN) model based on unlabeled speech data, the unlabeled speech data containing no speaker labels and the first-level DNN model specifying a plurality of basic voiceprint features for the unlabeled speech data; obtaining a plurality of high-level voiceprint features by tuning the first-level DNN model based on labeled speech data, the labeled speech data containing speech samples with respective speaker labels, and the tuning producing a second-level DNN model specifying the plurality of high-level voiceprint features; based on the second-level DNN model, registering a respective high-level voiceprint feature sequence for a user based on a registration speech sample received from the user; and performing speaker verification for the user based on the respective high-level voiceprint feature sequence registered for the user.

Type: Grant

Filed: December 12, 2013

Date of Patent: November 22, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Eryu Wang, Li Lu, Xiang Zhang, Haibo Liu, Lou Li, Feng Rao, Duling Lu, Shuai Yue, Bo Chen
Method and system for automatic speech recognition

Patent number: 9472190

Abstract: A method of recognizing speech is provided that includes generating a decoding network that includes a primary sub-network and a classification sub-network. The primary sub-network includes a classification node corresponding to the classification sub-network. The classification sub-network corresponds to a group of uncommon words. A speech input is received and decoded by instantiating a token in the primary sub-network and passing the token through the primary network. When the token reaches the classification node, the method includes transferring the token to the classification sub-network and passing the token through the classification sub-network. When the token reaches an accept node of the classification sub-network, the method includes returning a result of the token passing through the classification sub-network to the primary sub-network. The result includes one or more words in the group of uncommon words. A string corresponding to the speech input is output that includes the one or more words.

Type: Grant

Filed: April 28, 2014

Date of Patent: October 18, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Shuai Yue, Li Lu, Xiang Zhang, Dadong Xie, Bo Chen, Feng Rao
Keyword detection with international phonetic alphabet by foreground model and background model

Patent number: 9466289

Abstract: An electronic device with one or more processors and memory trains an acoustic model with an international phonetic alphabet (IPA) phoneme mapping collection and audio samples in different languages, where the acoustic model includes: a foreground model; and a background model. The device generates a phone decoder based on the trained acoustic model. The device collects keyword audio samples, decodes the keyword audio samples with the phone decoder to generate phoneme sequence candidates, and selects a keyword phoneme sequence from the phoneme sequence candidates. After obtaining the keyword phoneme sequence, the device detects one or more keywords in an input audio signal with the trained acoustic model, including: matching phonemic keyword portions of the input audio signal with phonemes in the keyword phoneme sequence with the foreground model; and filtering out phonemic non-keyword portions of the input audio signal with the background model.

Type: Grant

Filed: December 11, 2013

Date of Patent: October 11, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Li Lu, Xiang Zhang, Shuai Yue, Feng Rao, Eryu Wang, Lu Li
Method, system and computer storage medium for visual searching based on cloud service

Patent number: 9411849

Abstract: A method, system and computer storage medium for visual searching based on cloud service is disclosed. The method includes: receiving, from a client, an image recognition request of cloud service, the request containing image data; forwarding, according to a set classified forwarding rule, the image data to a corresponding classified visual search service; recognizing, by the respective corresponding classified visual search services, corresponding classified type information in the image data, and determining a corresponding name of the image data in accordance with the respective classified type information, and obtaining a classified visual search result; summarizing and sending, to a client, the classified visual search result of the corresponding classified visual search service.

Type: Grant

Filed: April 9, 2013

Date of Patent: August 9, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Hailong Liu, Jie Hou, Pengfei Xiong, Bo Chen, Xiaobo Zhou, Feng Rao
Method and device for acoustic language model training

Patent number: 9396723

Abstract: A method and a device for training an acoustic language model, include: conducting word segmentation for training samples in a training corpus using an initial language model containing no word class labels, to obtain initial word segmentation data containing no word class labels; performing word class replacement for the initial word segmentation data containing no word class labels, to obtain first word segmentation data containing word class labels; using the first word segmentation data containing word class labels to train a first language model containing word class labels; using the first language model containing word class labels to conduct word segmentation for the training samples in the training corpus, to obtain second word segmentation data containing word class labels; and in accordance with the second word segmentation data meeting one or more predetermined criteria, using the second word segmentation data containing word class labels to train the acoustic language model.

Type: Grant

Filed: December 17, 2013

Date of Patent: July 19, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Duling Lu, Lu Li, Feng Rao, Bo Chen, Li Lu, Xiang Zhang, Eryu Wang, Shuai Yue
Method and apparatus for building a language model

Patent number: 9396724

Abstract: A method includes: acquiring data samples; performing categorized sentence mining in the acquired data samples to obtain categorized training samples for multiple categories; building a text classifier based on the categorized training samples; classifying the data samples using the text classifier to obtain a class vocabulary and a corpus for each category; mining the corpus for each category according to the class vocabulary for the category to obtain a respective set of high-frequency language templates; training on the templates for each category to obtain a template-based language model for the category; training on the corpus for each category to obtain a class-based language model for the category; training on the class vocabulary for each category to obtain a lexicon-based language model for the category; building a speech decoder according to an acoustic model, the class-based language model and the lexicon-based language model for any given field, and the data samples.

Type: Grant

Filed: February 14, 2014

Date of Patent: July 19, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Feng Rao, Li Lu, Bo Chen, Xiang Zhang, Shuai Yue, Lu Li
Phase-change storage unit for replacing DRAM and FLASH and manufacturing method thereof

Patent number: 9362493

Abstract: The present invention provides a phase-change storage unit for replacing DRAM and FLASH and a manufacturing method thereof, and the phase-change storage unit includes a phase-change material layer and a cylindrical lower electrode being in contact with and located below the phase-change material layer, where the phase-change material layer is formed by connecting a side wall layer and a round bottom layer, forms a hollow cylinder or hollow inverted conical frustum having an opening at an upper part, and the hollow cylinder or hollow inverted conical frustum is internally filled with a medium layer.

Type: Grant

Filed: December 26, 2012

Date of Patent: June 7, 2016

Assignee: SHANGHAI INSTITUTE OF MICROSYSTEM AND INFORMATION TECHNOLOGY, CHINESE ACADEMY OF SCIENCES

Inventors: Feng Rao, Kun Ren, Zhitang Song, Yuefeng Gong, Wanchun Ren
Method and apparatus for performing speech keyword retrieval

Patent number: 9355637

Abstract: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

Type: Grant

Filed: February 11, 2015

Date of Patent: May 31, 2016

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventors: Jianxiong Ma, Lu Li, Li Lu, Xiang Zhang, Shuai Yue, Feng Rao, Eryu Wang, Linghui Kong
Language recognition based on vocabulary lists

Patent number: 9336197

Abstract: A method is implemented at a computer to determine that certain information content is composed or compiled in a specific language selected among two or more similar languages. The computer integrates a first vocabulary list of a first language and a second vocabulary list of a second language into a comprehensive vocabulary list. The integrating includes analyzing the first vocabulary list in view of the second vocabulary list to identify a first vocabulary sub-list that is used in the first language, but not in the second language. The computer then identifies, in the information content, a plurality of expressions that are included in the comprehensive vocabulary list, and a subset of expressions that are included in the first vocabulary sub-list. Upon a determination that a total frequency of occurrence of the subset of expressions meets predetermined occurrence criteria, the computer determines that the information content is composed in the first language.

Type: Grant

Filed: December 16, 2013

Date of Patent: May 10, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Lu Li, Qiang Cheng, Jianxiong Ma, Feng Rao, Duling Lu, Li Lu, Xiang Zhang, Bo Chen
SB-TE-TI PHASE-CHANGE MEMORY MATERIAL AND TI-SB2TE3 PHASE-CHANGE MEMORY MATERIAL

Publication number: 20160099050

Abstract: An Sb—Te—Ti phase-change thin-film material applicable to a phase-change memory and preparation thereof. The Sb—Te—Ti phase-change memory material is formed by doping an Sb—Te phase-change material with Ti, Ti forms bonds with both Sb and Te, and the Sb—Te—Ti phase-change memory material has a chemical formula SbxTeyTi100-x-y, where 0<x<80 and 0<y<100?x. When the Sb—Te—Ti phase-change memory material is a Ti—Sb2Te3 phase-change memory material, Ti atoms replace Sb atoms, and phase separation does not occur. The crystallization temperature of the Sb—Te—Ti phase-change memory material is significantly risen, retention is improved, and thermal stability is enhanced; meanwhile, the amorphous state resistance decreases, and the crystalline state resistance increases; and the Sb—Te—Ti phase-change memory material has wide application in phase-change memories.

Type: Application

Filed: December 11, 2015

Publication date: April 7, 2016

Applicant: SHANGHAI INSTITUTE OF MICROSYSTEM AND INFORMATION TECHNOLOGY, CHINESE ACADEMY OF SCIENCES

Inventors: Liangcai WU, Min ZHU, Zhitang SONG, Feng RAO, Cheng PENG, Xilin ZHOU, Kun REN, Songlin FENG
SYSTEMS AND METHODS FOR AUDIO COMMAND RECOGNITION

Publication number: 20160086609

Abstract: The present application discloses a method, an electronic system and a non-transitory computer readable storage medium for recognizing audio commands in an electronic device. The electronic device obtains audio data based on an audio signal provided by a user and extracts characteristic audio fingerprint features from the audio data. The electronic device further determines whether the corresponding audio signal is generated by an authorized user by comparing the characteristic audio fingerprint features with an audio fingerprint model for the authorized user and with a universal background model that represents user-independent audio fingerprint features, respectively. When the corresponding audio signal is generated by the authorized user of the electronic device, an audio command is extracted from the audio data, and an operation is performed according to the audio command.

Type: Application

Filed: December 3, 2015

Publication date: March 24, 2016

Inventors: Shuai Yue, Xiang Zhang, Li Lu, Feng Rao, Eryu Wang, Haibo Liu, Bo Chen, Jian Liu, Lu Li
Phase-change storage unit containing TiSiN material layer and method for preparing the same

Patent number: 9276202

Abstract: The present invention provides a phase-change storage unit containing a TiSiN material layer and a method for preparing the same. The phase-change storage unit includes a phase-change material layer and a lower electrode located there below, the phase-change material layer and the lower electrode are connected by a TiSiN material layer, the lower electrode includes a bottom and a sheet side connected to the bottom, the sheet side is perpendicular to the bottom to form a blade structure, and the top of the sheet side contacts the TiSiN material layer. The present invention adopts annealing to increase the grain size of the electrode so as to reduce the overall resistance of the device and form a TiSiN material layer on the top of the lower electrode so as to reduce the effective operation region. The phase-change storage unit of the present invention is applied to a phase-change memory to achieve the advantages such as low power consumption, high density and high data retention performance.

Type: Grant

Filed: December 27, 2012

Date of Patent: March 1, 2016

Assignee: SHANGHAI INSTITUTE OF MICROSYSTEM AND INFORMATION TECHNOLOGY, CHINESE ACADEMY OF SCIENCES

Inventors: Zhitang Song, Yuefeng Gong, Feng Rao, Bo Liu, Yong Kang, Bangming Chen
Method and apparatus for performing speech keyword retrieval

Patent number: 9257118

Abstract: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

Type: Grant

Filed: February 11, 2015

Date of Patent: February 9, 2016

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventors: Jianxiong Ma, Lu Li, Li Lu, Xiang Zhang, Shuai Yue, Feng Rao, Eryu Wang, Linghui Kong
Keyword detection for speech recognition

Patent number: 9230541

Abstract: This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.

Type: Grant

Filed: December 11, 2014

Date of Patent: January 5, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Lu Ll, Li Lu, Jianxiong Ma, Linghui Kong, Feng Rao, Shuai Yue, Xiang Zhang, Haibo Liu, Eryu Wang, Bo Chen
Augmented reality interaction implementation method and system

Patent number: 9189699

Abstract: The present disclosure provides a method and system for realizing interaction in augmented reality. The method includes: collecting a frame image and uploads the frame image; recognizing a template image that matches the frame image and returning the template image; detecting a marker area of the frame image according to the template image; and superposing media data corresponding to the template image on the marker area and displaying the superposed image.

Type: Grant

Filed: May 17, 2013

Date of Patent: November 17, 2015

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventors: Xiao Liu, Hailong Liu, Jie Hou, Feng Rao, Minhui Wu, Bo Chen
User authentication method and apparatus based on audio and video data

Patent number: 9177131

Abstract: A computer-implemented method is performed at a server having one or more processors and memory storing programs executed by the one or more processors for authenticating a user from video and audio data. The method includes: receiving a login request from a mobile device, the login request including video data and audio data; extracting a group of facial features from the video data; extracting a group of audio features from the audio data and recognizing a sequence of words in the audio data; identifying a first user account whose respective facial features match the group of facial features and a second user account whose respective audio features match the group of audio features. If the first user account is the same as the second user account, retrieve the sequence of words associated with the user account and compare the sequences of words for authentication purpose.

Type: Grant

Filed: April 25, 2014

Date of Patent: November 3, 2015

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Xiang Zhang, Li Lu, Eryu Wang, Shuai Yue, Feng Rao, Haibo Liu, Lou Li, Duling Lu, Bo Chen
METHOD AND DEVICE FOR LOCATING FEATURE POINTS ON HUMAN FACE AND STORAGE MEDIUM

Publication number: 20150302240

Abstract: The present invention relates to a method, device and storage medium for locating feature points on a human face. The method for locating feature points on a human face of the present invention comprises: preliminary locating of the position of a human face by combining human face detection with human eye matching, and acquiring preliminary locating information; according to the preliminary locating information, fitting feature points on the human face; and according to the fitting result, completing the locating of the feature points on the human face. The present invention has the beneficial effects that: by using human face detection and human eye matching at the same time, and incorporating at least one feature of an appearance model according to the preliminary locating information, the position information about a human face is more accurately located, and the location of the feature points on the human face is accurately accomplished.

Type: Application

Filed: July 31, 2013

Publication date: October 22, 2015

Inventors: Feng RAO, Bo CHEN, Bin XIAO, Hailong LIU, Pengfei XIONG

prev 1 2 3 4 next