Patents by Inventor Hongxu Ji

Hongxu Ji has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Keyword extraction method, apparatus and medium

Patent number: 11630954

Abstract: A keyword extraction method includes: extracting candidate words from an original document to form a first word set; acquiring a first association degree between each first word thereof and the original document, and determining a second word set according to the first association degree; for each second word in the second word set, inquiring, in a word association topology, at least one node word satisfying a condition of association with the second word and forming a third word set, the word association topology indicating an association relation among multiple node words in a predetermined field; and determining a union set of the second and third word sets, acquiring a second association degree between each candidate keyword in the union set and the original document, and selecting, according to the second association degree, at least one candidate keyword from the union set, to form a keyword set of the original document.

Type: Grant

Filed: March 24, 2020

Date of Patent: April 18, 2023

Assignee: Beijing Xiaomi Intelligent Technology Co., Ltd.

Inventors: Qun Guo, Xiao Lu, Erli Meng, Bin Wang, Liang Shi, Hongxu Ji, Baoyuan Qi
Method and device for keyword extraction and storage medium

Patent number: 11580303

Abstract: A method and device for keyword extraction and a storage medium. The method includes receiving, at a terminal, an original document, acquiring, at the terminal, a candidate set by extracting at least one candidate phrase from the original document, acquiring, at the terminal, an association degree between the at least one candidate phrase in the candidate set and the original document, acquiring, at the terminal, a divergence degree of the at least one candidate phrase in the candidate set, and updating, at the terminal, a key phrase set of the original document by selecting the at least one candidate phrase from the candidate set as at least one key phrase based on the association degree and the divergence degree.

Type: Grant

Filed: March 25, 2020

Date of Patent: February 14, 2023

Assignee: Beijing Xiaomi Mobile Software Co., Ltd.

Inventors: Qun Guo, Xiao Lu, Erli Meng, Bin Wang, Liang Shi, Baoyuan Qi, Hongxu Ji
Method and device for optimizing training set for text classification and storage medium

Patent number: 11507882

Abstract: A method for optimizing a training set for text classification includes: the training set for text classification is acquired; part of samples are selected from the training set as a first initial training subset, and an incorrectly tagged sample in the first initial training subset is corrected to obtain a second initial training subset; a text classification model is trained according to the second initial training subset; the samples in the training set are predicted by the trained text classification model to obtain a prediction result; an incorrectly tagged sample set is generated according to the prediction result; a key incorrectly tagged sample is selected from the incorrectly tagged sample set, and a tag of the key incorrectly tagged sample is corrected to generate a correctly tagged sample corresponding to the key incorrectly tagged sample; and the training set is updated by using the correctly tagged sample.

Type: Grant

Filed: November 25, 2019

Date of Patent: November 22, 2022

Assignee: Beijing Xiaomi Intelligent Technology Co., Ltd.

Inventors: Hongxu Ji, Qun Guo, Xiao Lu, Erli Meng
Method and device for evaluating quality of content, electronic equipment, and storage medium

Patent number: 11475879

Abstract: Text content is determined. The text content is input to a content classifying model. The content classifying model is adapted to determine a probability of the text content belonging to a category. An evaluated value of quality of the text content is determined according to the probability of the category and a weight of the category. The weight represents importance of the category.

Type: Grant

Filed: August 14, 2020

Date of Patent: October 18, 2022

Assignee: Beijing Xiaomi Pinecone Electronics Co., Ltd.

Inventors: Xiao Lu, Qun Guo, Erli Meng, Bin Wang, Hongxu Ji, Lei Sun
METHOD AND DEVICE FOR EVALUATING QUALITY OF CONTENT, ELECTRONIC EQUIPMENT, AND STORAGE MEDIUM

Publication number: 20210295827

Abstract: Text content is determined. The text content is input to a content classifying model. The content classifying model is adapted to determine a probability of the text content belonging to a category. An evaluated value of quality of the text content is determined according to the probability of the category and a weight of the category. The weight represents importance of the category.

Type: Application

Filed: August 14, 2020

Publication date: September 23, 2021

Applicant: BEIJING XIAOMI PINECONE ELECTRONICS CO., LTD.

Inventors: Xiao LU, Qun GUO, Erli MENG, Bin WANG, Hongxu JI, Lei SUN
METHOD AND DEVICE FOR KEYWORD EXTRACTION AND STORAGE MEDIUM

Publication number: 20210182490

Abstract: A method and device for keyword extraction and a storage medium. The method includes receiving, at a terminal, an original document, acquiring, at the terminal, a candidate set by extracting at least one candidate phrase from the original document, acquiring, at the terminal, an association degree between the at least one candidate phrase in the candidate set and the original document, acquiring, at the terminal, a divergence degree of the at least one candidate phrase in the candidate set, and updating, at the terminal, a key phrase set of the original document by selecting the at least one candidate phrase from the candidate set as at least one key phrase based on the association degree and the divergence degree.

Type: Application

Filed: March 25, 2020

Publication date: June 17, 2021

Applicant: BEIJING XIAOMI MOBILE SOFTWARE CO., LTD.

Inventors: Qun GUO, Xiao LU, Erli MENG, Bin WANG, Liang SHI, Baoyuan QI, Hongxu JI
METHOD AND DEVICE FOR OPTIMIZING TRAINING SET FOR TEXT CLASSIFICATION AND STORAGE MEDIUM

Publication number: 20210081832

Abstract: A method for optimizing a training set for text classification includes: the training set for text classification is acquired; part of samples are selected from the training set as a first initial training subset, and an incorrectly tagged sample in the first initial training subset is corrected to obtain a second initial training subset; a text classification model is trained according to the second initial training subset; the samples in the training set are predicted by the trained text classification model to obtain a prediction result; an incorrectly tagged sample set is generated according to the prediction result; a key incorrectly tagged sample is selected from the incorrectly tagged sample set, and a tag of the key incorrectly tagged sample is corrected to generate a correctly tagged sample corresponding to the key incorrectly tagged sample; and the training set is updated by using the correctly tagged sample.

Type: Application

Filed: November 25, 2019

Publication date: March 18, 2021

Applicant: Beijing Xiaomi Intelligent Technology Co., Ltd.

Inventors: Hongxu JI, Qun GUO, Xiao LU, Erli MENG
KEYWORD EXTRACTION METHOD, KEYWORD EXTRACTION DEVICE AND COMPUTER-READABLE STORAGE MEDIUM

Publication number: 20200250376

Abstract: A keyword extraction method includes: extracting candidate words from an original document to form a first word set; acquiring the first correlation degree between each candidate word in the first word set and the original document, and based on which determining a second word set; generating predicted words forming a third word set through a prediction model; determining a union set of the second and third word sets, acquiring the second correlation degree between each of the candidate keywords in the union set and the original document, acquiring a divergence of each candidate keyword in the union set; and selecting candidate keywords from the union set as keywords based on the second correlation degree and the divergence. Keyword redundancy can be avoided through the divergence of keywords. The final keywords are not affected by the frequency of candidate words, and the expression mode of keywords can be enriched.

Type: Application

Filed: April 22, 2020

Publication date: August 6, 2020

Applicant: Beijing Xiaomi Intelligent Technology Co., Ltd.

Inventors: Qun Guo, Xiao Lu, Erli Meng, Bin Wang, Liang Shi, Baoyuan Qi, Hongxu Ji
KEYWORD EXTRACTION METHOD, APPARATUS AND MEDIUM

Publication number: 20200226367

Abstract: A keyword extraction method includes: extracting candidate words from an original document to form a first word set; acquiring a first association degree between each first word thereof and the original document, and determining a second word set according to the first association degree; for each second word in the second word set, inquiring, in a word association topology, at least one node word satisfying a condition of association with the second word and forming a third word set, the word association topology indicating an association relation among multiple node words in a predetermined field; and determining a union set of the second and third word sets, acquiring a second association degree between each candidate keyword in the union set and the original document, and selecting, according to the second association degree, at least one candidate keyword from the union set, to form a keyword set of the original document.

Type: Application

Filed: March 24, 2020

Publication date: July 16, 2020

Applicant: Beijing Xiaomi Intelligent Technology Co., Ltd.

Inventors: Qun Guo, Xiao Lu, Erli Meng, Bin Wang, Liang Shi, Hongxu Ji, Baoyuan Qi