Patents by Inventor Hongxu Ji
Hongxu Ji has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11630954Abstract: A keyword extraction method includes: extracting candidate words from an original document to form a first word set; acquiring a first association degree between each first word thereof and the original document, and determining a second word set according to the first association degree; for each second word in the second word set, inquiring, in a word association topology, at least one node word satisfying a condition of association with the second word and forming a third word set, the word association topology indicating an association relation among multiple node words in a predetermined field; and determining a union set of the second and third word sets, acquiring a second association degree between each candidate keyword in the union set and the original document, and selecting, according to the second association degree, at least one candidate keyword from the union set, to form a keyword set of the original document.Type: GrantFiled: March 24, 2020Date of Patent: April 18, 2023Assignee: Beijing Xiaomi Intelligent Technology Co., Ltd.Inventors: Qun Guo, Xiao Lu, Erli Meng, Bin Wang, Liang Shi, Hongxu Ji, Baoyuan Qi
-
Patent number: 11580303Abstract: A method and device for keyword extraction and a storage medium. The method includes receiving, at a terminal, an original document, acquiring, at the terminal, a candidate set by extracting at least one candidate phrase from the original document, acquiring, at the terminal, an association degree between the at least one candidate phrase in the candidate set and the original document, acquiring, at the terminal, a divergence degree of the at least one candidate phrase in the candidate set, and updating, at the terminal, a key phrase set of the original document by selecting the at least one candidate phrase from the candidate set as at least one key phrase based on the association degree and the divergence degree.Type: GrantFiled: March 25, 2020Date of Patent: February 14, 2023Assignee: Beijing Xiaomi Mobile Software Co., Ltd.Inventors: Qun Guo, Xiao Lu, Erli Meng, Bin Wang, Liang Shi, Baoyuan Qi, Hongxu Ji
-
Patent number: 11507882Abstract: A method for optimizing a training set for text classification includes: the training set for text classification is acquired; part of samples are selected from the training set as a first initial training subset, and an incorrectly tagged sample in the first initial training subset is corrected to obtain a second initial training subset; a text classification model is trained according to the second initial training subset; the samples in the training set are predicted by the trained text classification model to obtain a prediction result; an incorrectly tagged sample set is generated according to the prediction result; a key incorrectly tagged sample is selected from the incorrectly tagged sample set, and a tag of the key incorrectly tagged sample is corrected to generate a correctly tagged sample corresponding to the key incorrectly tagged sample; and the training set is updated by using the correctly tagged sample.Type: GrantFiled: November 25, 2019Date of Patent: November 22, 2022Assignee: Beijing Xiaomi Intelligent Technology Co., Ltd.Inventors: Hongxu Ji, Qun Guo, Xiao Lu, Erli Meng
-
Patent number: 11475879Abstract: Text content is determined. The text content is input to a content classifying model. The content classifying model is adapted to determine a probability of the text content belonging to a category. An evaluated value of quality of the text content is determined according to the probability of the category and a weight of the category. The weight represents importance of the category.Type: GrantFiled: August 14, 2020Date of Patent: October 18, 2022Assignee: Beijing Xiaomi Pinecone Electronics Co., Ltd.Inventors: Xiao Lu, Qun Guo, Erli Meng, Bin Wang, Hongxu Ji, Lei Sun
-
Publication number: 20210295827Abstract: Text content is determined. The text content is input to a content classifying model. The content classifying model is adapted to determine a probability of the text content belonging to a category. An evaluated value of quality of the text content is determined according to the probability of the category and a weight of the category. The weight represents importance of the category.Type: ApplicationFiled: August 14, 2020Publication date: September 23, 2021Applicant: BEIJING XIAOMI PINECONE ELECTRONICS CO., LTD.Inventors: Xiao LU, Qun GUO, Erli MENG, Bin WANG, Hongxu JI, Lei SUN
-
Publication number: 20210182490Abstract: A method and device for keyword extraction and a storage medium. The method includes receiving, at a terminal, an original document, acquiring, at the terminal, a candidate set by extracting at least one candidate phrase from the original document, acquiring, at the terminal, an association degree between the at least one candidate phrase in the candidate set and the original document, acquiring, at the terminal, a divergence degree of the at least one candidate phrase in the candidate set, and updating, at the terminal, a key phrase set of the original document by selecting the at least one candidate phrase from the candidate set as at least one key phrase based on the association degree and the divergence degree.Type: ApplicationFiled: March 25, 2020Publication date: June 17, 2021Applicant: BEIJING XIAOMI MOBILE SOFTWARE CO., LTD.Inventors: Qun GUO, Xiao LU, Erli MENG, Bin WANG, Liang SHI, Baoyuan QI, Hongxu JI
-
Publication number: 20210081832Abstract: A method for optimizing a training set for text classification includes: the training set for text classification is acquired; part of samples are selected from the training set as a first initial training subset, and an incorrectly tagged sample in the first initial training subset is corrected to obtain a second initial training subset; a text classification model is trained according to the second initial training subset; the samples in the training set are predicted by the trained text classification model to obtain a prediction result; an incorrectly tagged sample set is generated according to the prediction result; a key incorrectly tagged sample is selected from the incorrectly tagged sample set, and a tag of the key incorrectly tagged sample is corrected to generate a correctly tagged sample corresponding to the key incorrectly tagged sample; and the training set is updated by using the correctly tagged sample.Type: ApplicationFiled: November 25, 2019Publication date: March 18, 2021Applicant: Beijing Xiaomi Intelligent Technology Co., Ltd.Inventors: Hongxu JI, Qun GUO, Xiao LU, Erli MENG
-
Publication number: 20200250376Abstract: A keyword extraction method includes: extracting candidate words from an original document to form a first word set; acquiring the first correlation degree between each candidate word in the first word set and the original document, and based on which determining a second word set; generating predicted words forming a third word set through a prediction model; determining a union set of the second and third word sets, acquiring the second correlation degree between each of the candidate keywords in the union set and the original document, acquiring a divergence of each candidate keyword in the union set; and selecting candidate keywords from the union set as keywords based on the second correlation degree and the divergence. Keyword redundancy can be avoided through the divergence of keywords. The final keywords are not affected by the frequency of candidate words, and the expression mode of keywords can be enriched.Type: ApplicationFiled: April 22, 2020Publication date: August 6, 2020Applicant: Beijing Xiaomi Intelligent Technology Co., Ltd.Inventors: Qun Guo, Xiao Lu, Erli Meng, Bin Wang, Liang Shi, Baoyuan Qi, Hongxu Ji
-
Publication number: 20200226367Abstract: A keyword extraction method includes: extracting candidate words from an original document to form a first word set; acquiring a first association degree between each first word thereof and the original document, and determining a second word set according to the first association degree; for each second word in the second word set, inquiring, in a word association topology, at least one node word satisfying a condition of association with the second word and forming a third word set, the word association topology indicating an association relation among multiple node words in a predetermined field; and determining a union set of the second and third word sets, acquiring a second association degree between each candidate keyword in the union set and the original document, and selecting, according to the second association degree, at least one candidate keyword from the union set, to form a keyword set of the original document.Type: ApplicationFiled: March 24, 2020Publication date: July 16, 2020Applicant: Beijing Xiaomi Intelligent Technology Co., Ltd.Inventors: Qun Guo, Xiao Lu, Erli Meng, Bin Wang, Liang Shi, Hongxu Ji, Baoyuan Qi