Patents by Inventor Xinyan Xiao

Xinyan Xiao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHOD FOR GENERATING QUERY STATEMENT, ELECTRONIC DEVICE AND STORAGE MEDIUM

Publication number: 20220179889

Abstract: The disclosure provides a method for generating a query statement. The method includes: determining a first vector representation based on known nodes in a first syntax tree corresponding to a query statement to be generated; determining a target generation strategy corresponding to a target node to be generated based on the first vector representation and a preset copy reference matrix; generating the target node based on the first vector representation or a second vector representation by performing the target generation strategy, in which the second vector representation is a vector representation corresponding to an adjacent query statement prior to the query statement to be generated; and generating the query statement based on the known nodes and a terminator in response to the target node being the terminator.

Type: Application

Filed: February 24, 2022

Publication date: June 9, 2022

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Ao ZHANG, Lijie WANG, Xinyan XIAO, Tingting LI
Cross-modality processing method and apparatus, and computer storage medium

Patent number: 11341366

Abstract: A cross-modality processing method is related to a field of natural language processing technologies. The method includes: obtaining a sample set, wherein the sample set includes a plurality of corpus and a plurality of images; generating a plurality of training samples according to the sample set, in which each of the plurality of the training samples is a combination of at least one of the plurality of the corpus and at least one of the plurality of the images corresponding to the at least one of the plurality of the corpus; adopting the plurality of the training samples to train a semantic model, so that the semantic model learns semantic vectors containing combinations of the corpus and the images.

Type: Grant

Filed: August 10, 2020

Date of Patent: May 24, 2022

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Guocheng Niu, Bolei He, Xinyan Xiao
METHOD FOR GENERATING SUMMARY, ELECTRONIC DEVICE AND STORAGE MEDIUM THEREOF

Publication number: 20220092252

Abstract: A method for generating a summary, an electronic device and a storage medium thereof, which relate to the natural language processing field, the deep learning field and the knowledge graph field, are disclosed. The method may include: acquiring a knowledge graph corresponding to a text to be processed, in the graph, nodes represent semantic concepts, and sides represent semantic relationships among the semantic concepts; encoding the text at a token level to obtain a context encoded representation of each token; determining an initial representation of each node in the knowledge graph according to the context encoded representation of each token; performing an encoding operation according to the initial representation of each node and the connection relationships among the nodes to obtain a node representation of each node; and performing a decoding operation according to the node representation of each node to obtain the summary of the text to be processed.

Type: Application

Filed: March 25, 2021

Publication date: March 24, 2022

Inventors: Wenhao Wu, Wei Li, Xinyan Xiao
Document recommendation method and device based on semantic tag

Patent number: 11216504

Abstract: A document recommendation method based on a semantic tag and a document recommendation device. The method includes: for each document, acquiring a first candidate tag set corresponding to the document, and processing each first candidate tag in the first candidate tag set corresponding to the document to obtain a second candidate tag set corresponding to the document; performing normalization processing on each second candidate tag in the second candidate tag set corresponding to the document to obtain a third candidate tag set corresponding to the document; performing expanding process on each third candidate tag in the third candidate tag set corresponding to the document, and acquiring a fourth candidate tag set corresponding to the document, to form a document library having semantic tags; and recommending a target document obtained from the document library having semantic tags to the user, according to historical semantic tag.

Type: Grant

Filed: December 6, 2019

Date of Patent: January 4, 2022

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Guocheng Niu, Bolei He, Chengxiang Liu, Xinyan Xiao, Yajuan Lyu
METHOD FOR TEXT GENERATION, DEVICE AND STORAGE MEDIUM

Publication number: 20210374349

Abstract: A method for text generation, relates to a field of natural language processing, including: obtaining corpus data; labeling the corpus data to obtain a first constraint element; obtaining a first generation target; and generating a first text matching the first generation target by inputting the corpus data and the first constraint element into a generation model.

Type: Application

Filed: August 9, 2021

Publication date: December 2, 2021

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Jiachen LIU, Xinyan XIAO, Hua WU, Haifeng WANG
METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM FOR TRAINING TEXT GENERATION MODEL

Publication number: 20210374359

Abstract: The disclosure may provide a method for obtaining a document layout, an electronic device, and a storage medium. The method may include: obtaining a plurality of pieces of first sample data; extracting structured information from each of the plurality of pieces of first sample data as target structured information corresponding to each of the plurality of pieces of first sample data; inputting the plurality of pieces of first sample data into an initial text generation model to generate predicted structured information corresponding to each of the plurality of pieces of first sample data; generating a first loss value based on a difference between the predicted structured information corresponding to each of the plurality of pieces of first sample data and the corresponding target structured information; and training a phrase generation ability of the initial text generation model based on the first loss value to generate the text generation model.

Type: Application

Filed: December 23, 2020

Publication date: December 2, 2021

Inventors: Wei LI, Xinyan XIAO, Hua WU, Haifeng WANG
METHOD AND DEVICE FOR PROCESSING SENTENCE, AND STORAGE MEDIUM

Publication number: 20210342379

Abstract: The disclosure discloses a method and a device for processing a sentence, and a storage medium. The detailed implementation includes: during processing a sentence, obtaining a dependency tree graph among respective segmented words in the sequence of segmented words by performing a dependency parsing on a sequence of segmented words of the sentence, inputting the dependency tree graph and a word vector corresponding to each segmented word into a preset graph neural network to obtain an intermediate word vector of each segmented word in the sequence of segmented words, and obtaining a processing result of the sentence by performing the downstream task on the intermediate word vector of each segmented word.

Type: Application

Filed: July 14, 2021

Publication date: November 4, 2021

Inventors: Shuai ZHANG, Lijie WANG, Ao ZHANG, Xinyan XIAO, Yue CHANG
CROSS-MODALITY PROCESSING METHOD AND APPARATUS, AND COMPUTER STORAGE MEDIUM

Publication number: 20210303921

Abstract: A cross-modality processing method is related to a field of natural language processing technologies. The method includes: obtaining a sample set, wherein the sample set includes a plurality of corpus and a plurality of images; generating a plurality of training samples according to the sample set, in which each of the plurality of the training samples is a combination of at least one of the plurality of the corpus and at least one of the plurality of the images corresponding to the at least one of the plurality of the corpus; adopting the plurality of the training samples to train a semantic model, so that the semantic model learns semantic vectors containing combinations of the corpus and the images.

Type: Application

Filed: August 10, 2020

Publication date: September 30, 2021

Inventors: Guocheng NIU, Bolei HE, Xinyan XIAO
IMPLEMENTING TEXT GENERATION

Publication number: 20210286934

Abstract: A method for implementing text generation, a device and a medium are provided. The method includes: determining a target task type of a target text generation task from multiple task types supported by a pre-trained general text generation model; determining, based on a requirement of the target text generation task for a target output text, a first target output text attribute for the target text generation task from multiple output text attributes supported by the general text generation model; and fine tuning the general text generation model based on a target training data set associated with the target text generation task to obtain a task-specific text generation model, by taking task indication information for the target task type and first attribute indication information for the first target output text attribute as at least part of an input of the general text generation model.

Type: Application

Filed: May 26, 2021

Publication date: September 16, 2021

Inventors: Jiachen LIU, Zhe HU, Xinyan XIAO, Hua WU
COMMENT INFORMATION PROCESSING METHOD AND APPARATUS, AND MEDIUM

Publication number: 20210200958

Abstract: The present disclosure discloses a comment information processing method and apparatus, and a medium. The specific implementation solution is: in response to a user operation, determining an opinion category corresponding to each opinion phrase in a comment opinion dictionary; obtaining a target corpus matching each opinion phrase from a plurality of comment corpora; for each opinion phrase, using a corresponding opinion category to label the target corpus matching each opinion phrase to obtain a first training sample; and training a classification model with the first training sample to identify the opinion category of a comment by using a trained classification model.

Type: Application

Filed: July 24, 2020

Publication date: July 1, 2021

Inventors: Hao LIU, Bolei HE, Xinyan XIAO
METHOD AND APPARATUS FOR OUTPUTTING INFORMATION

Publication number: 20210200951

Abstract: Embodiments of the present disclosure provide methods and apparatus for outputting information. The method may include: obtaining a sentence to be identified; Performing word segmentation on the to be identified sentence to obtain a word sequence; Inputting a word sequence into a pre-trained multi-task element recognition model based on sequence labeling and entity word prediction, and outputting the identified entity words, entity categories and entity word positions, where the multi-task element recognition model includes a sequence labeling network for performing sequence labeling tasks and an entity word predicting network for performing entity word predicting task, and the sequence labeling network is fused with the entity word predicting network through a fusion module.

Type: Application

Filed: June 9, 2020

Publication date: July 1, 2021

Inventors: Yuan GAO, Dai DAI, Xinyan XIAO
PRE-TRAINING METHOD FOR SENTIMENT ANALYSIS MODEL, AND ELECTRONIC DEVICE

Publication number: 20210200949

Abstract: The present disclosure provides a pre-training method for a sentiment analysis model and an electronic device, which relates to a field of artificial intelligence technologies. The method includes: based on a given seed sentiment dictionary, performing sentimental knowledge detection on a training corpus in a training corpus set, and determining a detection sentiment word and a detection word pair of the training corpus; according to preset mask processing rules, performing mask process on the training corpus to generate a masked corpus; performing encoding and decoding on the masked corpus by using a preset encoder and decoder to determine the detection sentiment word and the detection word pair of the training corpus; and updating the preset encoder and decoder according to a difference between prediction sentiment word and the detection sentiment word, and a difference between prediction word pair and the detection word pair.

Type: Application

Filed: July 21, 2020

Publication date: July 1, 2021

Inventors: Can GAO, Hao LIU, Bolei HE, Xinyan XIAO, Hao TIAN
METHOD AND APPARATUS FOR STRUCTURING DATA, RELATED COMPUTER DEVICE AND MEDIUM

Publication number: 20210191937

Abstract: A method and an apparatus for structuring data are related to information processing technologies in the field of natural language processing. By acquiring an unstructured text and inputting the unstructured text into an encoder-decoder model, an output sequence is obtained. The encoder-decoder model is trained using a training text marked with the attribute value of each attribute. A structured representation is generated based on the attributes corresponding to the attribute elements included in the output sequence and the attribute values comprised in the attribute elements.

Type: Application

Filed: July 28, 2020

Publication date: June 24, 2021

Inventors: Wei JIA, Dai DAI, Xinyan XIAO
SUMMARY GENERATION METHOD AND APPARATUS

Publication number: 20210182491

Abstract: Embodiments of the application disclose a summary generation method and apparatus. A specific embodiment of the method comprises: acquiring a target article including a headline and a body of the article; determining whether a question is included in the headline; determining, in the body of the article, an information-satisfied-paragraph including an answer to the question, in response to determining that the question is included in the headline; and generating a summary of the target article based on the determined information-satisfied-paragraph. The above embodiment may generate a summary that directly satisfies the users' demand for information.

Type: Application

Filed: June 2, 2020

Publication date: June 17, 2021

Inventors: Moye CHEN, Wei XU, Jiachen LIU, Xinyan XIAO, Qiaoqiao SHE
Method and device for generating text tag

Patent number: 10838997

Abstract: The present disclosure provides a method and a device for generating a text tag. The method includes: performing keyword extraction using strategies corresponding to respective tag types on a target text, to obtain one or more candidate tags of the respective tag types for the target text, wherein the tag type includes at least one of an entity word, a segment text and a topic; performing reduplication removing between different tag types on the one or more candidate tags of the respective tag types to obtain one or more validated candidate tags; and determining one or more target tags of the target text based on the one or more validated candidate tags.

Type: Grant

Filed: June 26, 2018

Date of Patent: November 17, 2020

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Jiachen Liu, Bolei He, Xinyan Xiao, Yajuan Lyu, Xiaoxu Fei
DOCUMENT RECOMMENDATION METHOD AND DEVICE BASED ON SEMANTIC TAG

Publication number: 20200210468

Abstract: The present disclosure provides a document recommendation method based on a semantic tag and a document recommendation device. The method includes: for each document, acquiring a first candidate tag set corresponding to the document, and processing each first candidate tag in the first candidate tag set corresponding to the document to obtain a second candidate tag set corresponding to the document; performing normalization processing on each second candidate tag in the second candidate tag set corresponding to the document to obtain a third candidate tag set corresponding to the document; performing expanding process on each third candidate tag in the third candidate tag set corresponding to the document, and acquiring a fourth candidate tag set corresponding to the document, to form a document library having semantic tags; and recommending a target document obtained from the document library having semantic tags to the user, according to historical semantic tag.

Type: Application

Filed: December 6, 2019

Publication date: July 2, 2020

Inventors: Guocheng NIU, Bolei HE, Chengxiang LIU, Xinyan XIAO, Yajuan LYU
Method for recommending text content based on concern, and computer device

Patent number: 10671656

Abstract: A method for recommending a text content based on a concern, a computer device, and a non-transitory computer readable storage medium are provided. The method includes: acquiring a query input by a user, and acquiring a reference text content selected by the user from search results corresponding to the query; generating a term vector of the query according to a term relative to the query in the reference text content; determining the concern of the user from a plurality of reference concerns according to similarities between the term vector of the query and term vectors of the plurality of reference concerns; and recommending the text content matched with the concern to the user.

Type: Grant

Filed: January 2, 2018

Date of Patent: June 2, 2020

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Chengxiang Liu, Xinyan Xiao
METHOD AND DEVICE FOR GENERATING TEXT TAG

Publication number: 20190012377

Abstract: The present disclosure provides a method and a device for generating a text tag. The method includes: performing keyword extraction using strategies corresponding to respective tag types on a target text, to obtain one or more candidate tags of the respective tag types for the target text, wherein the tag type includes at least one of an entity word, a segment text and a topic; performing reduplication removing between different tag types on the one or more candidate tags of the respective tag types to obtain one or more validated candidate tags; and determining one or more target tags of the target text based on the one or more validated candidate tags.

Type: Application

Filed: June 26, 2018

Publication date: January 10, 2019

Inventors: Jiachen LIU, Bolei HE, Xinyan XIAO, Yajuan LYU, Xiaoxu FEI
METHOD FOR RECOMMENDING TEXT CONTENT BASED ON CONCERN, AND COMPUTER DEVICE

Publication number: 20180373787

Abstract: A method for recommending a text content based on a concern, a computer device, and a non-transitory computer readable storage medium are provided. The method includes: acquiring a query input by a user, and acquiring a reference text content selected by the user from search results corresponding to the query; generating a term vector of the query according to a term relative to the query in the reference text content; determining the concern of the user from a plurality of reference concerns according to similarities between the term vector of the query and term vectors of the plurality of reference concerns; and recommending the text content matched with the concern to the user.

Type: Application

Filed: January 2, 2018

Publication date: December 27, 2018

Inventors: Chengxiang LIU, Xinyan XIAO
ARTIFICIAL INTELLIGENCE BASED METHOD AND APPARATUS FOR DETERMINING REGIONAL INFORMATION

Publication number: 20180341700

Abstract: The present disclosure discloses an artificial intelligence based method and apparatus for determining regional information. A specific embodiment of the method comprises: acquiring to-be-determined information, and extracting a keyword set of the to-be-determined information; inputting the keyword set of the to-be-determined information into a pre-trained topic classification model for classifying, to obtain a topic class of the to-be-determined information, wherein the topic classification model is used for representing a corresponding relation between the keyword set of the information and the topic class of the information; selecting, from a pre-stored place name set, a place name corresponding to the topic class of the to-be-determined information as a target place name set; matching, in the to-be-determined information, the target place name set; and determining, based on a matching result, whether the to-be-determined information belongs to the regional information.

Type: Application

Filed: March 30, 2018

Publication date: November 29, 2018

Inventors: Liangyu CHEN, Xinyan XIAO, Yajuan LV

prev 1 2 3 next