Patents by Inventor Chunguang Chai

Chunguang Chai has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11954084
    Abstract: A method and apparatus for processing a table, a device, a storage medium and a product. An implementation of the method comprise: receiving a content query request for a target table; acquiring a target tree structure of the target table according to the content query request; where, the target tree structure is obtained by performing absorbing processing and merging processing on at least one target cell in the target table; acquiring to-be-queried content in the content query request; and querying target content matching the to-be-queried content from the target tree structure.
    Type: Grant
    Filed: July 22, 2022
    Date of Patent: April 9, 2024
    Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
    Inventors: Yue Zhang, Yabing Shi, Ye Jiang, Chunguang Chai
  • Patent number: 11847164
    Abstract: A method, electronic device and storage medium for generating information are disclosed. The method includes: acquiring a plurality of tag entity words from a target video, the tag entity words including a person entity word, a work entity word, a video category entity word, and a video core entity word, the video core entity word including an entity word for characterizing a content related to the target video; linking, for a tag entity word among the plurality of tag entity words, the tag entity word to a node of a preset knowledge graph; determining semantic information of the target video based on a linking result of each of the tag entity words; and structuring the semantic information of the target video based on a relationship between the node and an edge of the knowledge graph, to obtain structured semantic information of the target video.
    Type: Grant
    Filed: March 26, 2021
    Date of Patent: December 19, 2023
    Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.
    Inventors: Shu Wang, Kexin Ren, Xiaohan Zhang, Zhifan Feng, Chunguang Chai, Yong Zhu
  • Patent number: 11755654
    Abstract: Provided by the present disclosure is a new category tag mining method, involving the field of knowledge graph technology, and including: obtaining a plurality of queries during a current preset time period; labeling a category tag on each query of the plurality of queries, by using a pre-trained sequence labeling model, to extract the category tag currently corresponding to the query from the query; and removing a category tag already existing in a preset current category tag library from category tags currently corresponding to all the queries, and determining a remaining category tag as a new category tag. The present disclosure also provides an electronic device and a non-transitory computer-readable storage medium.
    Type: Grant
    Filed: February 11, 2021
    Date of Patent: September 12, 2023
    Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
    Inventors: Qian Li, Yabing Shi, Ye Jiang, Chunguang Chai, Yong Zhu
  • Patent number: 11727216
    Abstract: A method, apparatus, device, and storage medium for linking an entity, relates to the technical fields of knowledge graph and deep learning are provided. The method may include: acquiring a target text; determining at least one entity mention included in the target text and a candidate entity corresponding to each entity mention; determining an embedding vector of each candidate entity based on the each candidate entity and a preset entity embedding vector determination model; determining context semantic information of the target text based on the target text and each embedding vector; determining type information of the at least one entity mention; and determining an entity linking result of the at least one entity mention, based on the each embedding vector, the context semantic information, and each type information.
    Type: Grant
    Filed: December 10, 2020
    Date of Patent: August 15, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Zhijie Liu, Qi Wang, Zhifan Feng, Chunguang Chai, Yong Zhu
  • Patent number: 11704492
    Abstract: A method, apparatus, device, and storage medium for entity linking is disclosed. The method includes: acquiring a target text; determining at least one entity mention included in the target text; determining a candidate entity corresponding to each of the entity mention based on a preset knowledge base; determining a reference text of each of the candidate entity and determining additional feature information of each of the candidate entity; and determining an entity linking result based on the target text, each of the reference text, and each piece of the additional feature information, wherein determining the entity linking result includes determining a probability of linking each of the candidate entity to the entity mention based on a splicing of a first embedding vector and a second embedding vector of the target text and a splicing of a first embedding vector and a second embedding vector of each respective reference text.
    Type: Grant
    Filed: March 26, 2021
    Date of Patent: July 18, 2023
    Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.
    Inventors: Qi Wang, Zhifan Feng, Zhijie Liu, Siqi Wang, Chunguang Chai, Yong Zhu
  • Publication number: 20230153337
    Abstract: A question answering method, a method of training a question answering model, a device, and a medium are provided, which relate to a field of artificial intelligence technology, in particular to fields of natural language processing technology, deep learning technology, and knowledge mapping technology. The question answering method includes: obtaining data to be processed, wherein the data to be processed includes a question and candidate answers; performing general semantic understanding on the data to be processed to obtain a general data feature; selecting a target question answering mode from candidate question answering modes based on the general data feature; and processing the general data feature by using the target question answering mode, to obtain a target answer for the question from the candidate answers.
    Type: Application
    Filed: January 20, 2023
    Publication date: May 18, 2023
    Inventors: Wenbin JIANG, Yajuan LV, Chunguang CHAI, Yong ZHU
  • Patent number: 11651164
    Abstract: The present disclosure provides a method, a device, an equipment and a storage medium for mining a topic concept. The method includes: acquiring a plurality of candidate topic concepts based on a query; performing word segmentation on the plurality of candidate topic concepts and performing part-of-speech tagging on words obtained after performing the word segmentation, to obtain a part-of-speech sequence of each of the plurality of candidate topic concepts; and filtering the plurality of candidate topic concepts based on the part-of-speech sequence, to filter out a topic concept corresponding to a target part-of-speech sequence among the plurality of candidate topic concepts, in which a proportion of accurate topic concepts in the target part-of-speech sequence is lower than or equal to a first preset threshold, or a proportion of inaccurate topic concepts in the target part-of-speech sequence is higher than or equal to a second preset threshold.
    Type: Grant
    Filed: September 29, 2020
    Date of Patent: May 16, 2023
    Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
    Inventors: Zhijie Liu, Qi Wang, Zhifan Feng, Zhou Fang, Chunguang Chai, Yong Zhu
  • Publication number: 20230133717
    Abstract: Disclosed are an information extraction method, an electronic device and a readable storage medium, which relate to the field of artificial intelligence technologies, and particularly to the field of knowledge graph technologies. The information extraction method includes: acquiring to-be-processed text to obtain a semantic vector of each token in the to-be-processed text; generating a relationship prediction matrix, an entity prediction matrix and an alignment matrix according to each token in the to-be-processed text and the semantic vector of each token; and extracting a target triplet in the to-be-processed text using the relationship prediction matrix, the entity prediction matrix and the alignment matrix, and taking the target triplet as an information extraction result of the to-be-processed text.
    Type: Application
    Filed: September 28, 2022
    Publication date: May 4, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Jiandong SUN, Yabing SHI, Ye JIANG, Chunguang CHAI
  • Publication number: 20230112385
    Abstract: A method of obtaining an event information, an electronic device, and a storage medium, which relate to a field of artificial intelligence, in particular to fields of knowledge graph and deep learning technologies. A specific implementation solution of the method of obtaining the event information includes: determining, according to a query information in data to be processed, a first key information describing an event; determining, according to multimedia data in the data to be processed, a second key information describing an event, wherein the multimedia data includes data obtained by querying based on the query information; and fusing the first key information and the second key information, so as to obtain an event information of a target event described by the data to be processed.
    Type: Application
    Filed: December 12, 2022
    Publication date: April 13, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Qi Wang, Zhifan Feng, Chunguang Chai, Yong Zhu
  • Publication number: 20230115737
    Abstract: A method of processing multimedia data, a device, and a medium, which relates to a field of an artificial intelligence technology, in particular to fields of knowledge graph and deep learning. The method of processing the multimedia data includes: recognizing the multimedia data so as to obtain at least one key information of the multimedia data; querying a predetermined knowledge base according to the at least one key information, so as to determine a multimedia name associated with the at least one key information and an association degree between the multimedia name and the at least one key information; and determining, in the multimedia name, a name of the multimedia data based on a similarity between alternative multimedia data for the multimedia name and the multimedia data, in response to the association degree being less than a first threshold value.
    Type: Application
    Filed: December 13, 2022
    Publication date: April 13, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Shuai CHEN, Qi WANG, Zhifan FENG, Chunguang CHAI, Yong ZHU
  • Publication number: 20230103728
    Abstract: A computer-implemented method for sample augmentation includes: acquiring a second sample corpus and second triplet information of the second sample corpus by performing data augmentation on a first sample corpus labeled with first triplet information; acquiring third triplet information of a third sample corpus by performing semi-supervised learning on the third sample corpus that is not labeled with triplet information; and generating a set of training corpora for a triplet information extraction network based on the first sample corpus and the first triplet information, the second sample corpus and the second triplet information, and the third sample corpus and the third triplet information.
    Type: Application
    Filed: December 8, 2022
    Publication date: April 6, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Jian Liu, Jiandong Sun, Yabing Shi, Ye Jiang, Chunguang Chai
  • Publication number: 20230038091
    Abstract: A method of extracting a table information, an electronic device, and a storage medium are provided, which relate to fields of artificial intelligence and big data, in particular to fields of machine learning, knowledge graph, intelligent search and intelligent recommendation, and may be used for an intelligent extraction of an information in a table and other scenarios. The method includes: performing a clustering based on features of a plurality of rows of cells and/or features of a plurality of columns of cells in a table, so as to determine candidate header cells in the table; and performing an information extraction on the table based on the candidate header cells, so as to extract attribute-attribute value pairs in the table.
    Type: Application
    Filed: September 30, 2022
    Publication date: February 9, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Yue ZHANG, Zhou FANG, Yabing SHI, Ye JIANG, Chunguang CHAI
  • Publication number: 20230016403
    Abstract: The present disclosure provides a method of processing triple data, a method of training a triple data processing model, an electronic device, and a storage medium. A specific implementation solution includes: performing a triple data extraction on text data to obtain a plurality of field data; normalizing the plurality of field data to determine target triple data, wherein the target triple data contains entity data, entity relationship data, and association entity data; and verifying a confidence level of the target triple data to obtain a verification result.
    Type: Application
    Filed: September 23, 2022
    Publication date: January 19, 2023
    Inventors: Zhaoji WANG, Fang HUANG, Ye JIANG, Yabing SHI, Chunguang CHAI, Yong ZHU
  • Patent number: 11557120
    Abstract: Technical solutions for video event recognition relate to the fields of knowledge graphs, deep learning and computer vision. A video event graph is constructed, and each event in the video event graph includes: M argument roles of the event and respective arguments of the argument roles, with M being a positive integer greater than one. For a to-be-recognized video, respective arguments of the M argument roles of a to-be-recognized event corresponding to the video are acquired. According to the arguments acquired, an event is selected from the video event graph as a recognized event corresponding to the video.
    Type: Grant
    Filed: June 17, 2021
    Date of Patent: January 17, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Qi Wang, Zhifan Feng, Hu Yang, Feng He, Chunguang Chai, Yong Zhu
  • Publication number: 20230010160
    Abstract: Disclosed are a method for processing multimodal data using a neural network, a device, and a medium, and relates to the field of artificial intelligence and, in particular to multimodal data processing, video classification, and deep learning. The neural network includes: an input subnetwork configured to receive the multimodal data to output respective first features of a plurality of modalities; a plurality of cross-modal feature subnetworks, each of which is configured to receive respective first features of two corresponding modalities to output a cross-modal feature corresponding to the two modalities; a plurality of cross-modal fusion subnetworks, each of which is configured to receive at least one cross-modal feature corresponding to a corresponding target modality and other modalities to output a second feature of the target modality; and an output subnetwork configured to receive respective second features of the plurality of modalities to output a processing result of the multimodal data.
    Type: Application
    Filed: September 15, 2022
    Publication date: January 12, 2023
    Inventors: Shuai CHEN, Qi WANG, Hu YANG, Feng HE, Zhifan FENG, Chunguang CHAI, Yong ZHU
  • Publication number: 20230005284
    Abstract: A computer-implemented method is provided. The method includes: obtaining a sample text and a sample image corresponding to the sample text; labeling a true semantic tag for the sample text according to a first preset rule; obtaining a text feature representation of the sample text and a predicted semantic tag output by a text coding sub-model; obtaining an image feature representation of the sample image output by an image coding sub-model; calculating a first loss based on the true semantic tag and the predicted semantic tag; calculating a contrast loss based on the text feature representation of the sample text and the image feature representation of the sample image; adjusting parameters of the text coding sub-model based on the first loss and the contrast loss; and adjusting parameters of the image coding sub-model based on the contrast loss.
    Type: Application
    Filed: September 13, 2022
    Publication date: January 5, 2023
    Inventors: Feng HE, Qi WANG, Hu YANG, Shuai CHEN, Zhifan FENG, Chunguang CHAI
  • Publication number: 20220358110
    Abstract: A method and apparatus for processing a table, a device, a storage medium and a product. An implementation of the method comprise: receiving a content query request for a target table; acquiring a target tree structure of the target table according to the content query request; where, the target tree structure is obtained by performing absorbing processing and merging processing on at least one target cell in the target table; acquiring to-be-queried content in the content query request; and querying target content matching the to-be-queried content from the target tree structure.
    Type: Application
    Filed: July 22, 2022
    Publication date: November 10, 2022
    Inventors: Yue ZHANG, Yabing SHI, Ye JIANG, Chunguang CHAI
  • Publication number: 20220350965
    Abstract: A method for generating a pre-trained language model, includes: obtaining sample files; obtaining typography structure information and text information of the sample files by parsing the sample files; obtaining a plurality of task models of a pre-trained language model; obtaining a trained pre-trained language model by jointly training the pre-trained language model and the plurality of task models according to the typography structure information and the text information; and generating a target pre-trained language model by fine-tuning the trained pre-trained language model according to the typography structure information and the text information.
    Type: Application
    Filed: July 14, 2022
    Publication date: November 3, 2022
    Inventors: Tongyang LIU, Shu WANG, Wanli CHANG, Wei ZHENG, Zhifan FENG, Chunguang CHAI, Yong ZHU
  • Patent number: 11490170
    Abstract: The disclosure provides a method for processing a video, an electronic device, and a computer storage medium. The method includes: determining a plurality of first identifiers related to a first object based on a plurality of frames including the first object in a target video; determining a plurality of attribute values associated with the plurality of first identifiers based on a knowledge base related to the first object; determining a set of frames from the plurality of frames, in which one or more attribute values associated with one or more first identifiers determined from each one of the set of frames are predetermined values; and splitting the target video into a plurality of video clips based on positions of the set of frames in the plurality of frames.
    Type: Grant
    Filed: April 28, 2021
    Date of Patent: November 1, 2022
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Hu Yang, Shu Wang, Xiaohan Zhang, Qi Wang, Zhifan Feng, Chunguang Chai
  • Publication number: 20220284246
    Abstract: The present disclosure discloses a method for training a cross-modal retrieval model, an electronic device and a storage medium, and relates to the field of computer technologies, and particularly to the field of artificial intelligence technologies, such as knowledge graph technologies, computer vision technologies, deep learning technologies, or the like. The method for training a cross-modal retrieval model includes: determining similarity of a cross-modal sample pair according to the cross-modal sample pair, the cross-modal sample pair including a sample of a first modal and a sample of a second modal, and the first modal being different from the second modal; determining a soft margin based on the similarity, and determining a soft margin loss function based on the soft margin; and determining a total loss function based on the soft margin loss function, and training a cross-modal retrieval model according to the total loss function.
    Type: Application
    Filed: October 15, 2021
    Publication date: September 8, 2022
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Feng HE, Qi WANG, Zhifan FENG, Hu YANG, Chunguang CHAI