Patents by Inventor Xiameng QIN

Xiameng QIN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHOD AND DEVICE FOR VISUAL QUESTION ANSWERING, COMPUTER APPARATUS AND MEDIUM

Publication number: 20210406468

Abstract: The present disclosure provides a method for visual question answering, which relates to a field of computer vision and natural language processing. The method includes: acquiring an input image and an input question; constructing a Visual Graph based on the input image, wherein the Visual Graph comprises a Node Feature and an Edge Feature; updating the Node Feature by using the Node Feature and the Edge Feature to obtain an updated Visual Graph; determining a question feature based on the input question; fusing the updated Visual Graph and the question feature to obtain a fused feature; and generating a predicted answer for the input image and the input question based on the fused feature. The present disclosure further provides an apparatus for visual question answering, a computer device and a non-transitory computer-readable storage medium.

Type: Application

Filed: January 28, 2021

Publication date: December 30, 2021

Inventors: Xiameng QIN, Yulin LI, Qunyi XIE, Ju HUANG, Junyu HAN
Image Table Extraction Method And Apparatus, Electronic Device, And Storgage Medium

Publication number: 20210390294

Abstract: Embodiments of the present disclosure disclose an image table extraction method and apparatus, an electronic device, a storage media, and a training method for a table extraction model, which relate to the field of artificial intelligence technologies and cloud computing technologies, including: acquiring an image to be processed; generating a table of the image to be processed according to a table extraction model, where the table extraction model is obtained according to a field position feature, an image feature, and a text feature of a sample image; and filling text information of the image to be processed into the table.

Type: Application

Filed: December 31, 2020

Publication date: December 16, 2021

Inventors: Xiangkai Huang, Qiaoyi LI, Yulin LI, Ju Huang, Duohao Qin, Xiameng Qin, Minghao Liu, Junyu Han, Jiangliang Guo
METHOD, APPARATUS AND ELECTRONIC DEVICE FOR ANNOTATING INFORMATION OF STRUCTURED DOCUMENT

Publication number: 20210390133

Abstract: Disclosed are a method, apparatus and electronic device for annotating information of a structured document. A specific implementation is: obtaining a template image of a structured document and at least one piece of annotation information of a field to be filled in the template image, where the annotation information includes attribute value and historical content of the field to be filled, and historical position of the field to be filled in the template image; generating, according to the attribute value of the field to be filled, the historical content of the field to be filled and the historical position of the field to be filled in the template image, target filling information of the field to be filled; obtaining, according to the target filling information of the field to be filled, an image of an annotated structured document.

Type: Application

Filed: March 19, 2021

Publication date: December 16, 2021

Inventors: QIAOYI LI, XIANGKAI HUANG, YULIN LI, JU HUANG, XIAMENG QIN, DUOHAO QIN, MINGHAO LIU, JUNYU HAN
METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM FOR RECOGNIZING BILL IMAGE

Publication number: 20210383107

Abstract: A method, apparatus, device and storage medium for recognizing a bill image may include: performing text detection on a bill image, and determining an attribute information set and a relationship information set of each text box of at least two text boxes in the bill image; determining a type of the text box and an associated text box that has a structural relationship with the text box based on the attribute information set and the relationship information set of the text box; and extracting structured bill data of the bill image, based on the type of the text box and the associated text box that has the structural relationship with the text box.

Type: Application

Filed: March 15, 2021

Publication date: December 9, 2021

Inventors: Yulin LI, Ju HUANG, Xiameng QIN, Junyu HAN
METHOD AND APPARATUS FOR PROCESSING IMAGE, DEVICE AND STORAGE MEDIUM

Publication number: 20210312174

Abstract: A method and apparatus for processing an image, a device and a storage medium are provided. An implementation of the method includes: acquiring a template image, the template image including at least one region of interest; determining a first feature map corresponding to each region of interest in the template image; acquiring a target image; determining a second feature map of the target image; and determining at least one region of interest in the target image according to the first feature map and the second feature map.

Type: Application

Filed: June 21, 2021

Publication date: October 7, 2021

Inventors: Chengquan ZHANG, Mengyi EN, Ju HUANG, Qunyi XIE, Xiameng QIN, Kun YAO, Junyu HAN, Jingtuo LIU, Errui DING
METHOD, APPARATUS AND DEVICE FOR RECOGNIZING BILL AND STORAGE MEDIUM

Publication number: 20210312173

Abstract: The present disclosure discloses a method, apparatus and device for recognizing a bill, and a storage medium. The method comprises: acquiring a bill image; inputting the bill image into a feature extraction network layer of a pre-trained bill recognition model, to obtain a bill key field feature map and a bill key field value feature map of the bill image; inputting the bill key field feature map into a first head network layer of the bill recognition model, to obtain a bill key field; processing the bill key field value feature map by a second head network layer of the bill recognition model, to obtain a bill key field value, the feature extraction network layer being respectively connected with the first head network layer and the second head network layer; and generating structured information of the bill image based on the bill key field and the bill key field value.

Type: Application

Filed: June 21, 2021

Publication date: October 7, 2021

Inventors: Ju HUANG, Qunyi XIE, Yulin LI, Xiameng QIN, Kun YAO, Junyu HAN
IMAGE QUESTIONING AND ANSWERING METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM

Publication number: 20210264190

Abstract: The present application discloses an image questioning and answering method, apparatus, device and storage medium, relating to the technical field of image processing, computer vision, deep learning and natural language processing. The specific implementation solution is as follows: constructing a question graph with a topological structure and extracting a question feature of a query sentence, according to the query sentence; constructing a visual graph with a topological structure and a text graph with a topological structure according to a target image corresponding to the query sentence; performing fusion on the visual graph, the text graph and the question graph by using a fusion model, to obtain a final fusion graph; and determining reply information of the query sentence according to a reasoning feature extracted from the final fusion graph and the question feature.

Type: Application

Filed: March 19, 2021

Publication date: August 26, 2021

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Xiameng Qin, Yulin Li, Ju Huang, Qunyi Xie, Junyu Han
METHOD AND APPARATUS FOR PERFORMING STRUCTURED EXTRACTION ON TEXT, DEVICE AND STORAGE MEDIUM

Publication number: 20210201182

Abstract: Embodiments of the present disclosure provide a method and apparatus for performing a structured extraction on a text, a device and a storage medium. The method may include: performing a text detection on an entity text image to obtain a position and content of a text line of the entity text image; extracting multivariate information of the text line based on the position and the content of the text line; performing a feature fusion on the multivariate information of the text line to obtain a multimodal fusion feature of the text line; performing category and relationship reasoning based on the multimodal fusion feature of the text line to obtain a category and a relationship probability matrix of the text line; and constructing structured information of the entity text image based on the category and the relationship probability matrix of the text line.

Type: Application

Filed: March 12, 2021

Publication date: July 1, 2021

Inventors: Yulin LI, Xiameng Qin, Chengquan Zhang, Junyu Han, Errui Ding, Tian Wu, Haifeng Wang
METHOD AND APPARATUS FOR CORRECTING DISTORTED DOCUMENT IMAGE

Publication number: 20210192696

Abstract: Embodiments of the present disclosure provide a method and apparatus for correcting a distorted document image, where the method for correcting a distorted document image includes: obtaining a distorted document image; and inputting the distorted document image into a correction model, and obtaining a corrected image corresponding to the distorted document image; where the correction model is a model obtained by training with a set of image samples as inputs and a corrected image corresponding to each image sample in the set of image samples as an output, and the image samples are distorted. By inputting the distorted document image to be corrected into the correction model, the corrected image corresponding to the distorted document image can be obtained through the correction model, which realizes document image correction end-to-end, improves accuracy of the document image correction, and extends application scenarios of the document image correction.

Type: Application

Filed: January 19, 2021

Publication date: June 24, 2021

Inventors: Qunyi XIE, Xiameng QIN, Yulin LI, Junyu HAN, Shengxian ZHU

prev 1 2