Patents by Inventor Xiameng QIN

Xiameng QIN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210406468
    Abstract: The present disclosure provides a method for visual question answering, which relates to a field of computer vision and natural language processing. The method includes: acquiring an input image and an input question; constructing a Visual Graph based on the input image, wherein the Visual Graph comprises a Node Feature and an Edge Feature; updating the Node Feature by using the Node Feature and the Edge Feature to obtain an updated Visual Graph; determining a question feature based on the input question; fusing the updated Visual Graph and the question feature to obtain a fused feature; and generating a predicted answer for the input image and the input question based on the fused feature. The present disclosure further provides an apparatus for visual question answering, a computer device and a non-transitory computer-readable storage medium.
    Type: Application
    Filed: January 28, 2021
    Publication date: December 30, 2021
    Inventors: Xiameng QIN, Yulin LI, Qunyi XIE, Ju HUANG, Junyu HAN
  • Publication number: 20210390294
    Abstract: Embodiments of the present disclosure disclose an image table extraction method and apparatus, an electronic device, a storage media, and a training method for a table extraction model, which relate to the field of artificial intelligence technologies and cloud computing technologies, including: acquiring an image to be processed; generating a table of the image to be processed according to a table extraction model, where the table extraction model is obtained according to a field position feature, an image feature, and a text feature of a sample image; and filling text information of the image to be processed into the table.
    Type: Application
    Filed: December 31, 2020
    Publication date: December 16, 2021
    Inventors: Xiangkai Huang, Qiaoyi LI, Yulin LI, Ju Huang, Duohao Qin, Xiameng Qin, Minghao Liu, Junyu Han, Jiangliang Guo
  • Publication number: 20210390133
    Abstract: Disclosed are a method, apparatus and electronic device for annotating information of a structured document. A specific implementation is: obtaining a template image of a structured document and at least one piece of annotation information of a field to be filled in the template image, where the annotation information includes attribute value and historical content of the field to be filled, and historical position of the field to be filled in the template image; generating, according to the attribute value of the field to be filled, the historical content of the field to be filled and the historical position of the field to be filled in the template image, target filling information of the field to be filled; obtaining, according to the target filling information of the field to be filled, an image of an annotated structured document.
    Type: Application
    Filed: March 19, 2021
    Publication date: December 16, 2021
    Inventors: QIAOYI LI, XIANGKAI HUANG, YULIN LI, JU HUANG, XIAMENG QIN, DUOHAO QIN, MINGHAO LIU, JUNYU HAN
  • Publication number: 20210383107
    Abstract: A method, apparatus, device and storage medium for recognizing a bill image may include: performing text detection on a bill image, and determining an attribute information set and a relationship information set of each text box of at least two text boxes in the bill image; determining a type of the text box and an associated text box that has a structural relationship with the text box based on the attribute information set and the relationship information set of the text box; and extracting structured bill data of the bill image, based on the type of the text box and the associated text box that has the structural relationship with the text box.
    Type: Application
    Filed: March 15, 2021
    Publication date: December 9, 2021
    Inventors: Yulin LI, Ju HUANG, Xiameng QIN, Junyu HAN
  • Publication number: 20210312174
    Abstract: A method and apparatus for processing an image, a device and a storage medium are provided. An implementation of the method includes: acquiring a template image, the template image including at least one region of interest; determining a first feature map corresponding to each region of interest in the template image; acquiring a target image; determining a second feature map of the target image; and determining at least one region of interest in the target image according to the first feature map and the second feature map.
    Type: Application
    Filed: June 21, 2021
    Publication date: October 7, 2021
    Inventors: Chengquan ZHANG, Mengyi EN, Ju HUANG, Qunyi XIE, Xiameng QIN, Kun YAO, Junyu HAN, Jingtuo LIU, Errui DING
  • Publication number: 20210312173
    Abstract: The present disclosure discloses a method, apparatus and device for recognizing a bill, and a storage medium. The method comprises: acquiring a bill image; inputting the bill image into a feature extraction network layer of a pre-trained bill recognition model, to obtain a bill key field feature map and a bill key field value feature map of the bill image; inputting the bill key field feature map into a first head network layer of the bill recognition model, to obtain a bill key field; processing the bill key field value feature map by a second head network layer of the bill recognition model, to obtain a bill key field value, the feature extraction network layer being respectively connected with the first head network layer and the second head network layer; and generating structured information of the bill image based on the bill key field and the bill key field value.
    Type: Application
    Filed: June 21, 2021
    Publication date: October 7, 2021
    Inventors: Ju HUANG, Qunyi XIE, Yulin LI, Xiameng QIN, Kun YAO, Junyu HAN
  • Publication number: 20210264190
    Abstract: The present application discloses an image questioning and answering method, apparatus, device and storage medium, relating to the technical field of image processing, computer vision, deep learning and natural language processing. The specific implementation solution is as follows: constructing a question graph with a topological structure and extracting a question feature of a query sentence, according to the query sentence; constructing a visual graph with a topological structure and a text graph with a topological structure according to a target image corresponding to the query sentence; performing fusion on the visual graph, the text graph and the question graph by using a fusion model, to obtain a final fusion graph; and determining reply information of the query sentence according to a reasoning feature extracted from the final fusion graph and the question feature.
    Type: Application
    Filed: March 19, 2021
    Publication date: August 26, 2021
    Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Xiameng Qin, Yulin Li, Ju Huang, Qunyi Xie, Junyu Han
  • Publication number: 20210201182
    Abstract: Embodiments of the present disclosure provide a method and apparatus for performing a structured extraction on a text, a device and a storage medium. The method may include: performing a text detection on an entity text image to obtain a position and content of a text line of the entity text image; extracting multivariate information of the text line based on the position and the content of the text line; performing a feature fusion on the multivariate information of the text line to obtain a multimodal fusion feature of the text line; performing category and relationship reasoning based on the multimodal fusion feature of the text line to obtain a category and a relationship probability matrix of the text line; and constructing structured information of the entity text image based on the category and the relationship probability matrix of the text line.
    Type: Application
    Filed: March 12, 2021
    Publication date: July 1, 2021
    Inventors: Yulin LI, Xiameng Qin, Chengquan Zhang, Junyu Han, Errui Ding, Tian Wu, Haifeng Wang
  • Publication number: 20210192696
    Abstract: Embodiments of the present disclosure provide a method and apparatus for correcting a distorted document image, where the method for correcting a distorted document image includes: obtaining a distorted document image; and inputting the distorted document image into a correction model, and obtaining a corrected image corresponding to the distorted document image; where the correction model is a model obtained by training with a set of image samples as inputs and a corrected image corresponding to each image sample in the set of image samples as an output, and the image samples are distorted. By inputting the distorted document image to be corrected into the correction model, the corrected image corresponding to the distorted document image can be obtained through the correction model, which realizes document image correction end-to-end, improves accuracy of the document image correction, and extends application scenarios of the document image correction.
    Type: Application
    Filed: January 19, 2021
    Publication date: June 24, 2021
    Inventors: Qunyi XIE, Xiameng QIN, Yulin LI, Junyu HAN, Shengxian ZHU