Patents by Inventor Chengquan Zhang

Chengquan Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for performing structured extraction on text, device and storage medium

Patent number: 12211304

Abstract: Embodiments of the present disclosure provide a method and apparatus for performing a structured extraction on a text, a device and a storage medium. The method may include: performing a text detection on an entity text image to obtain a position and content of a text line of the entity text image; extracting multivariate information of the text line based on the position and the content of the text line; performing a feature fusion on the multivariate information of the text line to obtain a multimodal fusion feature of the text line; performing category and relationship reasoning based on the multimodal fusion feature of the text line to obtain a category and a relationship probability matrix of the text line; and constructing structured information of the entity text image based on the category and the relationship probability matrix of the text line.

Type: Grant

Filed: March 12, 2021

Date of Patent: January 28, 2025

Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventors: Yulin Li, Xiameng Qin, Chengquan Zhang, Junyu Han, Errui Ding, Tian Wu, Haifeng Wang
METHOD OF TRAINING DEEP LEARNING MODEL FOR TEXT DETECTION AND TEXT DETECTION METHOD

Publication number: 20240304015

Abstract: The present disclosure provides a method of training a deep learning model for text detection and a text detection method, which relates to the technical field of artificial intelligence, and in particular, to the technical field of computer vision and deep learning and can be used in scenarios of OCR optical character recognition. A method of training a deep learning model for text detection is provided, in which a single character segmentation sub-network outputs a single character segmentation prediction result, a text line segmentation sub-network outputs a text line segmentation prediction result, the trained deep learning model can be used for detecting a text area; and, can at the same time achieve single character segmentation and text line segmentation, and thus is capable to perform text detection by combining two ways of text segmentation, which further improves the accuracy of text area detection.

Type: Application

Filed: April 21, 2022

Publication date: September 12, 2024

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventors: Sen FAN, Xiaoyan WANG, Pengyuan LV, Chengquan ZHANG, Kun YAO
TRAINING METHOD, METHOD OF DISPLAYING TRANSLATION, ELECTRONIC DEVICE AND STORAGE MEDIUM

Publication number: 20240282024

Abstract: A method of training a text erasure model, a method of display a translation, an electronic device, and a storage medium. The training method includes: processing a set of original text block images by using a generator of a generative adversarial network model to obtain a set of simulated text block-erased images; alternately training the generator and a discriminator of the generative adversarial network model by using a set of real text block-erased images and the set of simulated text block-erased images, so as to obtain a trained generator and a trained discriminator; and determining the trained generator as the text erasure model, wherein a pixel value of a text-erased region in a real text block-erased image contained in the set of real text block-erased images is determined based on a pixel value of another region in the real text block-erased image other than the text-erased region.

Type: Application

Filed: April 22, 2022

Publication date: August 22, 2024

Inventors: Liang WU, Shanshan LIU, Chengquan ZHANG, Kun YAO
METHOD OF TRAINING TEXT RECOGNITION MODEL, AND METHOD OF RECOGNIZING TEXT

Publication number: 20240281609

Abstract: The present application provides a method of training a text recognition model. The method includes: inputting a first sample image into the visual feature extraction sub-model to obtain a first visual feature and a first predicted text, the first sample image contains a text and a tag indicating a first actual text; obtaining, by using the semantic feature extraction sub-model, a first semantic feature based on the first predicted text; obtaining, by using the sequence sub-model, a second predicted text based on the first visual feature and the first semantic feature; and training the text recognition model based on the first predicted text, the second predicted text and the first actual text. The present disclosure further provides a method of recognizing a text, an electronic device, and a storage medium.

Type: Application

Filed: May 16, 2022

Publication date: August 22, 2024

Inventors: Pengyuan LV, Jingquan LI, Chengquan ZHANG, Kun YAO, Jingtuo LIU, Junyu HAN
METHOD OF TRAINING TEXT DETECTION MODEL, METHOD OF DETECTING TEXT, AND DEVICE

Publication number: 20240265718

Abstract: A method training a text detection model and a method of detecting a text. The training method includes: inputting a sample image into a text feature extraction sub-model of a text detection model to obtain a text feature of a text in the sample image, the sample image having a label indicating an actual position information and an actual category; inputting a predetermined text vector into a text encoding sub-model of the text detection model to obtain a text reference feature; inputting the text feature and the text reference feature into a decoding sub-model of the text detection model to obtain a text sequence vector; inputting the text sequence vector into an output sub-model of the text detection model to obtain a predicted position information and a predicted category; and training the text detection model based on the predicted and actual categories, the predicted and actual position information.

Type: Application

Filed: April 22, 2022

Publication date: August 8, 2024

Inventors: Xiaoqiang ZHANG, Xiameng QIN, Chengquan ZHANG, Kun YAO
Method and apparatus for processing image, device and storage medium

Patent number: 11881044

Abstract: A method and apparatus for processing an image, a device and a storage medium are provided. An implementation of the method includes: acquiring a template image, the template image including at least one region of interest; determining a first feature map corresponding to each region of interest in the template image; acquiring a target image; determining a second feature map of the target image; and determining at least one region of interest in the target image according to the first feature map and the second feature map.

Type: Grant

Filed: June 21, 2021

Date of Patent: January 23, 2024

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Chengquan Zhang, Mengyi En, Ju Huang, Qunyi Xie, Xiameng Qin, Kun Yao, Junyu Han, Jingtuo Liu, Errui Ding
Text recognition method and device, and electronic device

Patent number: 11861919

Abstract: A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.

Type: Grant

Filed: June 21, 2021

Date of Patent: January 2, 2024

Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventors: Chengquan Zhang, Pengyuan Lv, Kun Yao, Junyu Han, Jingtuo Liu
Method and apparatus for visual question answering, computer device and medium

Patent number: 11854283

Abstract: The present disclosure provides a method for visual question answering, which relates to fields of computer vision and natural language processing. The method includes: acquiring an input image and an input question; detecting visual information and position information of each of at least one text region in the input image; determining semantic information and attribute information of each of the at least one text region based on the visual information and the position information; determining a global feature of the input image based on the visual information, the position information, the semantic information, and the attribute information; determining a question feature based on the input question; and generating a predicted answer for the input image and the input question based on the global feature and the question feature. The present disclosure further provides a device for visual question answering, a computer device and a medium.

Type: Grant

Filed: February 5, 2021

Date of Patent: December 26, 2023

Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventors: Pengyuan Lv, Xiaoqiang Zhang, Shanshan Liu, Chengquan Zhang, Qiming Peng, Sijin Wu, Hua Lu, Yongfeng Chen
METHOD FOR TRAINING IMAGE RECOGNITION MODEL, ELECTRONIC DEVICE AND STORAGE MEDIUM

Publication number: 20230401828

Abstract: A method for training an image recognition model includes: obtaining a training data set, in which the training data set includes first text images of each vertical category in a non-target scene and second text images of each vertical category in a target scene, and a type of text content involved in the first text images is the same as a type of text content involved in the second text image; training an initial recognition model by using the first text images, to obtain a basic recognition model; and modifying the basic recognition model by using the second text images, to obtain an image recognition model corresponding to the target scene.

Type: Application

Filed: April 8, 2022

Publication date: December 14, 2023

Inventors: Meina QIAO, Shanshan LIU, Xiameng QIN, Chengquan ZHANG, Kun YAO
Method and apparatus for recognizing text

Patent number: 11836996

Abstract: The present disclosure discloses a method and apparatus for recognizing a text. The method comprises: acquiring images of a text area of an input image, the acquired images including a text centerline graph, a text direction offset graph, a text boundary offset graph, and a text character classification graph; extracting coordinates of feature points of a character center from the text centerline graph; sorting the extracted coordinates of the feature points based on the text direction offset graph to obtain a coordinate sequence of the feature points; determining a polygonal bounding box of the text area based on the coordinate sequence of the feature points of the character center and the text boundary offset graph; and determining a classification result of the feature points of the character center, based on the coordinate sequence of the feature points of the character center and the text character classification graph.

Type: Grant

Filed: March 23, 2021

Date of Patent: December 5, 2023

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Xiaoqiang Zhang, Pengyuan Lv, Shanshan Liu, Chengquan Zhang
METHOD AND APPARATUS FOR EDITING AN IMAGE AND METHOD AND APPARATUS FOR TRAINING AN IMAGE EDITING MODEL, DEVICE AND MEDIUM

Publication number: 20230377225

Abstract: A method for training an image editing model includes steps described below. Covering processing is performed on a region of interest determined in an original image so that a background image sample is formed, and content corresponding to the region of interest is determined as a sample of content of interest; the background image sample and the sample of the content of interest are input into an image editing model; fusion processing is performed on a background image feature and a feature of the region of interest by using the image editing model so that a fusion feature is formed; an image reconstruction operation is performed according to the fusion feature by using the image editing model so that a reconstructed image is output; and optimization training is performed on the image editing model according to a loss relationship between the reconstructed image and the original image.

Type: Application

Filed: March 14, 2023

Publication date: November 23, 2023

Inventors: Chengquan ZHANG, Yuechen YU, Liang WU
Method and apparatus for recognizing text content and electronic device

Patent number: 11810384

Abstract: The present application discloses a method and an apparatus for recognizing text content, and an electronic device, and relates to a text recognition technique in the field of computer technology. The specific implementation is as follows: acquiring a dial picture; detecting at least one text centerline and a bounding box corresponding to each text centerline in the dial picture; and recognizing text content in each line of text in the dial picture based on the at least one text centerline and the bounding box corresponding to each text centerline.

Type: Grant

Filed: February 9, 2021

Date of Patent: November 7, 2023

Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventors: Shanshan Liu, Chengquan Zhang, Xuan Li, Mengyi En, Hailun Xu, Xiaoqiang Zhang
Character recognition method and apparatus, electronic device and computer readable storage medium

Patent number: 11775845

Abstract: A character recognition method, a character recognition apparatus, an electronic device and a computer readable storage medium are disclosed. The character recognition method includes: determining semantic information and first position information of each individual character recognized from an image; constructing a graph network according to the semantic information and the first position information of each individual character; and determining a character recognition result of the image according to a feature of each individual character calculated by the graph network.

Type: Grant

Filed: March 23, 2021

Date of Patent: October 3, 2023

Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventors: Xiaoqiang Zhang, Chengquan Zhang, Shanshan Liu
METHOD FOR TRAINING ROI DETECTION MODEL, METHOD FOR DETECTING ROI, DEVICE, AND MEDIUM

Publication number: 20230290126

Abstract: Provided are a method for training a region of interest (ROI) detection model, a method for detecting an ROI, a device, and a medium. The specific implementation includes: performing feature extraction on a sample image to obtain a sample feature data; performing non-linear mapping on the sample feature data to obtain a first feature data and a second feature data; determining an inter-region difference data according to the second feature data and a third feature data of the first feature data in a region associated with a label ROI; and adjusting at least one of a to-be-trained feature extraction parameter and a to-be-trained feature enhancement parameter of the ROI detection model according to the inter-region difference data and the region associated with the label ROI.

Type: Application

Filed: February 28, 2023

Publication date: September 14, 2023

Inventors: Pengyuan LV, Sen FAN, Chengquan ZHANG, Kun YAO, Junyu HAN, Jingtuo LIU, Errui DING, Jingdong WANG
Method and Apparatus for Recognizing Document Image, Storage Medium and Electronic Device

Publication number: 20230260306

Abstract: A method and an apparatus is provided for recognizing a document image, a storage medium and an electronic device, relates to the technical field of artificial intelligent recognition, particularly relates to the technical fields of deep learning and computer vision. The method includes that a document image to be recognized is transformed into an image feature map, where the document image at least includes at least one text box and text information including multiple characters; a first recognition content of the document image to be recognized is predicted based on the image feature map, the multiple characters and the text box; the document image to be recognized is recognized based on an optical character recognition algorithm to obtain a second recognition content; and the first recognition content is matched with the second recognition content to obtain a target recognition content.

Type: Application

Filed: August 9, 2022

Publication date: August 17, 2023

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventors: Yuechen YU, Chengquan ZHANG, Kun YAO
CHARACTER RECOGNITION MODEL TRAINING METHOD AND APPARATUS, CHARACTER RECOGNITION METHOD AND APPARATUS, DEVICE AND STORAGE MEDIUM

Publication number: 20230215203

Abstract: The present disclosure provides a character recognition model training method and apparatus, a character recognition method and apparatus, a device and a medium, relating to the technical field of artificial intelligence, and specifically to the technical fields of deep learning, image processing and computer vision, which can be applied to scenarios such as character detection and recognition technology. The specific implementing solution is: partitioning an untagged training sample into at least two sub-sample images; dividing the at least two sub-sample images into a first training set and a second training set; where the first training set includes a first sub-sample image with a visible attribute, and the second training set includes a second sub-sample image with an invisible attribute; performing self-supervised training on a to-be-trained encoder by taking the second training set as a tag of the first training set, to obtain a target encoder.

Type: Application

Filed: February 14, 2023

Publication date: July 6, 2023

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Pengyuan LV, Chengquan ZHANG, Shanshan LIU, Meina QIAO, Yangliu XU, Liang WU, Xiaoyan WANG, Kun YAO, Junyu Han, Errui DING, Jingdong WANG, Tian WU, Haifeng WANG
Optical character recognition method and apparatus, electronic device and storage medium

Patent number: 11694461

Abstract: The present application discloses a method and an apparatus for optical character recognition, an electronic device and a storage medium, and relates to the fields of artificial intelligence and deep learning. The method may include: determining, for a to-be-recognized image, a text bounding box of a text area therein, and extracting a text area image from the to-be-recognized image according to the text bounding box; determining a bounding box of text lines in the text area image, and extracting a text-line image from the text area image according to the bounding box; and performing text sequence recognition on the text-line image, and obtaining a recognition result. The application of the solution in the present application can improve a recognition speed and the like.

Type: Grant

Filed: March 11, 2021

Date of Patent: July 4, 2023

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Mengyi En, Shanshan Liu, Xuan Li, Chengquan Zhang, Hailun Xu, Xiaoqiang Zhang
METHOD FOR RECOGNIZING TEXT, DEVICE, AND STORAGE MEDIUM

Publication number: 20230206667

Abstract: A method for recognizing text includes: obtaining a first feature map of an image; for each target feature unit, performing a feature enhancement process on a plurality of feature values of the target feature unit respectively based on the plurality of feature values of the target feature unit, in which the target feature unit is a feature unit in the first feature map along a feature enhancement direction; and performing a text recognition process on the image based on the first feature map after the feature enhancement process.

Type: Application

Filed: December 29, 2022

Publication date: June 29, 2023

Inventors: Pengyuan LV, Liang WU, Shanshan LIU, Meina QIAO, Chengquan ZHANG, Kun YAO, Junyu HAN
CHARACTER DETECTION METHOD AND APPARATUS , MODEL TRAINING METHOD AND APPARATUS, DEVICE AND STORAGE MEDIUM

Publication number: 20230196805

Abstract: The present disclosure provides a character detection method and apparatus, a model training method and apparatus, a device and a storage medium. The specific implementation is: acquiring a training sample, where the training sample includes a sample image and a marked image, and the marked image is an image obtained by marking a text instance in the sample image; inputting the sample image into a character detection model, to obtain segmented images and image types of the segmented images output by the character detection model, where the image type indicates that the segmented image includes a text instance, or the segmented image does not include a text instance; and adjusting a parameter of the character detection model according to the segmented images, the image types of the segmented images and the marked image.

Type: Application

Filed: February 13, 2023

Publication date: June 22, 2023

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Ju HUANG, Xiaoqiang ZHANG, Xiameng QIN, Chengquan ZHANG, Kun YAO
METHOD FOR TEXT RECOGNITION

Publication number: 20230186664

Abstract: A method for text recognition is disclosed. The method includes obtaining a whole-image scenario for an image to be processed and a text image in the image to be processed. The method further includes determining a first text recognition model corresponding to the whole-image scenario. The method further includes performing text recognition on the text image according to the first text recognition model to obtain text information.

Type: Application

Filed: February 14, 2023

Publication date: June 15, 2023

Inventors: Shanshan LIU, Meina QIAO, Liang WU, Pengyuan LV, Sen FAN, Chengquan ZHANG, Kun YAO

1 2 3 next