Patents by Inventor Chengquan Zhang

Chengquan Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11881044
    Abstract: A method and apparatus for processing an image, a device and a storage medium are provided. An implementation of the method includes: acquiring a template image, the template image including at least one region of interest; determining a first feature map corresponding to each region of interest in the template image; acquiring a target image; determining a second feature map of the target image; and determining at least one region of interest in the target image according to the first feature map and the second feature map.
    Type: Grant
    Filed: June 21, 2021
    Date of Patent: January 23, 2024
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Chengquan Zhang, Mengyi En, Ju Huang, Qunyi Xie, Xiameng Qin, Kun Yao, Junyu Han, Jingtuo Liu, Errui Ding
  • Patent number: 11861919
    Abstract: A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.
    Type: Grant
    Filed: June 21, 2021
    Date of Patent: January 2, 2024
    Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
    Inventors: Chengquan Zhang, Pengyuan Lv, Kun Yao, Junyu Han, Jingtuo Liu
  • Patent number: 11854283
    Abstract: The present disclosure provides a method for visual question answering, which relates to fields of computer vision and natural language processing. The method includes: acquiring an input image and an input question; detecting visual information and position information of each of at least one text region in the input image; determining semantic information and attribute information of each of the at least one text region based on the visual information and the position information; determining a global feature of the input image based on the visual information, the position information, the semantic information, and the attribute information; determining a question feature based on the input question; and generating a predicted answer for the input image and the input question based on the global feature and the question feature. The present disclosure further provides a device for visual question answering, a computer device and a medium.
    Type: Grant
    Filed: February 5, 2021
    Date of Patent: December 26, 2023
    Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
    Inventors: Pengyuan Lv, Xiaoqiang Zhang, Shanshan Liu, Chengquan Zhang, Qiming Peng, Sijin Wu, Hua Lu, Yongfeng Chen
  • Publication number: 20230401828
    Abstract: A method for training an image recognition model includes: obtaining a training data set, in which the training data set includes first text images of each vertical category in a non-target scene and second text images of each vertical category in a target scene, and a type of text content involved in the first text images is the same as a type of text content involved in the second text image; training an initial recognition model by using the first text images, to obtain a basic recognition model; and modifying the basic recognition model by using the second text images, to obtain an image recognition model corresponding to the target scene.
    Type: Application
    Filed: April 8, 2022
    Publication date: December 14, 2023
    Inventors: Meina QIAO, Shanshan LIU, Xiameng QIN, Chengquan ZHANG, Kun YAO
  • Patent number: 11836996
    Abstract: The present disclosure discloses a method and apparatus for recognizing a text. The method comprises: acquiring images of a text area of an input image, the acquired images including a text centerline graph, a text direction offset graph, a text boundary offset graph, and a text character classification graph; extracting coordinates of feature points of a character center from the text centerline graph; sorting the extracted coordinates of the feature points based on the text direction offset graph to obtain a coordinate sequence of the feature points; determining a polygonal bounding box of the text area based on the coordinate sequence of the feature points of the character center and the text boundary offset graph; and determining a classification result of the feature points of the character center, based on the coordinate sequence of the feature points of the character center and the text character classification graph.
    Type: Grant
    Filed: March 23, 2021
    Date of Patent: December 5, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Xiaoqiang Zhang, Pengyuan Lv, Shanshan Liu, Chengquan Zhang
  • Publication number: 20230377225
    Abstract: A method for training an image editing model includes steps described below. Covering processing is performed on a region of interest determined in an original image so that a background image sample is formed, and content corresponding to the region of interest is determined as a sample of content of interest; the background image sample and the sample of the content of interest are input into an image editing model; fusion processing is performed on a background image feature and a feature of the region of interest by using the image editing model so that a fusion feature is formed; an image reconstruction operation is performed according to the fusion feature by using the image editing model so that a reconstructed image is output; and optimization training is performed on the image editing model according to a loss relationship between the reconstructed image and the original image.
    Type: Application
    Filed: March 14, 2023
    Publication date: November 23, 2023
    Inventors: Chengquan ZHANG, Yuechen YU, Liang WU
  • Patent number: 11810384
    Abstract: The present application discloses a method and an apparatus for recognizing text content, and an electronic device, and relates to a text recognition technique in the field of computer technology. The specific implementation is as follows: acquiring a dial picture; detecting at least one text centerline and a bounding box corresponding to each text centerline in the dial picture; and recognizing text content in each line of text in the dial picture based on the at least one text centerline and the bounding box corresponding to each text centerline.
    Type: Grant
    Filed: February 9, 2021
    Date of Patent: November 7, 2023
    Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
    Inventors: Shanshan Liu, Chengquan Zhang, Xuan Li, Mengyi En, Hailun Xu, Xiaoqiang Zhang
  • Patent number: 11775845
    Abstract: A character recognition method, a character recognition apparatus, an electronic device and a computer readable storage medium are disclosed. The character recognition method includes: determining semantic information and first position information of each individual character recognized from an image; constructing a graph network according to the semantic information and the first position information of each individual character; and determining a character recognition result of the image according to a feature of each individual character calculated by the graph network.
    Type: Grant
    Filed: March 23, 2021
    Date of Patent: October 3, 2023
    Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.
    Inventors: Xiaoqiang Zhang, Chengquan Zhang, Shanshan Liu
  • Publication number: 20230290126
    Abstract: Provided are a method for training a region of interest (ROI) detection model, a method for detecting an ROI, a device, and a medium. The specific implementation includes: performing feature extraction on a sample image to obtain a sample feature data; performing non-linear mapping on the sample feature data to obtain a first feature data and a second feature data; determining an inter-region difference data according to the second feature data and a third feature data of the first feature data in a region associated with a label ROI; and adjusting at least one of a to-be-trained feature extraction parameter and a to-be-trained feature enhancement parameter of the ROI detection model according to the inter-region difference data and the region associated with the label ROI.
    Type: Application
    Filed: February 28, 2023
    Publication date: September 14, 2023
    Inventors: Pengyuan LV, Sen FAN, Chengquan ZHANG, Kun YAO, Junyu HAN, Jingtuo LIU, Errui DING, Jingdong WANG
  • Publication number: 20230260306
    Abstract: A method and an apparatus is provided for recognizing a document image, a storage medium and an electronic device, relates to the technical field of artificial intelligent recognition, particularly relates to the technical fields of deep learning and computer vision. The method includes that a document image to be recognized is transformed into an image feature map, where the document image at least includes at least one text box and text information including multiple characters; a first recognition content of the document image to be recognized is predicted based on the image feature map, the multiple characters and the text box; the document image to be recognized is recognized based on an optical character recognition algorithm to obtain a second recognition content; and the first recognition content is matched with the second recognition content to obtain a target recognition content.
    Type: Application
    Filed: August 9, 2022
    Publication date: August 17, 2023
    Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.
    Inventors: Yuechen YU, Chengquan ZHANG, Kun YAO
  • Publication number: 20230215203
    Abstract: The present disclosure provides a character recognition model training method and apparatus, a character recognition method and apparatus, a device and a medium, relating to the technical field of artificial intelligence, and specifically to the technical fields of deep learning, image processing and computer vision, which can be applied to scenarios such as character detection and recognition technology. The specific implementing solution is: partitioning an untagged training sample into at least two sub-sample images; dividing the at least two sub-sample images into a first training set and a second training set; where the first training set includes a first sub-sample image with a visible attribute, and the second training set includes a second sub-sample image with an invisible attribute; performing self-supervised training on a to-be-trained encoder by taking the second training set as a tag of the first training set, to obtain a target encoder.
    Type: Application
    Filed: February 14, 2023
    Publication date: July 6, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Pengyuan LV, Chengquan ZHANG, Shanshan LIU, Meina QIAO, Yangliu XU, Liang WU, Xiaoyan WANG, Kun YAO, Junyu Han, Errui DING, Jingdong WANG, Tian WU, Haifeng WANG
  • Patent number: 11694461
    Abstract: The present application discloses a method and an apparatus for optical character recognition, an electronic device and a storage medium, and relates to the fields of artificial intelligence and deep learning. The method may include: determining, for a to-be-recognized image, a text bounding box of a text area therein, and extracting a text area image from the to-be-recognized image according to the text bounding box; determining a bounding box of text lines in the text area image, and extracting a text-line image from the text area image according to the bounding box; and performing text sequence recognition on the text-line image, and obtaining a recognition result. The application of the solution in the present application can improve a recognition speed and the like.
    Type: Grant
    Filed: March 11, 2021
    Date of Patent: July 4, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Mengyi En, Shanshan Liu, Xuan Li, Chengquan Zhang, Hailun Xu, Xiaoqiang Zhang
  • Publication number: 20230206667
    Abstract: A method for recognizing text includes: obtaining a first feature map of an image; for each target feature unit, performing a feature enhancement process on a plurality of feature values of the target feature unit respectively based on the plurality of feature values of the target feature unit, in which the target feature unit is a feature unit in the first feature map along a feature enhancement direction; and performing a text recognition process on the image based on the first feature map after the feature enhancement process.
    Type: Application
    Filed: December 29, 2022
    Publication date: June 29, 2023
    Inventors: Pengyuan LV, Liang WU, Shanshan LIU, Meina QIAO, Chengquan ZHANG, Kun YAO, Junyu HAN
  • Publication number: 20230196805
    Abstract: The present disclosure provides a character detection method and apparatus, a model training method and apparatus, a device and a storage medium. The specific implementation is: acquiring a training sample, where the training sample includes a sample image and a marked image, and the marked image is an image obtained by marking a text instance in the sample image; inputting the sample image into a character detection model, to obtain segmented images and image types of the segmented images output by the character detection model, where the image type indicates that the segmented image includes a text instance, or the segmented image does not include a text instance; and adjusting a parameter of the character detection model according to the segmented images, the image types of the segmented images and the marked image.
    Type: Application
    Filed: February 13, 2023
    Publication date: June 22, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Ju HUANG, Xiaoqiang ZHANG, Xiameng QIN, Chengquan ZHANG, Kun YAO
  • Publication number: 20230186664
    Abstract: A method for text recognition is disclosed. The method includes obtaining a whole-image scenario for an image to be processed and a text image in the image to be processed. The method further includes determining a first text recognition model corresponding to the whole-image scenario. The method further includes performing text recognition on the text image according to the first text recognition model to obtain text information.
    Type: Application
    Filed: February 14, 2023
    Publication date: June 15, 2023
    Inventors: Shanshan LIU, Meina QIAO, Liang WU, Pengyuan LV, Sen FAN, Chengquan ZHANG, Kun YAO
  • Publication number: 20230123327
    Abstract: A method for recognizing text includes: obtaining an image sequence feature of an image to be recognized; obtaining a full text string of the image to be recognized by decoding the image sequence feature; obtaining a text sequence feature by performing a semantic enhancement process on the full text string, in which the image sequence feature, the full text string and the text sequence feature are of the same length; and determining text content of the image to be recognized based on the full text string and the text sequence feature.
    Type: Application
    Filed: December 19, 2022
    Publication date: April 20, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Chengquan Zhang, Pengyuan Lv, Kun Yao, Junyu Han, Jingtuo Liu
  • Publication number: 20230050079
    Abstract: Provided are a text recognition method, an electronic device, and a non-transitory computer-readable storage medium, which are applicable in an OCR scenario. In the particular solution, a text image to be recognized is acquired. Feature extraction is performed on the text image, to obtain an image feature corresponding to the text image, where a height-wise feature and a width-wise feature of the image feature each have a dimension greater than 1. According to the image feature, sampling features corresponding to multiple sampling points in the text image are determined. According to the sampling features corresponding to the multiple sampling points, a character recognition result corresponding to the text image is determined.
    Type: Application
    Filed: October 27, 2022
    Publication date: February 16, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Pengyuan LV, Xiaoyan WANG, Liang WU, Shanshan LIU, Yuechen YU, Meina QIAO, Jie LU, Chengquan ZHANG, Kun YAO
  • Publication number: 20230042234
    Abstract: A method for training a model includes: obtaining a scene image, second actual characters in the scene image and a second construct image; obtaining first features and first recognition characters of characters obtained by performing character recognition on the scene image using the model to be trained; obtaining second features of characters obtained by performing character recognition on the second construct image using the training auxiliary model; and obtaining a character recognition model by adjusting model parameters of the model to be trained based on the first recognition characters, the second actual characters, the first features and the second features.
    Type: Application
    Filed: October 24, 2022
    Publication date: February 9, 2023
    Inventors: Yangliu XU, Qunyi Xie, Yi Chen, Xiameng Qin, Chengquan Zhang, Kun Yao
  • Publication number: 20230045715
    Abstract: The present disclosure provides a text detection method, a text recognition method and an apparatus, which relate to the field of artificial intelligence technology, in particular to the field of deep learning and computer vision technologies, and can be applied to scenarios such as optical character recognition. The text detection method is: acquiring an image feature of a text strip in a to-be-recognized image; performing visual enhancement processing on the to-be-recognized image to obtain an enhanced feature map of the to-be-recognized image; comparing the image feature of the text strip with the enhanced feature map for similarity to obtain a target bounding box of the text strip on the enhanced feature map.
    Type: Application
    Filed: October 14, 2022
    Publication date: February 9, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Chengquan ZHANG, Pengyuan LV, Sen FAN, Kun YAO, Junyu HAN, Jingtuo LIU
  • Publication number: 20230020022
    Abstract: A method of recognizing a text, which relates to a field of an artificial intelligence technology, in particular to a field of computer vision and deep learning technology, and may be applied to optical character recognition or other applications. The method includes: acquiring a plurality of image sequences by continuously scanning a document; performing an image stitching, so as to obtain a plurality of successive frames of stitched images corresponding to the plurality of image sequences respectively, an overlapping region exists between each two successive frames of stitched images; performing a text recognition based on the plurality of successive frames of stitched images, so as to obtain a plurality of corresponding recognition results; and performing a de-duplication on the plurality of recognition results based on the overlapping region between each two successive frames of stitched images, so as to obtain a text recognition result for the document.
    Type: Application
    Filed: August 11, 2022
    Publication date: January 19, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Shanshan LIU, Meina QIAO, Liang WU, Chengquan ZHANG, Kun YAO