Patents by Inventor Mengyi En

Mengyi En has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11881044
    Abstract: A method and apparatus for processing an image, a device and a storage medium are provided. An implementation of the method includes: acquiring a template image, the template image including at least one region of interest; determining a first feature map corresponding to each region of interest in the template image; acquiring a target image; determining a second feature map of the target image; and determining at least one region of interest in the target image according to the first feature map and the second feature map.
    Type: Grant
    Filed: June 21, 2021
    Date of Patent: January 23, 2024
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Chengquan Zhang, Mengyi En, Ju Huang, Qunyi Xie, Xiameng Qin, Kun Yao, Junyu Han, Jingtuo Liu, Errui Ding
  • Patent number: 11810384
    Abstract: The present application discloses a method and an apparatus for recognizing text content, and an electronic device, and relates to a text recognition technique in the field of computer technology. The specific implementation is as follows: acquiring a dial picture; detecting at least one text centerline and a bounding box corresponding to each text centerline in the dial picture; and recognizing text content in each line of text in the dial picture based on the at least one text centerline and the bounding box corresponding to each text centerline.
    Type: Grant
    Filed: February 9, 2021
    Date of Patent: November 7, 2023
    Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
    Inventors: Shanshan Liu, Chengquan Zhang, Xuan Li, Mengyi En, Hailun Xu, Xiaoqiang Zhang
  • Patent number: 11694461
    Abstract: The present application discloses a method and an apparatus for optical character recognition, an electronic device and a storage medium, and relates to the fields of artificial intelligence and deep learning. The method may include: determining, for a to-be-recognized image, a text bounding box of a text area therein, and extracting a text area image from the to-be-recognized image according to the text bounding box; determining a bounding box of text lines in the text area image, and extracting a text-line image from the text area image according to the bounding box; and performing text sequence recognition on the text-line image, and obtaining a recognition result. The application of the solution in the present application can improve a recognition speed and the like.
    Type: Grant
    Filed: March 11, 2021
    Date of Patent: July 4, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Mengyi En, Shanshan Liu, Xuan Li, Chengquan Zhang, Hailun Xu, Xiaoqiang Zhang
  • Publication number: 20230134615
    Abstract: A method of processing a task, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence, in particular to fields of deep learning and computer vision, and may be applied to OCR optical character recognition and other scenarios. The method includes: parsing labeled data to be processed according to a task type identification, to obtain task labeled data, a tag information of the task labeled data is matched with the task type identification, and the task labeled data includes first task labeled data and second task labeled data; training a model using the first task labeled data, to obtain candidate models, the model is determined according to the task type identification; and determining a target model from the candidate models according to a performance evaluation result obtained by performing performance evaluation on the plurality of candidate models using the second task labeled data.
    Type: Application
    Filed: December 27, 2022
    Publication date: May 4, 2023
    Inventors: Qunyi XIE, Dongdong ZHANG, Xiameng QIN, Mengyi EN, Yangliu XU, Yi CHEN, Ju HUANG, Kun YAO
  • Publication number: 20230048495
    Abstract: A method and a platform of generating a document, an electronic device, and a storage medium are provided, which relate to a field of an artificial intelligence technology, in particular to fields of computer vision and deep learning technologies, and may be applied to a text recognition scenario and other scenarios. The method includes: performing a category recognition on a document picture to obtain a target category result; determining a target structured model matched with the target category result; and performing, by using the target structured model, a structure recognition on the document picture to obtain a structure recognition result, so as to generate an electronic document based on the structure recognition result, wherein the structure recognition result includes a field attribute recognition result and a field position recognition result.
    Type: Application
    Filed: October 26, 2022
    Publication date: February 16, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Qunyi XIE, Xiameng QIN, Mengyi EN, Dongdong ZHANG, Ju HUANG, Yangliu XU, Yi CHEN, Kun YAO
  • Patent number: 11482023
    Abstract: A method and apparatus for detecting text regions in an image, a device, and a medium are provided. The method may include: detecting, based on feature representation of an image, a first text region in the image, where the first text region covers a text in the image, a region occupied by the text being of a certain shape; determining, based on a feature block of the first text region, text geometry information associated with the text, where the text geometry information includes a text centerline of the text and distance information of the centerline from the upper and lower borders of the text; and adjusting, based on the text geometry information associated with the text, the first text region to a second text region, where the second text region also covers the text and is smaller than the first text region.
    Type: Grant
    Filed: December 11, 2019
    Date of Patent: October 25, 2022
    Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.
    Inventors: Chengquan Zhang, Zuming Huang, Mengyi En, Junyu Han, Errui Ding
  • Publication number: 20220092353
    Abstract: A computer-implemented method includes: acquiring training data, the training data includes training images for a preset vertical type, and the training images include a first training image containing real data of the preset vertical type and a second training image containing virtual data of the preset vertical type ; building a basic model, the basic model includes a deep learning network, and the deep learning network is configured to recognize the training images to extract text data in the training image; and training the basic model by using the training data to obtain the image recognition model.
    Type: Application
    Filed: December 1, 2021
    Publication date: March 24, 2022
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Ruixue Liu, Xiameng Qin, Mengyi En, Kun Yao, Chengquan Zhang, Shengxian Zhu, Yunhao Li, Junyu Han, Hao Sun
  • Publication number: 20210390296
    Abstract: The present application discloses a method and an apparatus for optical character recognition, an electronic device and a storage medium, and relates to the fields of artificial intelligence and deep learning. The method may include: determining, for a to-be-recognized image, a text bounding box of a text area therein, and extracting a text area image from the to-be-recognized image according to the text bounding box; determining a bounding box of text lines in the text area image, and extracting a text-line image from the text area image according to the bounding box; and performing text sequence recognition on the text-line image, and obtaining a recognition result. The application of the solution in the present application can improve a recognition speed and the like.
    Type: Application
    Filed: March 11, 2021
    Publication date: December 16, 2021
    Inventors: Mengyi En, Shanshan Liu, Xuan Li, Chengquan Zhang, Hailun Xu, Xiaoqiang Zhang
  • Publication number: 20210334602
    Abstract: The present application discloses a method and an apparatus for recognizing text content, and an electronic device, and relates to a text recognition technique in the field of computer technology. The specific implementation is as follows: acquiring a dial picture; detecting at least one text centerline and a bounding box corresponding to each text centerline in the dial picture; and recognizing text content in each line of text in the dial picture based on the at least one text centerline and the bounding box corresponding to each text centerline.
    Type: Application
    Filed: February 9, 2021
    Publication date: October 28, 2021
    Inventors: Shanshan Liu, Chengquan Zhang, Xuan Li, Mengyi En, Hailun Xu, Xiaoqiang Zhang
  • Publication number: 20210312174
    Abstract: A method and apparatus for processing an image, a device and a storage medium are provided. An implementation of the method includes: acquiring a template image, the template image including at least one region of interest; determining a first feature map corresponding to each region of interest in the template image; acquiring a target image; determining a second feature map of the target image; and determining at least one region of interest in the target image according to the first feature map and the second feature map.
    Type: Application
    Filed: June 21, 2021
    Publication date: October 7, 2021
    Inventors: Chengquan ZHANG, Mengyi EN, Ju HUANG, Qunyi XIE, Xiameng QIN, Kun YAO, Junyu HAN, Jingtuo LIU, Errui DING
  • Publication number: 20200327384
    Abstract: Embodiments of the present disclosure provide a method and apparatus for detecting text regions in an image, a device, and a medium. The method may include: detecting, based on feature representation of an image, a first text region in the image, where the first text region covers a text in the image, a region occupied by the text being of a certain shape; determining, based on a feature block of the first text region, text geometry information associated with the text, where the text geometry information includes a text centerline of the text and distance information of the centerline from the upper and lower borders of the text; and adjusting, based on the text geometry information associated with the text, the first text region to a second text region, where the second text region also covers the text and is smaller than the first text region.
    Type: Application
    Filed: December 11, 2019
    Publication date: October 15, 2020
    Inventors: Chengquan Zhang, Zuming Huang, Mengyi En, Junyu Han, Errui Ding