Patents by Inventor Pengyuan LV

Pengyuan LV has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Character recognition method and apparatus, computer device, and storage medium

Patent number: 12094229

Abstract: A computer device extracts an image feature of an image that includes one or more characters to be recognized. The image feature includes a plurality of image feature vectors. The device uses an attention mechanism to compute and output attention weight values corresponding to the target number of characters, based on the image feature vectors, through parallel computing. Each of the attention weight values corresponds to one or more respective characters and represents an importance of the plurality of image feature vectors for the respective characters. The device obtains at least one character according to the plurality of image feature vectors and the target number of attention weight values. Therefore, in a character recognition process, with recognition based on the foregoing attention mechanism, a character in any shape can be effectively recognized by using a simple procedure, thereby avoiding a cyclic operation process and greatly improving operation efficiency.

Type: Grant

Filed: September 15, 2021

Date of Patent: September 17, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Pengyuan Lv, Zhicheng Yang, Xinhang Leng, Ruiyu Li, Xiaoyong Shen, Yuwing Tai, Jiaya Jia
METHOD OF TRAINING DEEP LEARNING MODEL FOR TEXT DETECTION AND TEXT DETECTION METHOD

Publication number: 20240304015

Abstract: The present disclosure provides a method of training a deep learning model for text detection and a text detection method, which relates to the technical field of artificial intelligence, and in particular, to the technical field of computer vision and deep learning and can be used in scenarios of OCR optical character recognition. A method of training a deep learning model for text detection is provided, in which a single character segmentation sub-network outputs a single character segmentation prediction result, a text line segmentation sub-network outputs a text line segmentation prediction result, the trained deep learning model can be used for detecting a text area; and, can at the same time achieve single character segmentation and text line segmentation, and thus is capable to perform text detection by combining two ways of text segmentation, which further improves the accuracy of text area detection.

Type: Application

Filed: April 21, 2022

Publication date: September 12, 2024

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventors: Sen FAN, Xiaoyan WANG, Pengyuan LV, Chengquan ZHANG, Kun YAO
Adjusting method of yaw control strategy, yaw control system and medium

Patent number: 12071932

Abstract: The yaw control system may obtain time series data of a wind turbine generator set in response to a strategy adjustment request, the time series data of the wind turbine generator set comprising time series data for a wind facing angle; determine a generator set operating duration corresponding to each wind facing angle according to the time series data of the wind facing angle; determine a data distribution characteristic of the generator set operating duration corresponding to the wind facing angle according to generator set operating durations corresponding to multiple wind facing angles; and when identifying that the data distribution characteristic of the generator set operating duration corresponding to the wind facing angle meets a corresponding strategy adjustment condition, adjust a yaw control strategy, to perform yaw control on the wind turbine generator set according to an adjusted yaw control strategy.

Type: Grant

Filed: December 20, 2023

Date of Patent: August 27, 2024

Assignees: China Three Gorges Renewables (Group ) Co., LTD., Three Gorges New Energy Offshore Wind Power Operation and Maintenance Jiangsu Co., LTD., China Three Gorges New Energy (Group) Co., LTD. Liaoning Branch

Inventors: Haoning Xue, Pengyuan Lv, Jinjiang Lan, Yun Wang, Zhaorui Chai, Dongxing Gao, Long Jin, Mingzhe Liu, Chaoyue Geng, Xinyi Tan, Hongliang Song
METHOD OF TRAINING TEXT RECOGNITION MODEL, AND METHOD OF RECOGNIZING TEXT

Publication number: 20240281609

Abstract: The present application provides a method of training a text recognition model. The method includes: inputting a first sample image into the visual feature extraction sub-model to obtain a first visual feature and a first predicted text, the first sample image contains a text and a tag indicating a first actual text; obtaining, by using the semantic feature extraction sub-model, a first semantic feature based on the first predicted text; obtaining, by using the sequence sub-model, a second predicted text based on the first visual feature and the first semantic feature; and training the text recognition model based on the first predicted text, the second predicted text and the first actual text. The present disclosure further provides a method of recognizing a text, an electronic device, and a storage medium.

Type: Application

Filed: May 16, 2022

Publication date: August 22, 2024

Inventors: Pengyuan LV, Jingquan LI, Chengquan ZHANG, Kun YAO, Jingtuo LIU, Junyu HAN
Text recognition method and device, and electronic device

Patent number: 11861919

Abstract: A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.

Type: Grant

Filed: June 21, 2021

Date of Patent: January 2, 2024

Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventors: Chengquan Zhang, Pengyuan Lv, Kun Yao, Junyu Han, Jingtuo Liu
Method and apparatus for visual question answering, computer device and medium

Patent number: 11854283

Abstract: The present disclosure provides a method for visual question answering, which relates to fields of computer vision and natural language processing. The method includes: acquiring an input image and an input question; detecting visual information and position information of each of at least one text region in the input image; determining semantic information and attribute information of each of the at least one text region based on the visual information and the position information; determining a global feature of the input image based on the visual information, the position information, the semantic information, and the attribute information; determining a question feature based on the input question; and generating a predicted answer for the input image and the input question based on the global feature and the question feature. The present disclosure further provides a device for visual question answering, a computer device and a medium.

Type: Grant

Filed: February 5, 2021

Date of Patent: December 26, 2023

Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventors: Pengyuan Lv, Xiaoqiang Zhang, Shanshan Liu, Chengquan Zhang, Qiming Peng, Sijin Wu, Hua Lu, Yongfeng Chen
Method and apparatus for recognizing text

Patent number: 11836996

Abstract: The present disclosure discloses a method and apparatus for recognizing a text. The method comprises: acquiring images of a text area of an input image, the acquired images including a text centerline graph, a text direction offset graph, a text boundary offset graph, and a text character classification graph; extracting coordinates of feature points of a character center from the text centerline graph; sorting the extracted coordinates of the feature points based on the text direction offset graph to obtain a coordinate sequence of the feature points; determining a polygonal bounding box of the text area based on the coordinate sequence of the feature points of the character center and the text boundary offset graph; and determining a classification result of the feature points of the character center, based on the coordinate sequence of the feature points of the character center and the text character classification graph.

Type: Grant

Filed: March 23, 2021

Date of Patent: December 5, 2023

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Xiaoqiang Zhang, Pengyuan Lv, Shanshan Liu, Chengquan Zhang
METHOD FOR TRAINING ROI DETECTION MODEL, METHOD FOR DETECTING ROI, DEVICE, AND MEDIUM

Publication number: 20230290126

Abstract: Provided are a method for training a region of interest (ROI) detection model, a method for detecting an ROI, a device, and a medium. The specific implementation includes: performing feature extraction on a sample image to obtain a sample feature data; performing non-linear mapping on the sample feature data to obtain a first feature data and a second feature data; determining an inter-region difference data according to the second feature data and a third feature data of the first feature data in a region associated with a label ROI; and adjusting at least one of a to-be-trained feature extraction parameter and a to-be-trained feature enhancement parameter of the ROI detection model according to the inter-region difference data and the region associated with the label ROI.

Type: Application

Filed: February 28, 2023

Publication date: September 14, 2023

Inventors: Pengyuan LV, Sen FAN, Chengquan ZHANG, Kun YAO, Junyu HAN, Jingtuo LIU, Errui DING, Jingdong WANG
CHARACTER RECOGNITION MODEL TRAINING METHOD AND APPARATUS, CHARACTER RECOGNITION METHOD AND APPARATUS, DEVICE AND STORAGE MEDIUM

Publication number: 20230215203

Abstract: The present disclosure provides a character recognition model training method and apparatus, a character recognition method and apparatus, a device and a medium, relating to the technical field of artificial intelligence, and specifically to the technical fields of deep learning, image processing and computer vision, which can be applied to scenarios such as character detection and recognition technology. The specific implementing solution is: partitioning an untagged training sample into at least two sub-sample images; dividing the at least two sub-sample images into a first training set and a second training set; where the first training set includes a first sub-sample image with a visible attribute, and the second training set includes a second sub-sample image with an invisible attribute; performing self-supervised training on a to-be-trained encoder by taking the second training set as a tag of the first training set, to obtain a target encoder.

Type: Application

Filed: February 14, 2023

Publication date: July 6, 2023

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Pengyuan LV, Chengquan ZHANG, Shanshan LIU, Meina QIAO, Yangliu XU, Liang WU, Xiaoyan WANG, Kun YAO, Junyu Han, Errui DING, Jingdong WANG, Tian WU, Haifeng WANG
METHOD FOR RECOGNIZING TEXT, DEVICE, AND STORAGE MEDIUM

Publication number: 20230206667

Abstract: A method for recognizing text includes: obtaining a first feature map of an image; for each target feature unit, performing a feature enhancement process on a plurality of feature values of the target feature unit respectively based on the plurality of feature values of the target feature unit, in which the target feature unit is a feature unit in the first feature map along a feature enhancement direction; and performing a text recognition process on the image based on the first feature map after the feature enhancement process.

Type: Application

Filed: December 29, 2022

Publication date: June 29, 2023

Inventors: Pengyuan LV, Liang WU, Shanshan LIU, Meina QIAO, Chengquan ZHANG, Kun YAO, Junyu HAN
METHOD FOR TEXT RECOGNITION

Publication number: 20230186664

Abstract: A method for text recognition is disclosed. The method includes obtaining a whole-image scenario for an image to be processed and a text image in the image to be processed. The method further includes determining a first text recognition model corresponding to the whole-image scenario. The method further includes performing text recognition on the text image according to the first text recognition model to obtain text information.

Type: Application

Filed: February 14, 2023

Publication date: June 15, 2023

Inventors: Shanshan LIU, Meina QIAO, Liang WU, Pengyuan LV, Sen FAN, Chengquan ZHANG, Kun YAO
METHOD AND DEVICE FOR RECOGNIZING TEXT, AND METHOD AND DEVICE FOR TRAINING TEXT RECOGNITION MODEL

Publication number: 20230123327

Abstract: A method for recognizing text includes: obtaining an image sequence feature of an image to be recognized; obtaining a full text string of the image to be recognized by decoding the image sequence feature; obtaining a text sequence feature by performing a semantic enhancement process on the full text string, in which the image sequence feature, the full text string and the text sequence feature are of the same length; and determining text content of the image to be recognized based on the full text string and the text sequence feature.

Type: Application

Filed: December 19, 2022

Publication date: April 20, 2023

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Chengquan Zhang, Pengyuan Lv, Kun Yao, Junyu Han, Jingtuo Liu
TEXT RECOGNITION METHOD, ELECTRONIC DEVICE, AND NON-TRANSITORY STORAGE MEDIUM

Publication number: 20230050079

Abstract: Provided are a text recognition method, an electronic device, and a non-transitory computer-readable storage medium, which are applicable in an OCR scenario. In the particular solution, a text image to be recognized is acquired. Feature extraction is performed on the text image, to obtain an image feature corresponding to the text image, where a height-wise feature and a width-wise feature of the image feature each have a dimension greater than 1. According to the image feature, sampling features corresponding to multiple sampling points in the text image are determined. According to the sampling features corresponding to the multiple sampling points, a character recognition result corresponding to the text image is determined.

Type: Application

Filed: October 27, 2022

Publication date: February 16, 2023

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Pengyuan LV, Xiaoyan WANG, Liang WU, Shanshan LIU, Yuechen YU, Meina QIAO, Jie LU, Chengquan ZHANG, Kun YAO
TEXT DETECTION METHOD, TEXT RECOGNITION METHOD AND APPARATUS

Publication number: 20230045715

Abstract: The present disclosure provides a text detection method, a text recognition method and an apparatus, which relate to the field of artificial intelligence technology, in particular to the field of deep learning and computer vision technologies, and can be applied to scenarios such as optical character recognition. The text detection method is: acquiring an image feature of a text strip in a to-be-recognized image; performing visual enhancement processing on the to-be-recognized image to obtain an enhanced feature map of the to-be-recognized image; comparing the image feature of the text strip with the enhanced feature map for similarity to obtain a target bounding box of the text strip on the enhanced feature map.

Type: Application

Filed: October 14, 2022

Publication date: February 9, 2023

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Chengquan ZHANG, Pengyuan LV, Sen FAN, Kun YAO, Junyu HAN, Jingtuo LIU
TRAINING METHOD OF TEXT RECOGNITION MODEL, TEXT RECOGNITION METHOD, AND APPARATUS

Publication number: 20220415071

Abstract: The present disclosure provides a training method of a text recognition model, a text recognition method, and an apparatus, relating to the technical field of artificial intelligence, and specifically, to the technical field of deep learning and computer vision, which can be applied in scenarios such as optional character recognition, etc. The specific implementation solution is: performing mask prediction on visual features of an acquired sample image, to obtain a predicted visual feature; performing mask prediction on semantic features of acquired sample text, to obtain a predicted semantic feature, where the sample image includes text; determining a first loss value of the text of the sample image according to the predicted visual feature; determining a second loss value of the sample text according to the predicted semantic feature; training, according to the first loss value and the second loss value, to obtain the text recognition model.

Type: Application

Filed: August 31, 2022

Publication date: December 29, 2022

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Chengquan ZHANG, Pengyuan LV, Shanshan LIU, Meina QIAO, Yangliu XU, Liang WU, Jingtuo LIU, Junyu HAN, Errui DING, Jingdong WANG
CHARACTER RECOGNITION METHOD, MODEL TRAINING METHOD, RELATED APPARATUS AND ELECTRONIC DEVICE

Publication number: 20220139096

Abstract: A character recognition method, a model training method, a related apparatus and an electronic device are provided. The specific solution is: obtaining a target picture; performing feature encoding on the target picture to obtain a visual feature of the target picture; performing feature mapping on the visual feature to obtain a first target feature of the target picture, where the first target feature is a feature that has a matching space with a feature of character semantic information of the target picture; inputting the first target feature into a character recognition model for character recognition to obtain a first character recognition result of the target picture.

Type: Application

Filed: January 19, 2022

Publication date: May 5, 2022

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventors: Pengyuan Lv, Chengquan Zhang, Kun Yao, Junyu Han
CHARACTER RECOGNITION METHOD AND APPARATUS, COMPUTER DEVICE, AND STORAGE MEDIUM

Publication number: 20220004794

Abstract: A computer device extracts an image feature of an image that includes one or more characters to be recognized. The image feature includes a plurality of image feature vectors. The device uses an attention mechanism to compute and output attention weight values corresponding to the target number of characters, based on the image feature vectors, through parallel computing. Each of the attention weight values corresponds to one or more respective characters and represents an importance of the plurality of image feature vectors for the respective characters. The device obtains at least one character according to the plurality of image feature vectors and the target number of attention weight values. Therefore, in a character recognition process, with recognition based on the foregoing attention mechanism, a character in any shape can be effectively recognized by using a simple procedure, thereby avoiding a cyclic operation process and greatly improving operation efficiency.

Type: Application

Filed: September 15, 2021

Publication date: January 6, 2022

Inventors: Pengyuan LV, Zhicheng YANG, Xinhang LENG, Ruiyu LI, Xiaoyong SHEN, Yuwing TAI, Jiaya JIA
METHOD AND APPARATUS FOR VISUAL QUESTION ANSWERING, COMPUTER DEVICE AND MEDIUM

Publication number: 20210406619

Abstract: The present disclosure provides a method for visual question answering, which relates to fields of computer vision and natural language processing. The method includes: acquiring an input image and an input question; detecting visual information and position information of each of at least one text region in the input image; determining semantic information and attribute information of each of the at least one text region based on the visual information and the position information; determining a global feature of the input image based on the visual information, the position information, the semantic information, and the attribute information; determining a question feature based on the input question; and generating a predicted answer for the input image and the input question based on the global feature and the question feature. The present disclosure further provides a device for visual question answering, a computer device and a medium.

Type: Application

Filed: February 5, 2021

Publication date: December 30, 2021

Inventors: Pengyuan LV, Xiaoqiang ZHANG, Shanshan LIU, Chengquan ZHANG, Qiming PENG, Sijin WU, Hua LU, Yongfeng CHEN
TEXT RECOGNITION METHOD AND DEVICE, AND ELECTRONIC DEVICE

Publication number: 20210357710

Abstract: A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.

Type: Application

Filed: June 21, 2021

Publication date: November 18, 2021

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventors: Chengquan Zhang, Pengyuan Lv, Kun Yao, Junyu Han, Jingtuo Liu
METHOD AND APPARATUS FOR CHARACTER RECOGNITION AND PROCESSING

Publication number: 20210342621

Abstract: The disclosure provides a method and an apparatus for character recognition and processing. A character region is labelled for each character contained in each sample image of a sample image set. A character category and a character position code corresponding to each character region are labelled. A preset neural network model for character recognition is trained based on the sample image set having labelled character regions, character categories and character position codes corresponding to the character regions.

Type: Application

Filed: July 12, 2021

Publication date: November 4, 2021

Inventors: Pengyuan LV, Chengquan Zhang

1 2 next