Patents by Inventor Errui DING

Errui DING has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230213388
    Abstract: A method and an apparatus for measuring temperature, and a computer-readable storage medium includes detecting a target position of an object in an input image; determining key points of the target position and weight information of each key point based on a detection result of the target position, in which the weight information is configured to indicate a probability of each key point being covered; acquiring temperature information of each key point; and determining a temperature of the target position at least based on the temperature information and the weight information of each key point.
    Type: Application
    Filed: October 14, 2020
    Publication date: July 6, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Haocheng Feng, Haixiao Yue, Keyao Wang, Gang Zhang, Yanwen Fan, Xiyu Yu, Junyu Han, Jingtuo Liu, Errui Ding, Haifeng Wang
  • Publication number: 20230215203
    Abstract: The present disclosure provides a character recognition model training method and apparatus, a character recognition method and apparatus, a device and a medium, relating to the technical field of artificial intelligence, and specifically to the technical fields of deep learning, image processing and computer vision, which can be applied to scenarios such as character detection and recognition technology. The specific implementing solution is: partitioning an untagged training sample into at least two sub-sample images; dividing the at least two sub-sample images into a first training set and a second training set; where the first training set includes a first sub-sample image with a visible attribute, and the second training set includes a second sub-sample image with an invisible attribute; performing self-supervised training on a to-be-trained encoder by taking the second training set as a tag of the first training set, to obtain a target encoder.
    Type: Application
    Filed: February 14, 2023
    Publication date: July 6, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Pengyuan LV, Chengquan ZHANG, Shanshan LIU, Meina QIAO, Yangliu XU, Liang WU, Xiaoyan WANG, Kun YAO, Junyu Han, Errui DING, Jingdong WANG, Tian WU, Haifeng WANG
  • Publication number: 20230215136
    Abstract: The present disclosure provides a method and apparatus for training a multi-modal data matching degree calculation model, a method and apparatus for calculating a multi-modal data matching degree, an electronic device, a computer readable storage medium and a computer program product, and relates to the field of artificial intelligence technology such as deep learning, image processing and computer vision. The method comprises: acquiring first sample data and second sample data that are different in modalities; constructing a contrastive learning loss function comprising a semantic perplexity parameter, the semantic perplexity parameter being determined based on a semantic feature distance between the first sample data and the second sample data; and training, by using the contrastive learning loss function, an initial multi-modal data matching degree calculation model through a contrastive learning approach, to obtain a target multi-modal data matching degree calculation model.
    Type: Application
    Filed: February 24, 2023
    Publication date: July 6, 2023
    Inventors: Haoran WANG, Dongliang HE, Fu LI, Errui DING
  • Patent number: 11694436
    Abstract: The present application discloses a vehicle re-identification method and apparatus, a device and a storage medium, which relates to the field of computer vision, intelligent search, deep learning and intelligent transportation. The specific implementation scheme is: receiving a re-identification request from a terminal device, the re-identification request including a first image of a first vehicle shot by a first camera and information of the first camera; acquiring a first feature of the first vehicle and a first head orientation of the first vehicle according to the first image; determining a second image of the first vehicle from images of multiple vehicles according to the first feature, multiple second features extracted based on the images of the multiple vehicles in an image database, the first head orientation of the first vehicle, and the information of the first camera; and transmitting the second image to the terminal device.
    Type: Grant
    Filed: February 1, 2021
    Date of Patent: July 4, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Minyue Jiang, Xiao Tan, Hao Sun, Hongwu Zhang, Shilei Wen, Errui Ding
  • Publication number: 20230186486
    Abstract: A method for tracking vehicles includes: extracting a target image at a current moment from a video stream obtained during traveling of vehicles; performing instance segmentation on the target image to obtain detection boxes corresponding to individual vehicles in the target image; extracting, from the detection box for each vehicle, a set of pixel points corresponding to each vehicle; processing image features of each pixel point in the set of pixel points corresponding to each vehicle to determine features of each vehicle in the target image; and determining, according to the features of each vehicle in the target image and the degree of matching between the features of each vehicle in past images, movement trajectory of each vehicle in the target image, wherein the past images are n images adjacent to and before the target image in the video stream, and n is a positive integer.
    Type: Application
    Filed: October 30, 2020
    Publication date: June 15, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Wei Zhang, Xiao Tan, Hao Sun, Shilei Wen, Hongwu Zhang, Errui Ding
  • Publication number: 20230147550
    Abstract: A method for pre-training a semantic representation model includes: for each video-text pair in pre-training data, determining a mask image sequence, a mask character sequence, and a mask image-character sequence of the video-text pair; determining a plurality of feature sequences and mask position prediction results respectively corresponding to the plurality of feature sequences by inputting the mask image sequence, the mask character sequence, and the mask image-character sequence into an initial semantic representation model; and building a loss function based on the plurality of feature sequences, the mask position prediction results respectively corresponding to the plurality of feature sequences and true mask position results, and adjusting coefficients of the semantic representation model to realize training.
    Type: Application
    Filed: November 1, 2022
    Publication date: May 11, 2023
    Inventors: Dongliang HE, Errui DING
  • Publication number: 20230130006
    Abstract: The present application provides a method of processing a video, a method of querying a video, and a method of training a video processing model. A specific implementation solution of the method of processing the video includes: extracting, for a video to be processed, a plurality of video features under a plurality of receptive fields; extracting a local feature of the video to be processed according to a video feature under a target receptive field in the plurality of receptive fields; obtaining a global feature of the video to be processed according to a video feature under a largest receptive field in the plurality of receptive fields; and merging the local feature and the global feature to obtain a target feature of the video to be processed.
    Type: Application
    Filed: December 22, 2022
    Publication date: April 27, 2023
    Inventors: Dongliang HE, Errui DING, Haifeng WANG
  • Publication number: 20230124389
    Abstract: A model determination method and electronic device is provided, and relates to the technical field of artificial intelligence and, in particular, to the field of computer visions and deep learning, and can be applied to image processing, image identification and other scenarios. A specific implementation solution includes an image sample and a text sample are acquired, wherein text data in the text sample is used for performing text description to target image data in the image sample; at least one image feature in the image sample is stored to a first queue, and at least text feature in the text sample is stored to a second queue; the first queue and the second queue are trained to obtain a first target model; and the first target model is determined as an initialization model for a second target model.
    Type: Application
    Filed: August 15, 2022
    Publication date: April 20, 2023
    Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.
    Inventors: Longchao WANG, Yipeng SUN, Kun YAO, Junyu HAN, Jingtuo LIU, Errui DING
  • Publication number: 20230120253
    Abstract: A method and apparatus for generating a virtual character, an electronic device and a computer readable storage medium are provided. The method includes: performing mesh simplification on an initial model of a virtual character to obtain a mesh-simplified model; obtaining a first target model by performing white model mapping rendering on an area of each material type on the mesh-simplified model, and obtaining a second target model by performing hyper-realistic rendering on the area of each material type on the mesh-simplified model; and establishing a bidirectional mapping between the first target model and the second target model, and obtaining a target virtual character through iterative updating of the bidirectional mapping.
    Type: Application
    Filed: December 16, 2022
    Publication date: April 20, 2023
    Inventors: Jie Li, Haojie LIU, Yan ZHANG, Xuecen SHEN, Ruizhi CHEN, Chen ZHAO, Yuqiao TENG, Errui DING, Tian WU, Haifeng WANG
  • Publication number: 20230120985
    Abstract: A method for training a face recognition model includes: acquiring a plurality of first training images being uncovered face images, and acquiring a plurality of covering object images; generating a plurality of second training images by separately fusing the plurality of covering object images with the uncovered face images; and training the face recognition model by inputting the plurality of first training images and the plurality of second training images into the face recognition model.
    Type: Application
    Filed: December 16, 2022
    Publication date: April 20, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Yanwen Fan, Xiyu Yu, Gang Zhang, Jingtuo Liu, Haifeng Wang, Errui Ding, Junyu Han
  • Patent number: 11615140
    Abstract: A method includes screening, by a video-clip screening module in a video description model, a plurality of video proposal clips acquired from a video to be analyzed, to acquire a plurality of video clips suitable for description. The plural video proposal clips acquired from the video to be analyzed may be screened by the video-clip screening module to acquire the plural video clips suitable for description; and then, each video clip is described by a video-clip describing module, thus avoiding description of all the video proposal clips, only describing the screened video clips which have strong correlation with the video and are suitable for description, removing the interference of the description of the video clips which are not suitable for description in the description of the video, guaranteeing the accuracy of the final descriptions of the video clips, and improving the quality of the descriptions of the video clips.
    Type: Grant
    Filed: January 8, 2021
    Date of Patent: March 28, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Xiang Long, Dongliang He, Fu Li, Xiang Zhao, Tianwei Lin, Hao Sun, Shilei Wen, Errui Ding
  • Patent number: 11610388
    Abstract: The present application discloses a method and an apparatus for detecting wearing of a safety helmet, a device and a storage medium. The method for detecting wearing of a safety helmet includes: acquiring a first image collected by a camera device, where the first image includes at least one human body image; determining the at least one human body image and at least one head image in the first image; determining a human body image corresponding to each head image in the at least one human body image according to an area where the at least one human body image is located and an area where the at least one head image is located; and processing the human body image corresponding to the at least one head image according to a type of the at least one head image.
    Type: Grant
    Filed: February 1, 2021
    Date of Patent: March 21, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Mingyuan Mao, Yuan Feng, Ying Xin, Pengcheng Yuan, Bin Zhang, Shufei Lin, Xiaodi Wang, Shumin Han, Yingbo Xu, Jingwei Liu, Shilei Wen, Hongwu Zhang, Errui Ding
  • Patent number: 11610389
    Abstract: A method and apparatus for positioning a key point, a device, and a storage medium are provided. The method may include: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point.
    Type: Grant
    Filed: March 15, 2021
    Date of Patent: March 21, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Jian Wang, Zipeng Lu, Hao Sun, Hongwu Zhang, Shilei Wen, Errui Ding
  • Patent number: 11600069
    Abstract: A method and apparatus for detecting a temporal action of a video, an electronic device and a storage medium are disclosed, which relates to the field of video processing technologies. An implementation includes: acquiring an initial temporal feature sequence of a video to be detected; acquiring, by a pre-trained video-temporal-action detecting module, implicit features and explicit features of a plurality of configured temporal anchor boxes based on the initial temporal feature sequence; and acquiring, by the video-temporal-action detecting module, the starting position and the ending position of a video clip containing a specified action, the category of the specified action and the probability that the specified action belongs to the category from the plural temporal anchor boxes according to the explicit features and the implicit features of the plural temporal anchor boxes.
    Type: Grant
    Filed: January 8, 2021
    Date of Patent: March 7, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Tianwei Lin, Xin Li, Dongliang He, Fu Li, Hao Sun, Shilei Wen, Errui Ding
  • Publication number: 20230027813
    Abstract: An object detecting method includes: obtaining an object image of an object; obtaining an object feature map by performing feature extraction on the object image; obtaining decoded features by performing feature mapping on the object feature map by adopting a mapping network of an object recognition model; obtaining positions of prediction boxes by inputting the decoded features into a first prediction layer of the object recognition model to perform object regression prediction; and obtaining classes of objects within the prediction boxes by inputting the decoded features into a second prediction layer of the object recognition model to perform object class prediction.
    Type: Application
    Filed: September 29, 2022
    Publication date: January 26, 2023
    Inventors: Xipeng YANG, Xiao TAN, Hao SUN, Errui DING
  • Publication number: 20230009547
    Abstract: A method for detecting an object based on a video includes: obtaining a plurality of image frames of a video to be detected; obtaining initial feature maps by extracting features of the plurality of image frames; for each two adjacent image frames of the plurality of image frames, obtaining a target feature map of a latter image frame of the two adjacent image frames by performing feature fusing on the sub-feature maps of the first target dimensions included in the initial feature map of a former image frame of the two adjacent image frames and the sub-feature maps of the second target dimensions included in the initial feature map of the latter image frame; and performing object detection on the respective target feature map of each image frame.
    Type: Application
    Filed: September 19, 2022
    Publication date: January 12, 2023
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Xipeng Yang, Xiao Tan, Hao Sun, Errui Ding
  • Publication number: 20220415071
    Abstract: The present disclosure provides a training method of a text recognition model, a text recognition method, and an apparatus, relating to the technical field of artificial intelligence, and specifically, to the technical field of deep learning and computer vision, which can be applied in scenarios such as optional character recognition, etc. The specific implementation solution is: performing mask prediction on visual features of an acquired sample image, to obtain a predicted visual feature; performing mask prediction on semantic features of acquired sample text, to obtain a predicted semantic feature, where the sample image includes text; determining a first loss value of the text of the sample image according to the predicted visual feature; determining a second loss value of the sample text according to the predicted semantic feature; training, according to the first loss value and the second loss value, to obtain the text recognition model.
    Type: Application
    Filed: August 31, 2022
    Publication date: December 29, 2022
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Chengquan ZHANG, Pengyuan LV, Shanshan LIU, Meina QIAO, Yangliu XU, Liang WU, Jingtuo LIU, Junyu HAN, Errui DING, Jingdong WANG
  • Patent number: 11538286
    Abstract: A method and apparatus for vehicle damage assessment, an electronic device, and a computer-readable storage medium are provided. The method may include: extracting, from an input image, a first feature characterizing a part of a vehicle and a second feature characterizing a damage type of the vehicle; integrating the first feature and the second feature to generate a third feature characterizing a corresponding relation between the part and the damage type; converting the third feature into a characteristic vector; and determining a damage recognition result based on the characteristic vector. According to the technical solution of the disclosure, users can rapidly and accurately learn about the damage condition of the vehicle by providing pictures or videos of the damaged vehicle, thus providing an objective basis for subsequent damage assessment, claim settlement, and repair.
    Type: Grant
    Filed: December 11, 2019
    Date of Patent: December 27, 2022
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Wei Zhang, Xiao Tan, Hao Sun, Shilei Wen, Errui Ding
  • Publication number: 20220392205
    Abstract: Embodiments of the present disclosure provide a method and apparatus for training an image recognition model based on a semantic enhancement, a method and apparatus for recognizing an image, an electronic device, and a computer readable storage medium. The method for training an image recognition model based on a semantic enhancement comprises: extracting, from an inputted first image being unannotated and having no textual description, a first feature representation of the first image; calculating a first loss function based on the first feature representation; extracting, from an inputted second image being unannotated and having an original textual description, a second feature representation of the second image; calculating a second loss function based on the second feature representation, and training an image recognition model based on a fusion of the first loss function and the second loss function.
    Type: Application
    Filed: August 22, 2022
    Publication date: December 8, 2022
    Inventors: Yipeng SUN, Rongqiao AN, Xiang WEI, Longchao WANG, Kun YAO, Junyu HAN, Jingtuo LIU, Errui DING
  • Publication number: 20220392101
    Abstract: A training method, a method of detecting a target image, an electronic device and a medium, which relate to the field of artificial intelligence technology, and in particular to fields of computer vision and deep learning. The method can include: generating an expanded sample image set for a target scene by using a mask image set and an initial sample image set, wherein the mask image set is acquired by parsing a predetermined image set, a target object in the target scene is interfered by another object or the target object in the target scene is cut off, and an image in the predetermined image set includes the target object in the target scene or the another object; and training, by using the initial sample image set and the expanded sample image set, a detection model for detecting the target object.
    Type: Application
    Filed: August 15, 2022
    Publication date: December 8, 2022
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Zipeng LU, Jian WANG, Hao SUN, Errui DING