Patents by Inventor Errui DING

Errui DING has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210209731
    Abstract: Embodiments of the present disclosure provide a video processing method, a video processing device and a related non-transitory computer readable storage medium. The method includes the following. Frame sequence data of a low-resolution video to be converted is obtained. Pixel tensors of each frame in the frame sequence data are inputted into a pre-trained neural network model to obtain high-resolution video frame sequence data corresponding to the video to be converted output by the neural network model. The neural network model obtains the high-resolution video frame sequence data based on high-order pixel information of each frame in the frame sequence data.
    Type: Application
    Filed: September 17, 2020
    Publication date: July 8, 2021
    Inventors: Chao LI, Shilei WEN, Errui DING
  • Publication number: 20210201182
    Abstract: Embodiments of the present disclosure provide a method and apparatus for performing a structured extraction on a text, a device and a storage medium. The method may include: performing a text detection on an entity text image to obtain a position and content of a text line of the entity text image; extracting multivariate information of the text line based on the position and the content of the text line; performing a feature fusion on the multivariate information of the text line to obtain a multimodal fusion feature of the text line; performing category and relationship reasoning based on the multimodal fusion feature of the text line to obtain a category and a relationship probability matrix of the text line; and constructing structured information of the entity text image based on the category and the relationship probability matrix of the text line.
    Type: Application
    Filed: March 12, 2021
    Publication date: July 1, 2021
    Inventors: Yulin LI, Xiameng Qin, Chengquan Zhang, Junyu Han, Errui Ding, Tian Wu, Haifeng Wang
  • Publication number: 20210192214
    Abstract: The present application discloses a vehicle re-identification method and apparatus, a device and a storage medium, which relates to the field of computer vision, intelligent search, deep learning and intelligent transportation. The specific implementation scheme is: receiving a re-identification request from a terminal device, the re-identification request including a first image of a first vehicle shot by a first camera and information of the first camera; acquiring a first feature of the first vehicle and a first head orientation of the first vehicle according to the first image; determining a second image of the first vehicle from images of multiple vehicles according to the first feature, multiple second features extracted based on the images of the multiple vehicles in an image database, the first head orientation of the first vehicle, and the information of the first camera; and transmitting the second image to the terminal device.
    Type: Application
    Filed: February 1, 2021
    Publication date: June 24, 2021
    Inventors: Minyue JIANG, Xiao TAN, Hao SUN, Hongwu ZHANG, Shilei WEN, Errui DING
  • Publication number: 20210192194
    Abstract: The present application discloses a video-based human behavior recognition method, apparatus, device and storage medium, and relates to the technical field of human recognitions. The specific implementation scheme lies in: acquiring a human rectangle of each video frame of the video to be recognized, where each human rectangle includes a plurality of human key points, and each of the human key points has a key point feature; constructing a feature matrix according to the human rectangle of the each video frame; convolving the feature matrix with respect to a video frame quantity dimension to obtain a first convolution result and convolving the feature matrix with respect to a key point quantity dimension to obtain a second convolution result; inputting the first convolution result and the second convolution result into a preset classification model to obtain a human behavior category of the video to be recognized.
    Type: Application
    Filed: September 16, 2020
    Publication date: June 24, 2021
    Inventors: Zhizhen Chi, Fu Li, Hao Sun, Dongliang He, Xiang Long, Zhichao Zhou, Ping Wang, Shilei Wen, Errui Ding
  • Patent number: 11043000
    Abstract: Provided are a measuring method and apparatus for a damaged part of a vehicle includes: acquiring an image to be processed of a vehicle; acquiring the damaged part of the vehicle in the image to be processed according to the image to be processed; acquiring first position information of key points in the image to be processed according to the image to be processed; determining a transformation relation between the image to be processed and a first fitting plane according to the key points included in the image to be processed and the first position information, where the first fitting plane is a fitting plane determined according to the key points included in the image to be processed on the 3D model; acquiring a projection area of the damaged part in the first fitting plane according to the transformation relation; and measuring the projection area to acquire a measuring result.
    Type: Grant
    Filed: September 10, 2019
    Date of Patent: June 22, 2021
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY CO., LTD.
    Inventors: Yongfeng Zhong, Xiao Tan, Feng Zhou, Hao Sun, Errui Ding
  • Publication number: 20210174537
    Abstract: Embodiments of the present disclosure provide a method and apparatus for detecting a target object in an image. The method includes: performing following prediction operations using a pre-trained neural network: detecting a target object in a two-dimensional image to determine a two-dimensional bounding box of the target object; and determining a relative position constraint relationship between the two-dimensional bounding box of the target object and a three-dimensional projection bounding box obtained by projecting a three-dimensional bounding box of the target object into the two-dimensional image; and the method further including: determining the three-dimensional projection bounding box of the target object, based on the two-dimensional bounding box of the target object and the relative position constraint relationship between the two-dimensional bounding box of the target object and the three-dimensional projection bounding box.
    Type: Application
    Filed: June 5, 2020
    Publication date: June 10, 2021
    Inventors: Xiaoqing Ye, Xiao Tan, Wei Zhang, Hao Sun, Errui Ding
  • Publication number: 20210110168
    Abstract: Embodiments of the present disclosure provide an object tracking method and an apparatus. The method includes: obtaining multiple frames of first images shot by a first camera apparatus and a first shooting moment of each frame of the first images, where the first images include a first object; obtaining multiple frames of second images shot by a second camera apparatus and a second shooting moment of each frame of the second images, where the second images include a second object; obtaining a distance between the first camera apparatus and the second camera apparatus; and judging whether the first object and the second object are the same object according to the multiple frames of the first images, the first shooting moment of each frame of the first images, the multiple frames of the second images, the second shooting moment of each frame of the second images and the distance.
    Type: Application
    Filed: May 7, 2020
    Publication date: April 15, 2021
    Inventors: XIPENG YANG, XIAO TAN, HAO SUN, SHILEI WEN, ERRUI DING
  • Patent number: 10970528
    Abstract: A method for human motion analysis, an apparatus for human motion analysis, a device, and a storage medium. The method includes: acquiring image information captured by a number of photographing devices, where at least one of the number of photographing devices is disposed above a shelf; performing human tracking according to the image information captured by the plurality of photographing devices, and determining position information in space of at least one human body and identification information of the at least one human body; acquiring, according to the position information in space of a target human body of the at least one human body, a target image captured by the photographing device above a shelf corresponding to the position information; and recognizing an action of the target human body according to the target image and detection data of a non-visual sensor corresponding to the position information.
    Type: Grant
    Filed: July 2, 2019
    Date of Patent: April 6, 2021
    Inventors: Jian Wang, Xubin Li, Le Kang, Zeyu Liu, Zhizhen Chi, Chengyue Zhang, Xiao Liu, Hao Sun, Shilei Wen, Yingze Bao, Mingyu Chen, Errui Ding
  • Patent number: 10963693
    Abstract: A method and apparatus for training a character detector based on weak supervision, a character detection system and a computer readable storage medium are provided, wherein the method includes: inputting coarse-grained annotation information of a to-be-processed object, wherein the coarse-grained annotation information including a whole bounding outline of a word, text bar or line of the to-be-processed object; dividing the whole bounding outline of the coarse-grained annotation information, to obtain a coarse bounding box of a character of the to-be-processed object; obtaining a predicted bounding box of the character of the to-be-processed object through a neural network model from the coarse-grained annotation information; and determining a fine bounding box of the character of the to-be-processed object as character-based annotation of the to-be-processed object, according to the coarse bounding box and the predicted bounding box.
    Type: Grant
    Filed: April 21, 2020
    Date of Patent: March 30, 2021
    Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
    Inventors: Chengquan Zhang, Jiaming Liu, Junyu Han, Errui Ding
  • Publication number: 20210064919
    Abstract: Embodiments of the present disclosure disclose a method and apparatus for processing an image. A specific embodiment of the method includes: acquiring a feature map of a target image, where the target image contains a target object; determining a local feature map of a target size in the feature map; combining features of different channels in the local feature map to obtain a local texture feature map; and obtaining location information of the target object based on the local texture feature map.
    Type: Application
    Filed: May 7, 2020
    Publication date: March 4, 2021
    Inventors: Wei Zhang, Xiao Tan, Hao Sun, Shilei Wen, Errui Ding
  • Patent number: 10922804
    Abstract: The present disclosure provides a method and apparatus for evaluating image definition, a computer device and a storage medium, wherein the method comprises: obtaining an image to be processed; inputting the image to be processed to a pre-trained evaluation model; obtaining an comprehensive image definition score outputted by the evaluation model, the comprehensive image definition score being obtained by the evaluation model by obtaining N image definition scores based on N different scales respectively, and then integrating the N image definition scores, N being a positive integer greater than one. The solution of the present invention can be applied to improve the accuracy of the evaluation result.
    Type: Grant
    Filed: December 5, 2018
    Date of Patent: February 16, 2021
    Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
    Inventors: Xiang Zhao, Xin Li, Xiao Liu, Xubin Li, Hao Sun, Shilei Wen, Errui Ding
  • Patent number: 10915980
    Abstract: Embodiments of the present disclosure disclose a method and apparatus for adding a digital watermark in a video. A specific embodiment of the method includes: performing target detection on each frame of image in a target video; determining, based on a target detection result, whether the target video includes at least one carrier, the carrier referring to an object for adding a digital watermark in the each frame of image; and determining a target carrier from the at least one carrier, and adding the digital watermark to an area of the target carrier in the each frame of image including the target carrier in the target video, in response to determining the target video including the at least one carrier.
    Type: Grant
    Filed: September 18, 2018
    Date of Patent: February 9, 2021
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Xubin Li, Errui Ding, Shilei Wen, Xiao Liu, Hao Sun
  • Publication number: 20210019531
    Abstract: a method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.
    Type: Application
    Filed: March 26, 2020
    Publication date: January 21, 2021
    Inventors: Xiang Long, Dongliang He, Fu Li, Zhizhen Chi, Zhichao Zhou, Xiang Zhao, Ping Wang, Hao Sun, Shilei Wen, Errui Ding
  • Publication number: 20210004629
    Abstract: The present disclosure proposes an end-to-end text recognition method and apparatus, computer device and readable medium. The method comprises: obtaining a to-be-recognized picture containing a text region; recognizing a position of the text region in the to-be-recognized picture and text content included in the text region with a pre-trained end-to-end text recognition model; the end-to-end text recognition model comprising a region of interest perspective transformation processing module for performing perspective transformation processing for the text region. The technical solution of the present disclosure does not need to serially arrange a plurality of steps, and may avoid introducing the accumulated errors and may effectively improve the accuracy of the text recognition.
    Type: Application
    Filed: March 18, 2020
    Publication date: January 7, 2021
    Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Yipeng SUN, Chengquan ZHANG, Zuming HUANG, Jiaming LIU, Junyu HAN, Errui DING
  • Patent number: 10861133
    Abstract: A super-resolution video reconstruction method, device, apparatus and a computer-readable storage medium are provided. The method includes: extracting a hypergraph from consecutive frames of an original video; inputting a hypergraph vector of the hypergraph into a residual convolutional neural network to obtain an output result of the residual convolutional neural network; and inputting the output result of the residual convolutional neural network into a spatial upsampling network to obtain a super-resolution frame, wherein a super-resolution video of the original video is formed by multiple super-resolution frames.
    Type: Grant
    Filed: March 6, 2020
    Date of Patent: December 8, 2020
    Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.
    Inventors: Chao Li, Dongliang He, Xiao Liu, Yukang Ding, Shilei Wen, Errui Ding, Henan Zhang, Hao Sun
  • Publication number: 20200372609
    Abstract: A super-resolution video reconstruction method, device, apparatus and a computer-readable storage medium are provided. The method includes: extracting a hypergraph from consecutive frames of an original video; inputting a hypergraph vector of the hypergraph into a residual convolutional neural network to obtain an output result of the residual convolutional neural network; and inputting the output result of the residual convolutional neural network into a spatial upsampling network to obtain a super-resolution frame, wherein a super-resolution video of the original video is formed by multiple super-resolution frames.
    Type: Application
    Filed: March 6, 2020
    Publication date: November 26, 2020
    Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.
    Inventors: Chao LI, Dongliang HE, Xiao LIU, Yukang DING, Shilei WEN, Errui DING, Henan ZHANG, Hao SUN
  • Publication number: 20200357196
    Abstract: A method and apparatus for vehicle damage assessment, an electronic device, and a computer-readable storage medium are provided. The method may include: extracting, from an input image, a first feature characterizing a part of a vehicle and a second feature characterizing a damage type of the vehicle; integrating the first feature and the second feature to generate a third feature characterizing a corresponding relation between the part and the damage type; converting the third feature into a characteristic vector; and determining a damage recognition result based on the characteristic vector. According to the technical solution of the disclosure, users can rapidly and accurately learn about the damage condition of the vehicle by providing pictures or videos of the damaged vehicle, thus providing an objective basis for subsequent damage assessment, claim settlement, and repair.
    Type: Application
    Filed: December 11, 2019
    Publication date: November 12, 2020
    Inventors: Wei ZHANG, Xiao TAN, Hao SUN, Shilei WEN, Errui DING
  • Publication number: 20200349349
    Abstract: The present disclosure provides a human body recognition method and apparatus, and a storage medium, the method comprising: determining a coordinate of a target person in a three-dimensional space according to images containing the target person collected by at least two cameras; calculating back-projection errors of the target person under different cameras respectively according to the coordinate of the target person in the three-dimensional space; determining whether the cameras have a human body recognition error according to the back-projection errors of the cameras; when a camera has the human body recognition error, performing re-recognition of the target person under the camera by using person re-identification ReID, until the back-projection errors of all the cameras containing the target person are not greater than a preset threshold. The present disclosure can improve accuracy of the human body recognition result effectively.
    Type: Application
    Filed: July 21, 2020
    Publication date: November 5, 2020
    Applicant: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Jian WANG, Xubin LI, Le KANG, Zeyu LIU, Zhizhen CHI, Chengyue ZHANG, Xiao LIU, Hao SUN, Shilei WEN, Yingze BAO, Mingyu CHEN, Errui DING
  • Publication number: 20200342271
    Abstract: The present disclosure provides a pedestrian re-identification method and apparatus, computer device and readable medium. The method comprises: collecting a target image and a to-be-identified image including a pedestrian image; obtaining a feature expression of the target image and a feature expression of the to-be-identified image respectively, based on a pre-trained feature extraction model; wherein the feature extraction model is obtained by training based on a self-attention feature of a base image as well as a co-attention feature of the base image relative to a reference image; identifying whether a pedestrian in the to-be-identified image is the same pedestrian as that in the target image according to the feature expression of the target image and the feature expression of the to-be-identified image.
    Type: Application
    Filed: March 12, 2020
    Publication date: October 29, 2020
    Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Zhigang WANG, Jian WANG, Shilei WEN, Errui DING, Hao SUN
  • Publication number: 20200327384
    Abstract: Embodiments of the present disclosure provide a method and apparatus for detecting text regions in an image, a device, and a medium. The method may include: detecting, based on feature representation of an image, a first text region in the image, where the first text region covers a text in the image, a region occupied by the text being of a certain shape; determining, based on a feature block of the first text region, text geometry information associated with the text, where the text geometry information includes a text centerline of the text and distance information of the centerline from the upper and lower borders of the text; and adjusting, based on the text geometry information associated with the text, the first text region to a second text region, where the second text region also covers the text and is smaller than the first text region.
    Type: Application
    Filed: December 11, 2019
    Publication date: October 15, 2020
    Inventors: Chengquan Zhang, Zuming Huang, Mengyi En, Junyu Han, Errui Ding