Patents by Inventor Xiaocong Cai

Xiaocong Cai has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11481574
    Abstract: The present disclosure relates to an image processing method and device, an electronic apparatus and a storage medium. The method comprises: performing feature extraction on an image to be processed to obtain a first feature map of the image to be processed; splitting the first feature map into a plurality of first sub-feature maps according to dimension information of the first feature map and a preset splitting rule, wherein the dimension information of the first feature map comprises dimensions of the first feature map and size of each dimension; performing normalization on the plurality of first sub-feature maps respectively to obtain a plurality of second sub-feature maps; and splicing the plurality of second sub-feature maps to obtain a second feature map of the image to be processed. Embodiments of the present disclosure can reduce the statistical errors during normalization of a complete feature map.
    Type: Grant
    Filed: August 25, 2020
    Date of Patent: October 25, 2022
    Assignee: Beijing Sensetime Technology Development Co., Ltd.
    Inventors: Kunlin Yang, Jun Hou, Xiaocong Cai, Shuai Yi
  • Patent number: 11308351
    Abstract: The present disclosure relates to a method and apparatus for recognizing a sequence in an image, an electronic device, and a storage medium. The method includes: performing feature extraction on a to-be-processed image to obtain a first feature map of the to-be-processed image, where the to-be-processed image includes a sequence formed by stacking at least one object along a stacking direction; determining a region feature of each segmented region in the to-be-processed image based on the first feature map, where all segmented regions are obtained by dividing the to-be-processed image into k regions along the stacking direction, k is a set number of objects stacked along the stacking direction, and k is an integer greater than 1; and determining a category of each object in the sequence based on the region feature of each segmented region. Embodiments of the present disclosure may implement recognition of stacked objects in a sequence.
    Type: Grant
    Filed: April 14, 2020
    Date of Patent: April 19, 2022
    Assignee: SENSETIME INTERNATIONAL PTE. LTD.
    Inventors: Wenyang Hu, Xiaocong Cai, Jun Hou, Shuai Yi
  • Publication number: 20210097278
    Abstract: The present disclosure relates to a method and apparatus for recognizing stacked objects, an electronic device, and a storage medium. The method for recognizing stacked objects includes: obtaining a to-be-recognized image, wherein the to-be-recognized image includes a sequence formed by stacking at least one object along a stacking direction; performing feature extraction on the to-be-recognized image to obtain a feature map of the to-be-recognized image; and recognizing a category of the at least one object in the sequence according to the feature map. The embodiments of the present disclosure may implement accurate recognition of the category of stacked objects.
    Type: Application
    Filed: June 15, 2020
    Publication date: April 1, 2021
    Inventors: Yuan LIU, Jun HOU, Xiaocong CAI, Shuai YI
  • Publication number: 20210073578
    Abstract: The present disclosure relates to a method and apparatus for recognizing a sequence in an image, an electronic device, and a storage medium. The method includes: performing feature extraction on a to-be-processed image to obtain a first feature map of the to-be-processed image, where the to-be-processed image includes a sequence formed by stacking at least one object along a stacking direction; determining a region feature of each segmented region in the to-be-processed image based on the first feature map, where all segmented regions are obtained by dividing the to-be-processed image into k regions along the stacking direction, k is a set number of objects stacked along the stacking direction, and k is an integer greater than 1; and determining a category of each object in the sequence based on the region feature of each segmented region. Embodiments of the present disclosure may implement recognition of stacked objects in a sequence.
    Type: Application
    Filed: April 14, 2020
    Publication date: March 11, 2021
    Inventors: Wenyang HU, Xiaocong CAI, Jun HOU, Shuai YI
  • Publication number: 20210019560
    Abstract: The present disclosure relates to an image processing method and device, an electronic apparatus and a storage medium. The method comprises: performing feature extraction on an image to be processed to obtain a first feature map of the image to be processed; splitting the first feature map into a plurality of first sub-feature maps according to dimension information of the first feature map and a preset splitting rule, wherein the dimension information of the first feature map comprises dimensions of the first feature map and size of each dimension; performing normalization on the plurality of first sub-feature maps respectively to obtain a plurality of second sub-feature maps; and splicing the plurality of second sub-feature maps to obtain a second feature map of the image to be processed. Embodiments of the present disclosure can reduce the statistical errors during normalization of a complete feature map.
    Type: Application
    Filed: August 25, 2020
    Publication date: January 21, 2021
    Inventors: Kunlin Yang, Jun Hou, Xiaocong Cai, Shuai Yi
  • Publication number: 20210019562
    Abstract: The present disclosure relates to an image processing method and device, an electronic apparatus and a storage medium, the method comprising: performing, by a feature extraction network, feature extraction on an image to be processed to obtain a first feature map of the image to be processed; performing, by an M-level encoding network, scale-down and multi-scale fusion processing on the first feature map to obtain a plurality of feature maps which are encoded, each of the plurality of feature maps having a different scale; and performing, by an N-level decoding network, scale-up and multi-scale fusion processing on the plurality of feature maps which are encoded to obtain a prediction result of the image to be processed. Embodiments of the present disclosure are capable of improving the quality and robustness of the prediction result.
    Type: Application
    Filed: August 25, 2020
    Publication date: January 21, 2021
    Inventors: Kunlin YANG, Kun YAN, Jun HOU, Xiaocong CAI, Shual YI