Patents by Inventor Huiyan Wang

Huiyan Wang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11619593
    Abstract: The present disclosure provides a method for detecting a defect of a film. The method includes obtaining a film image, determining one or more pieces of scratch information corresponding to the film image through processing the film image using a recognition model, the recognition model includes a convolution layer, a regression layer, and a classification layer, determining whether each piece of scratch information in the one or more pieces of scratch information meets a preset condition, each piece of scratch information includes position information, angle information, and size information, in response to a determination that each piece of scratch information meets the preset condition, adding one or more pieces of annotation information to the one or more pieces of scratch information that meets the preset condition, and generating prompt information based on the one or more pieces of annotation information.
    Type: Grant
    Filed: June 13, 2022
    Date of Patent: April 4, 2023
    Assignee: ZHEJIANG GONGSHANG UNIVERSITY
    Inventors: Huiyan Wang, Zeyuan Shao
  • Patent number: 11615523
    Abstract: The present disclosure provides a method for recognizing a small target based on a deep learning network. The method comprises: determining, based on a collected image, spot defect information through a recognition model including a first feature determination layer, a second feature determination layer, and a spot defect determination layer, determining, based on the collected image, a first feature map, determining, based on the first feature map, a second feature map by fusing with positional encoding, determining, based on the second feature map, a third feature map through the second feature determination layer, and obtaining, based on the third feature map, positional information of the spot defect through a first determination layer, and determining, based on the third feature map, classification information of the spot defect through a second determination layer.
    Type: Grant
    Filed: July 3, 2022
    Date of Patent: March 28, 2023
    Assignee: ZHEJIANG GONGSHANG UNIVERSITY
    Inventors: Huiyan Wang, Huan Jiang
  • Publication number: 20230055146
    Abstract: The present disclosure provides a method for recognizing a small target based on a deep learning network. The method comprises: determining, based on a collected image, spot defect information through a recognition model including a first feature determination layer, a second feature determination layer, and a spot defect determination layer, determining, based on the collected image, a first feature map, determining, based on the first feature map, a second feature map by fusing with positional encoding, determining, based on the second feature map, a third feature map through the second feature determination layer, and obtaining, based on the third feature map, positional information of the spot defect through a first determination layer, and determining, based on the third feature map, classification information of the spot defect through a second determination layer.
    Type: Application
    Filed: July 3, 2022
    Publication date: February 23, 2023
    Applicant: ZHEJIANG GONGSHANG UNIVERSITY
    Inventors: Huiyan WANG, Huan JIANG
  • Patent number: 11544969
    Abstract: An end-to-end multimodal gait recognition method based on deep learning includes: first extracting gait appearance features (color, texture and the like) through RGB video frames, and obtaining a mask by semantic segmentation of the RGB video frames; then extracting gait mask features (contour and the like) through the mask; and finally performing fusion and recognition on the two kinds of features. The method is configured for extracting gait appearance feature and mask feature by improving GaitSet, improving semantic segmentation speed on the premise of ensuring accuracy through simplified FCN, and fusing the gait appearance feature and the mask feature to obtain a more complete information representation.
    Type: Grant
    Filed: March 7, 2022
    Date of Patent: January 3, 2023
    Assignee: Zhejiang Gongshang University
    Inventors: Huiyan Wang, Huayang Li, Jun Luo, Zeyuan Shao
  • Publication number: 20220381699
    Abstract: The present disclosure provides a method for detecting a defect of a film. The method includes obtaining a film image, determining one or more pieces of scratch information corresponding to the film image through processing the film image using a recognition model, the recognition model includes a convolution layer, a regression layer, and a classification layer, determining whether each piece of scratch information in the one or more pieces of scratch information meets a preset condition, each piece of scratch information includes position information, angle information, and size information, in response to a determination that each piece of scratch information meets the preset condition, adding one or more pieces of annotation information to the one or more pieces of scratch information that meets the preset condition, and generating prompt information based on the one or more pieces of annotation information.
    Type: Application
    Filed: June 13, 2022
    Publication date: December 1, 2022
    Applicant: ZHEJIANG GONGSHANG UNIVERSITY
    Inventors: Huiyan WANG, Zeyuan SHAO
  • Publication number: 20220343686
    Abstract: An end-to-end multimodal gait recognition method based on deep learning includes: first extracting gait appearance features (color, texture and the like) through RGB video frames, and obtaining a mask by semantic segmentation of the RGB video frames; then extracting gait mask features (contour and the like) through the mask; and finally performing fusion and recognition on the two kinds of features. The method is configured for extracting gait appearance feature and mask feature by improving GaitSet, improving semantic segmentation speed on the premise of ensuring accuracy through simplified FCN, and fusing the gait appearance feature and the mask feature to obtain a more complete information representation.
    Type: Application
    Filed: March 7, 2022
    Publication date: October 27, 2022
    Applicant: Zhejiang Gongshang University
    Inventors: Huiyan WANG, Huayang LI, Jun LUO, Zeyuan SHAO
  • Patent number: 10542249
    Abstract: A stereoscopic video generation method based on 3D convolution neural network is disclosed, which is able to convert existing 2D video sources into stereoscopic videos. The method includes preparing the training data, dividing the training video sources into left eye view sequences and right eye view sequences; and then processing the left eye image sequences through shot segmentation via fuzzy C-means clustering method, calculating a mean image of all left eye images, subtracting the mean image from the left eye images, taking the right eye images as a training target; training the obtained 3D convolution neural network through the training data; processing the 2D video sources which need to be converted into stereoscopic videos in the same way as training set, inputting to the trained 3D convolution neural network to obtain the right eye view image sequences of the 2D videos; and finally combining the two to be stereoscopic videos.
    Type: Grant
    Filed: December 29, 2016
    Date of Patent: January 21, 2020
    Assignee: ZHEJIANG GONGSHANG UNIVERSITY
    Inventors: Xun Wang, Leqing Zhu, Huiyan Wang
  • Publication number: 20190379883
    Abstract: A stereoscopic video generation method based on 3D convolution neural network is disclosed, which is able to convert existing 2D video sources into stereoscopic videos. The method includes preparing the training data, dividing the training video sources into left eye view sequences and right eye view sequences; and then processing the left eye image sequences through shot segmentation via fuzzy C-means clustering method, calculating a mean image of all left eye images, subtracting the mean image from the left eye images, taking the right eye images as a training target; training the obtained 3D convolution neural network through the training data; processing the 2D video sources which need to be converted into stereoscopic videos in the same way as training set, inputting to the trained 3D convolution neural network to obtain the right eye view image sequences of the 2D videos; and finally combining the two to be stereoscopic videos.
    Type: Application
    Filed: December 29, 2016
    Publication date: December 12, 2019
    Inventors: Xun Wang, Leqing Zhu, Huiyan Wang
  • Patent number: 10353271
    Abstract: A depth estimation method for a monocular image based on a multi-scale CNN and a continuous CRF is disclosed in this invention. A CRF module is adopted to calculate a unary potential energy according to the output depth map of a DCNN, and the pairwise sparse potential energy according to input RGB images. MAP (maximum a posteriori estimation) algorithm is used to infer the optimized depth map at last. The present invention integrates optimization theories of the multi-scale CNN with that of the continuous CRF. High accuracy and a clear contour are both achieved in the estimated depth map; the depth estimated by the present invention has a high resolution and detailed contour information can be kept for all objects in the scene, which provides better visual effects.
    Type: Grant
    Filed: December 14, 2016
    Date of Patent: July 16, 2019
    Assignee: ZHEJIANG GONGSHANG UNIVERSITY
    Inventors: Xun Wang, Leqing Zhu, Huiyan Wang
  • Publication number: 20180231871
    Abstract: A depth estimation method for a monocular image based on a multi-scale CNN and a continuous CRF is disclosed in this invention. A CRF module is adopted to calculate a unary potential energy according to the output depth map of a DCNN, and the pairwise sparse potential energy according to input RGB images. MAP (maximum a posteriori estimation) algorithm is used to infer the optimized depth map at last. The present invention integrates optimization theories of the multi-scale CNN with that of the continuous CRF. High accuracy and a clear contour are both achieved in the estimated depth map; the depth estimated by the present invention has a high resolution and detailed contour information can be kept for all objects in the scene, which provides better visual effects.
    Type: Application
    Filed: December 14, 2016
    Publication date: August 16, 2018
    Inventors: Xun Wang, Leqing Zhu, Huiyan Wang