Patents by Inventor Sihan WEN

Sihan WEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11468602
    Abstract: Embodiments of this disclosure provide an image encoding method and apparatus and image decoding method and apparatus. The image encoding includes performing convolutional neural network (CNN) encoding on image data to generate feature vectors or feature maps; quantizing the feature vectors or feature maps to generate discrete symbols to be encoded; and estimating probabilities of the symbols to be encoded by using a multi-scale context model including multiple mask convolution layers of different scales. An entropy encoding of the image data is performed according to the probabilities of the symbols to be encoded.
    Type: Grant
    Filed: January 23, 2020
    Date of Patent: October 11, 2022
    Assignee: FUJITSU LIMITED
    Inventors: Jing Zhou, Sihan Wen, Zhiming Tan
  • Patent number: 11386583
    Abstract: Embodiments of this disclosure provide an image coding apparatus, a probability model generating apparatus and an image decoding apparatus. A processor is to perform feature extraction on an input image to obtain first feature maps of N channels; to perform feature extraction on the input image with a size of the input image being adjusted K times, to respectively obtain second feature maps of N channels; and to concatenate the first feature maps of the K×N channels with the second feature maps of K×N channels to output a concatenated feature maps of channels. Hence, features of images may be accurately extracted and more competitive latent representations may be obtained.
    Type: Grant
    Filed: May 15, 2020
    Date of Patent: July 12, 2022
    Assignee: FUJITSU LIMITED
    Inventors: Sihan Wen, Jing Zhou, Zhiming Tan
  • Patent number: 11375152
    Abstract: Embodiments of this disclosure provide a video frame interpolation apparatus and method. The method includes calculating a bidirectional optical flow between a first frame and a second frame; an performing kernel and weight estimation according to the first frame and the second frame. An adaptive local convolutional kernel is generated by using a convolutional layer and a weight coefficient is generated by using another convolutional layer. A conversion on the first frame and the second frame is performed by using an adaptive conversion layer according to the bidirectional optical flow, the weight coefficient and the adaptive local convolutional kernel, so as to generate a conversion frame. A frame synthesis on the first frame, the second frame and the conversion frame is performed to generate an interpolation frame between the first frame and the second frame.
    Type: Grant
    Filed: April 9, 2021
    Date of Patent: June 28, 2022
    Assignee: FUJITSU LIMITED
    Inventors: Sihan Wen, Jing Zhou, Zhiming Tan
  • Patent number: 11330264
    Abstract: Embodiments of this disclosure provide a training method, an image encoding method, an image decoding method and apparatuses thereof. The image encoding apparatus includes: an image encoder configured to encode input image data to obtain a latent variable; a quantizer configured to perform quantizing processing on the latent variable according to a quantization step to generate a quantized latent variable; and an entropy encoder configured to perform entropy coding on the quantized latent variable by using an entropy model to form a bit stream.
    Type: Grant
    Filed: February 23, 2021
    Date of Patent: May 10, 2022
    Assignee: FUJITSU LIMITED
    Inventors: Jing Zhou, Akira Nakagawa, Sihan Wen, Zhiming Tan
  • Patent number: 11257252
    Abstract: Embodiments of this disclosure provide an image coding method and apparatus and an image compression system. The image coding apparatus includes a memory and a processor. The processor is configured to perform feature extraction on an input image to obtain feature maps of N channels; assign a weight to a feature map of each channel among the N channels; perform down-dimension processing on weighted feature maps processed in association with the N channels, to obtain feature maps of M channels and output the feature maps of M channels, M being smaller than N. Hence, by multiplying different feature maps by a weight to obtain corresponding importance and then performing down-dimension processing on the feature maps processed according to the weighting, time for decoding may be reduced.
    Type: Grant
    Filed: May 14, 2020
    Date of Patent: February 22, 2022
    Assignee: FUJITSU LIMITED
    Inventors: Sihan Wen, Jing Zhou, Zhiming Tan
  • Publication number: 20210368131
    Abstract: Embodiments of this disclosure provide a video frame interpolation apparatus and method. The method includes calculating a bidirectional optical flow between a first frame and a second frame; an performing kernel and weight estimation according to the first frame and the second frame. An adaptive local convolutional kernel is generated by using a convolutional layer and a weight coefficient is generated by using another convolutional layer. A conversion on the first frame and the second frame is performed by using an adaptive conversion layer according to the bidirectional optical flow, the weight coefficient and the adaptive local convolutional kernel, so as to generate a conversion frame. A frame synthesis on the first frame, the second frame and the conversion frame is performed to generate an interpolation frame between the first frame and the second frame.
    Type: Application
    Filed: April 9, 2021
    Publication date: November 25, 2021
    Applicant: FUJITSU LIMITED
    Inventors: Sihan WEN, Jing ZHOU, Zhiming TAN
  • Patent number: 11184615
    Abstract: Embodiments of this disclosure provide an image coding method and apparatus and an image decoding method and apparatus. The image coding method includes: performing feature extraction on to-be-processed image data by using a convolutional neural network, to generate feature maps of the image data; quantizing the feature maps to generate discrete feature maps; preprocessing the discrete feature maps to generate preprocessed data, an amount of data of the preprocessed data being less than an amount of data of the discrete feature maps; calculating probabilities of to-be-coded data in the discrete feature maps according to the preprocessed data; and performing entropy coding on the to-be-coded data according to the probabilities of the to-be-coded data.
    Type: Grant
    Filed: April 22, 2020
    Date of Patent: November 23, 2021
    Assignee: FUJITSU LIMITED
    Inventors: Jing Zhou, Akira Nakagawa, Sihan Wen, Zhiming Tan
  • Publication number: 20210297667
    Abstract: Embodiments of this disclosure provide a training method, an image encoding method, an image decoding method and apparatuses thereof. The image encoding apparatus includes: an image encoder configured to encode input image data to obtain a latent variable; a quantizer configured to perform quantizing processing on the latent variable according to a quantization step to generate a quantized latent variable; and an entropy encoder configured to perform entropy coding on the quantized latent variable by using an entropy model to form a bit stream.
    Type: Application
    Filed: February 23, 2021
    Publication date: September 23, 2021
    Applicant: Fujitsu Limited
    Inventors: Jing ZHOU, Akira NAKAGAWA, Sihan WEN, Zhiming TAN
  • Publication number: 20200372686
    Abstract: Embodiments of this disclosure provide an image coding apparatus, a probability model generating apparatus and an image decoding apparatus. A processor is to perform feature extraction on an input image to obtain first feature maps of N channels; to perform feature extraction on the input image with a size of the input image being adjusted K times, to respectively obtain second feature maps of N channels; and to concatenate the first feature maps of the K×N channels with the second feature maps of K×N channels to output a concatenated feature maps of channels. Hence, features of images may be accurately extracted and more competitive latent representations may be obtained.
    Type: Application
    Filed: May 15, 2020
    Publication date: November 26, 2020
    Applicant: FUJITSU LIMITED
    Inventors: Sihan WEN, Jing ZHOU, Zhiming TAN
  • Publication number: 20200374522
    Abstract: Embodiments of this disclosure provide an image coding method and apparatus and an image decoding method and apparatus. The image coding method includes: performing feature extraction on to-be-processed image data by using a convolutional neural network, to generate feature maps of the image data; quantizing the feature maps to generate discrete feature maps; preprocessing the discrete feature maps to generate preprocessed data, an amount of data of the preprocessed data being less than an amount of data of the discrete feature maps; calculating probabilities of to-be-coded data in the discrete feature maps according to the preprocessed data; and performing entropy coding on the to-be-coded data according to the probabilities of the to-be-coded data.
    Type: Application
    Filed: April 22, 2020
    Publication date: November 26, 2020
    Applicant: Fujitsu Limited
    Inventors: Jing Zhou, Akira Nakagawa, Sihan Wen, Zhiming Tan
  • Publication number: 20200372684
    Abstract: Embodiments of this disclosure provide an image coding method and apparatus and an image compression system. The image coding apparatus includes a memory and a processor. The processor is configured to perform feature extraction on an input image to obtain feature maps of N channels; assign a weight to a feature map of each channel among the N channels; perform down-dimension processing on weighted feature maps processed in association with the N channels, to obtain feature maps of M channels and output the feature maps of M channels, M being smaller than N. Hence, by multiplying different feature maps by a weight to obtain corresponding importance and then performing down-dimension processing on the feature maps processed according to the weighting, time for decoding may be reduced.
    Type: Application
    Filed: May 14, 2020
    Publication date: November 26, 2020
    Applicant: FUJITSU LIMITED
    Inventors: Sihan Wen, Jing Zhou, Zhiming Tan
  • Publication number: 20200327701
    Abstract: Embodiments of this disclosure provide an image encoding method and apparatus and image decoding method and apparatus. The image encoding includes performing convolutional neural network (CNN) encoding on image data to generate feature vectors or feature maps; quantizing the feature vectors or feature maps to generate discrete symbols to be encoded; and estimating probabilities of the symbols to be encoded by using a multi-scale context model including multiple mask convolution layers of different scales. An entropy encoding of the image data is performed according to the probabilities of the symbols to be encoded.
    Type: Application
    Filed: January 23, 2020
    Publication date: October 15, 2020
    Applicant: FUJITSU LIMITED
    Inventors: Jing ZHOU, Sihan WEN, Zhiming TAN