Patents by Inventor Sihan WEN
Sihan WEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11468602Abstract: Embodiments of this disclosure provide an image encoding method and apparatus and image decoding method and apparatus. The image encoding includes performing convolutional neural network (CNN) encoding on image data to generate feature vectors or feature maps; quantizing the feature vectors or feature maps to generate discrete symbols to be encoded; and estimating probabilities of the symbols to be encoded by using a multi-scale context model including multiple mask convolution layers of different scales. An entropy encoding of the image data is performed according to the probabilities of the symbols to be encoded.Type: GrantFiled: January 23, 2020Date of Patent: October 11, 2022Assignee: FUJITSU LIMITEDInventors: Jing Zhou, Sihan Wen, Zhiming Tan
-
Patent number: 11386583Abstract: Embodiments of this disclosure provide an image coding apparatus, a probability model generating apparatus and an image decoding apparatus. A processor is to perform feature extraction on an input image to obtain first feature maps of N channels; to perform feature extraction on the input image with a size of the input image being adjusted K times, to respectively obtain second feature maps of N channels; and to concatenate the first feature maps of the K×N channels with the second feature maps of K×N channels to output a concatenated feature maps of channels. Hence, features of images may be accurately extracted and more competitive latent representations may be obtained.Type: GrantFiled: May 15, 2020Date of Patent: July 12, 2022Assignee: FUJITSU LIMITEDInventors: Sihan Wen, Jing Zhou, Zhiming Tan
-
Patent number: 11375152Abstract: Embodiments of this disclosure provide a video frame interpolation apparatus and method. The method includes calculating a bidirectional optical flow between a first frame and a second frame; an performing kernel and weight estimation according to the first frame and the second frame. An adaptive local convolutional kernel is generated by using a convolutional layer and a weight coefficient is generated by using another convolutional layer. A conversion on the first frame and the second frame is performed by using an adaptive conversion layer according to the bidirectional optical flow, the weight coefficient and the adaptive local convolutional kernel, so as to generate a conversion frame. A frame synthesis on the first frame, the second frame and the conversion frame is performed to generate an interpolation frame between the first frame and the second frame.Type: GrantFiled: April 9, 2021Date of Patent: June 28, 2022Assignee: FUJITSU LIMITEDInventors: Sihan Wen, Jing Zhou, Zhiming Tan
-
Patent number: 11330264Abstract: Embodiments of this disclosure provide a training method, an image encoding method, an image decoding method and apparatuses thereof. The image encoding apparatus includes: an image encoder configured to encode input image data to obtain a latent variable; a quantizer configured to perform quantizing processing on the latent variable according to a quantization step to generate a quantized latent variable; and an entropy encoder configured to perform entropy coding on the quantized latent variable by using an entropy model to form a bit stream.Type: GrantFiled: February 23, 2021Date of Patent: May 10, 2022Assignee: FUJITSU LIMITEDInventors: Jing Zhou, Akira Nakagawa, Sihan Wen, Zhiming Tan
-
Patent number: 11257252Abstract: Embodiments of this disclosure provide an image coding method and apparatus and an image compression system. The image coding apparatus includes a memory and a processor. The processor is configured to perform feature extraction on an input image to obtain feature maps of N channels; assign a weight to a feature map of each channel among the N channels; perform down-dimension processing on weighted feature maps processed in association with the N channels, to obtain feature maps of M channels and output the feature maps of M channels, M being smaller than N. Hence, by multiplying different feature maps by a weight to obtain corresponding importance and then performing down-dimension processing on the feature maps processed according to the weighting, time for decoding may be reduced.Type: GrantFiled: May 14, 2020Date of Patent: February 22, 2022Assignee: FUJITSU LIMITEDInventors: Sihan Wen, Jing Zhou, Zhiming Tan
-
Publication number: 20210368131Abstract: Embodiments of this disclosure provide a video frame interpolation apparatus and method. The method includes calculating a bidirectional optical flow between a first frame and a second frame; an performing kernel and weight estimation according to the first frame and the second frame. An adaptive local convolutional kernel is generated by using a convolutional layer and a weight coefficient is generated by using another convolutional layer. A conversion on the first frame and the second frame is performed by using an adaptive conversion layer according to the bidirectional optical flow, the weight coefficient and the adaptive local convolutional kernel, so as to generate a conversion frame. A frame synthesis on the first frame, the second frame and the conversion frame is performed to generate an interpolation frame between the first frame and the second frame.Type: ApplicationFiled: April 9, 2021Publication date: November 25, 2021Applicant: FUJITSU LIMITEDInventors: Sihan WEN, Jing ZHOU, Zhiming TAN
-
Patent number: 11184615Abstract: Embodiments of this disclosure provide an image coding method and apparatus and an image decoding method and apparatus. The image coding method includes: performing feature extraction on to-be-processed image data by using a convolutional neural network, to generate feature maps of the image data; quantizing the feature maps to generate discrete feature maps; preprocessing the discrete feature maps to generate preprocessed data, an amount of data of the preprocessed data being less than an amount of data of the discrete feature maps; calculating probabilities of to-be-coded data in the discrete feature maps according to the preprocessed data; and performing entropy coding on the to-be-coded data according to the probabilities of the to-be-coded data.Type: GrantFiled: April 22, 2020Date of Patent: November 23, 2021Assignee: FUJITSU LIMITEDInventors: Jing Zhou, Akira Nakagawa, Sihan Wen, Zhiming Tan
-
Publication number: 20210297667Abstract: Embodiments of this disclosure provide a training method, an image encoding method, an image decoding method and apparatuses thereof. The image encoding apparatus includes: an image encoder configured to encode input image data to obtain a latent variable; a quantizer configured to perform quantizing processing on the latent variable according to a quantization step to generate a quantized latent variable; and an entropy encoder configured to perform entropy coding on the quantized latent variable by using an entropy model to form a bit stream.Type: ApplicationFiled: February 23, 2021Publication date: September 23, 2021Applicant: Fujitsu LimitedInventors: Jing ZHOU, Akira NAKAGAWA, Sihan WEN, Zhiming TAN
-
Publication number: 20200372686Abstract: Embodiments of this disclosure provide an image coding apparatus, a probability model generating apparatus and an image decoding apparatus. A processor is to perform feature extraction on an input image to obtain first feature maps of N channels; to perform feature extraction on the input image with a size of the input image being adjusted K times, to respectively obtain second feature maps of N channels; and to concatenate the first feature maps of the K×N channels with the second feature maps of K×N channels to output a concatenated feature maps of channels. Hence, features of images may be accurately extracted and more competitive latent representations may be obtained.Type: ApplicationFiled: May 15, 2020Publication date: November 26, 2020Applicant: FUJITSU LIMITEDInventors: Sihan WEN, Jing ZHOU, Zhiming TAN
-
Publication number: 20200374522Abstract: Embodiments of this disclosure provide an image coding method and apparatus and an image decoding method and apparatus. The image coding method includes: performing feature extraction on to-be-processed image data by using a convolutional neural network, to generate feature maps of the image data; quantizing the feature maps to generate discrete feature maps; preprocessing the discrete feature maps to generate preprocessed data, an amount of data of the preprocessed data being less than an amount of data of the discrete feature maps; calculating probabilities of to-be-coded data in the discrete feature maps according to the preprocessed data; and performing entropy coding on the to-be-coded data according to the probabilities of the to-be-coded data.Type: ApplicationFiled: April 22, 2020Publication date: November 26, 2020Applicant: Fujitsu LimitedInventors: Jing Zhou, Akira Nakagawa, Sihan Wen, Zhiming Tan
-
Publication number: 20200372684Abstract: Embodiments of this disclosure provide an image coding method and apparatus and an image compression system. The image coding apparatus includes a memory and a processor. The processor is configured to perform feature extraction on an input image to obtain feature maps of N channels; assign a weight to a feature map of each channel among the N channels; perform down-dimension processing on weighted feature maps processed in association with the N channels, to obtain feature maps of M channels and output the feature maps of M channels, M being smaller than N. Hence, by multiplying different feature maps by a weight to obtain corresponding importance and then performing down-dimension processing on the feature maps processed according to the weighting, time for decoding may be reduced.Type: ApplicationFiled: May 14, 2020Publication date: November 26, 2020Applicant: FUJITSU LIMITEDInventors: Sihan Wen, Jing Zhou, Zhiming Tan
-
Publication number: 20200327701Abstract: Embodiments of this disclosure provide an image encoding method and apparatus and image decoding method and apparatus. The image encoding includes performing convolutional neural network (CNN) encoding on image data to generate feature vectors or feature maps; quantizing the feature vectors or feature maps to generate discrete symbols to be encoded; and estimating probabilities of the symbols to be encoded by using a multi-scale context model including multiple mask convolution layers of different scales. An entropy encoding of the image data is performed according to the probabilities of the symbols to be encoded.Type: ApplicationFiled: January 23, 2020Publication date: October 15, 2020Applicant: FUJITSU LIMITEDInventors: Jing ZHOU, Sihan WEN, Zhiming TAN