Patents by Inventor Jiefu Zhai

Jiefu Zhai has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210227236
    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
    Type: Application
    Filed: April 2, 2021
    Publication date: July 22, 2021
    Inventors: Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
  • Publication number: 20210185361
    Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.
    Type: Application
    Filed: March 1, 2021
    Publication date: June 17, 2021
    Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
  • Patent number: 10999583
    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
    Type: Grant
    Filed: September 14, 2018
    Date of Patent: May 4, 2021
    Assignee: Apple Inc.
    Inventors: Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
  • Patent number: 10999602
    Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.
    Type: Grant
    Filed: December 23, 2016
    Date of Patent: May 4, 2021
    Assignee: Apple Inc.
    Inventors: Jae Hoon Kim, Xiaosong Zhou, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 10992919
    Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
    Type: Grant
    Filed: December 23, 2019
    Date of Patent: April 27, 2021
    Assignee: Apple Inc.
    Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
  • Patent number: 10939139
    Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.
    Type: Grant
    Filed: November 29, 2018
    Date of Patent: March 2, 2021
    Assignee: Apple Inc.
    Inventors: Xiaohua Yang, Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
  • Patent number: 10924747
    Abstract: Techniques are disclosed for coding and decoding video captured as cube map images. According to these techniques, padded reference images are generated for use during predicting input data. A reference image is stored in a cube map format. A padded reference image is generated from the reference image in which image data of a first view contained in reference image is replicated and placed adjacent to a second view contained in the cube map image. When coding a pixel block of an input image, a prediction search may be performed between the input pixel block and content of the padded reference image. When the prediction search identifies a match, the pixel block may be coded with respect to matching data from the padded reference image. Presence of replicated data in the padded reference image is expected to increase the likelihood that adequate prediction matches will be identified for input pixel block data, which will increase overall efficiency of the video coding.
    Type: Grant
    Filed: February 27, 2017
    Date of Patent: February 16, 2021
    Assignee: Apple Inc.
    Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jiefu Zhai, Xiaosong Zhou
  • Patent number: 10812832
    Abstract: Coding techniques for image data may cause a still image to be converted to a “phantom” video sequence, which is coded by motion compensated prediction techniques. Thus, coded video data obtained from the coding operation may include temporal prediction references between frames of the video sequence. Metadata may be generated that identifies allocations of content from the still image to the frames of the video sequence. The coded data and the metadata may be transmitted to another device, whereupon it may be decoded by motion compensated prediction techniques and converted back to a still image data. Other techniques may involve coding an image in both a base layer representation and at least one coded enhancement layer representation. The enhancement layer representation may be coded predictively with reference to the base layer representation. The coded base layer representation may be partitioned into a plurality of individually-transmittable segments and stored.
    Type: Grant
    Filed: June 5, 2015
    Date of Patent: October 20, 2020
    Assignee: APPLE INC.
    Inventors: Hang Yuan, Chris Y. Chung, Jae Hoon Kim, Yeping Su, Jiefu Zhai, Xiaosong Zhou, Hsi-Jung Wu
  • Publication number: 20200296426
    Abstract: Chroma deblock filtering of reconstructed video samples may be performed to remove blockiness artifacts and reduce color artifacts without over-smoothing. In a first method, chroma deblocking may be performed for boundary samples of a smallest transform size, regardless of partitions and coding modes. In a second method, chroma deblocking may be performed when a boundary strength is greater than 0. In a third method, chroma deblocking may be performed regardless of boundary strengths. In a fourth method, the type of chroma deblocking to be performed may be signaled in a slice header by a flag. Furthermore, luma deblock filtering techniques may be applied to chroma deblock filtering.
    Type: Application
    Filed: June 2, 2020
    Publication date: September 17, 2020
    Inventors: Jiefu Zhai, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu, Peikang Song, David R. Conrad, Jae Hoon Kim, Yunfei Zheng
  • Patent number: 10754242
    Abstract: Techniques are described for implementing format configurations for multi-directional video and for switching between them. Source images may be assigned to formats that may change during a coding session. When a change occurs between formats, video coders and decoder may transform decoded reference frames from the first format to the second format. Thereafter, new frames in the second configuration may be coded or decoded predictively using transformed reference frame(s) as source(s) of prediction. In this manner, video coders and decoders may use intra-coding techniques and achieve high efficiency in coding.
    Type: Grant
    Filed: June 30, 2017
    Date of Patent: August 25, 2020
    Assignee: Apple Inc.
    Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
  • Publication number: 20200236349
    Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.
    Type: Application
    Filed: January 22, 2019
    Publication date: July 23, 2020
    Inventors: Jiefu ZHAI, Xingyu ZHANG, Xiaosong ZHOU, Jun XIN, Hsi-Jung WU, Yeping SU
  • Patent number: 10708623
    Abstract: Chroma deblock filtering of reconstructed video samples may be performed to remove blockiness artifacts and reduce color artifacts without over-smoothing. In a first method, chroma deblocking may be performed for boundary samples of a smallest transform size, regardless of partitions and coding modes. In a second method, chroma deblocking may be performed when a boundary strength is greater than 0. In a third method, chroma deblocking may be performed regardless of boundary strengths. In a fourth method, the type of chroma deblocking to be performed may be signaled in a slice header by a flag. Furthermore, luma deblock filtering techniques may be applied to chroma deblock filtering.
    Type: Grant
    Filed: July 31, 2018
    Date of Patent: July 7, 2020
    Assignee: Apple Inc.
    Inventors: Jiefu Zhai, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu, Peikang Song, David R. Conrad, Jae Hoon Kim, Yunfei Zheng
  • Publication number: 20200213571
    Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
    Type: Application
    Filed: December 23, 2019
    Publication date: July 2, 2020
    Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
  • Publication number: 20200177927
    Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.
    Type: Application
    Filed: November 29, 2018
    Publication date: June 4, 2020
    Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
  • Patent number: 10652578
    Abstract: Image processing techniques may accelerate coding of viewport data contained within multi-view image data. According to such techniques, an encoder may shifting content of a multi-directional image data according to the viewport location data provided by a decoder. The encoder may code the shifted multi-directional image data by predictive coding, and transmit to the decoder, the coded multi-directional image data and data identifying an amount of the shift. Doing so may move the viewport location to positions in the image data that are coded earlier than the positions that the viewport location naturally occupies and, thereby, may accelerate coding. On decode, a decoder may compare its present viewport location with viewport location data provided by the encoder with coded video data. The decoder may decode the coded video data and extract a portion of the decoded video data corresponding to a present viewport location for display.
    Type: Grant
    Filed: February 5, 2018
    Date of Patent: May 12, 2020
    Assignee: APPLE INC.
    Inventors: Jae Hoon Kim, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Ming Chen, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 10616498
    Abstract: Systems and methods are provided for capturing high quality video data, including data having a high dynamic range, for use with conventional encoders and decoders. High dynamic range data is captured using multiple groups of pixels where each group is captured using different exposure times to create groups of pixels. The pixels that are captured at different exposure times may be determined adaptively based on the content of the image, the parameters of the encoding system, or on the available resources within the encoding system. The transition from single exposure to using two different exposure times may be implemented gradually.
    Type: Grant
    Filed: May 17, 2019
    Date of Patent: April 7, 2020
    Assignee: Apple Inc.
    Inventors: Jiefu Zhai, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Publication number: 20200092571
    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
    Type: Application
    Filed: September 14, 2018
    Publication date: March 19, 2020
    Inventors: Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
  • Patent number: 10536731
    Abstract: Systems and methods are provided for processing high quality video data, such as data having a higher than standard bit depth, a high dynamic range, or a wide or custom color gamut, to be compatible with conventional encoders and decoders without significant loss of quality. High quality data is encoded into a plurality of layers with a base layer having the standard quality data and one or more higher quality layers. Decoding systems and methods may map the base layer to the dynamic range or color gamut of the enhancement layer, combine the layers, and map the combined layers to a dynamic range or color gamut appropriate for the target display. Each of the standard quality and the high quality data may be encoded as a plurality of tiers of increasing quality and reference lower level tiers as sources of prediction during predictive coding.
    Type: Grant
    Filed: May 12, 2015
    Date of Patent: January 14, 2020
    Assignee: Apple Inc.
    Inventors: Yeping Su, Jiefu Zhai, Ke Zhang, Xiaosong Zhou, Hsi-Jung Wu, Chris Y. Chung
  • Patent number: 10523913
    Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
    Type: Grant
    Filed: June 30, 2017
    Date of Patent: December 31, 2019
    Assignee: APPLE INC.
    Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
  • Publication number: 20190349509
    Abstract: Systems and methods are provided for capturing high quality video data, including data having a high dynamic range, for use with conventional encoders and decoders. High dynamic range data is captured using multiple groups of pixels where each group is captured using different exposure times to create groups of pixels. The pixels that are captured at different exposure times may be determined adaptively based on the content of the image, the parameters of the encoding system, or on the available resources within the encoding system. The transition from single exposure to using two different exposure times may be implemented gradually.
    Type: Application
    Filed: May 17, 2019
    Publication date: November 14, 2019
    Inventors: Jiefu ZHAI, Xiaosong ZHOU, Chris Y. CHUNG, Hsi-Jung WU