Patents by Inventor Xiaosong Zhou

Xiaosong Zhou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20200380260
    Abstract: Techniques disclosed for managing video captured by an imaging device. Methods disclosed capture a video in response to a capture command received at the imaging device. Following a video capture, techniques for classifying the captured video based on feature(s) extracted therefrom, for marking the captured video based on the classification, and for generating a media item from the captured video according to the marking are disclosed. Accordingly, the captured video may be classified as representing a static event, and, as a result, a media item of a still image may be generated. Otherwise, the captured video may be classified as representing a dynamic event, and, as a result, a media item of a video may be generated.
    Type: Application
    Filed: May 26, 2020
    Publication date: December 3, 2020
    Inventors: Bartlomiej RYMKOWSKI, Robert BAILEY, Ethan TIRA-THOMPSON, Shuang GAO, Ben ENGLERT, Emilie KIM, Shujie LIU, Ke ZHANG, Vinay SHARMA, Xiaosong ZHOU
  • Patent number: 10812832
    Abstract: Coding techniques for image data may cause a still image to be converted to a “phantom” video sequence, which is coded by motion compensated prediction techniques. Thus, coded video data obtained from the coding operation may include temporal prediction references between frames of the video sequence. Metadata may be generated that identifies allocations of content from the still image to the frames of the video sequence. The coded data and the metadata may be transmitted to another device, whereupon it may be decoded by motion compensated prediction techniques and converted back to a still image data. Other techniques may involve coding an image in both a base layer representation and at least one coded enhancement layer representation. The enhancement layer representation may be coded predictively with reference to the base layer representation. The coded base layer representation may be partitioned into a plurality of individually-transmittable segments and stored.
    Type: Grant
    Filed: June 5, 2015
    Date of Patent: October 20, 2020
    Assignee: APPLE INC.
    Inventors: Hang Yuan, Chris Y. Chung, Jae Hoon Kim, Yeping Su, Jiefu Zhai, Xiaosong Zhou, Hsi-Jung Wu
  • Publication number: 20200304837
    Abstract: An encoding system may include a video source that captures video image, a video coder, and a controller to manage operation of the system. The video coder may encode the video image into encoded video data using a plurality of subgroup parameters corresponding to a plurality of subgroups of pixels within a group. The controller may set the subgroup parameters for at least one of the subgroups of pixels in the video coder, based upon at least one parameters corresponding to the group. A decoding system may decode the video data based upon the motion prediction parameters.
    Type: Application
    Filed: June 8, 2020
    Publication date: September 24, 2020
    Inventors: Yunfei Zheng, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Publication number: 20200296426
    Abstract: Chroma deblock filtering of reconstructed video samples may be performed to remove blockiness artifacts and reduce color artifacts without over-smoothing. In a first method, chroma deblocking may be performed for boundary samples of a smallest transform size, regardless of partitions and coding modes. In a second method, chroma deblocking may be performed when a boundary strength is greater than 0. In a third method, chroma deblocking may be performed regardless of boundary strengths. In a fourth method, the type of chroma deblocking to be performed may be signaled in a slice header by a flag. Furthermore, luma deblock filtering techniques may be applied to chroma deblock filtering.
    Type: Application
    Filed: June 2, 2020
    Publication date: September 17, 2020
    Inventors: Jiefu Zhai, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu, Peikang Song, David R. Conrad, Jae Hoon Kim, Yunfei Zheng
  • Patent number: 10757428
    Abstract: Systems and methods are disclosed for reshaping HDR video content to improve compression efficiency while using standard encoding/decoding techniques. Input HDR video frames, e.g., represented in an IPT color space, may be reshaped before the encoding/decoding process and the corresponding reconstructed HDR video frames may then be reverse reshaped. The disclosed reshaping methods may be combinations of scene-based or segment-based methods.
    Type: Grant
    Filed: October 10, 2018
    Date of Patent: August 25, 2020
    Assignee: APPLE INC.
    Inventors: Mei Guo, Jun Xin, Jun Xu, Yeping Su, Chris Chung, Xiaosong Zhou, Hsi-Jung Wu
  • Patent number: 10754242
    Abstract: Techniques are described for implementing format configurations for multi-directional video and for switching between them. Source images may be assigned to formats that may change during a coding session. When a change occurs between formats, video coders and decoder may transform decoded reference frames from the first format to the second format. Thereafter, new frames in the second configuration may be coded or decoded predictively using transformed reference frame(s) as source(s) of prediction. In this manner, video coders and decoders may use intra-coding techniques and achieve high efficiency in coding.
    Type: Grant
    Filed: June 30, 2017
    Date of Patent: August 25, 2020
    Assignee: Apple Inc.
    Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
  • Patent number: 10749923
    Abstract: Methods and apparatus for contextual video content adaptation are disclosed. Video content is adapted based on any number of criteria such as a target device type, viewing conditions, network conditions or various use cases, for example. A target adaptation of content may be defined for a specified video source. For example, based on receiving a request from a portable device for a live sports feed, a shortened and reduced resolution version of the live sport feed video may be defined for the portable device. The source content may be accessed and adapted (e.g., adapted temporally, spatially, etc.) and an adapted version of content generated. For example, the source content may be cropped to a particular spatial region of interest and/or reduced in length to a particular scene. The generated adaptation may be transmitted to a device in response to the request, or stored to a storage device.
    Type: Grant
    Filed: May 31, 2016
    Date of Patent: August 18, 2020
    Assignee: Apple Inc.
    Inventors: Chris Y. Chung, Hsi-Jung Wu, Xiaosong Zhou, Jae Hoon Kim, Jingteng Xue
  • Patent number: 10735773
    Abstract: Techniques for coding video data are described that maintain high precision coding for low motion video content. Such techniques include determining whether a source video sequence to be coded has low motion content. When the source video sequence contains low motion content, the video sequence may be coded as a plurality of coded frames using a chain of temporal prediction references among the coded frames. Thus, a single frame in the source video sequence is coded as a plurality of frames. Because the coded frames each represent identical content, the quality of coding should improve across the plurality of frames. Optionally, the disclosed techniques may increase the resolution at which video is coded to improve precision and coding quality.
    Type: Grant
    Filed: June 4, 2015
    Date of Patent: August 4, 2020
    Assignee: Apple Inc.
    Inventors: Peikang Song, Jae Hoon Kim, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu, Dazhong Zhang
  • Publication number: 20200236349
    Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.
    Type: Application
    Filed: January 22, 2019
    Publication date: July 23, 2020
    Inventors: Jiefu ZHAI, Xingyu ZHANG, Xiaosong ZHOU, Jun XIN, Hsi-Jung WU, Yeping SU
  • Patent number: 10715833
    Abstract: An encoding system may include a video source that captures video image, a video coder, and a controller to manage operation of the system. The video coder may encode the video image into encoded video data using a plurality of subgroup parameters corresponding to a plurality of subgroups of pixels within a group. The controller may set the subgroup parameters for at least one of the subgroups of pixels in the video coder, based upon at least one parameters corresponding to the group. A decoding system may decode the video data based upon the motion prediction parameters.
    Type: Grant
    Filed: May 28, 2014
    Date of Patent: July 14, 2020
    Assignee: Apple Inc.
    Inventors: Yunfei Zheng, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 10708623
    Abstract: Chroma deblock filtering of reconstructed video samples may be performed to remove blockiness artifacts and reduce color artifacts without over-smoothing. In a first method, chroma deblocking may be performed for boundary samples of a smallest transform size, regardless of partitions and coding modes. In a second method, chroma deblocking may be performed when a boundary strength is greater than 0. In a third method, chroma deblocking may be performed regardless of boundary strengths. In a fourth method, the type of chroma deblocking to be performed may be signaled in a slice header by a flag. Furthermore, luma deblock filtering techniques may be applied to chroma deblock filtering.
    Type: Grant
    Filed: July 31, 2018
    Date of Patent: July 7, 2020
    Assignee: Apple Inc.
    Inventors: Jiefu Zhai, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu, Peikang Song, David R. Conrad, Jae Hoon Kim, Yunfei Zheng
  • Publication number: 20200213571
    Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
    Type: Application
    Filed: December 23, 2019
    Publication date: July 2, 2020
    Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
  • Publication number: 20200177927
    Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.
    Type: Application
    Filed: November 29, 2018
    Publication date: June 4, 2020
    Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
  • Patent number: 10652578
    Abstract: Image processing techniques may accelerate coding of viewport data contained within multi-view image data. According to such techniques, an encoder may shifting content of a multi-directional image data according to the viewport location data provided by a decoder. The encoder may code the shifted multi-directional image data by predictive coding, and transmit to the decoder, the coded multi-directional image data and data identifying an amount of the shift. Doing so may move the viewport location to positions in the image data that are coded earlier than the positions that the viewport location naturally occupies and, thereby, may accelerate coding. On decode, a decoder may compare its present viewport location with viewport location data provided by the encoder with coded video data. The decoder may decode the coded video data and extract a portion of the decoded video data corresponding to a present viewport location for display.
    Type: Grant
    Filed: February 5, 2018
    Date of Patent: May 12, 2020
    Assignee: APPLE INC.
    Inventors: Jae Hoon Kim, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Ming Chen, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 10652567
    Abstract: Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.
    Type: Grant
    Filed: March 28, 2018
    Date of Patent: May 12, 2020
    Assignee: APPLE INC.
    Inventors: Xing Wen, Dazhong Zhang, Peikang Song, Xiaosong Zhou, Sudeng Hu, Hsi-Jung Wu, Jae Hoon Kim
  • Patent number: 10638169
    Abstract: A video streaming method for transitioning between multiple sequences of coded video data may include receiving and decoding transmission units from a first sequence of coded video data. In response to a request to transition to a second sequence of coded video data, the method may determine whether a time to transition to the second sequence of coded video data can be reduced by transitioning to the second sequence of coded video data via an intermediate sequence of coded video data. If the time can be reduced, the method may include receiving at least one transmission unit from an intermediate sequence of coded video data that corresponds to the request to transition, decoding the transmission unit from the intermediate sequence, and transitioning from the first sequence to the second sequence via the decoded transmission unit from the intermediate sequence.
    Type: Grant
    Filed: December 18, 2017
    Date of Patent: April 28, 2020
    Assignee: Apple Inc.
    Inventors: Yeping Su, Chris Y. Chung, Xiaosong Zhou, James Oliver Normile, Hsi-Jung Wu, Thomas Jansen, Hyeonkuk Jeong, Joe S. Abuan, Douglas Scott Price
  • Patent number: 10638147
    Abstract: Techniques are disclosed for managing reference frames for gradual coder refresh (GDR) operation. A GDR frame may be partitioned into a plurality of units, at least one of which is coded by instantaneous decoder refresh (IDR) techniques and other(s) of which are coded by other techniques such as inter-coding. The coded GDR frame may be exchanged between an encoder and a decoder. The encoder and decoder both may decode the GDR frame. The encoder and decoder may store the IDR-coded portion of the GDR frame in a reference picture buffer in a modified frame that includes, for the other portion(s) of the GDR frame, replacement content instead of the content obtained by decoding. The modified reference frame are expected by bias prediction search operations performed on later frame toward selection of the IDR-coded content as opposed to the replacement content.
    Type: Grant
    Filed: June 1, 2018
    Date of Patent: April 28, 2020
    Assignee: APPLE INC.
    Inventors: Sudeng Hu, Dazhong Zhang, Xing Wen, Peikang Song, Jae Hoon Kim, Hang Yuan, Xiaosong Zhou, Hsi-Jung Wu, Jingteng Xue
  • Publication number: 20200120345
    Abstract: Systems and methods are disclosed for reshaping HDR video content to improve compression efficiency while using standard encoding/decoding techniques. Input HDR video frames, e.g., represented in an IPT color space, may be reshaped before the encoding/decoding process and the corresponding reconstructed HDR video frames may then be reverse reshaped. The disclosed reshaping methods may be combinations of scene-based or segment-based methods.
    Type: Application
    Filed: October 10, 2018
    Publication date: April 16, 2020
    Inventors: Mei GUO, Jun XIN, Jun XU, Yeping SU, Chris CHUNG, Xiaosong ZHOU, Hsi-Jung WU
  • Patent number: 10623744
    Abstract: The present disclosure describes techniques for coding video data in a manner that provides consistency to portions of the video that have similar content. According to such techniques, a video sequence may be parsed into partitions and content of the partitions may be analyzed. Partitions may be grouped together based on detected similarities in content. Coding parameters may be selected for each partition based on the partition's membership in the groups. Thus, when the video sequence is coded, coding parameters for frames of two commonly-grouped partitions may be similar, which causes coded video data to have similar presentation.
    Type: Grant
    Filed: October 4, 2017
    Date of Patent: April 14, 2020
    Assignee: APPLE INC.
    Inventors: Mei Guo, Jun Xin, Yeping Su, Chris Y. Chung, Xiaosong Zhou, Hsi-Jung Wu
  • Patent number: 10616498
    Abstract: Systems and methods are provided for capturing high quality video data, including data having a high dynamic range, for use with conventional encoders and decoders. High dynamic range data is captured using multiple groups of pixels where each group is captured using different exposure times to create groups of pixels. The pixels that are captured at different exposure times may be determined adaptively based on the content of the image, the parameters of the encoding system, or on the available resources within the encoding system. The transition from single exposure to using two different exposure times may be implemented gradually.
    Type: Grant
    Filed: May 17, 2019
    Date of Patent: April 7, 2020
    Assignee: Apple Inc.
    Inventors: Jiefu Zhai, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu