Patents by Inventor Xiaosong Zhou

Xiaosong Zhou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

AUTOMATED MEDIA EDITING OPERATIONS IN CONSUMER DEVICES

Publication number: 20200380260

Abstract: Techniques disclosed for managing video captured by an imaging device. Methods disclosed capture a video in response to a capture command received at the imaging device. Following a video capture, techniques for classifying the captured video based on feature(s) extracted therefrom, for marking the captured video based on the classification, and for generating a media item from the captured video according to the marking are disclosed. Accordingly, the captured video may be classified as representing a static event, and, as a result, a media item of a still image may be generated. Otherwise, the captured video may be classified as representing a dynamic event, and, as a result, a media item of a video may be generated.

Type: Application

Filed: May 26, 2020

Publication date: December 3, 2020

Inventors: Bartlomiej RYMKOWSKI, Robert BAILEY, Ethan TIRA-THOMPSON, Shuang GAO, Ben ENGLERT, Emilie KIM, Shujie LIU, Ke ZHANG, Vinay SHARMA, Xiaosong ZHOU
Efficient still image coding with video compression techniques

Patent number: 10812832

Abstract: Coding techniques for image data may cause a still image to be converted to a “phantom” video sequence, which is coded by motion compensated prediction techniques. Thus, coded video data obtained from the coding operation may include temporal prediction references between frames of the video sequence. Metadata may be generated that identifies allocations of content from the still image to the frames of the video sequence. The coded data and the metadata may be transmitted to another device, whereupon it may be decoded by motion compensated prediction techniques and converted back to a still image data. Other techniques may involve coding an image in both a base layer representation and at least one coded enhancement layer representation. The enhancement layer representation may be coded predictively with reference to the base layer representation. The coded base layer representation may be partitioned into a plurality of individually-transmittable segments and stored.

Type: Grant

Filed: June 5, 2015

Date of Patent: October 20, 2020

Assignee: APPLE INC.

Inventors: Hang Yuan, Chris Y. Chung, Jae Hoon Kim, Yeping Su, Jiefu Zhai, Xiaosong Zhou, Hsi-Jung Wu
ADAPTIVE SYNTAX GROUPING AND COMPRESSION IN VIDEO DATA

Publication number: 20200304837

Abstract: An encoding system may include a video source that captures video image, a video coder, and a controller to manage operation of the system. The video coder may encode the video image into encoded video data using a plurality of subgroup parameters corresponding to a plurality of subgroups of pixels within a group. The controller may set the subgroup parameters for at least one of the subgroups of pixels in the video coder, based upon at least one parameters corresponding to the group. A decoding system may decode the video data based upon the motion prediction parameters.

Type: Application

Filed: June 8, 2020

Publication date: September 24, 2020

Inventors: Yunfei Zheng, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
IN LOOP CHROMA DEBLOCKING FILTER

Publication number: 20200296426

Abstract: Chroma deblock filtering of reconstructed video samples may be performed to remove blockiness artifacts and reduce color artifacts without over-smoothing. In a first method, chroma deblocking may be performed for boundary samples of a smallest transform size, regardless of partitions and coding modes. In a second method, chroma deblocking may be performed when a boundary strength is greater than 0. In a third method, chroma deblocking may be performed regardless of boundary strengths. In a fourth method, the type of chroma deblocking to be performed may be signaled in a slice header by a flag. Furthermore, luma deblock filtering techniques may be applied to chroma deblock filtering.

Type: Application

Filed: June 2, 2020

Publication date: September 17, 2020

Inventors: Jiefu Zhai, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu, Peikang Song, David R. Conrad, Jae Hoon Kim, Yunfei Zheng
Luma and chroma reshaping of HDR video encoding

Patent number: 10757428

Abstract: Systems and methods are disclosed for reshaping HDR video content to improve compression efficiency while using standard encoding/decoding techniques. Input HDR video frames, e.g., represented in an IPT color space, may be reshaped before the encoding/decoding process and the corresponding reconstructed HDR video frames may then be reverse reshaped. The disclosed reshaping methods may be combinations of scene-based or segment-based methods.

Type: Grant

Filed: October 10, 2018

Date of Patent: August 25, 2020

Assignee: APPLE INC.

Inventors: Mei Guo, Jun Xin, Jun Xu, Yeping Su, Chris Chung, Xiaosong Zhou, Hsi-Jung Wu
Adaptive resolution and projection format in multi-direction video

Patent number: 10754242

Abstract: Techniques are described for implementing format configurations for multi-directional video and for switching between them. Source images may be assigned to formats that may change during a coding session. When a change occurs between formats, video coders and decoder may transform decoded reference frames from the first format to the second format. Thereafter, new frames in the second configuration may be coded or decoded predictively using transformed reference frame(s) as source(s) of prediction. In this manner, video coders and decoders may use intra-coding techniques and achieve high efficiency in coding.

Type: Grant

Filed: June 30, 2017

Date of Patent: August 25, 2020

Assignee: Apple Inc.

Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
Contextual video content adaptation based on target device

Patent number: 10749923

Abstract: Methods and apparatus for contextual video content adaptation are disclosed. Video content is adapted based on any number of criteria such as a target device type, viewing conditions, network conditions or various use cases, for example. A target adaptation of content may be defined for a specified video source. For example, based on receiving a request from a portable device for a live sports feed, a shortened and reduced resolution version of the live sport feed video may be defined for the portable device. The source content may be accessed and adapted (e.g., adapted temporally, spatially, etc.) and an adapted version of content generated. For example, the source content may be cropped to a particular spatial region of interest and/or reduced in length to a particular scene. The generated adaptation may be transmitted to a device in response to the request, or stored to a storage device.

Type: Grant

Filed: May 31, 2016

Date of Patent: August 18, 2020

Assignee: Apple Inc.

Inventors: Chris Y. Chung, Hsi-Jung Wu, Xiaosong Zhou, Jae Hoon Kim, Jingteng Xue
Video coding techniques for high quality coding of low motion content

Patent number: 10735773

Abstract: Techniques for coding video data are described that maintain high precision coding for low motion video content. Such techniques include determining whether a source video sequence to be coded has low motion content. When the source video sequence contains low motion content, the video sequence may be coded as a plurality of coded frames using a chain of temporal prediction references among the coded frames. Thus, a single frame in the source video sequence is coded as a plurality of frames. Because the coded frames each represent identical content, the quality of coding should improve across the plurality of frames. Optionally, the disclosed techniques may increase the resolution at which video is coded to improve precision and coding quality.

Type: Grant

Filed: June 4, 2015

Date of Patent: August 4, 2020

Assignee: Apple Inc.

Inventors: Peikang Song, Jae Hoon Kim, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu, Dazhong Zhang
PREDICTIVE CODING WITH NEURAL NETWORKS

Publication number: 20200236349

Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.

Type: Application

Filed: January 22, 2019

Publication date: July 23, 2020

Inventors: Jiefu ZHAI, Xingyu ZHANG, Xiaosong ZHOU, Jun XIN, Hsi-Jung WU, Yeping SU
Adaptive syntax grouping and compression in video data using a default value and an exception value

Patent number: 10715833

Abstract: An encoding system may include a video source that captures video image, a video coder, and a controller to manage operation of the system. The video coder may encode the video image into encoded video data using a plurality of subgroup parameters corresponding to a plurality of subgroups of pixels within a group. The controller may set the subgroup parameters for at least one of the subgroups of pixels in the video coder, based upon at least one parameters corresponding to the group. A decoding system may decode the video data based upon the motion prediction parameters.

Type: Grant

Filed: May 28, 2014

Date of Patent: July 14, 2020

Assignee: Apple Inc.

Inventors: Yunfei Zheng, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
In loop chroma deblocking filter

Patent number: 10708623

Abstract: Chroma deblock filtering of reconstructed video samples may be performed to remove blockiness artifacts and reduce color artifacts without over-smoothing. In a first method, chroma deblocking may be performed for boundary samples of a smallest transform size, regardless of partitions and coding modes. In a second method, chroma deblocking may be performed when a boundary strength is greater than 0. In a third method, chroma deblocking may be performed regardless of boundary strengths. In a fourth method, the type of chroma deblocking to be performed may be signaled in a slice header by a flag. Furthermore, luma deblock filtering techniques may be applied to chroma deblock filtering.

Type: Grant

Filed: July 31, 2018

Date of Patent: July 7, 2020

Assignee: Apple Inc.

Inventors: Jiefu Zhai, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu, Peikang Song, David R. Conrad, Jae Hoon Kim, Yunfei Zheng
Packed Image Format for Multi-Directional Video

Publication number: 20200213571

Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.

Type: Application

Filed: December 23, 2019

Publication date: July 2, 2020

Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
ADAPTIVE CODING AND STREAMING OF MULTI-DIRECTIONAL VIDEO

Publication number: 20200177927

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Application

Filed: November 29, 2018

Publication date: June 4, 2020

Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
Processing of multi-directional images in spatially-ordered video coding applications

Patent number: 10652578

Abstract: Image processing techniques may accelerate coding of viewport data contained within multi-view image data. According to such techniques, an encoder may shifting content of a multi-directional image data according to the viewport location data provided by a decoder. The encoder may code the shifted multi-directional image data by predictive coding, and transmit to the decoder, the coded multi-directional image data and data identifying an amount of the shift. Doing so may move the viewport location to positions in the image data that are coded earlier than the positions that the viewport location naturally occupies and, thereby, may accelerate coding. On decode, a decoder may compare its present viewport location with viewport location data provided by the encoder with coded video data. The decoder may decode the coded video data and extract a portion of the decoded video data corresponding to a present viewport location for display.

Type: Grant

Filed: February 5, 2018

Date of Patent: May 12, 2020

Assignee: APPLE INC.

Inventors: Jae Hoon Kim, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Ming Chen, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
Applications for decoder-side modeling of objects identified in decoded video data

Patent number: 10652567

Abstract: Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.

Type: Grant

Filed: March 28, 2018

Date of Patent: May 12, 2020

Assignee: APPLE INC.

Inventors: Xing Wen, Dazhong Zhang, Peikang Song, Xiaosong Zhou, Sudeng Hu, Hsi-Jung Wu, Jae Hoon Kim
Codec techniquest for fast switching without a synchronization frame

Patent number: 10638169

Abstract: A video streaming method for transitioning between multiple sequences of coded video data may include receiving and decoding transmission units from a first sequence of coded video data. In response to a request to transition to a second sequence of coded video data, the method may determine whether a time to transition to the second sequence of coded video data can be reduced by transitioning to the second sequence of coded video data via an intermediate sequence of coded video data. If the time can be reduced, the method may include receiving at least one transmission unit from an intermediate sequence of coded video data that corresponds to the request to transition, decoding the transmission unit from the intermediate sequence, and transitioning from the first sequence to the second sequence via the decoded transmission unit from the intermediate sequence.

Type: Grant

Filed: December 18, 2017

Date of Patent: April 28, 2020

Assignee: Apple Inc.

Inventors: Yeping Su, Chris Y. Chung, Xiaosong Zhou, James Oliver Normile, Hsi-Jung Wu, Thomas Jansen, Hyeonkuk Jeong, Joe S. Abuan, Douglas Scott Price
Gradual decoder refresh techniques with management of reference pictures

Patent number: 10638147

Abstract: Techniques are disclosed for managing reference frames for gradual coder refresh (GDR) operation. A GDR frame may be partitioned into a plurality of units, at least one of which is coded by instantaneous decoder refresh (IDR) techniques and other(s) of which are coded by other techniques such as inter-coding. The coded GDR frame may be exchanged between an encoder and a decoder. The encoder and decoder both may decode the GDR frame. The encoder and decoder may store the IDR-coded portion of the GDR frame in a reference picture buffer in a modified frame that includes, for the other portion(s) of the GDR frame, replacement content instead of the content obtained by decoding. The modified reference frame are expected by bias prediction search operations performed on later frame toward selection of the IDR-coded content as opposed to the replacement content.

Type: Grant

Filed: June 1, 2018

Date of Patent: April 28, 2020

Assignee: APPLE INC.

Inventors: Sudeng Hu, Dazhong Zhang, Xing Wen, Peikang Song, Jae Hoon Kim, Hang Yuan, Xiaosong Zhou, Hsi-Jung Wu, Jingteng Xue
LUMA AND CHROMA RESHAPING OF HDR VIDEO ENCODING

Publication number: 20200120345

Abstract: Systems and methods are disclosed for reshaping HDR video content to improve compression efficiency while using standard encoding/decoding techniques. Input HDR video frames, e.g., represented in an IPT color space, may be reshaped before the encoding/decoding process and the corresponding reconstructed HDR video frames may then be reverse reshaped. The disclosed reshaping methods may be combinations of scene-based or segment-based methods.

Type: Application

Filed: October 10, 2018

Publication date: April 16, 2020

Inventors: Mei GUO, Jun XIN, Jun XU, Yeping SU, Chris CHUNG, Xiaosong ZHOU, Hsi-Jung WU
Scene based rate control for video compression and video streaming

Patent number: 10623744

Abstract: The present disclosure describes techniques for coding video data in a manner that provides consistency to portions of the video that have similar content. According to such techniques, a video sequence may be parsed into partitions and content of the partitions may be analyzed. Partitions may be grouped together based on detected similarities in content. Coding parameters may be selected for each partition based on the partition's membership in the groups. Thus, when the video sequence is coded, coding parameters for frames of two commonly-grouped partitions may be similar, which causes coded video data to have similar presentation.

Type: Grant

Filed: October 4, 2017

Date of Patent: April 14, 2020

Assignee: APPLE INC.

Inventors: Mei Guo, Jun Xin, Yeping Su, Chris Y. Chung, Xiaosong Zhou, Hsi-Jung Wu
High dynamic range video capture control for video transmission

Patent number: 10616498

Abstract: Systems and methods are provided for capturing high quality video data, including data having a high dynamic range, for use with conventional encoders and decoders. High dynamic range data is captured using multiple groups of pixels where each group is captured using different exposure times to create groups of pixels. The pixels that are captured at different exposure times may be determined adaptively based on the content of the image, the parameters of the encoding system, or on the available resources within the encoding system. The transition from single exposure to using two different exposure times may be implemented gradually.

Type: Grant

Filed: May 17, 2019

Date of Patent: April 7, 2020

Assignee: Apple Inc.

Inventors: Jiefu Zhai, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu

prev 1 2 3 4 5 6 7 8 … next