Patents by Inventor Xiaosong Zhou

Xiaosong Zhou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SCALABILITY OF MULTI-DIRECTIONAL VIDEO STREAMING

Publication number: 20200092571

Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.

Type: Application

Filed: September 14, 2018

Publication date: March 19, 2020

Inventors: Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
Bandwidth management in devices with simultaneous download of multiple data streams

Patent number: 10575008

Abstract: A method of managing resources on a terminal includes determining a number of downloaded video streams active at the terminal, prioritizing the active video streams, assigning a decoding quality level to each active video stream based on a priority assignment for each active video stream, and apportioning reception bandwidth to each active video stream based on an assigned quality level of each active video stream.

Type: Grant

Filed: May 31, 2016

Date of Patent: February 25, 2020

Assignee: APPLE INC.

Inventors: Dazhong Zhang, Hsi-Jung Wu, Peikang Song, Yunfei Zheng, Chris Y. Chung, Jae Hoon Kim, Xiaosong Zhou
ESTABLISHING A VIDEO CONFERENCE DURING A PHONE CALL

Publication number: 20200059628

Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.

Type: Application

Filed: October 28, 2019

Publication date: February 20, 2020

Inventors: Elizabeth C. Cranfill, Stephen O. Lemay, Joe S. Abuan, Hsi-Jung Wu, Xiaosong Zhou, Roberto Garcia, JR.
Techniques for HDR/WCR video coding

Patent number: 10536731

Abstract: Systems and methods are provided for processing high quality video data, such as data having a higher than standard bit depth, a high dynamic range, or a wide or custom color gamut, to be compatible with conventional encoders and decoders without significant loss of quality. High quality data is encoded into a plurality of layers with a base layer having the standard quality data and one or more higher quality layers. Decoding systems and methods may map the base layer to the dynamic range or color gamut of the enhancement layer, combine the layers, and map the combined layers to a dynamic range or color gamut appropriate for the target display. Each of the standard quality and the high quality data may be encoded as a plurality of tiers of increasing quality and reference lower level tiers as sources of prediction during predictive coding.

Type: Grant

Filed: May 12, 2015

Date of Patent: January 14, 2020

Assignee: Apple Inc.

Inventors: Yeping Su, Jiefu Zhai, Ke Zhang, Xiaosong Zhou, Hsi-Jung Wu, Chris Y. Chung
Pixel patch collection for prediction in video coding system

Patent number: 10536726

Abstract: In a video coding system, an encoder may include a coding engine to predictively code input video, a decoder to reconstruct reference pictures generated by the coding engine, a reference picture cache to store the reconstructed reference pictures, a patch cache to store prediction patches generated from other sources, and a prediction search unit to search among the reference picture cache and the patch cache to generate prediction references for use by the coding engine while coding input video. The prediction patches may be assembled from a variety of sources including: predefined image content, reference pictures being evicted from the reference picture cache, image content of prior coding sessions and image data stored by applications on a common terminal where the encoder resides. A decoder may store prediction patches in its own patch cache for synchronous decoding.

Type: Grant

Filed: May 3, 2012

Date of Patent: January 14, 2020

Assignee: APPLE INC.

Inventors: Xiaosong Zhou, James Oliver Normile, Hsi-Jung Wu
Packed image format for multi-directional video

Patent number: 10523913

Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.

Type: Grant

Filed: June 30, 2017

Date of Patent: December 31, 2019

Assignee: APPLE INC.

Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
Gradual Decoder Refresh Techniques with Management of Reference Pictures

Publication number: 20190373276

Abstract: Techniques are disclosed for managing reference frames for gradual coder refresh (GDR) operation. A GDR frame may be partitioned into a plurality of units, at least one of which is coded by instantaneous decoder refresh (IDR) techniques and other(s) of which are coded by other techniques such as inter-coding. The coded GDR frame may be exchanged between an encoder and a decoder. The encoder and decoder both may decode the GDR frame. The encoder and decoder may store the IDR-coded portion of the GDR frame in a reference picture buffer in a modified frame that includes, for the other portion(s) of the GDR frame, replacement content instead of the content obtained by decoding. The modified reference frame are expected by bias prediction search operations performed on later frame toward selection of the IDR-coded content as opposed to the replacement content.

Type: Application

Filed: June 1, 2018

Publication date: December 5, 2019

Inventors: Sudeng Hu, Dazhong Zhang, Xing Wen, Peikang Song, Jae Hoon Kim, Hang Yuan, Xiaosong Zhou, Hsi-Jung Wu, Jingteng Xue
EFFICIENT CODING OF VIDEO DATA IN THE PRESENCE OF VIDEO ANNOTATIONS

Publication number: 20190373270

Abstract: Systems and methods for coding a video to be overlaid by annotations are devised. A motion compensated predictive coding is employed, wherein coding parameters of video pixel blocks are determined based on the pixel blocks' relation to the annotations. A decoder decodes the video and annotates it based on metadata, obtained from the coder or other sources, describing the annotations' appearance and rendering mode.

Type: Application

Filed: May 23, 2019

Publication date: December 5, 2019

Inventors: Sudeng HU, Xing WEN, Jae Hoon KIM, Peikang SONG, Hang YUAN, Dazhong ZHANG, Xiaosong ZHOU, Hsi-Jung WU, Christopher GARRIDO, Ming JIN, Patrick MIAUTON, Karthick SANTHANAM
HIGH DYNAMIC RANGE VIDEO CAPTURE CONTROL FOR VIDEO TRANSMISSION

Publication number: 20190349509

Abstract: Systems and methods are provided for capturing high quality video data, including data having a high dynamic range, for use with conventional encoders and decoders. High dynamic range data is captured using multiple groups of pixels where each group is captured using different exposure times to create groups of pixels. The pixels that are captured at different exposure times may be determined adaptively based on the content of the image, the parameters of the encoding system, or on the available resources within the encoding system. The transition from single exposure to using two different exposure times may be implemented gradually.

Type: Application

Filed: May 17, 2019

Publication date: November 14, 2019

Inventors: Jiefu ZHAI, Xiaosong ZHOU, Chris Y. CHUNG, Hsi-Jung WU
MEDIA FEED PRIORITIZATION FOR MULTI-PARTY CONFERENCING

Publication number: 20190342351

Abstract: Techniques presented herein provide an improved relay user experience and improved management of scarce computing and network resources as the number of relay endpoints increases. A sourcing endpoint device may generate a media feed, such as video and/or audio feed, representing contribution from a conference participant. The sourcing endpoint device may generate a priority value for the media feed, and the priority value may be transmitted to other members of the relay along with the input feed. Priority values of the different relay participants may be used by other devices, for example, intermediate servers or receiving endpoint devices, to manage aspects of the relay. For example, a relay server may prune streams from select endpoint devices based on relative priority values received from those devices. Alternatively, receiving endpoint devices may alter presentation of received feeds based on their associated priority values.

Type: Application

Filed: May 7, 2019

Publication date: November 7, 2019

Inventors: Christopher M. Garrido, Dazhong Zhang, Karthick Santhanam, Patrick Miauton, Xiaoxiao Zheng, Bess Chan, Peter Shiang, Sudeng Hu, Peikang Song, Xiaosong Zhou
Camera-assisted motion estimation for application control

Patent number: 10466809

Abstract: Embodiments of the present invention generate estimates of device motion from two data sources on a computing device—a motion sensor and a camera. The device may compare the estimates to each other to determine if they agree. If they agree, the device may confirm that device motion estimates based on the motion sensor are accurate and may output those estimates to an application within the device. If the device motion estimates disagree, the device may alter the motion estimates obtained from the motion sensor before outputting them to the application.

Type: Grant

Filed: October 20, 2017

Date of Patent: November 5, 2019

Assignee: Apple Inc.

Inventors: Yao-Chung Lin, Xiaosong Zhou, Hsi-Jung Wu
Establishing a video conference during a phone call

Patent number: 10462420

Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.

Type: Grant

Filed: October 5, 2017

Date of Patent: October 29, 2019

Assignee: APPLE INC.

Inventors: Elizabeth C. Cranfill, Stephen O. Lemay, Joe S. Abuan, Hsi-Jung Wu, Xiaosong Zhou, Roberto Garcia, Jr.
Video analysis techniques for improved editing, navigation, and summarization

Patent number: 10452713

Abstract: Systems and processes for improved video editing, summarization and navigation based on generation and analysis of metadata are described. The metadata may be content-based (e.g., differences between neighboring frames, exposure data, key frame identification data, motion data, or face detection data) or non-content-based (e.g., exposure, focus, location, time) and used to prioritize and/or classify portions of video. The metadata may be generated at the time of image capture or during post-processing. Prioritization information, such as a score for various portions of the image data may be based on the metadata and/or image data. Classification information such as the type or quality of a scene may be determined based on the metadata and/or image data. The classification and prioritization information may be metadata and may be used to automatically remove undesirable portions of the video, generate suggestions during editing or automatically generate summary video.

Type: Grant

Filed: December 3, 2014

Date of Patent: October 22, 2019

Assignee: Apple Inc.

Inventors: Shujie Liu, Ke Zhang, Xiaosong Zhou, Hsi-Jung Wu, Chris Y. Chung, James O. Normile, Douglas S. Price, Albert E. Keinath
Applications for Decoder-Side Modeling of Objects Identified in Decoded Video Data

Publication number: 20190306524

Abstract: Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.

Type: Application

Filed: March 28, 2018

Publication date: October 3, 2019

Inventors: Xing Wen, Dazhong Zhang, Peikang Song, Xiaosong Zhou, Sudeng Hu, Hsi-Jung Wu, Jae Hoon Kim
De-juddering techniques for coded video

Patent number: 10432946

Abstract: Judder artifacts are remedied in video coding system by employing frame rate conversion at an encoder. A source video sequence may be coded as base layer coded video at a first frame rate. An encoder may identify a portion of the coded video sequence that likely will exhibit judder effects when decoded. For those portions that likely will exhibit judder effects, video data representing the portion of the source video may be coded at a higher frame rate than a frame rate of the coded base layer data as enhancement layer data. Moreover, an encoder may generate metadata representing “FRC hints”—techniques that a decoder should employ when performing decoder-side frame rate conversion. An encoding terminal may transmit the base layer coded video and either the enhancement layer coded video or the FRC hints to a decoder. Thus, encoder infrastructure may mitigate against judder artifacts that may arise during decoding.

Type: Grant

Filed: December 10, 2015

Date of Patent: October 1, 2019

Assignee: Apple Inc.

Inventors: Yeping Su, Chris Y. Chung, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
REAL-TIME FACE AND OBJECT MANIPULATION

Publication number: 20190279681

Abstract: Techniques are presented for modifying images of an object in video, for example to correct for lens distortion, or to beautify a face. These techniques include extracting and validating features of an object from a source video frame, tracking those features over time, estimating a pose of the object, modifying a 3D model of the object based on the features, and rendering a modified video frame based on the modified 3D model and modified intrinsic and extrinsic matrices. These techniques may be applied in real-time to an object in a sequence of video frames.

Type: Application

Filed: March 9, 2018

Publication date: September 12, 2019

Inventors: Hang Yuan, Jiefu Zhai, Ming Chen, Jae Hoon Kim, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
Media analysis and processing framework on a resource restricted device

Patent number: 10402226

Abstract: A system for processing media on a resource restricted device, the system including a memory to store data representing media assets and associated descriptors, and program instructions representing an application and a media processing system, and a processor to execute the program instructions, wherein the program instructions represent the media processing system, in response to a call from an application defining a plurality of services to be performed on an asset, determine a tiered schedule of processing operations to be performed upon the asset based on a processing budget associated therewith, and iteratively execute the processing operations on a tier-by-tier basis, unless interrupted.

Type: Grant

Filed: June 3, 2016

Date of Patent: September 3, 2019

Assignee: Apple Inc.

Inventors: Albert Keinath, Ke Zhang, Yunfei Zheng, Shujie Liu, Jiefu Zhai, Chris Y. Chung, Xiaosong Zhou, Hsi-Jung Wu
Hierarchical sharpness evaluation

Patent number: 10402677

Abstract: Techniques are disclosed for estimating quality of images in an automated fashion. According to these techniques, a source image may be downsampled to generate at least two downsampled images at different levels of downsampling. Blurriness of the images may be estimated starting with a most-heavily downsampled image. Blocks of a given image may be evaluated for blurriness and, when a block of a given image is estimated to be blurry, the block of the image and co-located blocks of higher resolution image(s) may be designated as blurry. Thereafter, a blurriness score may be calculated for the source image from the number of blocks of the source image designated as blurry.

Type: Grant

Filed: June 9, 2017

Date of Patent: September 3, 2019

Assignee: Apple Inc.

Inventors: Jiefu Zhai, Ke Zhang, Yunfei Zheng, Shujie Liu, Albert Keinath, Xiaosong Zhou, Chris Chung, Hsi-Jung Wu
Trick play in digital video streaming

Patent number: 10382830

Abstract: System and methods for improved playback of a video stream are presented. Video snippets are identified that include a number of consecutive frames for playback. Snippets may be evenly temporally spaced in the video stream or may be content adaptive. Then the first frame of a snippet may be selected as the first frame of a scene or other appropriate stopping point. Scene detection, object detection, motion detection, video metadata, or other information generated during encoding or decoding of the video stream may aid in appropriate snippet selection.

Type: Grant

Filed: October 18, 2016

Date of Patent: August 13, 2019

Assignee: Apple Inc.

Inventors: David L. Biderman, Xiaosong Zhou, Douglas Scott Price, Yeping Su, Chris Y. Chung, Hsi-Jung Wu, Zhenheng Li, John Su, James D. Batson
Processing of Multi-Directional Images in Spatially-Ordered Video Coding Applications

Publication number: 20190246141

Abstract: Image processing techniques may accelerate coding of viewport data contained within multi-view image data. According to such techniques, an encoder may shifting content of a multi-directional image data according to the viewport location data provided by a decoder. The encoder may code the shifted multi-directional image data by predictive coding, and transmit to the decoder, the coded multi-directional image data and data identifying an amount of the shift. Doing so may move the viewport location to positions in the image data that are coded earlier than the positions that the viewport location naturally occupies and, thereby, may accelerate coding. On decode, a decoder may compare its present viewport location with viewport location data provided by the encoder with coded video data. The decoder may decode the coded video data and extract a portion of the decoded video data corresponding to a present viewport location for display.

Type: Application

Filed: February 5, 2018

Publication date: August 8, 2019

Inventors: Jae Hoon Kim, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Ming Chen, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu

prev 1 2 3 4 5 6 7 8 9 … next