Patents by Inventor Xiaosong Zhou

Xiaosong Zhou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20200092571
    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
    Type: Application
    Filed: September 14, 2018
    Publication date: March 19, 2020
    Inventors: Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
  • Patent number: 10575008
    Abstract: A method of managing resources on a terminal includes determining a number of downloaded video streams active at the terminal, prioritizing the active video streams, assigning a decoding quality level to each active video stream based on a priority assignment for each active video stream, and apportioning reception bandwidth to each active video stream based on an assigned quality level of each active video stream.
    Type: Grant
    Filed: May 31, 2016
    Date of Patent: February 25, 2020
    Assignee: APPLE INC.
    Inventors: Dazhong Zhang, Hsi-Jung Wu, Peikang Song, Yunfei Zheng, Chris Y. Chung, Jae Hoon Kim, Xiaosong Zhou
  • Publication number: 20200059628
    Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.
    Type: Application
    Filed: October 28, 2019
    Publication date: February 20, 2020
    Inventors: Elizabeth C. Cranfill, Stephen O. Lemay, Joe S. Abuan, Hsi-Jung Wu, Xiaosong Zhou, Roberto Garcia, JR.
  • Patent number: 10536731
    Abstract: Systems and methods are provided for processing high quality video data, such as data having a higher than standard bit depth, a high dynamic range, or a wide or custom color gamut, to be compatible with conventional encoders and decoders without significant loss of quality. High quality data is encoded into a plurality of layers with a base layer having the standard quality data and one or more higher quality layers. Decoding systems and methods may map the base layer to the dynamic range or color gamut of the enhancement layer, combine the layers, and map the combined layers to a dynamic range or color gamut appropriate for the target display. Each of the standard quality and the high quality data may be encoded as a plurality of tiers of increasing quality and reference lower level tiers as sources of prediction during predictive coding.
    Type: Grant
    Filed: May 12, 2015
    Date of Patent: January 14, 2020
    Assignee: Apple Inc.
    Inventors: Yeping Su, Jiefu Zhai, Ke Zhang, Xiaosong Zhou, Hsi-Jung Wu, Chris Y. Chung
  • Patent number: 10536726
    Abstract: In a video coding system, an encoder may include a coding engine to predictively code input video, a decoder to reconstruct reference pictures generated by the coding engine, a reference picture cache to store the reconstructed reference pictures, a patch cache to store prediction patches generated from other sources, and a prediction search unit to search among the reference picture cache and the patch cache to generate prediction references for use by the coding engine while coding input video. The prediction patches may be assembled from a variety of sources including: predefined image content, reference pictures being evicted from the reference picture cache, image content of prior coding sessions and image data stored by applications on a common terminal where the encoder resides. A decoder may store prediction patches in its own patch cache for synchronous decoding.
    Type: Grant
    Filed: May 3, 2012
    Date of Patent: January 14, 2020
    Assignee: APPLE INC.
    Inventors: Xiaosong Zhou, James Oliver Normile, Hsi-Jung Wu
  • Patent number: 10523913
    Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
    Type: Grant
    Filed: June 30, 2017
    Date of Patent: December 31, 2019
    Assignee: APPLE INC.
    Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
  • Publication number: 20190373276
    Abstract: Techniques are disclosed for managing reference frames for gradual coder refresh (GDR) operation. A GDR frame may be partitioned into a plurality of units, at least one of which is coded by instantaneous decoder refresh (IDR) techniques and other(s) of which are coded by other techniques such as inter-coding. The coded GDR frame may be exchanged between an encoder and a decoder. The encoder and decoder both may decode the GDR frame. The encoder and decoder may store the IDR-coded portion of the GDR frame in a reference picture buffer in a modified frame that includes, for the other portion(s) of the GDR frame, replacement content instead of the content obtained by decoding. The modified reference frame are expected by bias prediction search operations performed on later frame toward selection of the IDR-coded content as opposed to the replacement content.
    Type: Application
    Filed: June 1, 2018
    Publication date: December 5, 2019
    Inventors: Sudeng Hu, Dazhong Zhang, Xing Wen, Peikang Song, Jae Hoon Kim, Hang Yuan, Xiaosong Zhou, Hsi-Jung Wu, Jingteng Xue
  • Publication number: 20190373270
    Abstract: Systems and methods for coding a video to be overlaid by annotations are devised. A motion compensated predictive coding is employed, wherein coding parameters of video pixel blocks are determined based on the pixel blocks' relation to the annotations. A decoder decodes the video and annotates it based on metadata, obtained from the coder or other sources, describing the annotations' appearance and rendering mode.
    Type: Application
    Filed: May 23, 2019
    Publication date: December 5, 2019
    Inventors: Sudeng HU, Xing WEN, Jae Hoon KIM, Peikang SONG, Hang YUAN, Dazhong ZHANG, Xiaosong ZHOU, Hsi-Jung WU, Christopher GARRIDO, Ming JIN, Patrick MIAUTON, Karthick SANTHANAM
  • Publication number: 20190349509
    Abstract: Systems and methods are provided for capturing high quality video data, including data having a high dynamic range, for use with conventional encoders and decoders. High dynamic range data is captured using multiple groups of pixels where each group is captured using different exposure times to create groups of pixels. The pixels that are captured at different exposure times may be determined adaptively based on the content of the image, the parameters of the encoding system, or on the available resources within the encoding system. The transition from single exposure to using two different exposure times may be implemented gradually.
    Type: Application
    Filed: May 17, 2019
    Publication date: November 14, 2019
    Inventors: Jiefu ZHAI, Xiaosong ZHOU, Chris Y. CHUNG, Hsi-Jung WU
  • Publication number: 20190342351
    Abstract: Techniques presented herein provide an improved relay user experience and improved management of scarce computing and network resources as the number of relay endpoints increases. A sourcing endpoint device may generate a media feed, such as video and/or audio feed, representing contribution from a conference participant. The sourcing endpoint device may generate a priority value for the media feed, and the priority value may be transmitted to other members of the relay along with the input feed. Priority values of the different relay participants may be used by other devices, for example, intermediate servers or receiving endpoint devices, to manage aspects of the relay. For example, a relay server may prune streams from select endpoint devices based on relative priority values received from those devices. Alternatively, receiving endpoint devices may alter presentation of received feeds based on their associated priority values.
    Type: Application
    Filed: May 7, 2019
    Publication date: November 7, 2019
    Inventors: Christopher M. Garrido, Dazhong Zhang, Karthick Santhanam, Patrick Miauton, Xiaoxiao Zheng, Bess Chan, Peter Shiang, Sudeng Hu, Peikang Song, Xiaosong Zhou
  • Patent number: 10466809
    Abstract: Embodiments of the present invention generate estimates of device motion from two data sources on a computing device—a motion sensor and a camera. The device may compare the estimates to each other to determine if they agree. If they agree, the device may confirm that device motion estimates based on the motion sensor are accurate and may output those estimates to an application within the device. If the device motion estimates disagree, the device may alter the motion estimates obtained from the motion sensor before outputting them to the application.
    Type: Grant
    Filed: October 20, 2017
    Date of Patent: November 5, 2019
    Assignee: Apple Inc.
    Inventors: Yao-Chung Lin, Xiaosong Zhou, Hsi-Jung Wu
  • Patent number: 10462420
    Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.
    Type: Grant
    Filed: October 5, 2017
    Date of Patent: October 29, 2019
    Assignee: APPLE INC.
    Inventors: Elizabeth C. Cranfill, Stephen O. Lemay, Joe S. Abuan, Hsi-Jung Wu, Xiaosong Zhou, Roberto Garcia, Jr.
  • Patent number: 10452713
    Abstract: Systems and processes for improved video editing, summarization and navigation based on generation and analysis of metadata are described. The metadata may be content-based (e.g., differences between neighboring frames, exposure data, key frame identification data, motion data, or face detection data) or non-content-based (e.g., exposure, focus, location, time) and used to prioritize and/or classify portions of video. The metadata may be generated at the time of image capture or during post-processing. Prioritization information, such as a score for various portions of the image data may be based on the metadata and/or image data. Classification information such as the type or quality of a scene may be determined based on the metadata and/or image data. The classification and prioritization information may be metadata and may be used to automatically remove undesirable portions of the video, generate suggestions during editing or automatically generate summary video.
    Type: Grant
    Filed: December 3, 2014
    Date of Patent: October 22, 2019
    Assignee: Apple Inc.
    Inventors: Shujie Liu, Ke Zhang, Xiaosong Zhou, Hsi-Jung Wu, Chris Y. Chung, James O. Normile, Douglas S. Price, Albert E. Keinath
  • Publication number: 20190306524
    Abstract: Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.
    Type: Application
    Filed: March 28, 2018
    Publication date: October 3, 2019
    Inventors: Xing Wen, Dazhong Zhang, Peikang Song, Xiaosong Zhou, Sudeng Hu, Hsi-Jung Wu, Jae Hoon Kim
  • Patent number: 10432946
    Abstract: Judder artifacts are remedied in video coding system by employing frame rate conversion at an encoder. A source video sequence may be coded as base layer coded video at a first frame rate. An encoder may identify a portion of the coded video sequence that likely will exhibit judder effects when decoded. For those portions that likely will exhibit judder effects, video data representing the portion of the source video may be coded at a higher frame rate than a frame rate of the coded base layer data as enhancement layer data. Moreover, an encoder may generate metadata representing “FRC hints”—techniques that a decoder should employ when performing decoder-side frame rate conversion. An encoding terminal may transmit the base layer coded video and either the enhancement layer coded video or the FRC hints to a decoder. Thus, encoder infrastructure may mitigate against judder artifacts that may arise during decoding.
    Type: Grant
    Filed: December 10, 2015
    Date of Patent: October 1, 2019
    Assignee: Apple Inc.
    Inventors: Yeping Su, Chris Y. Chung, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
  • Publication number: 20190279681
    Abstract: Techniques are presented for modifying images of an object in video, for example to correct for lens distortion, or to beautify a face. These techniques include extracting and validating features of an object from a source video frame, tracking those features over time, estimating a pose of the object, modifying a 3D model of the object based on the features, and rendering a modified video frame based on the modified 3D model and modified intrinsic and extrinsic matrices. These techniques may be applied in real-time to an object in a sequence of video frames.
    Type: Application
    Filed: March 9, 2018
    Publication date: September 12, 2019
    Inventors: Hang Yuan, Jiefu Zhai, Ming Chen, Jae Hoon Kim, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 10402226
    Abstract: A system for processing media on a resource restricted device, the system including a memory to store data representing media assets and associated descriptors, and program instructions representing an application and a media processing system, and a processor to execute the program instructions, wherein the program instructions represent the media processing system, in response to a call from an application defining a plurality of services to be performed on an asset, determine a tiered schedule of processing operations to be performed upon the asset based on a processing budget associated therewith, and iteratively execute the processing operations on a tier-by-tier basis, unless interrupted.
    Type: Grant
    Filed: June 3, 2016
    Date of Patent: September 3, 2019
    Assignee: Apple Inc.
    Inventors: Albert Keinath, Ke Zhang, Yunfei Zheng, Shujie Liu, Jiefu Zhai, Chris Y. Chung, Xiaosong Zhou, Hsi-Jung Wu
  • Patent number: 10402677
    Abstract: Techniques are disclosed for estimating quality of images in an automated fashion. According to these techniques, a source image may be downsampled to generate at least two downsampled images at different levels of downsampling. Blurriness of the images may be estimated starting with a most-heavily downsampled image. Blocks of a given image may be evaluated for blurriness and, when a block of a given image is estimated to be blurry, the block of the image and co-located blocks of higher resolution image(s) may be designated as blurry. Thereafter, a blurriness score may be calculated for the source image from the number of blocks of the source image designated as blurry.
    Type: Grant
    Filed: June 9, 2017
    Date of Patent: September 3, 2019
    Assignee: Apple Inc.
    Inventors: Jiefu Zhai, Ke Zhang, Yunfei Zheng, Shujie Liu, Albert Keinath, Xiaosong Zhou, Chris Chung, Hsi-Jung Wu
  • Patent number: 10382830
    Abstract: System and methods for improved playback of a video stream are presented. Video snippets are identified that include a number of consecutive frames for playback. Snippets may be evenly temporally spaced in the video stream or may be content adaptive. Then the first frame of a snippet may be selected as the first frame of a scene or other appropriate stopping point. Scene detection, object detection, motion detection, video metadata, or other information generated during encoding or decoding of the video stream may aid in appropriate snippet selection.
    Type: Grant
    Filed: October 18, 2016
    Date of Patent: August 13, 2019
    Assignee: Apple Inc.
    Inventors: David L. Biderman, Xiaosong Zhou, Douglas Scott Price, Yeping Su, Chris Y. Chung, Hsi-Jung Wu, Zhenheng Li, John Su, James D. Batson
  • Publication number: 20190246141
    Abstract: Image processing techniques may accelerate coding of viewport data contained within multi-view image data. According to such techniques, an encoder may shifting content of a multi-directional image data according to the viewport location data provided by a decoder. The encoder may code the shifted multi-directional image data by predictive coding, and transmit to the decoder, the coded multi-directional image data and data identifying an amount of the shift. Doing so may move the viewport location to positions in the image data that are coded earlier than the positions that the viewport location naturally occupies and, thereby, may accelerate coding. On decode, a decoder may compare its present viewport location with viewport location data provided by the encoder with coded video data. The decoder may decode the coded video data and extract a portion of the decoded video data corresponding to a present viewport location for display.
    Type: Application
    Filed: February 5, 2018
    Publication date: August 8, 2019
    Inventors: Jae Hoon Kim, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Ming Chen, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu