Patents by Inventor Hsi-Jung Wu

Hsi-Jung Wu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10607373
    Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. A closed-loop color conversion process is used to improve compression while taking into consideration distortion introduced throughout the point cloud compression process.
    Type: Grant
    Filed: November 21, 2018
    Date of Patent: March 31, 2020
    Assignee: Apple Inc.
    Inventors: Khaled Mammou, Yeping Su, David W. Singer, Alexandros Tourapis, Fabrice A. Robinet, Hsi-Jung Wu
  • Publication number: 20200099942
    Abstract: Support for additional components may be specified in a coding scheme for image data. A layer of a coding scheme that specifies color components may also specify additional components. Characteristics of the components may be specified in the same layer or a different layer of the coding scheme. An encoder or decoder may identify the specified components and determine the respective characteristics to perform encoding and decoding of image data.
    Type: Application
    Filed: September 25, 2019
    Publication date: March 26, 2020
    Applicant: Apple Inc.
    Inventors: Alexandros Tourapis, Yeping Su, Khaled Mammou, Jungsun Kim, David W. Singer, Fabrice A. Robinet, Guy Cote, Jim C. Chou, Hsi-Jung Wu
  • Patent number: 10602025
    Abstract: Image and video processing techniques are disclosed for processing components of a color space individually by determining limits for each component based on the relationship between each component in a color space. These limits may then be used to clip each component such that the component values are within the determined range for that component. In this manner, more efficient processing of images and/or video may be achieved.
    Type: Grant
    Filed: July 12, 2018
    Date of Patent: March 24, 2020
    Assignee: Apple Inc.
    Inventors: Alexandros Michael Tourapis, Yeping Su, David William Singer, Hsi-Jung Wu
  • Publication number: 20200092571
    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
    Type: Application
    Filed: September 14, 2018
    Publication date: March 19, 2020
    Inventors: Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
  • Patent number: 10575008
    Abstract: A method of managing resources on a terminal includes determining a number of downloaded video streams active at the terminal, prioritizing the active video streams, assigning a decoding quality level to each active video stream based on a priority assignment for each active video stream, and apportioning reception bandwidth to each active video stream based on an assigned quality level of each active video stream.
    Type: Grant
    Filed: May 31, 2016
    Date of Patent: February 25, 2020
    Assignee: APPLE INC.
    Inventors: Dazhong Zhang, Hsi-Jung Wu, Peikang Song, Yunfei Zheng, Chris Y. Chung, Jae Hoon Kim, Xiaosong Zhou
  • Patent number: 10574997
    Abstract: Systems and methods are disclosed for coding pixel blocks of an input frame in which coding costs of a plurality of candidate coding modes are estimated. A coding cost of a candidate coding mode may be estimated based on noise estimate associated with the candidate coding mode. A coding mode for the input pixel block may be selected based on a comparison of the estimated coding costs of the plurality of candidate coding modes. The input pixel block may then be coded according to the selected coding mode.
    Type: Grant
    Filed: October 27, 2017
    Date of Patent: February 25, 2020
    Assignee: APPLE INC.
    Inventors: Chris Y. Chung, Mei Guo, Hsi-Jung Wu, Jingteng Xue, Jun Xin
  • Publication number: 20200059628
    Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.
    Type: Application
    Filed: October 28, 2019
    Publication date: February 20, 2020
    Inventors: Elizabeth C. Cranfill, Stephen O. Lemay, Joe S. Abuan, Hsi-Jung Wu, Xiaosong Zhou, Roberto Garcia, JR.
  • Patent number: 10536731
    Abstract: Systems and methods are provided for processing high quality video data, such as data having a higher than standard bit depth, a high dynamic range, or a wide or custom color gamut, to be compatible with conventional encoders and decoders without significant loss of quality. High quality data is encoded into a plurality of layers with a base layer having the standard quality data and one or more higher quality layers. Decoding systems and methods may map the base layer to the dynamic range or color gamut of the enhancement layer, combine the layers, and map the combined layers to a dynamic range or color gamut appropriate for the target display. Each of the standard quality and the high quality data may be encoded as a plurality of tiers of increasing quality and reference lower level tiers as sources of prediction during predictive coding.
    Type: Grant
    Filed: May 12, 2015
    Date of Patent: January 14, 2020
    Assignee: Apple Inc.
    Inventors: Yeping Su, Jiefu Zhai, Ke Zhang, Xiaosong Zhou, Hsi-Jung Wu, Chris Y. Chung
  • Patent number: 10536726
    Abstract: In a video coding system, an encoder may include a coding engine to predictively code input video, a decoder to reconstruct reference pictures generated by the coding engine, a reference picture cache to store the reconstructed reference pictures, a patch cache to store prediction patches generated from other sources, and a prediction search unit to search among the reference picture cache and the patch cache to generate prediction references for use by the coding engine while coding input video. The prediction patches may be assembled from a variety of sources including: predefined image content, reference pictures being evicted from the reference picture cache, image content of prior coding sessions and image data stored by applications on a common terminal where the encoder resides. A decoder may store prediction patches in its own patch cache for synchronous decoding.
    Type: Grant
    Filed: May 3, 2012
    Date of Patent: January 14, 2020
    Assignee: APPLE INC.
    Inventors: Xiaosong Zhou, James Oliver Normile, Hsi-Jung Wu
  • Publication number: 20200014953
    Abstract: A system comprises an encoder configured to compress attribute and/or spatial information for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. In some embodiments, an encoder performs downscaling of an image frame prior to video encoding and a decoder performs upscaling of an image frame subsequent to video decoding.
    Type: Application
    Filed: July 5, 2019
    Publication date: January 9, 2020
    Applicant: Apple Inc.
    Inventors: Khaled Mammou, Yeping Su, Jungsun Kim, Valery G. Valentin, David W. Singer, Fabrice A. Robinet, Hsi-Jung Wu, Alexandros Tourapis
  • Patent number: 10523913
    Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
    Type: Grant
    Filed: June 30, 2017
    Date of Patent: December 31, 2019
    Assignee: APPLE INC.
    Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
  • Publication number: 20190373270
    Abstract: Systems and methods for coding a video to be overlaid by annotations are devised. A motion compensated predictive coding is employed, wherein coding parameters of video pixel blocks are determined based on the pixel blocks' relation to the annotations. A decoder decodes the video and annotates it based on metadata, obtained from the coder or other sources, describing the annotations' appearance and rendering mode.
    Type: Application
    Filed: May 23, 2019
    Publication date: December 5, 2019
    Inventors: Sudeng HU, Xing WEN, Jae Hoon KIM, Peikang SONG, Hang YUAN, Dazhong ZHANG, Xiaosong ZHOU, Hsi-Jung WU, Christopher GARRIDO, Ming JIN, Patrick MIAUTON, Karthick SANTHANAM
  • Publication number: 20190373276
    Abstract: Techniques are disclosed for managing reference frames for gradual coder refresh (GDR) operation. A GDR frame may be partitioned into a plurality of units, at least one of which is coded by instantaneous decoder refresh (IDR) techniques and other(s) of which are coded by other techniques such as inter-coding. The coded GDR frame may be exchanged between an encoder and a decoder. The encoder and decoder both may decode the GDR frame. The encoder and decoder may store the IDR-coded portion of the GDR frame in a reference picture buffer in a modified frame that includes, for the other portion(s) of the GDR frame, replacement content instead of the content obtained by decoding. The modified reference frame are expected by bias prediction search operations performed on later frame toward selection of the IDR-coded content as opposed to the replacement content.
    Type: Application
    Filed: June 1, 2018
    Publication date: December 5, 2019
    Inventors: Sudeng Hu, Dazhong Zhang, Xing Wen, Peikang Song, Jae Hoon Kim, Hang Yuan, Xiaosong Zhou, Hsi-Jung Wu, Jingteng Xue
  • Publication number: 20190356911
    Abstract: A method for decoding a compressed video data sequence containing one or more coded pixel blocks. The compressed video sequence is buffered. Prediction information for each of the coded pixel blocks is reviewed. One or more groups of coded pixel blocks are formed based on the reviewed prediction information such that the coded pixel blocks within a given group have similar prediction dependencies and/or at least do not depend on a reconstructed pixel within a group of received pixel blocks to enable parallel decoding. The formed groups are scheduled for processing and subsequently decoded to produce a decoded video data sequence.
    Type: Application
    Filed: August 1, 2019
    Publication date: November 21, 2019
    Inventors: Thomas Pun, Paul Chang, Hsi-Jung Wu
  • Publication number: 20190349509
    Abstract: Systems and methods are provided for capturing high quality video data, including data having a high dynamic range, for use with conventional encoders and decoders. High dynamic range data is captured using multiple groups of pixels where each group is captured using different exposure times to create groups of pixels. The pixels that are captured at different exposure times may be determined adaptively based on the content of the image, the parameters of the encoding system, or on the available resources within the encoding system. The transition from single exposure to using two different exposure times may be implemented gradually.
    Type: Application
    Filed: May 17, 2019
    Publication date: November 14, 2019
    Inventors: Jiefu ZHAI, Xiaosong ZHOU, Chris Y. CHUNG, Hsi-Jung WU
  • Patent number: 10477249
    Abstract: A video decoder system includes a video decoding engine, noise database, artifact estimator and post-processing unit. The video coder may generate recovered video from a data stream of coded video data, which may have visually-perceptible artifacts introduced as a byproduct of compression. The noise database may store a plurality of previously developed noise patches. The artifact estimator may estimate the location of coding artifacts present in the recovered video and select noise patches from the database to mask the artifacts and the post-processing unit may integrate the selected noise patches into the recovered video. In this manner, the video decoder may generate post-processed noise which may mask artifacts that otherwise would be generated by a video coding process.
    Type: Grant
    Filed: June 5, 2009
    Date of Patent: November 12, 2019
    Assignee: APPLE INC.
    Inventors: Yuxin Liu, Hsi-Jung Wu, Xiaojin Shi, Chris Yoochang Chung
  • Patent number: 10466809
    Abstract: Embodiments of the present invention generate estimates of device motion from two data sources on a computing deviceā€”a motion sensor and a camera. The device may compare the estimates to each other to determine if they agree. If they agree, the device may confirm that device motion estimates based on the motion sensor are accurate and may output those estimates to an application within the device. If the device motion estimates disagree, the device may alter the motion estimates obtained from the motion sensor before outputting them to the application.
    Type: Grant
    Filed: October 20, 2017
    Date of Patent: November 5, 2019
    Assignee: Apple Inc.
    Inventors: Yao-Chung Lin, Xiaosong Zhou, Hsi-Jung Wu
  • Patent number: 10462420
    Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.
    Type: Grant
    Filed: October 5, 2017
    Date of Patent: October 29, 2019
    Assignee: APPLE INC.
    Inventors: Elizabeth C. Cranfill, Stephen O. Lemay, Joe S. Abuan, Hsi-Jung Wu, Xiaosong Zhou, Roberto Garcia, Jr.
  • Patent number: 10452713
    Abstract: Systems and processes for improved video editing, summarization and navigation based on generation and analysis of metadata are described. The metadata may be content-based (e.g., differences between neighboring frames, exposure data, key frame identification data, motion data, or face detection data) or non-content-based (e.g., exposure, focus, location, time) and used to prioritize and/or classify portions of video. The metadata may be generated at the time of image capture or during post-processing. Prioritization information, such as a score for various portions of the image data may be based on the metadata and/or image data. Classification information such as the type or quality of a scene may be determined based on the metadata and/or image data. The classification and prioritization information may be metadata and may be used to automatically remove undesirable portions of the video, generate suggestions during editing or automatically generate summary video.
    Type: Grant
    Filed: December 3, 2014
    Date of Patent: October 22, 2019
    Assignee: Apple Inc.
    Inventors: Shujie Liu, Ke Zhang, Xiaosong Zhou, Hsi-Jung Wu, Chris Y. Chung, James O. Normile, Douglas S. Price, Albert E. Keinath
  • Publication number: 20190306524
    Abstract: Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.
    Type: Application
    Filed: March 28, 2018
    Publication date: October 3, 2019
    Inventors: Xing Wen, Dazhong Zhang, Peikang Song, Xiaosong Zhou, Sudeng Hu, Hsi-Jung Wu, Jae Hoon Kim