Patents by Inventor Hsi-Jung Wu

Hsi-Jung Wu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10432961
    Abstract: A video coding system may include an encoder performs motion-compensated prediction on a video signal in a second format converted from an input format of the video signal. The video coding system may also include a decoder to decode portions of the encoded video, and a filtering system that filters portions of the decoded video, for example, by deblocking filtering or SAO filtering, using parameters derived from the video signal in the input format. A prediction system may include another format converter that converts the decoded video to the input format. The prediction system may select parameters of the motion-compensated prediction based at least in part on a comparison of the video signal in the input format to decoded video in the input format.
    Type: Grant
    Filed: March 10, 2016
    Date of Patent: October 1, 2019
    Assignee: Apple Inc.
    Inventors: Alexandros Michael Tourapis, Yeping Su, David William Singer, Hsi-Jung Wu
  • Patent number: 10432946
    Abstract: Judder artifacts are remedied in video coding system by employing frame rate conversion at an encoder. A source video sequence may be coded as base layer coded video at a first frame rate. An encoder may identify a portion of the coded video sequence that likely will exhibit judder effects when decoded. For those portions that likely will exhibit judder effects, video data representing the portion of the source video may be coded at a higher frame rate than a frame rate of the coded base layer data as enhancement layer data. Moreover, an encoder may generate metadata representing “FRC hints”—techniques that a decoder should employ when performing decoder-side frame rate conversion. An encoding terminal may transmit the base layer coded video and either the enhancement layer coded video or the FRC hints to a decoder. Thus, encoder infrastructure may mitigate against judder artifacts that may arise during decoding.
    Type: Grant
    Filed: December 10, 2015
    Date of Patent: October 1, 2019
    Assignee: Apple Inc.
    Inventors: Yeping Su, Chris Y. Chung, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
  • Publication number: 20190297338
    Abstract: A method of adaptive chroma downsampling is presented. The method comprises converting a source image to a converted image in an output color format, applying a plurality of downsample filters to the converted image and estimating a distortion for each filter chose the filter that produces the minimum distortion. The distortion estimation includes applying an upsample filter, and a pixel is output based on the chosen filter. Methods for closed loop conversions are also presented.
    Type: Application
    Filed: June 7, 2019
    Publication date: September 26, 2019
    Inventors: Alexandros Michael Tourapis, Yeping Su, David William Singer, Hsi-Jung Wu
  • Publication number: 20190279681
    Abstract: Techniques are presented for modifying images of an object in video, for example to correct for lens distortion, or to beautify a face. These techniques include extracting and validating features of an object from a source video frame, tracking those features over time, estimating a pose of the object, modifying a 3D model of the object based on the features, and rendering a modified video frame based on the modified 3D model and modified intrinsic and extrinsic matrices. These techniques may be applied in real-time to an object in a sequence of video frames.
    Type: Application
    Filed: March 9, 2018
    Publication date: September 12, 2019
    Inventors: Hang Yuan, Jiefu Zhai, Ming Chen, Jae Hoon Kim, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 10402677
    Abstract: Techniques are disclosed for estimating quality of images in an automated fashion. According to these techniques, a source image may be downsampled to generate at least two downsampled images at different levels of downsampling. Blurriness of the images may be estimated starting with a most-heavily downsampled image. Blocks of a given image may be evaluated for blurriness and, when a block of a given image is estimated to be blurry, the block of the image and co-located blocks of higher resolution image(s) may be designated as blurry. Thereafter, a blurriness score may be calculated for the source image from the number of blocks of the source image designated as blurry.
    Type: Grant
    Filed: June 9, 2017
    Date of Patent: September 3, 2019
    Assignee: Apple Inc.
    Inventors: Jiefu Zhai, Ke Zhang, Yunfei Zheng, Shujie Liu, Albert Keinath, Xiaosong Zhou, Chris Chung, Hsi-Jung Wu
  • Patent number: 10402226
    Abstract: A system for processing media on a resource restricted device, the system including a memory to store data representing media assets and associated descriptors, and program instructions representing an application and a media processing system, and a processor to execute the program instructions, wherein the program instructions represent the media processing system, in response to a call from an application defining a plurality of services to be performed on an asset, determine a tiered schedule of processing operations to be performed upon the asset based on a processing budget associated therewith, and iteratively execute the processing operations on a tier-by-tier basis, unless interrupted.
    Type: Grant
    Filed: June 3, 2016
    Date of Patent: September 3, 2019
    Assignee: Apple Inc.
    Inventors: Albert Keinath, Ke Zhang, Yunfei Zheng, Shujie Liu, Jiefu Zhai, Chris Y. Chung, Xiaosong Zhou, Hsi-Jung Wu
  • Patent number: 10382750
    Abstract: A method for decoding a compressed video data sequence containing one or more coded pixel blocks. The compressed video sequence is buffered. Prediction information for each of the coded pixel blocks is reviewed. One or more groups of coded pixel blocks are formed based on the reviewed prediction information such that the coded pixel blocks within a given group have similar prediction dependencies and/or at least do not depend on a reconstructed pixel within a group of received pixel blocks to enable parallel decoding. The formed groups are scheduled for processing and subsequently decoded to produce a decoded video data sequence.
    Type: Grant
    Filed: June 9, 2017
    Date of Patent: August 13, 2019
    Assignee: Apple Inc.
    Inventors: Thomas Pun, Paul Chang, Hsi-Jung Wu
  • Patent number: 10382830
    Abstract: System and methods for improved playback of a video stream are presented. Video snippets are identified that include a number of consecutive frames for playback. Snippets may be evenly temporally spaced in the video stream or may be content adaptive. Then the first frame of a snippet may be selected as the first frame of a scene or other appropriate stopping point. Scene detection, object detection, motion detection, video metadata, or other information generated during encoding or decoding of the video stream may aid in appropriate snippet selection.
    Type: Grant
    Filed: October 18, 2016
    Date of Patent: August 13, 2019
    Assignee: Apple Inc.
    Inventors: David L. Biderman, Xiaosong Zhou, Douglas Scott Price, Yeping Su, Chris Y. Chung, Hsi-Jung Wu, Zhenheng Li, John Su, James D. Batson
  • Publication number: 20190246141
    Abstract: Image processing techniques may accelerate coding of viewport data contained within multi-view image data. According to such techniques, an encoder may shifting content of a multi-directional image data according to the viewport location data provided by a decoder. The encoder may code the shifted multi-directional image data by predictive coding, and transmit to the decoder, the coded multi-directional image data and data identifying an amount of the shift. Doing so may move the viewport location to positions in the image data that are coded earlier than the positions that the viewport location naturally occupies and, thereby, may accelerate coding. On decode, a decoder may compare its present viewport location with viewport location data provided by the encoder with coded video data. The decoder may decode the coded video data and extract a portion of the decoded video data corresponding to a present viewport location for display.
    Type: Application
    Filed: February 5, 2018
    Publication date: August 8, 2019
    Inventors: Jae Hoon Kim, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Ming Chen, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Publication number: 20190246114
    Abstract: The present disclosure describes techniques for coding and decoding video in which a plurality of coding hypotheses are developed for an input pixel block of frame content. Each coding hypothesis may include generation of prediction data for the input pixel block according to a respective prediction search. The input pixel block may be coded with reference to a prediction block formed from prediction data derived according to plurality of hypotheses. Data of the coded pixel block may be transmitted to a decoder along with data identifying a number of the hypotheses used during the coding to a channel. At a decoder, an inverse process may be performed, which may include generation of a counterpart prediction block from prediction data derived according to the hypothesis identified with the coded pixel block data, then decoding of the coded pixel block according to the prediction data.
    Type: Application
    Filed: January 25, 2019
    Publication date: August 8, 2019
    Inventors: Alexandros Michael TOURAPIS, Yeping SU, David SINGER, Hsi-Jung WU
  • Patent number: 10366497
    Abstract: Techniques for cropping images containing an occlusion are presented. A method for image editing is presented comprising, when an occlusion is detected in an original digital image, determining an area occupied by the occlusion, assigning importance scores to different content elements of the original digital image, defining a cropping window around an area of the original digital image that does not include the area occupied by the occlusion and that is based on the importance scores, and cropping the original digital image to the cropping window.
    Type: Grant
    Filed: June 9, 2017
    Date of Patent: July 30, 2019
    Assignee: Apple Inc.
    Inventors: Ke Zhang, Jiefu Zhai, Yunfei Zheng, Shujie Liu, Albert E. Keinath, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 10349064
    Abstract: A method of adaptive chroma downsampling is presented. The method comprises converting a source image to a converted image in an output color format, applying a plurality of downsample filters to the converted image and estimating a distortion for each filter chose the filter that produces the minimum distortion. The distortion estimation includes applying an upsample filter, and a pixel is output based on the chosen filter. Methods for closed loop conversions are also presented.
    Type: Grant
    Filed: March 10, 2016
    Date of Patent: July 9, 2019
    Assignee: Apple Inc.
    Inventors: Alexandros Michael Tourapis, Yeping Su, David William Singer, Hsi-Jung Wu
  • Publication number: 20190156520
    Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. A block/sub-block organization scheme is used to encode blocks and sub-blocks of an occupancy map used in compressing the point cloud. Binary values are assigned to blocks/sub-blocks based on whether they contain patches projected on the point cloud. A traversal path is chosen that takes advantage of run-length encoding strategies to reduce a size of an encoded occupancy map. Also, auxiliary information is used to further improve occupancy map compression.
    Type: Application
    Filed: November 21, 2018
    Publication date: May 23, 2019
    Applicant: Apple Inc.
    Inventors: Khaled Mammou, Yeping Su, David W. Singer, Alexandros Tourapis, Fabrice A. Robinet, Hsi-Jung Wu
  • Publication number: 20190156518
    Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. A closed-loop color conversion process is used to improve compression while taking into consideration distortion introduced throughout the point cloud compression process.
    Type: Application
    Filed: November 21, 2018
    Publication date: May 23, 2019
    Applicant: Apple Inc.
    Inventors: Khaled Mammou, Yeping Su, David W. Singer, Alexandros Tourapis, Fabrice A. Robinet, Hsi-Jung Wu
  • Publication number: 20190156519
    Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. The encoder is configured project the point cloud on to patch planes to compress the point cloud, and supports multiple layered patch planes. For example, some point clouds may have a depth, and points at different depths may be assigned to different layered patch planes.
    Type: Application
    Filed: November 21, 2018
    Publication date: May 23, 2019
    Applicant: Apple Inc.
    Inventors: Khaled Mammou, Yeping Su, David W. Singer, Alexandros Tourapis, Fabrice A. Robinet, Hsi-Jung Wu
  • Patent number: 10298854
    Abstract: Systems and methods are provided for capturing high quality video data, including data having a high dynamic range, for use with conventional encoders and decoders. High dynamic range data is captured using multiple groups of pixels where each group is captured using different exposure times to create groups of pixels. The pixels that are captured at different exposure times may be determined adaptively based on the content of the image, the parameters of the encoding system, or on the available resources within the encoding system. The transition from single exposure to using two different exposure times may be implemented gradually.
    Type: Grant
    Filed: May 29, 2015
    Date of Patent: May 21, 2019
    Assignee: APPLE INC.
    Inventors: Jiefu Zhai, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 10282633
    Abstract: A method for processing media assets includes, given a first media asset, deriving characteristics from the first media asset, searching for other media assets having characteristics that correlate to the characteristics of the first media asset, when a match is found, deriving content corrections for the first media asset or a matching media asset from the other of the first media asset or the matching media asset, and correcting content of the first media asset or the matching media asset based on the content corrections.
    Type: Grant
    Filed: October 20, 2016
    Date of Patent: May 7, 2019
    Assignee: Apple Inc.
    Inventors: Shujie Liu, Jiefu Zhai, Chris Y. Chung, Hsi-Jung Wu, Yunfei Zheng, Albert E. Keinath, Xiaosong Zhou, Ke Zhang
  • Publication number: 20190132594
    Abstract: Techniques are disclosed for coding pixel blocks of an input frame in which coding costs of a plurality of candidate coding modes are estimated, in which estimates of distortion include an estimate of noise of each respective candidate coding mode. Thus, evaluation of each coding mode may involve searching for a prediction reference for a pixel block according to the mode, and estimating coding cost of the respective candidate coding mode based at least in part on an estimate of noise associated with the respective coding mode. A coding mode for the input pixel block may be selected based on a comparison of the coding costs of the candidate coding modes, and the input pixel block may be coded according to the selected coding mode. By estimating noise associated with the candidate coding modes and, in particular, employing techniques to maintain consistent levels of noise across different frames of a coded video sequence, it is expected that perceived quality of coding will be improved.
    Type: Application
    Filed: October 27, 2017
    Publication date: May 2, 2019
    Inventors: Chris Y. Chung, Mei Guo, Hsi-Jung Wu, Jingteng Xue, Jun Xin
  • Publication number: 20190104315
    Abstract: The present disclosure describes techniques for coding video data in a manner that provides consistency to portions of the video that have similar content. According to such techniques, a video sequence may be parsed into partitions and content of the partitions may be analyzed. Partitions may be grouped together based on detected similarities in content. Coding parameters may be selected for each partition based on the partition's membership in the groups. Thus, when the video sequence is coded, coding parameters for frames of two commonly-grouped partitions may be similar, which causes coded video data to have similar presentation.
    Type: Application
    Filed: October 4, 2017
    Publication date: April 4, 2019
    Inventors: Mei Guo, Jun Xin, Yeping Su, Chris Y. Chung, Xiaosong Zhou, Hsi-Jung Wu
  • Publication number: 20190075342
    Abstract: A video streaming method for transitioning between multiple sequences of coded video data may include receiving and decoding transmission units from a first sequence of coded video data. In response to a request to transition to a second sequence of coded video data, the method may determine whether a time to transition to the second sequence of coded video data can be reduced by transitioning to the second sequence of coded video data via an intermediate sequence of coded video data. If the time can be reduced, the method may include receiving at least one transmission unit from an intermediate sequence of coded video data that corresponds to the request to transition, decoding the transmission unit from the intermediate sequence, and transitioning from the first sequence to the second sequence via the decoded transmission unit from the intermediate sequence.
    Type: Application
    Filed: November 5, 2018
    Publication date: March 7, 2019
    Inventors: Yeping Su, Chris Y. Chung, Xiaosong Zhou, James Oliver Normile, Hsi-Jung Wu, Thomas Jansen, Hyeonkuk Jeong, Joe S. Abuan, Douglas Scott Price