Patents by Inventor Hsi-Jung Wu

Hsi-Jung Wu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210360192
    Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.
    Type: Application
    Filed: May 27, 2021
    Publication date: November 18, 2021
    Inventors: Elizabeth C. Cranfill, Stephen O. Lemay, Joe S. Abuan, Hsi-Jung Wu, Xiaosong Zhou, Roberto Garcia, JR.
  • Patent number: 11166027
    Abstract: Techniques are disclosed by which a coding parameter is determined to encode video data resulting in encoded video data possessing a highest possible video quality. Features may be extracted from an input video sequence. The extracted features may be compared to features described in a model of coding parameters generated by a machine learning algorithm from reviews of previously-coded videos, extracted features of the previously-coded videos, and coding parameters of the previously-coded videos. When a match is detected between the extracted features of the input video sequence and extracted features represented in the model, a determination may be made as to whether coding parameters that correspond to the matching extracted feature correspond to a tier of service to which the input video sequence is to be coded.
    Type: Grant
    Filed: June 6, 2017
    Date of Patent: November 2, 2021
    Assignee: Apple Inc.
    Inventors: Yeping Su, Xingyu Zhang, Chris Chung, Jun Xin, Hsi-Jung Wu
  • Patent number: 11165989
    Abstract: Techniques are presented for managing for visual prominence of participants in a video conference, including conferences where participants communicate visually, such as with sign language. According to these techniques, a visual prominence indication of a participant in a video conference may be estimated, a video stream of the participant may be encoded, and the encoded video stream may be transmitted along with an indication of the estimated visual prominence to a receiving device in the video conference.
    Type: Grant
    Filed: November 20, 2019
    Date of Patent: November 2, 2021
    Assignee: Apple Inc.
    Inventors: Johnny Trenh, Hsi-Jung Wu, Sarah K. Herrlinger, Xiaoxia Sun, Ian J. Baird, Dazhong Zhang, Xiaosong Zhou, Christopher M. Garrido
  • Patent number: 11153594
    Abstract: Video compression and decompression techniques are disclosed that provide improved bandwidth control for video compression and decompression systems. In particular, video coding and decoding techniques quantize input video in multiple dimensions. According to these techniques, pixel residuals may be generated from a comparison of an array of input data to an array of prediction data. The pixel residuals may be quantized in a first dimension. After the quantization, the quantized pixel residuals may be transformed to an array of transform coefficients. The transform coefficients may be quantized in a second dimension and entropy coded. Decoding techniques invert these processes. In still other embodiments, multiple quantizers may be provided upstream of the transform stage, either in parallel or in cascade, which provide greater flexibility to video coders to quantize data in different dimensions in an effort to balance the competing interest in compression efficiency and quality of reconstructed video.
    Type: Grant
    Filed: August 29, 2016
    Date of Patent: October 19, 2021
    Assignee: Apple Inc.
    Inventors: Alexandros Tourapis, Yeping Su, David Singer, Hsi-Jung Wu
  • Publication number: 20210321133
    Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.
    Type: Application
    Filed: March 19, 2021
    Publication date: October 14, 2021
    Inventors: Jae Hoon Kim, Xiaosong Zhou, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 11109042
    Abstract: Systems and methods for coding a video to be overlaid by annotations are devised. A motion compensated predictive coding is employed, wherein coding parameters of video pixel blocks are determined based on the pixel blocks' relation to the annotations. A decoder decodes the video and annotates it based on metadata, obtained from the coder or other sources, describing the annotations' appearance and rendering mode.
    Type: Grant
    Filed: May 23, 2019
    Date of Patent: August 31, 2021
    Assignee: Apple Inc.
    Inventors: Sudeng Hu, Xing Wen, Jae Hoon Kim, Peikang Song, Hang Yuan, Dazhong Zhang, Xiaosong Zhou, Hsi-Jung Wu, Christopher Garrido, Ming Jin, Patrick Miauton, Karthick Santhanam
  • Patent number: 11102515
    Abstract: Chroma deblock filtering of reconstructed video samples may be performed to remove blockiness artifacts and reduce color artifacts without over-smoothing. In a first method, chroma deblocking may be performed for boundary samples of a smallest transform size, regardless of partitions and coding modes. In a second method, chroma deblocking may be performed when a boundary strength is greater than 0. In a third method, chroma deblocking may be performed regardless of boundary strengths. In a fourth method, the type of chroma deblocking to be performed may be signaled in a slice header by a flag. Furthermore, luma deblock filtering techniques may be applied to chroma deblock filtering.
    Type: Grant
    Filed: June 2, 2020
    Date of Patent: August 24, 2021
    Assignee: Apple Inc.
    Inventors: Jiefu Zhai, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu, Peikang Song, David R. Conrad, Jae Hoon Kim, Yunfei Zheng
  • Publication number: 20210256735
    Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. A processing/filtering element utilizes occupancy map information and/or auxiliary patch information to determine relationships between patches in image frames and adjusts encoding/decoding and/or filtering or pre/post-processing parameters based on the determined relationships.
    Type: Application
    Filed: May 3, 2021
    Publication date: August 19, 2021
    Applicant: Apple Inc.
    Inventors: Alexandros Tourapis, Khaled Mammou, Yeping Su, Jungsun Kim, Valery G. Valentin, David W. Singer, Fabrice A. Robinet, Hsi-Jung Wu
  • Patent number: 11093752
    Abstract: Techniques are disclosed for managing display of content from multi-view video data. According to these techniques, an object may be identified from content of the multi-view video. The object's location may be tracked across a sequence of multi-view video. The technique may extract a sub-set of video that is contained within a view window that is shifted in an image space of the multi-view video in correspondence to the tracked object's location. These techniques may be implemented either in an image source device or an image sink device.
    Type: Grant
    Filed: June 2, 2017
    Date of Patent: August 17, 2021
    Assignee: Apple Inc.
    Inventors: Jae Hoon Kim, Ming Chen, Hang Yuan, Jiefu Zhai, Dazhong Zhang, Xiaosong Zhou, Chris Chung, Hsi-Jung Wu
  • Publication number: 20210243451
    Abstract: Techniques are disclosed for deriving prediction pixel blocks for use in intra-coding video and combined inter- and intra-coding video. In a first aspect, the techniques may include deriving value(s) for pixel location(s) of the prediction pixel block by, when a prediction direction vector assigned to the prediction vector points to quadrants I or III of a Cartesian plane, deriving the pixel location's value from pixel values in two regions of previously-decoded pixel data intercepted by extending the prediction direction vector in two opposite directions through the pixel location. When the prediction direction vector points toward quadrants II of the Cartesian plane, deriving the pixel location's value from pixel values in one region intercepted by the prediction direction vector through the pixel location, and from a second region intercepted by a vector that is orthogonal to the prediction direction vector.
    Type: Application
    Filed: March 30, 2020
    Publication date: August 5, 2021
    Inventors: Mei Guo, Jun Xin, Hsi-Jung Wu, Alexandros Tourapis, Krishnakanth Rapaka
  • Publication number: 20210243409
    Abstract: Computing devices may implement instant video communication connections for video communications. Connection information for mobile computing devices may be maintained. A request to initiate an instant video communication may be received, and if authorized, the connection information for the particular recipient mobile computing device may be accessed. Video communication data may then be sent to the recipient mobile computing device according to the connection information so that the video communication data may be displayed at the recipient device as it is received. New connection information for different mobile computing devices may be added, or updates to existing connection information may also be performed. Connection information for some mobile computing devices may be removed.
    Type: Application
    Filed: February 12, 2021
    Publication date: August 5, 2021
    Inventors: Xiaosong Zhou, Hsi-Jung Wu, Chris Y. Chung, James Normile, Joe S. Abuan, Hyeonkuk Jeong, Yan Yang, Gobind Johar, Thomas Christopher Jansen
  • Publication number: 20210227236
    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
    Type: Application
    Filed: April 2, 2021
    Publication date: July 22, 2021
    Inventors: Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
  • Publication number: 20210185361
    Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.
    Type: Application
    Filed: March 1, 2021
    Publication date: June 17, 2021
    Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
  • Patent number: 11025933
    Abstract: Techniques are disclosed for managing memory allocations when coding video data according to multiple codec configurations. According to these techniques, devices may negotiate parameters of a coding session that include parameters of a plurality of different codec configurations that may be used during the coding session. A device may estimate sizes of decoded picture buffers for each of the negotiated codec configurations and allocate in its memory a portion of memory sized according to a largest size of the estimated decoded picture buffers. Thereafter, the devices may exchange coded video data. The exchange may involve decoding coded data of reference pictures and storing the decoded reference pictures in the allocated memory. During the coding session, the devices may toggle among the different negotiated codec configurations. As they do, reallocations of memory may be avoided.
    Type: Grant
    Filed: May 3, 2017
    Date of Patent: June 1, 2021
    Assignee: Apple Inc.
    Inventors: Dazhong Zhang, Hang Yuan, Peikang Song, Jae Hoon Kim, Xing Wen, Sudeng Hu, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 11025861
    Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.
    Type: Grant
    Filed: October 28, 2019
    Date of Patent: June 1, 2021
    Assignee: Apple Inc.
    Inventors: Elizabeth C. Cranfill, Stephen O. Lemay, Joe S. Abuan, Hsi-Jung Wu, Xiaosong Zhou, Roberto Garcia, Jr.
  • Patent number: 11017566
    Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. A processing/filtering element utilizes occupancy map information and/or auxiliary patch information to determine relationships between patches in image frames and adjusts encoding/decoding and/or filtering or pre/post-processing parameters based on the determined relationships.
    Type: Grant
    Filed: July 1, 2019
    Date of Patent: May 25, 2021
    Assignee: Apple Inc.
    Inventors: Alexandros Tourapis, Khaled Mammou, Yeping Su, Jungsun Kim, Valery G. Valentin, David W. Singer, Fabrice A. Robinet, Hsi-Jung Wu
  • Patent number: 10999583
    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
    Type: Grant
    Filed: September 14, 2018
    Date of Patent: May 4, 2021
    Assignee: Apple Inc.
    Inventors: Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
  • Patent number: 10999602
    Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.
    Type: Grant
    Filed: December 23, 2016
    Date of Patent: May 4, 2021
    Assignee: Apple Inc.
    Inventors: Jae Hoon Kim, Xiaosong Zhou, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung, Hsi-Jung Wu
  • Patent number: 10992919
    Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
    Type: Grant
    Filed: December 23, 2019
    Date of Patent: April 27, 2021
    Assignee: Apple Inc.
    Inventors: Jae Hoon Kim, Ming Chen, Xiaosong Zhou, Hsi-Jung Wu, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung
  • Patent number: 10972753
    Abstract: Techniques are disclosed for coding and delivering multi-view video in which the video is represented as a manifest file identifying a plurality of segments of the video available for download. The multi-view video may be partitioned spatially into a plurality of tiles that, in aggregate, encompass the entire spatial area of the video. The tiles are coded as segments contains coded video representing content contained within its respective tile. Tiles may be given different sizes based on saliency of the content within their respective regions. In this manner, tiles with high levels of interest may have relatively large spatial areas, which can lead to efficient coding in the presence of content motion.
    Type: Grant
    Filed: September 13, 2019
    Date of Patent: April 6, 2021
    Assignee: Apple Inc.
    Inventors: Fanyi Duanmu, Eduardo Asbun, Xiaosong Zhou, Jun Xin, Hsi-Jung Wu, John Su, Samir Gehani, Christopher Flick, Shalini Sahoo