Patents by Inventor Jiefu Zhai

Jiefu Zhai has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Neural network based residual coding and prediction for predictive coding

Patent number: 12192440

Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.

Type: Grant

Filed: January 4, 2022

Date of Patent: January 7, 2025

Assignee: APPLE INC.

Inventors: Jiefu Zhai, Xingyu Zhang, Xiaosong Zhou, Jun Xin, Hsi-Jung Wu, Yeping Su
ADAPTIVE CODING AND STREAMING OF MULTI-DIRECTIONAL VIDEO

Publication number: 20240397119

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Application

Filed: August 7, 2024

Publication date: November 28, 2024

Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
Adaptive coding and streaming of multi-directional video

Patent number: 12096044

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Grant

Filed: March 9, 2023

Date of Patent: September 17, 2024

Assignee: APPLE INC.

Inventors: Xiaohua Yang, Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
Object and keypoint detection system with low spatial jitter, low latency and low power usage

Patent number: 11847823

Abstract: Video object and keypoint location detection techniques are presented. The system includes a detection system for generation locations of an object's keypoints along with probabilities associated with the locations, and a stability system for stabilizing keypoint locations of the detected objects. In some aspects, the generated probabilities are two-dimensional array correspond locations within input images, and stability system fits the generated probabilities to a two-dimensional probability distribution function.

Type: Grant

Filed: June 4, 2021

Date of Patent: December 19, 2023

Assignee: APPLE INC.

Inventors: Xiaoxia Sun, Jiefu Zhai, Ke Zhang, Xiaosong Zhou, Hsi-Jung Wu
VIDEO CLASSIFICATION AND SEARCH SYSTEM TO SUPPORT CUSTOMIZABLE VIDEO HIGHLIGHTS

Publication number: 20230394081

Abstract: A video classification, indexing, and retrieval system is disclosed that classifies and retrieves video along multiple indexing dimensions. A search system may field queries identifying desired parameters of video, search an indexed database for videos that match the query parameters, and create clips extracted from responsive videos that are provided in response. In this manner, different queries may cause different clips to be created from a single video, each clip tailored to the parameters of the query that is received.

Type: Application

Filed: June 1, 2023

Publication date: December 7, 2023

Inventors: Shujie LIU, Xiaosong ZHOU, Hsi-Jung WU, Jiefu ZHAI, Ke ZHANG, Ming CHEN
ANALYTIC- AND APPLICATION-AWARE VIDEO DERIVATIVE GENERATION TECHNIQUES

Publication number: 20230396819

Abstract: A video delivery system generates and stores reduced bandwidth videos from source video. The system may include a track generator that executes functionality of application(s) to be used at sink devices, in which the track generator generates tracks from execution of the application(s) on source video and generates tracks having a reduced data size as compared to the source video. The track generator may execute a first instance of application functionality on the source video, which identifies region(s) of interest from the source video. The track generator further may downsample the source video according to downsampling parameters, and execute a second instance of application functionality on the downsampled video. The track generator may determine, from a comparison of outputs from the first and second instances of the application, whether the output from the second instance of application functionality is within an error tolerance of the output from the first instance of application functionality.

Type: Application

Filed: June 1, 2023

Publication date: December 7, 2023

Inventors: Ke ZHANG, Xiaoxia SUN, Shujie LIU, Xiaosong ZHOU, Jian LI, Xun SHI, Jiefu ZHAI, Albert E KEINATH, Hsi-Jung WU, Jingteng XUE, Xingyu ZHANG, Jun XIN
Systems and methods for perspective shifting in video conferencing session

Patent number: 11818502

Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.

Type: Grant

Filed: June 22, 2022

Date of Patent: November 14, 2023

Assignee: APPLE INC.

Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
Sphere projected motion estimation/compensation and mode decision

Patent number: 11818394

Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

Type: Grant

Filed: March 19, 2021

Date of Patent: November 14, 2023

Assignee: APPLE INC.

Inventors: Jae Hoon Kim, Xiaosong Zhou, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung, Hsi-Jung Wu
ADAPTIVE CODING AND STREAMING OF MULTI-DIRECTIONAL VIDEO

Publication number: 20230269400

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Application

Filed: March 9, 2023

Publication date: August 24, 2023

Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
Modular Machine Learning Architecture

Publication number: 20230147442

Abstract: In an example method, a system accesses first input data and a machine learning architecture. The machine learning architecture includes a first module having a first neural network, a second module having a second neural network, and a third module having a third neural network. The system generates a first feature set representing a first portion of the first input data using the first neural network, and a second feature set representing a second portion of the first input data using the second neural network. The system generates, using the third neural network, first output data based on the first feature set and the second feature set.

Type: Application

Filed: June 3, 2022

Publication date: May 11, 2023

Inventors: Shujie Liu, Jiefu Zhai, Xiaosong Zhou, Hsi-Jung Wu, Ke Zhang, Xiaoxia Sun, Jian Li
Adaptive coding and streaming of multi-directional video

Patent number: 11627343

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Grant

Filed: March 1, 2021

Date of Patent: April 11, 2023

Assignee: APPLE INC.

Inventors: Xiaohua Yang, Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
HYBRID NEURAL NETWORK BASED END-TO-END IMAGE AND VIDEO CODING METHOD

Publication number: 20230096567

Abstract: Improved neural-network-based image and video coding techniques are presented, including hybrid techniques that include both tools of a host codec and neural-network-based tools. In these improved techniques, the host coding tools may include conventional video coding standards such H.266 (VVC). In an aspects, source frames may be partitioned and either host or neural-network-based tools may be selected per partition. Coding parameter decisions for a partition may be constrained based on the partitioning and coding tool selection. Rate control for host and neural network tools may be combined. Multi-stage processing of neural network output may use a checkerboard prediction pattern.

Type: Application

Filed: September 23, 2022

Publication date: March 30, 2023

Inventors: Alican NALCI, Alexandros TOURAPIS, Hsi-Jung WU, Jiefu ZHAI, Jingteng XUE, Jun XIN, Mei GUO, Xingyu ZHANG, Yeqing WU, Yunfei ZHENG, Jean Begaint
SYSTEMS AND METHODS FOR PERSPECTIVE SHIFTING IN VIDEO CONFERENCING SESSION

Publication number: 20220329756

Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.

Type: Application

Filed: June 22, 2022

Publication date: October 13, 2022

Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
Systems and methods for perspective shifting in video conferencing session

Patent number: 11394921

Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.

Type: Grant

Filed: March 10, 2017

Date of Patent: July 19, 2022

Assignee: Apple Inc.

Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
Signal generation for LED/LCD-based high dynamic range displays

Patent number: 11380270

Abstract: A method of operating a high dynamic range display device comprises the steps of: accessing an image signal; generating an intermediate backlighting driver signal for individual backlight elements for a backlighting unit responsive to the image signal; convoluting the intermediate backlighting driver signals with a point spread function of the backlighting unit; deriving at least one new backlighting driver signal responsive to the convoluting step; determining display error associated with a plurality of available light shutter signals of a front-end unit having individual light shutters and associated with the at least one new backlighting driver signal, the front-end unit having a higher resolution than the backlighting unit; driving the display device with a combination of shutter signals and new backlighting driver signals that causes a reduction in the display error with respect to other generated intermediate backlighting driver signals and other available light shutter signals.

Type: Grant

Filed: February 9, 2010

Date of Patent: July 5, 2022

Assignee: INTERDIGITAL MADISON PATENT HOLDINGS

Inventors: Jiefu Zhai, Joan Llach
NEURAL NETWORK BASED RESIDUAL CODING AND PREDICTION FOR PREDICTIVE CODING

Publication number: 20220191473

Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.

Type: Application

Filed: January 4, 2022

Publication date: June 16, 2022

Inventors: Jiefu ZHAI, Xingyu ZHANG, Xiaosong ZHOU, Jun XIN, Hsi-Jung WU, Yeping SU
Real-time face and object manipulation

Patent number: 11282543

Abstract: Techniques are presented for modifying images of an object in video, for example to correct for lens distortion, or to beautify a face. These techniques include extracting and validating features of an object from a source video frame, tracking those features over time, estimating a pose of the object, modifying a 3D model of the object based on the features, and rendering a modified video frame based on the modified 3D model and modified intrinsic and extrinsic matrices. These techniques may be applied in real-time to an object in a sequence of video frames.

Type: Grant

Filed: March 9, 2018

Date of Patent: March 22, 2022

Assignee: Apple Inc.

Inventors: Hang Yuan, Jiefu Zhai, Ming Chen, Jae Hoon Kim, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu
Processing of equirectangular object data to compensate for distortion by spherical projections

Patent number: 11259046

Abstract: Methods and Systems disclosed to counteract spatial distortions introduced by imaging processes of multi-directional video frames, where objects may be projected to spherical or equirectangular representations. Techniques provided to invert the spatial distortions in video frames used as reference picture data in predictive coding, by spatially transforming the image content of the reference picture data before this image content is being used for the prediction of input video data in prediction-based coders and decoders.

Type: Grant

Filed: February 15, 2017

Date of Patent: February 22, 2022

Assignee: Apple Inc.

Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jiefu Zhai, Xiaosong Zhou
Neural network based residual coding and prediction for predictive coding

Patent number: 11240492

Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.

Type: Grant

Filed: January 22, 2019

Date of Patent: February 1, 2022

Assignee: Apple Inc.

Inventors: Jiefu Zhai, Xingyu Zhang, Xiaosong Zhou, Jun Xin, Hsi-Jung Wu, Yeping Su
OBJECT AND KEYPOINT DETECTION SYSTEM WITH LOW SPATIAL JITTER, LOW LATENCY AND LOW POWER USAGE

Publication number: 20210397826

Abstract: Video object and keypoint location detection techniques are presented. The system includes a detection system for generation locations of an object's keypoints along with probabilities associated with the locations, and a stability system for stabilizing keypoint locations of the detected objects. In some aspects, the generated probabilities are two-dimensional array correspond locations within input images, and stability system fits the generated probabilities to a two-dimensional probability distribution function.

Type: Application

Filed: June 4, 2021

Publication date: December 23, 2021

Inventors: Xiaoxia SUN, Jiefu ZHAI, Ke ZHANG, Xiaosong ZHOU, Hsi-Jung WU

1 2 3 4 5 … next