Patents by Inventor Dazhong Zhang

Dazhong Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

BACKWARD REFERENCE UPDATING FOR VIDEO CODING

Publication number: 20250080759

Abstract: Techniques are disclosed for coding video in applications where regions of video are inactive on a frame to frame basis. According to the techniques, coding processes update and reconstruct only a subset of pixel blocks of pixels within a frame, while other pixel blocks are retained from a previously coded frame stored in a coder's or decoder's reference frame buffer. The technique is called Backward Reference Updating (or “BRU”) for convenience. At a desired pixel block granularity, based on the activity between a current frame to be coded and its reference frame(s), BRU will only perform prediction, transform, quantization, and reconstruction on selected regions that are determined to be active. The reconstructed pixels in these active regions are directly placed onto a specified reference frame in memory instead of creating a new frame. Therefore, fewer memory transfers need to be performed.

Type: Application

Filed: August 5, 2024

Publication date: March 6, 2025

Inventors: Guoxin JIN, Yeqing WU, Yunfei ZHENG, Dazhong ZHANG, Dzung T. HOANG, Felix C. FERNANDES, Van Luong PHAM, Jiancong LUO, Yixin DU, Hilmi Enes EGILMEZ, Alican NALCI, Jun XIN, Hsi-Jung WU
Optimizing media experience in conferencing with diverse participants

Patent number: 12200401

Abstract: Participant systems in an audiovisual (AV) conferencing can, in one embodiment, use methods to ensure that all participants have a common platform to support AV conferencing, such as a common codec, and also attempt to provide higher quality AV conferencing (e.g., better than the common codec) based on available bandwidth and other criteria. The participant systems can use the common codec as a fallback platform when bandwidth or other criteria dictate a reduction in the quality of the AV conferencing. Other embodiments are also disclosed.

Type: Grant

Filed: May 4, 2022

Date of Patent: January 14, 2025

Assignee: APPLE INC.

Inventors: Gurtej Singh G. Chandok, Christopher M. Garrido, Chieh Lu, Daniel B. Pollack, Karthick Santhanam, David L. Biderman, Jinbo Qui, Dazhong Zhang, Jose A. Lozano Hinojosa, Igor Kobzar
ADAPTIVE CODING AND STREAMING OF MULTI-DIRECTIONAL VIDEO

Publication number: 20240397119

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Application

Filed: August 7, 2024

Publication date: November 28, 2024

Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
Adaptive coding and streaming of multi-directional video

Patent number: 12096044

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Grant

Filed: March 9, 2023

Date of Patent: September 17, 2024

Assignee: APPLE INC.

Inventors: Xiaohua Yang, Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
MULTI-DEVICE COMMUNICATION MANAGEMENT

Publication number: 20240306046

Abstract: A device implementing the subject technology may include at least one processor configured to receive downlink condition reports from device, each downlink condition report indicating a downlink channel condition of a respective device. The at least one processor is further configured to determine an uplink channel condition for each of the devices. The at least one processor is further configured to determine, for each respective device and based at least in part on the downlink condition reports and the uplink channel conditions, quality tiers, each of the quality tiers indicating a quality of at least one of an audio stream or a video stream. The at least one processor is further configured to provide for transmission, to each respective device, the quality tiers determined for the respective device.

Type: Application

Filed: May 20, 2024

Publication date: September 12, 2024

Inventors: Joe S. ABUAN, Ian J. BAIRD, Xiaosong ZHOU, Christopher M. GARRIDO, Dazhong ZHANG, Keith W. RAUENBUEHLER, Yan YANG, Patrick MIAUTON, Eric L. CHIEN, Berkat S. TUNG, Karthick SANTHANAM
Gaze-Based Copresence System

Publication number: 20240305682

Abstract: A technique for transmitting data in a copresence environment includes initiating a virtual communication session between a local device and remote devices in a shared copresence environment, where each of the plurality of sending devices are transmitting a sending quality data stream in the virtual communication session. A region of interest for the local device is determined that includes a portion of the copresence environment. The local device subscribes to a first quality data stream for the remote devices represented in the region of interest, and a second quality data stream for the remote devices not represented in the region of interest.

Type: Application

Filed: March 8, 2024

Publication date: September 12, 2024

Inventors: Jay Mayur Khandhar, Borna Ghavam, Jinbo Qiu, Christopher M. Garrido, Karthick Santhanam, Patrick Miauton, Xiaosong Zhou, Dazhong Zhang, Kristian D. Pereira, Dan Miao
MULTI-DEVICE COMMUNICATION MANAGEMENT

Publication number: 20240306047

Abstract: A device implementing the subject technology may include at least one processor configured to establish a group communication session for two or more electronic devices utilizing a first communication modality. The at least one processor may be further configured to determine to utilize a second communication modality for the group communication session. The at least one processor may be further configured to transition the group communication session from the first communication modality to the second communication modality.

Type: Application

Filed: May 20, 2024

Publication date: September 12, 2024

Inventors: Joe S. ABUAN, Ian J. BAIRD, Xiaosong ZHOU, Christopher M. GARRIDO, Dazhong ZHANG, Keith W. RAUENBUEHLER, Yan YANG, Patrick MIAUTON, Eric L. CHIEN, Berkat S. TUNG, Karthick SANTHANAM
Multi-device communication management

Patent number: 11991566

Abstract: A device implementing the subject technology may include at least one processor configured to transmit an allocation request requesting allocation of a group communication session with a plurality of devices and receive an allocation response in response to the allocation request, the allocation response including credential information for the device to use to join the group communication session. The at least one processor may be further configured to transmit an allocation bind request with the credential information to join the group communication session using the credential information and receive an allocation bind success response in response to the allocation bind request, the allocation bind success response indicating that the device has joined the group communication session. The at least one processor may be further configured to provide a join notification to the plurality of devices via an intermediary device to notify that the device has joined the group communication session.

Type: Grant

Filed: September 27, 2018

Date of Patent: May 21, 2024

Assignee: Apple Inc.

Inventors: Joe S. Abuan, Ian J. Baird, Xiaosong Zhou, Christopher M. Garrido, Dazhong Zhang, Keith W. Rauenbuehler, Yan Yang, Patrick Miauton, Eric L. Chien, Berkat S. Tung, Karthick Santhanam
Sphere projected motion estimation/compensation and mode decision

Patent number: 11818394

Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

Type: Grant

Filed: March 19, 2021

Date of Patent: November 14, 2023

Assignee: APPLE INC.

Inventors: Jae Hoon Kim, Xiaosong Zhou, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung, Hsi-Jung Wu
Systems and methods for perspective shifting in video conferencing session

Patent number: 11818502

Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.

Type: Grant

Filed: June 22, 2022

Date of Patent: November 14, 2023

Assignee: APPLE INC.

Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
ADAPTIVE CODING AND STREAMING OF MULTI-DIRECTIONAL VIDEO

Publication number: 20230269400

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Application

Filed: March 9, 2023

Publication date: August 24, 2023

Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
Encoding and decoding video content

Patent number: 11677934

Abstract: In an example method, a system receives a plurality of frames of a video, and generates a data structure representing the video and representing a plurality of temporal layers. Generating the data structure includes: (i) determining a plurality of quality levels for presenting the video, where each of the quality levels corresponds to a different respective sampling period for sampling the frames of the video, (ii) assigning, based on the sampling periods, each of the frames to a respective one of the temporal layers of the data structure, and (iii) indicating, in the data structure, one or more relationships between (a) at least one the frames assigned to at least one of the temporal layers of the data structure, and (b) at least another one of the frames assigned to at least another one of the temporal layers of the data structure. Further, the system outputs the data structure.

Type: Grant

Filed: September 24, 2021

Date of Patent: June 13, 2023

Assignee: Apple Inc.

Inventors: Sudeng Hu, David L. Biderman, Christopher M. Garrido, Hsi-Jung Wu, Xiaosong Zhou, Dazhong Zhang, Jinbo Qiu, Karthick Santhanam, Hang Yuan, Joshua L. Hare, Luciano M. Verger, Kevin Arthur Robertson, Sasanka Vemuri
Adaptive coding and streaming of multi-directional video

Patent number: 11627343

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Grant

Filed: March 1, 2021

Date of Patent: April 11, 2023

Assignee: APPLE INC.

Inventors: Xiaohua Yang, Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
Encoding and Decoding Video Content

Publication number: 20230098082

Abstract: In an example method, a system receives a plurality of frames of a video, and generates a data structure representing the video and representing a plurality of temporal layers. Generating the data structure includes: (i) determining a plurality of quality levels for presenting the video, where each of the quality levels corresponds to a different respective sampling period for sampling the frames of the video, (ii) assigning, based on the sampling periods, each of the frames to a respective one of the temporal layers of the data structure, and (iii) indicating, in the data structure, one or more relationships between (a) at least one the frames assigned to at least one of the temporal layers of the data structure, and (b) at least another one of the frames assigned to at least another one of the temporal layers of the data structure. Further, the system outputs the data structure.

Type: Application

Filed: September 24, 2021

Publication date: March 30, 2023

Inventors: Sudeng Hu, David L. Biderman, Christopher M. Garrido, Hsi-Jung Wu, Xiaosong Zhou, Dazhong Zhang, Jinbo Qiu, Karthick Santhanam, Hang Yuan, Joshua L. Hare, Luciano M. Verger, Kevin Arthur Robertson, Sasanka Vemuri
Efficient coding of source video sequences partitioned into tiles

Patent number: 11606574

Abstract: Techniques are disclosed for coding video data in which frames from a video source are partitioned into a plurality of tiles of common size, and the tiles are coded as a virtual video sequence according to motion-compensated prediction, each tile treated as having respective temporal location of the virtual video sequence. The coding scheme permits relative allocation of coding resources to tiles that are likely to have greater significance in a video coding session, which may lead to certain tiles that have low complexity or low motion content to be skipped during coding of the tiles for select source frames. Moreover, coding of the tiles may be ordered to achieve low coding latencies during a coding session.

Type: Grant

Filed: May 26, 2020

Date of Patent: March 14, 2023

Assignee: APPLE INC.

Inventors: Dazhong Zhang, Peikang Song, Beibei Wang, Giribalan Gopalan, Albert E. Keinath, Christopher M. Garrido, David R. Conrad, Hsi-Jung Wu, Ming Jin, Hang Yuan, Xiaohua Yang, Xiaosong Zhou, Vikrant Kasarabada, Davide Concion, Eric L. Chien, Bess C. Chan, Karthick Santhanam, Gurtej Singh Chandok
Applications for decoder-side modeling of objects identified in decoded video data

Patent number: 11553200

Abstract: Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.

Type: Grant

Filed: May 11, 2020

Date of Patent: January 10, 2023

Assignee: APPLE INC.

Inventors: Xing Wen, Dazhong Zhang, Peikang Song, Xiaosong Zhou, Sudeng Hu, Hsi-Jung Wu, Jae Hoon Kim
OPTIMIZING MEDIA EXPERIENCE IN CONFERENCING WITH DIVERSE PARTICIPANTS

Publication number: 20220394212

Abstract: Participant systems in an audiovisual (AV) conferencing can, in one embodiment, use methods to ensure that all participants have a common platform to support AV conferencing, such as a common codec, and also attempt to provide higher quality AV conferencing (e.g., better than the common codec) based on available bandwidth and other criteria. The participant systems can use the common codec as a fallback platform when bandwidth or other criteria dictate a reduction in the quality of the AV conferencing. Other embodiments are also disclosed.

Type: Application

Filed: May 4, 2022

Publication date: December 8, 2022

Inventors: Gurtej Singh G. Chandok, Christopher M. Garrido, Chieh Lu, Daniel B. Pollack, Karthick Santhanam, David L. Biderman, Jinbo Qui, Dazhong Zhang, Jose A. Lozano Hinojosa, Igor Kobzar
SYSTEMS AND METHODS FOR PERSPECTIVE SHIFTING IN VIDEO CONFERENCING SESSION

Publication number: 20220329756

Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.

Type: Application

Filed: June 22, 2022

Publication date: October 13, 2022

Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
Systems and methods for perspective shifting in video conferencing session

Patent number: 11394921

Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.

Type: Grant

Filed: March 10, 2017

Date of Patent: July 19, 2022

Assignee: Apple Inc.

Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
Real-time face and object manipulation

Patent number: 11282543

Abstract: Techniques are presented for modifying images of an object in video, for example to correct for lens distortion, or to beautify a face. These techniques include extracting and validating features of an object from a source video frame, tracking those features over time, estimating a pose of the object, modifying a 3D model of the object based on the features, and rendering a modified video frame based on the modified 3D model and modified intrinsic and extrinsic matrices. These techniques may be applied in real-time to an object in a sequence of video frames.

Type: Grant

Filed: March 9, 2018

Date of Patent: March 22, 2022

Assignee: Apple Inc.

Inventors: Hang Yuan, Jiefu Zhai, Ming Chen, Jae Hoon Kim, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu

1 2 3 4 5 … next