Patents by Inventor Dazhong Zhang
Dazhong Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250080759Abstract: Techniques are disclosed for coding video in applications where regions of video are inactive on a frame to frame basis. According to the techniques, coding processes update and reconstruct only a subset of pixel blocks of pixels within a frame, while other pixel blocks are retained from a previously coded frame stored in a coder's or decoder's reference frame buffer. The technique is called Backward Reference Updating (or “BRU”) for convenience. At a desired pixel block granularity, based on the activity between a current frame to be coded and its reference frame(s), BRU will only perform prediction, transform, quantization, and reconstruction on selected regions that are determined to be active. The reconstructed pixels in these active regions are directly placed onto a specified reference frame in memory instead of creating a new frame. Therefore, fewer memory transfers need to be performed.Type: ApplicationFiled: August 5, 2024Publication date: March 6, 2025Inventors: Guoxin JIN, Yeqing WU, Yunfei ZHENG, Dazhong ZHANG, Dzung T. HOANG, Felix C. FERNANDES, Van Luong PHAM, Jiancong LUO, Yixin DU, Hilmi Enes EGILMEZ, Alican NALCI, Jun XIN, Hsi-Jung WU
-
Patent number: 12200401Abstract: Participant systems in an audiovisual (AV) conferencing can, in one embodiment, use methods to ensure that all participants have a common platform to support AV conferencing, such as a common codec, and also attempt to provide higher quality AV conferencing (e.g., better than the common codec) based on available bandwidth and other criteria. The participant systems can use the common codec as a fallback platform when bandwidth or other criteria dictate a reduction in the quality of the AV conferencing. Other embodiments are also disclosed.Type: GrantFiled: May 4, 2022Date of Patent: January 14, 2025Assignee: APPLE INC.Inventors: Gurtej Singh G. Chandok, Christopher M. Garrido, Chieh Lu, Daniel B. Pollack, Karthick Santhanam, David L. Biderman, Jinbo Qui, Dazhong Zhang, Jose A. Lozano Hinojosa, Igor Kobzar
-
Publication number: 20240397119Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.Type: ApplicationFiled: August 7, 2024Publication date: November 28, 2024Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
-
Patent number: 12096044Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.Type: GrantFiled: March 9, 2023Date of Patent: September 17, 2024Assignee: APPLE INC.Inventors: Xiaohua Yang, Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
-
Publication number: 20240306047Abstract: A device implementing the subject technology may include at least one processor configured to establish a group communication session for two or more electronic devices utilizing a first communication modality. The at least one processor may be further configured to determine to utilize a second communication modality for the group communication session. The at least one processor may be further configured to transition the group communication session from the first communication modality to the second communication modality.Type: ApplicationFiled: May 20, 2024Publication date: September 12, 2024Inventors: Joe S. ABUAN, Ian J. BAIRD, Xiaosong ZHOU, Christopher M. GARRIDO, Dazhong ZHANG, Keith W. RAUENBUEHLER, Yan YANG, Patrick MIAUTON, Eric L. CHIEN, Berkat S. TUNG, Karthick SANTHANAM
-
Publication number: 20240305682Abstract: A technique for transmitting data in a copresence environment includes initiating a virtual communication session between a local device and remote devices in a shared copresence environment, where each of the plurality of sending devices are transmitting a sending quality data stream in the virtual communication session. A region of interest for the local device is determined that includes a portion of the copresence environment. The local device subscribes to a first quality data stream for the remote devices represented in the region of interest, and a second quality data stream for the remote devices not represented in the region of interest.Type: ApplicationFiled: March 8, 2024Publication date: September 12, 2024Inventors: Jay Mayur Khandhar, Borna Ghavam, Jinbo Qiu, Christopher M. Garrido, Karthick Santhanam, Patrick Miauton, Xiaosong Zhou, Dazhong Zhang, Kristian D. Pereira, Dan Miao
-
Publication number: 20240306046Abstract: A device implementing the subject technology may include at least one processor configured to receive downlink condition reports from device, each downlink condition report indicating a downlink channel condition of a respective device. The at least one processor is further configured to determine an uplink channel condition for each of the devices. The at least one processor is further configured to determine, for each respective device and based at least in part on the downlink condition reports and the uplink channel conditions, quality tiers, each of the quality tiers indicating a quality of at least one of an audio stream or a video stream. The at least one processor is further configured to provide for transmission, to each respective device, the quality tiers determined for the respective device.Type: ApplicationFiled: May 20, 2024Publication date: September 12, 2024Inventors: Joe S. ABUAN, Ian J. BAIRD, Xiaosong ZHOU, Christopher M. GARRIDO, Dazhong ZHANG, Keith W. RAUENBUEHLER, Yan YANG, Patrick MIAUTON, Eric L. CHIEN, Berkat S. TUNG, Karthick SANTHANAM
-
Patent number: 11991566Abstract: A device implementing the subject technology may include at least one processor configured to transmit an allocation request requesting allocation of a group communication session with a plurality of devices and receive an allocation response in response to the allocation request, the allocation response including credential information for the device to use to join the group communication session. The at least one processor may be further configured to transmit an allocation bind request with the credential information to join the group communication session using the credential information and receive an allocation bind success response in response to the allocation bind request, the allocation bind success response indicating that the device has joined the group communication session. The at least one processor may be further configured to provide a join notification to the plurality of devices via an intermediary device to notify that the device has joined the group communication session.Type: GrantFiled: September 27, 2018Date of Patent: May 21, 2024Assignee: Apple Inc.Inventors: Joe S. Abuan, Ian J. Baird, Xiaosong Zhou, Christopher M. Garrido, Dazhong Zhang, Keith W. Rauenbuehler, Yan Yang, Patrick Miauton, Eric L. Chien, Berkat S. Tung, Karthick Santhanam
-
Patent number: 11818502Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.Type: GrantFiled: June 22, 2022Date of Patent: November 14, 2023Assignee: APPLE INC.Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
-
Patent number: 11818394Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.Type: GrantFiled: March 19, 2021Date of Patent: November 14, 2023Assignee: APPLE INC.Inventors: Jae Hoon Kim, Xiaosong Zhou, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung, Hsi-Jung Wu
-
Publication number: 20230269400Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.Type: ApplicationFiled: March 9, 2023Publication date: August 24, 2023Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
-
Patent number: 11677934Abstract: In an example method, a system receives a plurality of frames of a video, and generates a data structure representing the video and representing a plurality of temporal layers. Generating the data structure includes: (i) determining a plurality of quality levels for presenting the video, where each of the quality levels corresponds to a different respective sampling period for sampling the frames of the video, (ii) assigning, based on the sampling periods, each of the frames to a respective one of the temporal layers of the data structure, and (iii) indicating, in the data structure, one or more relationships between (a) at least one the frames assigned to at least one of the temporal layers of the data structure, and (b) at least another one of the frames assigned to at least another one of the temporal layers of the data structure. Further, the system outputs the data structure.Type: GrantFiled: September 24, 2021Date of Patent: June 13, 2023Assignee: Apple Inc.Inventors: Sudeng Hu, David L. Biderman, Christopher M. Garrido, Hsi-Jung Wu, Xiaosong Zhou, Dazhong Zhang, Jinbo Qiu, Karthick Santhanam, Hang Yuan, Joshua L. Hare, Luciano M. Verger, Kevin Arthur Robertson, Sasanka Vemuri
-
Patent number: 11627343Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.Type: GrantFiled: March 1, 2021Date of Patent: April 11, 2023Assignee: APPLE INC.Inventors: Xiaohua Yang, Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
-
Publication number: 20230098082Abstract: In an example method, a system receives a plurality of frames of a video, and generates a data structure representing the video and representing a plurality of temporal layers. Generating the data structure includes: (i) determining a plurality of quality levels for presenting the video, where each of the quality levels corresponds to a different respective sampling period for sampling the frames of the video, (ii) assigning, based on the sampling periods, each of the frames to a respective one of the temporal layers of the data structure, and (iii) indicating, in the data structure, one or more relationships between (a) at least one the frames assigned to at least one of the temporal layers of the data structure, and (b) at least another one of the frames assigned to at least another one of the temporal layers of the data structure. Further, the system outputs the data structure.Type: ApplicationFiled: September 24, 2021Publication date: March 30, 2023Inventors: Sudeng Hu, David L. Biderman, Christopher M. Garrido, Hsi-Jung Wu, Xiaosong Zhou, Dazhong Zhang, Jinbo Qiu, Karthick Santhanam, Hang Yuan, Joshua L. Hare, Luciano M. Verger, Kevin Arthur Robertson, Sasanka Vemuri
-
Patent number: 11606574Abstract: Techniques are disclosed for coding video data in which frames from a video source are partitioned into a plurality of tiles of common size, and the tiles are coded as a virtual video sequence according to motion-compensated prediction, each tile treated as having respective temporal location of the virtual video sequence. The coding scheme permits relative allocation of coding resources to tiles that are likely to have greater significance in a video coding session, which may lead to certain tiles that have low complexity or low motion content to be skipped during coding of the tiles for select source frames. Moreover, coding of the tiles may be ordered to achieve low coding latencies during a coding session.Type: GrantFiled: May 26, 2020Date of Patent: March 14, 2023Assignee: APPLE INC.Inventors: Dazhong Zhang, Peikang Song, Beibei Wang, Giribalan Gopalan, Albert E. Keinath, Christopher M. Garrido, David R. Conrad, Hsi-Jung Wu, Ming Jin, Hang Yuan, Xiaohua Yang, Xiaosong Zhou, Vikrant Kasarabada, Davide Concion, Eric L. Chien, Bess C. Chan, Karthick Santhanam, Gurtej Singh Chandok
-
Patent number: 11553200Abstract: Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.Type: GrantFiled: May 11, 2020Date of Patent: January 10, 2023Assignee: APPLE INC.Inventors: Xing Wen, Dazhong Zhang, Peikang Song, Xiaosong Zhou, Sudeng Hu, Hsi-Jung Wu, Jae Hoon Kim
-
Publication number: 20220394212Abstract: Participant systems in an audiovisual (AV) conferencing can, in one embodiment, use methods to ensure that all participants have a common platform to support AV conferencing, such as a common codec, and also attempt to provide higher quality AV conferencing (e.g., better than the common codec) based on available bandwidth and other criteria. The participant systems can use the common codec as a fallback platform when bandwidth or other criteria dictate a reduction in the quality of the AV conferencing. Other embodiments are also disclosed.Type: ApplicationFiled: May 4, 2022Publication date: December 8, 2022Inventors: Gurtej Singh G. Chandok, Christopher M. Garrido, Chieh Lu, Daniel B. Pollack, Karthick Santhanam, David L. Biderman, Jinbo Qui, Dazhong Zhang, Jose A. Lozano Hinojosa, Igor Kobzar
-
Publication number: 20220329756Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.Type: ApplicationFiled: June 22, 2022Publication date: October 13, 2022Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
-
Patent number: 11394921Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.Type: GrantFiled: March 10, 2017Date of Patent: July 19, 2022Assignee: Apple Inc.Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
-
Patent number: 11282543Abstract: Techniques are presented for modifying images of an object in video, for example to correct for lens distortion, or to beautify a face. These techniques include extracting and validating features of an object from a source video frame, tracking those features over time, estimating a pose of the object, modifying a 3D model of the object based on the features, and rendering a modified video frame based on the modified 3D model and modified intrinsic and extrinsic matrices. These techniques may be applied in real-time to an object in a sequence of video frames.Type: GrantFiled: March 9, 2018Date of Patent: March 22, 2022Assignee: Apple Inc.Inventors: Hang Yuan, Jiefu Zhai, Ming Chen, Jae Hoon Kim, Dazhong Zhang, Xiaosong Zhou, Chris Y. Chung, Hsi-Jung Wu