Patents by Inventor Xiaosong Zhou

Xiaosong Zhou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Neural network based residual coding and prediction for predictive coding

Patent number: 12192440

Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.

Type: Grant

Filed: January 4, 2022

Date of Patent: January 7, 2025

Assignee: APPLE INC.

Inventors: Jiefu Zhai, Xingyu Zhang, Xiaosong Zhou, Jun Xin, Hsi-Jung Wu, Yeping Su
ADAPTIVE CODING AND STREAMING OF MULTI-DIRECTIONAL VIDEO

Publication number: 20240397119

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Application

Filed: August 7, 2024

Publication date: November 28, 2024

Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
Immersive video streaming using view-adaptive prefetching and buffer control

Patent number: 12137199

Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.

Type: Grant

Filed: January 8, 2024

Date of Patent: November 5, 2024

Assignee: Apple Inc.

Inventors: Fanyi Duanmu, Jun Xin, Hsi-Jung Wu, Xiaosong Zhou
Adaptive coding and streaming of multi-directional video

Patent number: 12096044

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Grant

Filed: March 9, 2023

Date of Patent: September 17, 2024

Assignee: APPLE INC.

Inventors: Xiaohua Yang, Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
MULTI-DEVICE COMMUNICATION MANAGEMENT

Publication number: 20240306046

Abstract: A device implementing the subject technology may include at least one processor configured to receive downlink condition reports from device, each downlink condition report indicating a downlink channel condition of a respective device. The at least one processor is further configured to determine an uplink channel condition for each of the devices. The at least one processor is further configured to determine, for each respective device and based at least in part on the downlink condition reports and the uplink channel conditions, quality tiers, each of the quality tiers indicating a quality of at least one of an audio stream or a video stream. The at least one processor is further configured to provide for transmission, to each respective device, the quality tiers determined for the respective device.

Type: Application

Filed: May 20, 2024

Publication date: September 12, 2024

Inventors: Joe S. ABUAN, Ian J. BAIRD, Xiaosong ZHOU, Christopher M. GARRIDO, Dazhong ZHANG, Keith W. RAUENBUEHLER, Yan YANG, Patrick MIAUTON, Eric L. CHIEN, Berkat S. TUNG, Karthick SANTHANAM
MULTI-DEVICE COMMUNICATION MANAGEMENT

Publication number: 20240306047

Abstract: A device implementing the subject technology may include at least one processor configured to establish a group communication session for two or more electronic devices utilizing a first communication modality. The at least one processor may be further configured to determine to utilize a second communication modality for the group communication session. The at least one processor may be further configured to transition the group communication session from the first communication modality to the second communication modality.

Type: Application

Filed: May 20, 2024

Publication date: September 12, 2024

Inventors: Joe S. ABUAN, Ian J. BAIRD, Xiaosong ZHOU, Christopher M. GARRIDO, Dazhong ZHANG, Keith W. RAUENBUEHLER, Yan YANG, Patrick MIAUTON, Eric L. CHIEN, Berkat S. TUNG, Karthick SANTHANAM
Gaze-Based Copresence System

Publication number: 20240305682

Abstract: A technique for transmitting data in a copresence environment includes initiating a virtual communication session between a local device and remote devices in a shared copresence environment, where each of the plurality of sending devices are transmitting a sending quality data stream in the virtual communication session. A region of interest for the local device is determined that includes a portion of the copresence environment. The local device subscribes to a first quality data stream for the remote devices represented in the region of interest, and a second quality data stream for the remote devices not represented in the region of interest.

Type: Application

Filed: March 8, 2024

Publication date: September 12, 2024

Inventors: Jay Mayur Khandhar, Borna Ghavam, Jinbo Qiu, Christopher M. Garrido, Karthick Santhanam, Patrick Miauton, Xiaosong Zhou, Dazhong Zhang, Kristian D. Pereira, Dan Miao
Multi-device communication management

Patent number: 11991566

Abstract: A device implementing the subject technology may include at least one processor configured to transmit an allocation request requesting allocation of a group communication session with a plurality of devices and receive an allocation response in response to the allocation request, the allocation response including credential information for the device to use to join the group communication session. The at least one processor may be further configured to transmit an allocation bind request with the credential information to join the group communication session using the credential information and receive an allocation bind success response in response to the allocation bind request, the allocation bind success response indicating that the device has joined the group communication session. The at least one processor may be further configured to provide a join notification to the plurality of devices via an intermediary device to notify that the device has joined the group communication session.

Type: Grant

Filed: September 27, 2018

Date of Patent: May 21, 2024

Assignee: Apple Inc.

Inventors: Joe S. Abuan, Ian J. Baird, Xiaosong Zhou, Christopher M. Garrido, Dazhong Zhang, Keith W. Rauenbuehler, Yan Yang, Patrick Miauton, Eric L. Chien, Berkat S. Tung, Karthick Santhanam
Immersive Video Streaming Using View-Adaptive Prefetching and Buffer Control

Publication number: 20240146892

Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.

Type: Application

Filed: January 8, 2024

Publication date: May 2, 2024

Inventors: Fanyi Duanmu, Jun Xin, Hsi-Jung Wu, Xiaosong Zhou
Client-end enhanced view prediction for multi-view video streaming exploiting pre-fetched data and side information

Patent number: 11956295

Abstract: Techniques for multi-view video streaming are described in the present disclosure, wherein a viewport prediction may be employed at a client-end based on analysis of pre-fetched media item data and ancillary information. A streaming method may first prefetch a portion of content of a multi-view media item. The method may next identify a salient region from the prefetched content and may then download additional content of the media item that corresponds to the identified salient region.

Type: Grant

Filed: March 20, 2020

Date of Patent: April 9, 2024

Assignee: APPLE INC.

Inventors: Fanyi Duanmu, Alexandros Tourapis, Jun Xin, Hsi-Jung Wu, Xiaosong Zhou
METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM FOR RECOMMENDING INFORMATION

Publication number: 20240086685

Abstract: A method, apparatus, device and storage medium for recommending information. The method includes determining, based on a set of feature representations of a plurality of features associated with information recommendation, a first set of weights indicating importance of the plurality of features. The method also includes determining a second set of weights based on the set of feature representations and the first set of weights. The method further includes recommending the information to a user based on the set of feature representations, the first set of weights and the second set of weights. The importance of respective features associated with the information recommendation can be accurately determined through this method, which further improves the effectiveness of information recommendation and improves the user experience.

Type: Application

Filed: September 8, 2023

Publication date: March 14, 2024

Inventors: Xiaosong ZHOU, Qingliang CAI, Shu CHEN, Zhe WANG, Haiqian HE
Immersive video streaming using view-adaptive prefetching and buffer control

Patent number: 11924391

Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.

Type: Grant

Filed: December 16, 2022

Date of Patent: March 5, 2024

Assignee: Apple Inc.

Inventors: Fanyi Duanmu, Jun Xin, Hsi-Jung Wu, Xiaosong Zhou
METHOD FOR REPAIRING WALL DISEASES OF EARTHEN ARCHITECTURE

Publication number: 20230417074

Abstract: A method for repairing wall diseases of an earthen architecture includes the following specific steps: S1: selecting raw material components, and mixing the raw material components to prepare a mixed material; S2: mixing the mixed material with plain soil at a certain ratio to form a repair material; and S3: mixing the repair material with water at a certain ratio to prepare slurry, adjusting the ratio of the repair material to water to prepare the slurry based on diseases of the earthen site, putting the slurry into a pressure grouting machine, and repairing the diseases of the earthen site by using the pressure grouting machine. The repair material prepared in the method features excellent performance, stable volume and good compatibility with a site. The method based on the repair material has certain practical significance in repairing diseases of the earthen site.

Type: Application

Filed: April 5, 2023

Publication date: December 28, 2023

Applicant: Xi’an University of Technology

Inventors: Caihui ZHU, Song QIU, Zhuqing LI, Sen PENG, Junlian LI, Yifan CHEN, Miaomiao GE, Jian XU, Zhenghong LIU, Xiaosong ZHOU, Yunfeng MA, Yubo LI, Changsong DONG, Ning LI
Object and keypoint detection system with low spatial jitter, low latency and low power usage

Patent number: 11847823

Abstract: Video object and keypoint location detection techniques are presented. The system includes a detection system for generation locations of an object's keypoints along with probabilities associated with the locations, and a stability system for stabilizing keypoint locations of the detected objects. In some aspects, the generated probabilities are two-dimensional array correspond locations within input images, and stability system fits the generated probabilities to a two-dimensional probability distribution function.

Type: Grant

Filed: June 4, 2021

Date of Patent: December 19, 2023

Assignee: APPLE INC.

Inventors: Xiaoxia Sun, Jiefu Zhai, Ke Zhang, Xiaosong Zhou, Hsi-Jung Wu
VIDEO CLASSIFICATION AND SEARCH SYSTEM TO SUPPORT CUSTOMIZABLE VIDEO HIGHLIGHTS

Publication number: 20230394081

Abstract: A video classification, indexing, and retrieval system is disclosed that classifies and retrieves video along multiple indexing dimensions. A search system may field queries identifying desired parameters of video, search an indexed database for videos that match the query parameters, and create clips extracted from responsive videos that are provided in response. In this manner, different queries may cause different clips to be created from a single video, each clip tailored to the parameters of the query that is received.

Type: Application

Filed: June 1, 2023

Publication date: December 7, 2023

Inventors: Shujie LIU, Xiaosong ZHOU, Hsi-Jung WU, Jiefu ZHAI, Ke ZHANG, Ming CHEN
ANALYTIC- AND APPLICATION-AWARE VIDEO DERIVATIVE GENERATION TECHNIQUES

Publication number: 20230396819

Abstract: A video delivery system generates and stores reduced bandwidth videos from source video. The system may include a track generator that executes functionality of application(s) to be used at sink devices, in which the track generator generates tracks from execution of the application(s) on source video and generates tracks having a reduced data size as compared to the source video. The track generator may execute a first instance of application functionality on the source video, which identifies region(s) of interest from the source video. The track generator further may downsample the source video according to downsampling parameters, and execute a second instance of application functionality on the downsampled video. The track generator may determine, from a comparison of outputs from the first and second instances of the application, whether the output from the second instance of application functionality is within an error tolerance of the output from the first instance of application functionality.

Type: Application

Filed: June 1, 2023

Publication date: December 7, 2023

Inventors: Ke ZHANG, Xiaoxia SUN, Shujie LIU, Xiaosong ZHOU, Jian LI, Xun SHI, Jiefu ZHAI, Albert E KEINATH, Hsi-Jung WU, Jingteng XUE, Xingyu ZHANG, Jun XIN
Sphere projected motion estimation/compensation and mode decision

Patent number: 11818394

Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

Type: Grant

Filed: March 19, 2021

Date of Patent: November 14, 2023

Assignee: APPLE INC.

Inventors: Jae Hoon Kim, Xiaosong Zhou, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung, Hsi-Jung Wu
Systems and methods for perspective shifting in video conferencing session

Patent number: 11818502

Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.

Type: Grant

Filed: June 22, 2022

Date of Patent: November 14, 2023

Assignee: APPLE INC.

Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
ADAPTIVE CODING AND STREAMING OF MULTI-DIRECTIONAL VIDEO

Publication number: 20230269400

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

Type: Application

Filed: March 9, 2023

Publication date: August 24, 2023

Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
ESTABLISHING A VIDEO CONFERENCE DURING A PHONE CALL

Publication number: 20230262196

Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.

Type: Application

Filed: April 27, 2023

Publication date: August 17, 2023

Inventors: Elizabeth C. CRANFILL, Stephen O. LEMAY, Joe S. ABUAN, Hsi-Jung WU, Xiaosong ZHOU, Roberto GARCIA, JR.

1 2 3 4 5 … next