Patents by Inventor Xiaosong Zhou
Xiaosong Zhou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240146892Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.Type: ApplicationFiled: January 8, 2024Publication date: May 2, 2024Inventors: Fanyi Duanmu, Jun Xin, Hsi-Jung Wu, Xiaosong Zhou
-
Patent number: 11956295Abstract: Techniques for multi-view video streaming are described in the present disclosure, wherein a viewport prediction may be employed at a client-end based on analysis of pre-fetched media item data and ancillary information. A streaming method may first prefetch a portion of content of a multi-view media item. The method may next identify a salient region from the prefetched content and may then download additional content of the media item that corresponds to the identified salient region.Type: GrantFiled: March 20, 2020Date of Patent: April 9, 2024Assignee: APPLE INC.Inventors: Fanyi Duanmu, Alexandros Tourapis, Jun Xin, Hsi-Jung Wu, Xiaosong Zhou
-
Publication number: 20240086685Abstract: A method, apparatus, device and storage medium for recommending information. The method includes determining, based on a set of feature representations of a plurality of features associated with information recommendation, a first set of weights indicating importance of the plurality of features. The method also includes determining a second set of weights based on the set of feature representations and the first set of weights. The method further includes recommending the information to a user based on the set of feature representations, the first set of weights and the second set of weights. The importance of respective features associated with the information recommendation can be accurately determined through this method, which further improves the effectiveness of information recommendation and improves the user experience.Type: ApplicationFiled: September 8, 2023Publication date: March 14, 2024Inventors: Xiaosong ZHOU, Qingliang CAI, Shu CHEN, Zhe WANG, Haiqian HE
-
Patent number: 11924391Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.Type: GrantFiled: December 16, 2022Date of Patent: March 5, 2024Assignee: Apple Inc.Inventors: Fanyi Duanmu, Jun Xin, Hsi-Jung Wu, Xiaosong Zhou
-
Patent number: 11915962Abstract: The present invention provides a display panel and manufacturing method thereof, the method including following steps: providing a driving backplane and a light-emitting substrate, and bonding the driving backplane and the light-emitting substrate; patterning the light-emitting substrate to form a pixel array; forming a thin film packaging layer on an outside of the pixel array, the thin film packaging layer completely covering the pixel array; forming quantum dots on top of the thin film packaging layer to form a multi-color display; forming a reflective array between two adjacent quantum dots to avoid optical crosstalk between the pixel arrays. The display panel and the method of the present invention break through the physical limit of the high PPI, high-precision metal mask, which can realize the display of 2000 and higher PPI, and can prevent the optical crosstalk between the pixel arrays.Type: GrantFiled: April 30, 2020Date of Patent: February 27, 2024Assignee: KUNSHAN FANTAVIEW ELECTRONIC TECHNOLOGY CO., LTD.Inventors: Xiaosong Du, Xiaolong Yang, Wenbin Zhou, Feng Zhang, Jian Sun, Yudi Gao
-
Publication number: 20230417074Abstract: A method for repairing wall diseases of an earthen architecture includes the following specific steps: S1: selecting raw material components, and mixing the raw material components to prepare a mixed material; S2: mixing the mixed material with plain soil at a certain ratio to form a repair material; and S3: mixing the repair material with water at a certain ratio to prepare slurry, adjusting the ratio of the repair material to water to prepare the slurry based on diseases of the earthen site, putting the slurry into a pressure grouting machine, and repairing the diseases of the earthen site by using the pressure grouting machine. The repair material prepared in the method features excellent performance, stable volume and good compatibility with a site. The method based on the repair material has certain practical significance in repairing diseases of the earthen site.Type: ApplicationFiled: April 5, 2023Publication date: December 28, 2023Applicant: Xi’an University of TechnologyInventors: Caihui ZHU, Song QIU, Zhuqing LI, Sen PENG, Junlian LI, Yifan CHEN, Miaomiao GE, Jian XU, Zhenghong LIU, Xiaosong ZHOU, Yunfeng MA, Yubo LI, Changsong DONG, Ning LI
-
Patent number: 11847823Abstract: Video object and keypoint location detection techniques are presented. The system includes a detection system for generation locations of an object's keypoints along with probabilities associated with the locations, and a stability system for stabilizing keypoint locations of the detected objects. In some aspects, the generated probabilities are two-dimensional array correspond locations within input images, and stability system fits the generated probabilities to a two-dimensional probability distribution function.Type: GrantFiled: June 4, 2021Date of Patent: December 19, 2023Assignee: APPLE INC.Inventors: Xiaoxia Sun, Jiefu Zhai, Ke Zhang, Xiaosong Zhou, Hsi-Jung Wu
-
Publication number: 20230396819Abstract: A video delivery system generates and stores reduced bandwidth videos from source video. The system may include a track generator that executes functionality of application(s) to be used at sink devices, in which the track generator generates tracks from execution of the application(s) on source video and generates tracks having a reduced data size as compared to the source video. The track generator may execute a first instance of application functionality on the source video, which identifies region(s) of interest from the source video. The track generator further may downsample the source video according to downsampling parameters, and execute a second instance of application functionality on the downsampled video. The track generator may determine, from a comparison of outputs from the first and second instances of the application, whether the output from the second instance of application functionality is within an error tolerance of the output from the first instance of application functionality.Type: ApplicationFiled: June 1, 2023Publication date: December 7, 2023Inventors: Ke ZHANG, Xiaoxia SUN, Shujie LIU, Xiaosong ZHOU, Jian LI, Xun SHI, Jiefu ZHAI, Albert E KEINATH, Hsi-Jung WU, Jingteng XUE, Xingyu ZHANG, Jun XIN
-
Publication number: 20230394081Abstract: A video classification, indexing, and retrieval system is disclosed that classifies and retrieves video along multiple indexing dimensions. A search system may field queries identifying desired parameters of video, search an indexed database for videos that match the query parameters, and create clips extracted from responsive videos that are provided in response. In this manner, different queries may cause different clips to be created from a single video, each clip tailored to the parameters of the query that is received.Type: ApplicationFiled: June 1, 2023Publication date: December 7, 2023Inventors: Shujie LIU, Xiaosong ZHOU, Hsi-Jung WU, Jiefu ZHAI, Ke ZHANG, Ming CHEN
-
Patent number: 11818502Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.Type: GrantFiled: June 22, 2022Date of Patent: November 14, 2023Assignee: APPLE INC.Inventors: Jae Hoon Kim, Chris Y. Chung, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Xiaosong Zhou, Jiefu Zhai
-
Patent number: 11818394Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.Type: GrantFiled: March 19, 2021Date of Patent: November 14, 2023Assignee: APPLE INC.Inventors: Jae Hoon Kim, Xiaosong Zhou, Dazhong Zhang, Hang Yuan, Jiefu Zhai, Chris Y. Chung, Hsi-Jung Wu
-
Publication number: 20230269400Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.Type: ApplicationFiled: March 9, 2023Publication date: August 24, 2023Inventors: Xiaohua YANG, Alexandros TOURAPIS, Dazhong ZHANG, Hang YUAN, Hsi-Jung WU, Jae Hoon KIM, Jiefu ZHAI, Ming CHEN, Xiaosong ZHOU
-
Publication number: 20230262196Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.Type: ApplicationFiled: April 27, 2023Publication date: August 17, 2023Inventors: Elizabeth C. CRANFILL, Stephen O. LEMAY, Joe S. ABUAN, Hsi-Jung WU, Xiaosong ZHOU, Roberto GARCIA, JR.
-
Patent number: 11677934Abstract: In an example method, a system receives a plurality of frames of a video, and generates a data structure representing the video and representing a plurality of temporal layers. Generating the data structure includes: (i) determining a plurality of quality levels for presenting the video, where each of the quality levels corresponds to a different respective sampling period for sampling the frames of the video, (ii) assigning, based on the sampling periods, each of the frames to a respective one of the temporal layers of the data structure, and (iii) indicating, in the data structure, one or more relationships between (a) at least one the frames assigned to at least one of the temporal layers of the data structure, and (b) at least another one of the frames assigned to at least another one of the temporal layers of the data structure. Further, the system outputs the data structure.Type: GrantFiled: September 24, 2021Date of Patent: June 13, 2023Assignee: Apple Inc.Inventors: Sudeng Hu, David L. Biderman, Christopher M. Garrido, Hsi-Jung Wu, Xiaosong Zhou, Dazhong Zhang, Jinbo Qiu, Karthick Santhanam, Hang Yuan, Joshua L. Hare, Luciano M. Verger, Kevin Arthur Robertson, Sasanka Vemuri
-
Publication number: 20230147442Abstract: In an example method, a system accesses first input data and a machine learning architecture. The machine learning architecture includes a first module having a first neural network, a second module having a second neural network, and a third module having a third neural network. The system generates a first feature set representing a first portion of the first input data using the first neural network, and a second feature set representing a second portion of the first input data using the second neural network. The system generates, using the third neural network, first output data based on the first feature set and the second feature set.Type: ApplicationFiled: June 3, 2022Publication date: May 11, 2023Inventors: Shujie Liu, Jiefu Zhai, Xiaosong Zhou, Hsi-Jung Wu, Ke Zhang, Xiaoxia Sun, Jian Li
-
Publication number: 20230117742Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.Type: ApplicationFiled: December 16, 2022Publication date: April 20, 2023Inventors: Fanyi Duanmu, Jun Xin, Hsi-Jung Wu, Xiaosong Zhou
-
Patent number: 11627343Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.Type: GrantFiled: March 1, 2021Date of Patent: April 11, 2023Assignee: APPLE INC.Inventors: Xiaohua Yang, Alexandros Tourapis, Dazhong Zhang, Hang Yuan, Hsi-Jung Wu, Jae Hoon Kim, Jiefu Zhai, Ming Chen, Xiaosong Zhou
-
Publication number: 20230098082Abstract: In an example method, a system receives a plurality of frames of a video, and generates a data structure representing the video and representing a plurality of temporal layers. Generating the data structure includes: (i) determining a plurality of quality levels for presenting the video, where each of the quality levels corresponds to a different respective sampling period for sampling the frames of the video, (ii) assigning, based on the sampling periods, each of the frames to a respective one of the temporal layers of the data structure, and (iii) indicating, in the data structure, one or more relationships between (a) at least one the frames assigned to at least one of the temporal layers of the data structure, and (b) at least another one of the frames assigned to at least another one of the temporal layers of the data structure. Further, the system outputs the data structure.Type: ApplicationFiled: September 24, 2021Publication date: March 30, 2023Inventors: Sudeng Hu, David L. Biderman, Christopher M. Garrido, Hsi-Jung Wu, Xiaosong Zhou, Dazhong Zhang, Jinbo Qiu, Karthick Santhanam, Hang Yuan, Joshua L. Hare, Luciano M. Verger, Kevin Arthur Robertson, Sasanka Vemuri
-
Patent number: 11606574Abstract: Techniques are disclosed for coding video data in which frames from a video source are partitioned into a plurality of tiles of common size, and the tiles are coded as a virtual video sequence according to motion-compensated prediction, each tile treated as having respective temporal location of the virtual video sequence. The coding scheme permits relative allocation of coding resources to tiles that are likely to have greater significance in a video coding session, which may lead to certain tiles that have low complexity or low motion content to be skipped during coding of the tiles for select source frames. Moreover, coding of the tiles may be ordered to achieve low coding latencies during a coding session.Type: GrantFiled: May 26, 2020Date of Patent: March 14, 2023Assignee: APPLE INC.Inventors: Dazhong Zhang, Peikang Song, Beibei Wang, Giribalan Gopalan, Albert E. Keinath, Christopher M. Garrido, David R. Conrad, Hsi-Jung Wu, Ming Jin, Hang Yuan, Xiaohua Yang, Xiaosong Zhou, Vikrant Kasarabada, Davide Concion, Eric L. Chien, Bess C. Chan, Karthick Santhanam, Gurtej Singh Chandok
-
Patent number: 11605224Abstract: Techniques disclosed for managing video captured by an imaging device. Methods disclosed capture a video in response to a capture command received at the imaging device. Following a video capture, techniques for classifying the captured video based on feature(s) extracted therefrom, for marking the captured video based on the classification, and for generating a media item from the captured video according to the marking are disclosed. Accordingly, the captured video may be classified as representing a static event, and, as a result, a media item of a still image may be generated. Otherwise, the captured video may be classified as representing a dynamic event, and, as a result, a media item of a video may be generated.Type: GrantFiled: May 26, 2020Date of Patent: March 14, 2023Assignee: APPLE INC.Inventors: Bartlomiej Rymkowski, Robert Bailey, Ethan Tira-Thompson, Shuang Gao, Ben Englert, Emilie Kim, Shujie Liu, Ke Zhang, Vinay Sharma, Xiaosong Zhou