Patents by Inventor Wen Gao
Wen Gao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11417030Abstract: A method for coding information of a point cloud comprises obtaining the point cloud including a set of points in a three-dimensional space; partitioning the point cloud into a plurality of objects and generating occupancy information for each of the plurality of objects; and encoding the occupancy information by taking into account the distance between the plurality of objects.Type: GrantFiled: October 26, 2020Date of Patent: August 16, 2022Assignee: TENCENT AMERICA LLCInventors: Xiang Zhang, Wen Gao, Shan Liu
-
Patent number: 11417029Abstract: Aspects of the disclosure provide methods and apparatuses for point cloud compression. In some examples, an apparatus for point cloud compression includes processing circuitry. In some embodiments, the processing circuitry determines one or more original points in a point cloud that are associated with a reconstructed position. Positions of the one or more original points can be reconstructed, according to a geometry quantization, to the reconstructed position. The processing circuitry then determines an attribute value for the reconstructed position based on attribute information of the one or more original points, and encodes texture of the point cloud with the reconstructed position having the determined attribute value.Type: GrantFiled: October 6, 2020Date of Patent: August 16, 2022Assignee: Tencent America LLCInventors: Xiang Zhang, Wen Gao, Shan Liu
-
Publication number: 20220254057Abstract: Provided are a point cloud attribute prediction method and prediction device based on a filter. The method comprises a coding method and a decoding method, and the device comprises a coding device and a decoding device. The method comprises: determining K nearest neighbor points of the current point; determining a filter matrix; and determining an attribute prediction value of the current point according to the filter matrix. Therefore, the compression performance of a point cloud attribute can be improved by means of selecting an appropriate filter.Type: ApplicationFiled: May 11, 2020Publication date: August 11, 2022Inventors: Ge LI, Chuang MA, Jing WANG, Yi Ting SHAO, Wen GAO
-
Publication number: 20220248053Abstract: Aspects of the disclosure provide methods, apparatuses, and a non-transitory computer-readable medium for point cloud coding. In a method, when parallel octree coding is enabled for occupancy codes of nodes in an octree partitioning structure of the point cloud, syntax information of the point cloud is decoded from a coded bitstream and a bitstream offset of an octree depth is determined. The syntax information indicates a bitstream length of the octree depth at which the parallel octree coding is enabled. Parallel decoding is performed on the occupancy codes of the nodes of the octree depth based on the bitstream offset and the bitstream length of the octree depth. Further, the point cloud is reconstructed based on the occupancy codes of the nodes.Type: ApplicationFiled: April 19, 2022Publication date: August 4, 2022Applicant: Tencent America LLCInventors: Xiang ZHANG, Wen GAO, Shan LIU
-
Publication number: 20220248054Abstract: Aspects of the disclosure include a method for point cloud coding. In the method, whether decoding of occupancy codes of nodes in a range of octree partition depths in an octree partitioning structure of a point cloud reaches a minimum octree partition depth at which parallel decoding is enabled is determined. Arithmetic coding information for decoding the occupancy codes of the nodes in the minimum octree partition depth is stored based on the decoding of the occupancy codes of the nodes in the range of octree partition depths reaching the minimum octree partition depth. The parallel decoding is performed on occupancy codes of the nodes in each of the at least one remaining octree partitions depth based on the stored arithmetic coding information. The point cloud is reconstructed based on the occupancy codes of the nodes in the range of octree partition depths in the octree partitioning structure.Type: ApplicationFiled: April 19, 2022Publication date: August 4, 2022Applicant: Tencent America LLCInventors: Xiang ZHANG, Wen GAO, Shan LIU
-
Publication number: 20220245775Abstract: A tone mapping method is provided. This method includes: obtaining one or a plurality of high dynamic range images, and determining a storage format of each high dynamic range image; performing an image decomposition on the high dynamic range image to obtain a first component, a second component and a third component, when the storage format of the high dynamic range image is determined as a predetermined storage format; inputting the first component and the second component into a predetermined deep neural network, and using the deep neural network to perform mapping on the first component and the second component respectively to obtain a first mapped component and a second mapped component; and fusing the first mapped component and the second mapped component with the third component to obtain a fused low dynamic range image corresponding to the high dynamic range image.Type: ApplicationFiled: April 20, 2022Publication date: August 4, 2022Inventors: Ronggang WANG, Ning ZHANG, Wen GAO
-
Patent number: 11399192Abstract: A method, a non-transitory computer readable medium, and a computer system is provided for encoding or decoding video data. The method may include: receiving an entropy coded bitstream comprising compressed video data including point cloud occupancy codes; generating one or more dequantized dimensions of a boundary box of a point cloud; based on determining that the node or node depth has attribute information, using the attribute information for the node or node depth; and based on determining that the node or node depth does not have attribute information, obtaining attribute information for the node or node depth by inheriting attribute information from a node or node depth that is at least one depth level in an octree of the point cloud above a depth level of the node or node depth in the octree.Type: GrantFiled: December 17, 2020Date of Patent: July 26, 2022Assignee: TENCENT AMERICA LLCInventors: Wen Gao, Xiang Zhang, Shan Liu
-
Patent number: 11397890Abstract: The present application discloses a cross-media retrieval method based on deep semantic space, which includes a feature generation stage and a semantic space learning stage. In the feature generation stage, a CNN visual feature vector and an LSTM language description vector of an image are generated by simulating a perception process of a person for the image; and topic information about a text is explored by using an LDA topic model, thus extracting an LDA text topic vector. In the semantic space learning phase, a training set image is trained to obtain a four-layer Multi-Sensory Fusion Deep Neural Network, and a training set text is trained to obtain a three-layer text semantic network, respectively. Finally, a test image and a text are respectively mapped to an isomorphic semantic space by using two networks, so as to realize cross-media retrieval. The disclosed method can significantly improve the performance of cross-media retrieval.Type: GrantFiled: August 16, 2017Date of Patent: July 26, 2022Assignee: Peking University Shenzhen Graduate SchoolInventors: Wenmin Wang, Mengdi Fan, Peilei Dong, Ronggang Wang, Ge Li, Shengfu Dong, Zhenyu Wang, Ying Li, Hui Zhao, Wen Gao
-
Publication number: 20220232253Abstract: In some examples, an apparatus for point cloud compression/decompression includes processing circuitry. The processing circuitry determines a flag that indicates an enable/disable control for saving coding state in a largest coding unit (LCU) based coding of a point cloud. In some examples, the processing circuitry stores coding state information before a coding of a first LCU; and in response to the flag indicating an enable control, the processing circuitry restores, a coding state according to the stored coding state information before a coding of a second LCU. In some examples, in response to the flag indicating an enable control, the processing circuitry stores the coding state information before the coding of the first LCU. In some examples, in response to the flag indicating a disable control, the processing circuitry skip the storing/restoring of the coding state information.Type: ApplicationFiled: August 17, 2021Publication date: July 21, 2022Applicant: TENCENT AMERICA LLCInventors: Wen GAO, Xiang ZHANG, Shan LIU
-
Publication number: 20220219189Abstract: The present invention discloses a shower device, comprising a shower and a base, the base is provided with a convex boss, the back of the shower 1 is provided with a groove matching the convex boss, and the base is also provided with an elastic buckle, the shower is provided with a clamping slot that can be matched and clamped with the elastic buckle, and another socket for hooking up the shower is also provided on the base. Through the design of the above-mentioned structure, the invention specifically uses the cooperation relationship between the clamping slot and the groove to form a lock position in a certain, so as to avoid the accident that the shower falls off from the base and accidentally injures the child. At the same time, a socket is arranged at the height of the side end of the base, which meets the needs of people who need high-altitude shower, and enhance diversity and flexibility.Type: ApplicationFiled: December 30, 2021Publication date: July 14, 2022Applicant: RUNNER(XIAMEN) CORP.Inventors: bi fu Dai, xin zhan HU, wen Gao, tao yan Zhang, gang Xu, zhi liang Lin, wei zhen Chen, yong Lin, lu Lu
-
Publication number: 20220224943Abstract: Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for video encoding/decoding. An apparatus includes processing circuitry that receives metadata associated with a coded video bitstream. The metadata includes labeling information of one or more objects detected in a first picture that is coded in the coded video bitstream. The processing circuitry decodes the labeling information of the one or more objects in the first picture that is coded in the coded video bitstream. The processing circuitry applies the labeling information to the one or more objects in the first picture.Type: ApplicationFiled: August 27, 2021Publication date: July 14, 2022Applicant: TENCENT AMERICA LLCInventors: Shan LIU, Xiaozhong Xu, Wen Gao
-
Patent number: 11381812Abstract: Disclosed is a boundary filtering method for intra prediction, relating to the video encoding technology filed. Whether boundary filtering is performed on an intra prediction block or not is adaptively selected by means of a rate distortion optimization decision; during filtering, a filter coefficient exponentially attenuated relative to distance to boundary is adopted to perform filtering on the first N rows or the first N columns of the intra prediction block by means of an intra prediction block filter, and different filtering strengths are used according to different sizes of the prediction blocks. Therefore, the boundary distortion problem of intra prediction block is solved, the intra prediction precision is improved, and the encoding efficiency of intra prediction block is increased; and the practicability and the robustness of the boundary filtering technology are improved.Type: GrantFiled: September 25, 2018Date of Patent: July 5, 2022Assignee: PEKING UNIVERSITY SHENZHEN GRADUATE SCHOOLInventors: Ronggang Wang, Kui Fan, Ge Li, Wen Gao
-
Patent number: 11379711Abstract: A video action detection method based on a convolutional neural network (CNN) is disclosed in the field of computer vision recognition technologies. A temporal-spatial pyramid pooling layer is added to a network structure, which eliminates limitations on input by a network, speeds up training and detection, and improves performance of video action classification and time location. The disclosed convolutional neural network includes a convolutional layer, a common pooling layer, a temporal-spatial pyramid pooling layer and a full connection layer. The outputs of the convolutional neural network include a category classification output layer and a time localization calculation result output layer. The disclosed method does not require down-sampling to obtain video clips of different durations, but instead utilizes direct input of the whole video at once, improving efficiency.Type: GrantFiled: August 16, 2017Date of Patent: July 5, 2022Assignee: Peking University Shenzhen Graduate SchoolInventors: Wenmin Wang, Zhihao Li, Ronggang Wang, Ge Li, Shengfu Dong, Zhenyu Wang, Ying Li, Hui Zhao, Wen Gao
-
Patent number: 11381840Abstract: A method of point cloud geometry decoding in a point cloud decoder can include receiving a bitstream including a slice of a coded point cloud frame, and reconstructing an octree representing a geometry of points in a bounding box of the slice where a current node of the octree is partitioned with a quadtree (QT) partition or a binary tree (BT) partition.Type: GrantFiled: June 23, 2020Date of Patent: July 5, 2022Assignee: TENCENT AMERICA LLCInventors: Xiang Zhang, Wen Gao, Sehoon Yea, Shan Liu
-
Publication number: 20220210472Abstract: A method, a non-transitory computer readable medium, and a computer system is provided for encoding or decoding video data. The method may include: receiving an entropy coded bitstream comprising compressed video data including point cloud occupancy codes; generating one or more dequantized dimensions of a boundary box of a point cloud; based on determining that the compressed video data was predicted by using the attribute-based predictor, determining a predictor for decoding is the attribute-based predictor; based on determining that the compressed video data was predicted by using the attribute-based predictor, determining the predictor for decoding is the geometry-based predictor; and building an octree structure by using the determined predictor.Type: ApplicationFiled: March 17, 2022Publication date: June 30, 2022Applicant: TENCENT AMERICA LLCInventors: Wen GAO, Xiang ZHANG, Shan LIU
-
Publication number: 20220201321Abstract: Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for video encoding/decoding. An apparatus includes processing circuitry that decodes prediction information of a current block in a current picture that is a part of a video bitstream. The processing circuitry determines that a first plurality of coding tools is enabled and a second plurality of coding tools is disabled for the current block based on a syntax element included in the video bitstream. The first plurality of coding tools includes a deblocking filter. The second plurality of coding tools includes at least one of a sample adaptive offset filter, an intra sub-partitioning, and a matrix based intra prediction. The processing circuitry reconstructs the current block based on the first plurality of coding tools being enabled and the second plurality of coding tools being disabled.Type: ApplicationFiled: September 3, 2021Publication date: June 23, 2022Applicant: TENCENT AMERICA LLCInventors: Wen GAO, Xiaozhong Xu, Shan Liu
-
Publication number: 20220196580Abstract: The embodiments herein relate to defect inspection methods of semiconductor wafers during the manufacturing process. According to an aspect of the present disclosure, a defect inspection system is provided. The defect inspection system includes a first inspection system, pattern simulator software, and a second inspection system. The first inspection system is capable of determining a plurality of defect locations on an article. The pattern simulator software is capable of generating a set of simulated pattern features from the plurality of defect locations. The second inspection system is capable of providing a higher graphical resolution of defects than the first inspection at the defect locations corresponding to the set of simulated pattern features.Type: ApplicationFiled: December 21, 2020Publication date: June 23, 2022Inventors: HAIZHOU YIN, CHENLONG MIAO, SHAO WEN GAO, MICHAEL WOJTOWECZ, TAMER DESOUKY
-
Publication number: 20220201331Abstract: A method and apparatus for coding information of a point cloud that includes obtaining the point cloud including a set of points in a three-dimensional space; determining whether a current node in the set of points is isolated; and coding the current node in isolation mode based on a determination that the current node is isolated and coding the current node in non-isolation mode, based on a determination that the current node is not isolated.Type: ApplicationFiled: March 14, 2022Publication date: June 23, 2022Applicant: TENCENT AMERICA LLCInventors: Xiang ZHANG, Wen Gao, Shan Liu
-
Patent number: 11368717Abstract: Aspects of the disclosure provide methods, apparatuses, and a non-transitory computer-readable medium for point cloud compression and decompression. In a method, syntax information of a bounding box of a point cloud is decoded from a coded bitstream. The syntax information indicates an octree partitioning structure for the bounding box of the point cloud. Whether the syntax information indicates that parallel decoding is to be performed on occupancy codes of nodes in a range of one or more partitioning depths in the octree partitioning structure is determined. The parallel decoding is performed on the occupancy codes of the nodes in response to the syntax information indicating that the parallel decoding is to be performed on the occupancy codes of the nodes in the range of the one or more partitioning depths in the octree partitioning structure. The bounding box is reconstructed based on the occupancy codes of the nodes.Type: GrantFiled: September 2, 2020Date of Patent: June 21, 2022Assignee: TENCENT AMERICA LLCInventors: Xiang Zhang, Wen Gao, Shan Liu
-
Patent number: 11368661Abstract: The embodiment of the present specification discloses an image synthesis method, apparatus and device for free-viewpoint.Type: GrantFiled: February 14, 2019Date of Patent: June 21, 2022Assignee: PEKING UNIVERSITY SHENZHEN GRADUATE SCHOOLInventors: Ronggang Wang, Sheng Wang, Zhenyu Wang, Wen Gao