Patents by Inventor Wen Gao

Wen Gao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11417030
    Abstract: A method for coding information of a point cloud comprises obtaining the point cloud including a set of points in a three-dimensional space; partitioning the point cloud into a plurality of objects and generating occupancy information for each of the plurality of objects; and encoding the occupancy information by taking into account the distance between the plurality of objects.
    Type: Grant
    Filed: October 26, 2020
    Date of Patent: August 16, 2022
    Assignee: TENCENT AMERICA LLC
    Inventors: Xiang Zhang, Wen Gao, Shan Liu
  • Patent number: 11417029
    Abstract: Aspects of the disclosure provide methods and apparatuses for point cloud compression. In some examples, an apparatus for point cloud compression includes processing circuitry. In some embodiments, the processing circuitry determines one or more original points in a point cloud that are associated with a reconstructed position. Positions of the one or more original points can be reconstructed, according to a geometry quantization, to the reconstructed position. The processing circuitry then determines an attribute value for the reconstructed position based on attribute information of the one or more original points, and encodes texture of the point cloud with the reconstructed position having the determined attribute value.
    Type: Grant
    Filed: October 6, 2020
    Date of Patent: August 16, 2022
    Assignee: Tencent America LLC
    Inventors: Xiang Zhang, Wen Gao, Shan Liu
  • Publication number: 20220254057
    Abstract: Provided are a point cloud attribute prediction method and prediction device based on a filter. The method comprises a coding method and a decoding method, and the device comprises a coding device and a decoding device. The method comprises: determining K nearest neighbor points of the current point; determining a filter matrix; and determining an attribute prediction value of the current point according to the filter matrix. Therefore, the compression performance of a point cloud attribute can be improved by means of selecting an appropriate filter.
    Type: Application
    Filed: May 11, 2020
    Publication date: August 11, 2022
    Inventors: Ge LI, Chuang MA, Jing WANG, Yi Ting SHAO, Wen GAO
  • Publication number: 20220248053
    Abstract: Aspects of the disclosure provide methods, apparatuses, and a non-transitory computer-readable medium for point cloud coding. In a method, when parallel octree coding is enabled for occupancy codes of nodes in an octree partitioning structure of the point cloud, syntax information of the point cloud is decoded from a coded bitstream and a bitstream offset of an octree depth is determined. The syntax information indicates a bitstream length of the octree depth at which the parallel octree coding is enabled. Parallel decoding is performed on the occupancy codes of the nodes of the octree depth based on the bitstream offset and the bitstream length of the octree depth. Further, the point cloud is reconstructed based on the occupancy codes of the nodes.
    Type: Application
    Filed: April 19, 2022
    Publication date: August 4, 2022
    Applicant: Tencent America LLC
    Inventors: Xiang ZHANG, Wen GAO, Shan LIU
  • Publication number: 20220248054
    Abstract: Aspects of the disclosure include a method for point cloud coding. In the method, whether decoding of occupancy codes of nodes in a range of octree partition depths in an octree partitioning structure of a point cloud reaches a minimum octree partition depth at which parallel decoding is enabled is determined. Arithmetic coding information for decoding the occupancy codes of the nodes in the minimum octree partition depth is stored based on the decoding of the occupancy codes of the nodes in the range of octree partition depths reaching the minimum octree partition depth. The parallel decoding is performed on occupancy codes of the nodes in each of the at least one remaining octree partitions depth based on the stored arithmetic coding information. The point cloud is reconstructed based on the occupancy codes of the nodes in the range of octree partition depths in the octree partitioning structure.
    Type: Application
    Filed: April 19, 2022
    Publication date: August 4, 2022
    Applicant: Tencent America LLC
    Inventors: Xiang ZHANG, Wen GAO, Shan LIU
  • Publication number: 20220245775
    Abstract: A tone mapping method is provided. This method includes: obtaining one or a plurality of high dynamic range images, and determining a storage format of each high dynamic range image; performing an image decomposition on the high dynamic range image to obtain a first component, a second component and a third component, when the storage format of the high dynamic range image is determined as a predetermined storage format; inputting the first component and the second component into a predetermined deep neural network, and using the deep neural network to perform mapping on the first component and the second component respectively to obtain a first mapped component and a second mapped component; and fusing the first mapped component and the second mapped component with the third component to obtain a fused low dynamic range image corresponding to the high dynamic range image.
    Type: Application
    Filed: April 20, 2022
    Publication date: August 4, 2022
    Inventors: Ronggang WANG, Ning ZHANG, Wen GAO
  • Patent number: 11399192
    Abstract: A method, a non-transitory computer readable medium, and a computer system is provided for encoding or decoding video data. The method may include: receiving an entropy coded bitstream comprising compressed video data including point cloud occupancy codes; generating one or more dequantized dimensions of a boundary box of a point cloud; based on determining that the node or node depth has attribute information, using the attribute information for the node or node depth; and based on determining that the node or node depth does not have attribute information, obtaining attribute information for the node or node depth by inheriting attribute information from a node or node depth that is at least one depth level in an octree of the point cloud above a depth level of the node or node depth in the octree.
    Type: Grant
    Filed: December 17, 2020
    Date of Patent: July 26, 2022
    Assignee: TENCENT AMERICA LLC
    Inventors: Wen Gao, Xiang Zhang, Shan Liu
  • Patent number: 11397890
    Abstract: The present application discloses a cross-media retrieval method based on deep semantic space, which includes a feature generation stage and a semantic space learning stage. In the feature generation stage, a CNN visual feature vector and an LSTM language description vector of an image are generated by simulating a perception process of a person for the image; and topic information about a text is explored by using an LDA topic model, thus extracting an LDA text topic vector. In the semantic space learning phase, a training set image is trained to obtain a four-layer Multi-Sensory Fusion Deep Neural Network, and a training set text is trained to obtain a three-layer text semantic network, respectively. Finally, a test image and a text are respectively mapped to an isomorphic semantic space by using two networks, so as to realize cross-media retrieval. The disclosed method can significantly improve the performance of cross-media retrieval.
    Type: Grant
    Filed: August 16, 2017
    Date of Patent: July 26, 2022
    Assignee: Peking University Shenzhen Graduate School
    Inventors: Wenmin Wang, Mengdi Fan, Peilei Dong, Ronggang Wang, Ge Li, Shengfu Dong, Zhenyu Wang, Ying Li, Hui Zhao, Wen Gao
  • Publication number: 20220232253
    Abstract: In some examples, an apparatus for point cloud compression/decompression includes processing circuitry. The processing circuitry determines a flag that indicates an enable/disable control for saving coding state in a largest coding unit (LCU) based coding of a point cloud. In some examples, the processing circuitry stores coding state information before a coding of a first LCU; and in response to the flag indicating an enable control, the processing circuitry restores, a coding state according to the stored coding state information before a coding of a second LCU. In some examples, in response to the flag indicating an enable control, the processing circuitry stores the coding state information before the coding of the first LCU. In some examples, in response to the flag indicating a disable control, the processing circuitry skip the storing/restoring of the coding state information.
    Type: Application
    Filed: August 17, 2021
    Publication date: July 21, 2022
    Applicant: TENCENT AMERICA LLC
    Inventors: Wen GAO, Xiang ZHANG, Shan LIU
  • Publication number: 20220219189
    Abstract: The present invention discloses a shower device, comprising a shower and a base, the base is provided with a convex boss, the back of the shower 1 is provided with a groove matching the convex boss, and the base is also provided with an elastic buckle, the shower is provided with a clamping slot that can be matched and clamped with the elastic buckle, and another socket for hooking up the shower is also provided on the base. Through the design of the above-mentioned structure, the invention specifically uses the cooperation relationship between the clamping slot and the groove to form a lock position in a certain, so as to avoid the accident that the shower falls off from the base and accidentally injures the child. At the same time, a socket is arranged at the height of the side end of the base, which meets the needs of people who need high-altitude shower, and enhance diversity and flexibility.
    Type: Application
    Filed: December 30, 2021
    Publication date: July 14, 2022
    Applicant: RUNNER(XIAMEN) CORP.
    Inventors: bi fu Dai, xin zhan HU, wen Gao, tao yan Zhang, gang Xu, zhi liang Lin, wei zhen Chen, yong Lin, lu Lu
  • Publication number: 20220224943
    Abstract: Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for video encoding/decoding. An apparatus includes processing circuitry that receives metadata associated with a coded video bitstream. The metadata includes labeling information of one or more objects detected in a first picture that is coded in the coded video bitstream. The processing circuitry decodes the labeling information of the one or more objects in the first picture that is coded in the coded video bitstream. The processing circuitry applies the labeling information to the one or more objects in the first picture.
    Type: Application
    Filed: August 27, 2021
    Publication date: July 14, 2022
    Applicant: TENCENT AMERICA LLC
    Inventors: Shan LIU, Xiaozhong Xu, Wen Gao
  • Patent number: 11381812
    Abstract: Disclosed is a boundary filtering method for intra prediction, relating to the video encoding technology filed. Whether boundary filtering is performed on an intra prediction block or not is adaptively selected by means of a rate distortion optimization decision; during filtering, a filter coefficient exponentially attenuated relative to distance to boundary is adopted to perform filtering on the first N rows or the first N columns of the intra prediction block by means of an intra prediction block filter, and different filtering strengths are used according to different sizes of the prediction blocks. Therefore, the boundary distortion problem of intra prediction block is solved, the intra prediction precision is improved, and the encoding efficiency of intra prediction block is increased; and the practicability and the robustness of the boundary filtering technology are improved.
    Type: Grant
    Filed: September 25, 2018
    Date of Patent: July 5, 2022
    Assignee: PEKING UNIVERSITY SHENZHEN GRADUATE SCHOOL
    Inventors: Ronggang Wang, Kui Fan, Ge Li, Wen Gao
  • Patent number: 11379711
    Abstract: A video action detection method based on a convolutional neural network (CNN) is disclosed in the field of computer vision recognition technologies. A temporal-spatial pyramid pooling layer is added to a network structure, which eliminates limitations on input by a network, speeds up training and detection, and improves performance of video action classification and time location. The disclosed convolutional neural network includes a convolutional layer, a common pooling layer, a temporal-spatial pyramid pooling layer and a full connection layer. The outputs of the convolutional neural network include a category classification output layer and a time localization calculation result output layer. The disclosed method does not require down-sampling to obtain video clips of different durations, but instead utilizes direct input of the whole video at once, improving efficiency.
    Type: Grant
    Filed: August 16, 2017
    Date of Patent: July 5, 2022
    Assignee: Peking University Shenzhen Graduate School
    Inventors: Wenmin Wang, Zhihao Li, Ronggang Wang, Ge Li, Shengfu Dong, Zhenyu Wang, Ying Li, Hui Zhao, Wen Gao
  • Patent number: 11381840
    Abstract: A method of point cloud geometry decoding in a point cloud decoder can include receiving a bitstream including a slice of a coded point cloud frame, and reconstructing an octree representing a geometry of points in a bounding box of the slice where a current node of the octree is partitioned with a quadtree (QT) partition or a binary tree (BT) partition.
    Type: Grant
    Filed: June 23, 2020
    Date of Patent: July 5, 2022
    Assignee: TENCENT AMERICA LLC
    Inventors: Xiang Zhang, Wen Gao, Sehoon Yea, Shan Liu
  • Publication number: 20220210472
    Abstract: A method, a non-transitory computer readable medium, and a computer system is provided for encoding or decoding video data. The method may include: receiving an entropy coded bitstream comprising compressed video data including point cloud occupancy codes; generating one or more dequantized dimensions of a boundary box of a point cloud; based on determining that the compressed video data was predicted by using the attribute-based predictor, determining a predictor for decoding is the attribute-based predictor; based on determining that the compressed video data was predicted by using the attribute-based predictor, determining the predictor for decoding is the geometry-based predictor; and building an octree structure by using the determined predictor.
    Type: Application
    Filed: March 17, 2022
    Publication date: June 30, 2022
    Applicant: TENCENT AMERICA LLC
    Inventors: Wen GAO, Xiang ZHANG, Shan LIU
  • Publication number: 20220201321
    Abstract: Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for video encoding/decoding. An apparatus includes processing circuitry that decodes prediction information of a current block in a current picture that is a part of a video bitstream. The processing circuitry determines that a first plurality of coding tools is enabled and a second plurality of coding tools is disabled for the current block based on a syntax element included in the video bitstream. The first plurality of coding tools includes a deblocking filter. The second plurality of coding tools includes at least one of a sample adaptive offset filter, an intra sub-partitioning, and a matrix based intra prediction. The processing circuitry reconstructs the current block based on the first plurality of coding tools being enabled and the second plurality of coding tools being disabled.
    Type: Application
    Filed: September 3, 2021
    Publication date: June 23, 2022
    Applicant: TENCENT AMERICA LLC
    Inventors: Wen GAO, Xiaozhong Xu, Shan Liu
  • Publication number: 20220196580
    Abstract: The embodiments herein relate to defect inspection methods of semiconductor wafers during the manufacturing process. According to an aspect of the present disclosure, a defect inspection system is provided. The defect inspection system includes a first inspection system, pattern simulator software, and a second inspection system. The first inspection system is capable of determining a plurality of defect locations on an article. The pattern simulator software is capable of generating a set of simulated pattern features from the plurality of defect locations. The second inspection system is capable of providing a higher graphical resolution of defects than the first inspection at the defect locations corresponding to the set of simulated pattern features.
    Type: Application
    Filed: December 21, 2020
    Publication date: June 23, 2022
    Inventors: HAIZHOU YIN, CHENLONG MIAO, SHAO WEN GAO, MICHAEL WOJTOWECZ, TAMER DESOUKY
  • Publication number: 20220201331
    Abstract: A method and apparatus for coding information of a point cloud that includes obtaining the point cloud including a set of points in a three-dimensional space; determining whether a current node in the set of points is isolated; and coding the current node in isolation mode based on a determination that the current node is isolated and coding the current node in non-isolation mode, based on a determination that the current node is not isolated.
    Type: Application
    Filed: March 14, 2022
    Publication date: June 23, 2022
    Applicant: TENCENT AMERICA LLC
    Inventors: Xiang ZHANG, Wen Gao, Shan Liu
  • Patent number: 11368717
    Abstract: Aspects of the disclosure provide methods, apparatuses, and a non-transitory computer-readable medium for point cloud compression and decompression. In a method, syntax information of a bounding box of a point cloud is decoded from a coded bitstream. The syntax information indicates an octree partitioning structure for the bounding box of the point cloud. Whether the syntax information indicates that parallel decoding is to be performed on occupancy codes of nodes in a range of one or more partitioning depths in the octree partitioning structure is determined. The parallel decoding is performed on the occupancy codes of the nodes in response to the syntax information indicating that the parallel decoding is to be performed on the occupancy codes of the nodes in the range of the one or more partitioning depths in the octree partitioning structure. The bounding box is reconstructed based on the occupancy codes of the nodes.
    Type: Grant
    Filed: September 2, 2020
    Date of Patent: June 21, 2022
    Assignee: TENCENT AMERICA LLC
    Inventors: Xiang Zhang, Wen Gao, Shan Liu
  • Patent number: 11368661
    Abstract: The embodiment of the present specification discloses an image synthesis method, apparatus and device for free-viewpoint.
    Type: Grant
    Filed: February 14, 2019
    Date of Patent: June 21, 2022
    Assignee: PEKING UNIVERSITY SHENZHEN GRADUATE SCHOOL
    Inventors: Ronggang Wang, Sheng Wang, Zhenyu Wang, Wen Gao