Patents by Inventor Wen Gao

Wen Gao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

TEMPORAL DE-NOISING FOR VIDEO

Publication number: 20210385503

Abstract: A method, computer program, and computer system is provided for video coding. Video data including one or more frames is received. A static background is estimated for each of the one or more frames based on a temporal average of the one or more frames. Pixels from among the one or more frames are identified as corresponding to the static background. Noise is removed in the static background based on the identified pixels.

Type: Application

Filed: May 7, 2021

Publication date: December 9, 2021

Applicant: TENCENT AMERICA LLC

Inventors: Jun TIAN, Wen Gao, Shan Liu
An Image Synthesis Method, Apparatus and Device for Free-viewpoint

Publication number: 20210368153

Abstract: The embodiment of the present specification discloses an image synthesis method, apparatus and device for free-viewpoint.

Type: Application

Filed: February 14, 2019

Publication date: November 25, 2021

Inventors: Ronggang WANG, Sheng WANG, Zhenyu WANG, Wen GAO
Geometry model for point cloud coding

Patent number: 11158116

Abstract: A method, computer program, and computer system for point cloud coding is provided. Data corresponding to a point cloud is received, and one or more geometric features are detected from among the data corresponding to the point cloud. A representation is determined for one or more of the detected geometric features, and the received data is encoded or decoded based on the determined representations whereby the point cloud is reconstructed based on the decoded data.

Type: Grant

Filed: July 10, 2020

Date of Patent: October 26, 2021

Assignee: TENCENT AMERICA LLC

Inventors: Xiang Zhang, Wen Gao, Shan Liu
METHOD AND APPARATUS FOR POINT CLOUD CODING

Publication number: 20210329270

Abstract: In a method of point cloud geometry decoding in a point cloud decoder, chroma prediction residual information of a point in a set of points is received from a coded bitstream for a point cloud that includes the set of points. The chroma prediction residual information includes a Cb component and a Cr component. Further, a type of correlation between the Cb component and the Cr component of the chroma prediction residual information is determined by processing circuitry and from the coded bitstream for the point cloud. The chroma prediction residual information is decoded based on the type of the correlation between the Cb component and the Cr component of the chroma prediction residual information.

Type: Application

Filed: March 24, 2021

Publication date: October 21, 2021

Applicant: Tencent America LLC

Inventors: Sehoon YEA, Wen GAO, Shan LIU
Methods of intra picture block prediction for multi-view video compression

Patent number: 11146818

Abstract: A method, computer program, and computer system is provided for coding video data. Video data including a reference view and a current view is received. A co-located block in the reference view is identified for a current block in the current view. A predicted block vector is calculated based on an offset vector between the current block and the co-located block, and a disparity vector between the co-located block and the reference block in the reference view. The video data is encoded/decoded based on the calculated predicted block vector.

Type: Grant

Filed: September 21, 2020

Date of Patent: October 12, 2021

Assignee: TENCENT AMERICA LLC

Inventors: Jun Tian, Shan Liu, Xiaozhong Xu, Weiwei Feng, Wen Gao
METHOD AND APPARATUS FOR POINT CLOUD CODING

Publication number: 20210312667

Abstract: A method of point cloud geometry decoding in a point cloud decoder is provided. In the method, first signaling information is received from a coded bitstream for a point cloud that includes a set of points in a three-dimensional (3D) space. The first signaling information indicates partition information of the point cloud. Second signaling information is determined based on the first signaling information indicating a first value. The second signaling information is indicative of a partition mode of the set of points in the 3D space. Further, the partition mode of the set of points in the 3D space is determined based on the second signaling information. The point cloud is reconstructed subsequently based on the partition mode.

Type: Application

Filed: March 16, 2021

Publication date: October 7, 2021

Applicant: Tencent America LLC

Inventors: Sehoon YEA, Wen GAO, Xiang ZHANG, Shan LIU
Method and apparatus for temporal smoothing for video

Patent number: 11140416

Abstract: Aspects of the disclosure provide methods and apparatuses for video processing. In some examples, an apparatus for video processing includes processing circuitry. For example, processing circuitry determines a frame interval for a current block in a current frame within a sequence of frames. The frame interval indicates a group of frames in the sequence of frames with collocated blocks of the current block that satisfy an error metric requirement comparing to the current block. Further, the processing circuitry determines a replacement block based on the collocated blocks in the group of frames, and replaces the current block in the current frame with the replacement block.

Type: Grant

Filed: November 11, 2020

Date of Patent: October 5, 2021

Assignee: Tencent America LLC

Inventors: Jun Tian, Wen Gao, Shan Liu
METHOD OF CODING ATTRIBUTES FOR POINT CLOUD CODING

Publication number: 20210306664

Abstract: A method, a non-transitory computer readable medium, and a computer system is provided for encoding or decoding video data. The method may include: receiving an entropy coded bitstream comprising compressed video data including point cloud occupancy codes; generating one or more dequantized dimensions of a boundary box of a point cloud; based on determining that the compressed video data was predicted by using the attribute-based predictor, determining a predictor for decoding is the attribute-based predictor; based on determining that the compressed video data was predicted by using the attribute-based predictor, determining the predictor for decoding is the geometry-based predictor; and building an octree structure by using the determined predictor.

Type: Application

Filed: December 31, 2020

Publication date: September 30, 2021

Applicant: TENCENT AMERICA LLC

Inventors: Wen GAO, Xiang ZHANG, Shan LIU
METHODS OF CODING DUPLICATE AND ISOLATED POINTS FOR POINT CLOUD CODING

Publication number: 20210306663

Abstract: A method and apparatus for coding information of a point cloud that includes obtaining the point cloud including a set of points in a three-dimensional space; determining whether a current node in the set of points is isolated; and coding the current node in isolation mode based on a determination that the current node is isolated and coding the current node in non-isolation mode, based on a determination that the current node is not isolated.

Type: Application

Filed: October 29, 2020

Publication date: September 30, 2021

Applicant: TENCENT AMERICA LLC

Inventors: Xiang ZHANG, Wen GAO, Shan LIU
METHOD AND APPARATUS FOR TEMPORAL SMOOTHING FOR VIDEO

Publication number: 20210306668

Abstract: Aspects of the disclosure provide methods and apparatuses for video processing. In some examples, an apparatus for video processing includes processing circuitry. For example, processing circuitry determines a frame interval for a current block in a current frame within a sequence of frames. The frame interval indicates a group of frames in the sequence of frames with collocated blocks of the current block that satisfy an error metric requirement comparing to the current block. Further, the processing circuitry determines a replacement block based on the collocated blocks in the group of frames, and replaces the current block in the current frame with the replacement block.

Type: Application

Filed: November 11, 2020

Publication date: September 30, 2021

Applicant: Tencent America LLC

Inventors: Jun TIAN, Wen GAO, Shan LIU
MECHANICAL AND ELECTRONIC DUAL CONTROL WATER TAP

Publication number: 20210278009

Abstract: The invention discloses A mechanical and electronic dual control water tap, including faucet group, control box and sensor, the sensor is arranged on the faucet group, and the faucet group is provided with a mixing valve; the mixing valve is internally arranged a cold water inlet, a hot water inlet, a first mixed water outlet and a second mixed water outlet are arranged; the second mixed water outlet is a normal water outlet channel; the control box includes a detection control unit, a first inflow runner, a second inflow runner and a mixed outlet channel; the first mixed water outlet is connected with the first inflow runner, and the second mixed water outlet is connected with the second inflow runner. The first inflow runner and the second inflow are respectively provided with inductor and solenoid valves.

Type: Application

Filed: December 31, 2020

Publication date: September 9, 2021

Inventors: Yongsheng WANG, Yonglong ZHANG, Dingjun WANG, Wen GAO
Method of bidirectional image-text retrieval based on multi-view joint embedding space

Patent number: 11106951

Abstract: A bidirectional image-text retrieval method based on a multi-view joint embedding space includes: performing retrieval with reference to a semantic association relationship at a global level and a local level, obtaining the semantic association relationship at the global level and the local level in a frame-sentence view and a region-phrase view, and obtaining semantic association information in a global level subspace of frame and sentence in the frame-sentence view, obtaining semantic association information in a local level subspace of region and phrase in the region-phrase view, processing data by a dual-branch neural network in the two views to obtain an isomorphic feature and embedding the same in a common space, and using a constraint condition to reserve an original semantic relationship of the data during training, and merging the two semantic association relationships using multi-view merging and sorting to obtain a more accurate semantic similarity between data.

Type: Grant

Filed: January 29, 2018

Date of Patent: August 31, 2021

Assignee: Peking University Shenzhen Graduate Sohool

Inventors: Wenmin Wang, Lu Ran, Ronggang Wang, Ge Li, Shengfu Dong, Zhenyu Wang, Ying Li, Hui Zhao, Wen Gao
Method of using deep discriminate network model for person re-identification in image or video

Patent number: 11100370

Abstract: Disclosed is a deep discriminative network for person re-identification in an image or a video. Concatenation are carried out on different input images on a color channel by constructing a deep discriminative network, and an obtained splicing result is defined as an original difference space of different images. The original difference space is sent into a convolutional network. The network outputs the similarity between two input images by learning difference information in the original difference space, thereby realizing person re-identification. The features of an individual image are not learnt, and concatenation are carried out on input images on a color channel at the beginning, and difference information is learnt on an original space of the images by using a designed network. By introducing an Inception module and embedding the same into a model, the learning ability of a network can be improved, and a better differentiation effect can be achieved.

Type: Grant

Filed: January 23, 2018

Date of Patent: August 24, 2021

Assignee: Peking University Shenzhen Graduate School

Inventors: Wenmin Wang, Yihao Zhang, Ronggang Wang, Ge Li, Shengfu Dong, Zhenyu Wang, Ying Li, Hui Zhao, Wen Gao
Cross-media retrieval method based on deep semantic space

Publication number: 20210256365

Abstract: The present application discloses a cross-media retrieval method based on deep semantic space, which includes a feature generation stage and a semantic space learning stage. In the feature generation stage, a CNN visual feature vector and an LSTM language description vector of an image are generated by simulating a perception process of a person for the image; and topic information about a text is explored by using an LDA topic model, thus extracting an LDA text topic vector. In the semantic space learning phase, a training set image is trained to obtain a four-layer Multi-Sensory Fusion Deep Neural Network, and a training set text is trained to obtain a three-layer text semantic network, respectively. Finally, a test image and a text are respectively mapped to an isomorphic semantic space by using two networks, so as to realize cross-media retrieval. The disclosed method can significantly improve the performance of cross-media retrieval.

Type: Application

Filed: August 16, 2017

Publication date: August 19, 2021

Inventors: Wenmin Wang, Mengdi Fan, Peilei Dong, Ronggang Wang, Ge Li, Shengfu Dong, Zhenyu Wang, Ying Li, Hui Zhao, Wen Gao
CONTEXT MODELING OF OCCUPANCY CODING FOR POINT CLOUD CODING

Publication number: 20210256737

Abstract: A method for coding information of a point cloud comprises obtaining the point cloud including a set of points in a three-dimensional space; partitioning the point cloud into a plurality of objects and generating occupancy information for each of the plurality of objects; and encoding the occupancy information by taking into account the distance between the plurality of objects.

Type: Application

Filed: October 26, 2020

Publication date: August 19, 2021

Applicant: TENCENT AMERICA LLC

Inventors: Xiang Zhang, Wen Gao, Shan Liu
SPATIAL SCALABLE CODING FOR POINT CLOUD CODING

Publication number: 20210250594

Abstract: A method, a non-transitory computer readable medium, and a computer system is provided for encoding or decoding video data. The method may include: receiving an entropy coded bitstream comprising compressed video data including point cloud occupancy codes; generating one or more dequantized dimensions of a boundary box of a point cloud; based on determining that the node or node depth has attribute information, using the attribute information for the node or node depth; and based on determining that the node or node depth does not have attribute information, obtaining attribute information for the node or node depth by inheriting attribute information from a node or node depth that is at least one depth level in an octree of the point cloud above a depth level of the node or node depth in the octree.

Type: Application

Filed: December 17, 2020

Publication date: August 12, 2021

Applicant: TENCENT AMERICA LLC

Inventors: Wen GAO, Xiang ZHANG, Shan LIU
FLEXIBLE TREE PARTITION AND REPRESENTATION FOR POINT CLOUD CODING

Publication number: 20210250618

Abstract: A method of decoding encoded information of a point cloud may be performed by at least one processor and comprises: obtaining an encoded bitstream, the encoded bitstream including encoded information of a point cloud including a set of points in a three-dimensional space; and determining a type of partitioning used to encode the information of the point cloud by at least one of parsing signals of at least three binary syntaxes or inferring at least one syntax of the at least three binary syntaxes.

Type: Application

Filed: October 30, 2020

Publication date: August 12, 2021

Applicant: TENCENT AMERICA LLC

Inventors: Xiang Zhang, Wen Gao, Shan Liu
NODE-BASED GEOMETRY AND ATTRIBUTE CODING FOR A POINT CLOUD

Publication number: 20210248784

Abstract: A method and apparatus for coding information of a point cloud may be performed by at least one processor and comprises: obtaining the point cloud including a set of points in a three-dimensional space; partitioning the point cloud into a tree structure comprising a plurality of nodes at different depths; encoding geometry information of the nodes; and encoding attribute information of the nodes before the entire point cloud is partitioned.

Type: Application

Filed: October 30, 2020

Publication date: August 12, 2021

Applicant: TENCENT AMERICA LLC

Inventors: Wen GAO, Xiang ZHANG, Shan LIU
PREDICTIVE TREE CODING FOR POINT CLOUD CODING

Publication number: 20210248785

Abstract: A method and device for decoding a point cloud using octree partitioning and a predictive tree include obtaining the point cloud. A bounding box of the point cloud is determined. Octree nodes are generated by partitioning the bounding box using octree partitioning. The predictive tree is generated for points in at least one octree node of the octree nodes. A transform is applied to the predictive tree. The points in the at least one octree node are decoded using the predictive tree.

Type: Application

Filed: November 13, 2020

Publication date: August 12, 2021

Applicant: TENCENT AMERICA LLC

Inventors: Xiang ZHANG, Wen Gao, Shan Liu
METHOD AND DEVICE OF ENCODING AND DECODING BASED ON FREE VIEWPOINT

Publication number: 20210250613

Abstract: The present application provides a method and a device of encoding and decoding based on free viewpoint, and relates to the technical field of video encoding. The method includes: generating a planar splicing image and splice information based on multiple single-viewpoint videos at a server side; generating a planar splicing video based on the planar splicing image; generating camera side information of the planar splicing video based on camera side information existing in the multiple single-viewpoint videos; and encoding the planar splicing video, the splice information and the camera side information of the planar splicing video to generate a planar splicing video bit stream, and decoding a planar splicing video bit stream to acquire a virtual viewpoint according to viewpoint information of a viewer at client side.

Type: Application

Filed: April 8, 2019

Publication date: August 12, 2021

Inventors: Ronggang WANG, Zhenyu WANG, Wen GAO

prev … 6 7 8 9 10 11 12 13 14 … next