Patents Assigned to Tencent America LLC

Signaling schemes in events message tracks

Patent number: 11838341

Abstract: Systems, devices, and methods for obtaining an event message track configured to be processed by a Dynamic Adaptive Streaming over HyperText Transfer Protocol (DASH) client for displaying media content, the event message track corresponds to a plurality of events comprising one or more of Media Presentation Description (MPD) events, inband events, timed metadata, Common Media Application Format (CMAF) events, and DASH events; obtaining a scheme identifier box identifying one or more event schemes that appear in the event message track, wherein each event scheme of the one or more event schemes specifies at least one parameter for processing an event among the plurality of events included in the event message track; and providing the event message track to the DASH client or displaying media content based on the event message track according to the one or more event schemes.

Type: Grant

Filed: March 25, 2022

Date of Patent: December 5, 2023

Assignee: TENCENT AMERICA LLC

Inventor: Iraj Sodagar
Function repository selection mode and signaling for cloud based processing

Patent number: 11838390

Abstract: A method and system for processing media content by at least one processor is provided. The method includes: obtaining a list of function repositories, each function repository storing one or more functions for processing the media content, obtaining a descriptor identifying a location of each of the function repositories identified in the list of function repositories, selecting a function repository, from among the function repositories, according to a criterion, and processing the media content using a function included in the selected function repository.

Type: Grant

Filed: May 3, 2022

Date of Patent: December 5, 2023

Assignee: TENCENT AMERICA LLC

Inventor: Iraj Sodagar
Techniques for signaling audio mixing gain in teleconferencing and telepresence for remote terminals

Patent number: 11838347

Abstract: A method and system for audio mixing in a teleconference using a Session Description Protocol (SDP) and Real-time Transport Protocol (RTP) signaling of audio mixing gain is provided. The method includes receiving an input audio stream from a 360-degree video/audio stream that includes an audio mixing gain which includes audio gain from the input audio stream and audio gain from overlay audio streams, declaring a RTP header extension for the input audio stream using the SDP, the RTP header extension, and signaling the audio mixing gain based on a single attribute of the SDP using the RTP header extension. An element of the RTP header extension may include an element identifier, a length of an extension element, and a magnitude of the audio mixing gain of the RTP header extension. The RTP header extension may use a one-byte or two-byte header extension format which is declared using the SDP.

Type: Grant

Filed: March 11, 2022

Date of Patent: December 5, 2023

Assignee: TENCENT AMERICA LLC

Inventors: Rohit Abhishek, Iraj Sodagar
Method and apparatus for video coding

Patent number: 11838522

Abstract: Aspects of the disclosure provide methods and apparatuses for video processing. In some examples, an apparatus for video processing includes processing circuitry. The processing circuitry converts a picture in a subsampled format in a color space into a non subsampled format in the color space. Then, the processing circuitry clips values of a color component of the picture in the non subsampled format before providing the picture in the non subsampled format as an input to a neural network based filter.

Type: Grant

Filed: August 31, 2021

Date of Patent: December 5, 2023

Assignee: TENCENT AMERICA LLC

Inventors: Cheung Auyeung, Xiang Li, Shan Liu
DYNAMIC MESH COMPRESSION USING INTER AND INTRA PREDICTION

Publication number: 20230388544

Abstract: There is includes a method and apparatus comprising computer code configured to cause a processor or processors to perform obtaining volumetric data of at least one three-dimensional (3D) visual content, obtaining a patch by dividing a plurality of 3D meshes from the volumetric data, the patch including vertices of at least one of the 3D meshes, forming a prediction group comprising a subset of the vertices of the patch, and signaling a prediction mode of the prediction group collectively for the subset of the plurality of vertices of the patch.

Type: Application

Filed: April 19, 2023

Publication date: November 30, 2023

Applicant: TENCENT AMERICA LLC

Inventors: Xiaozhong XU, Jun Tian, Xiang Zhang, Chao Huang, Shan Liu
METHOD AND APPARATUS FOR TEMPORAL INTERPOLATED PREDICTION IN VIDEO BITSTREAM

Publication number: 20230388529

Abstract: A video decoder is provided for the decoding of a video bitstream encoded in a temporal interpolated prediction (TIP) mode. First and second motion vectors pointing to respective reference frames, or reference pictures within those frames, are generated for a block of a current picture. The motion vectors are then refined by application of a decoder-side motion vector refinement (DMVR) process, based on a bilateral matching process, and the refined motion vectors are used to decode the block. The refinement may more specifically involve consideration of candidates for the refined motion vectors, selected by the bilateral matching. The refinement may be applied at both block and sub-block divisions of the current picture.

Type: Application

Filed: November 7, 2022

Publication date: November 30, 2023

Applicant: TENCENT AMERICA LLC

Inventors: Han GAO, Xin Zhao, Liang Zhao, Shan Liu
TEMPORAL PREDICTION BASED VERTEX POSITION COMPRESSION

Publication number: 20230388543

Abstract: A plurality of neighboring vertices of a current vertex in a current frame of a mesh is determined. The current frame corresponds to the mesh at a first time instance. Each of the plurality of neighboring vertices is connected to the current vertex through a respective edge in the mesh. A plurality of neighboring estimation errors of the plurality of neighboring vertices is determined. Each of the plurality of neighboring estimation errors indicates a difference between a reference vertex of a corresponding one of the plurality of neighboring vertices in a reference frame and the corresponding one of the plurality of neighboring vertices in the current frame. The reference frame corresponds to the mesh at a second time instance. A prediction residue of the current vertex is determined based on the plurality of neighboring estimation errors. Prediction information of the current vertex is generated based on the determined prediction residue.

Type: Application

Filed: March 28, 2023

Publication date: November 30, 2023

Applicant: Tencent America LLC

Inventors: Jun TIAN, Chao HUANG, Xiang ZHANG, Xiaozhong XU, Shan LIU
CONDITIONAL FACTORIZATION FOR JOINTLY MODELING CODE-SWITCHED AND MONOLINGUAL ASR

Publication number: 20230386507

Abstract: A method, apparatus, and non-transitory computer-readable medium for automatic speech recognition using conditional factorization for bilingual code-switched and monolingual speech may include receiving an audio observation sequence comprising a plurality of frames, the audio observation sequence including audio in a first language or a second language. The approach may further include mapping the audio observation sequence into a first sequence of hidden representations, the mapping being generated by a first encoder corresponding to the first language and mapping the audio observation sequence into a second sequence of hidden representations, the mapping being generated by a second encoder corresponding to the second language. The approach may further include generating a label-to-frame sequence based on the first sequence of hidden representations and the second sequence of hidden representations, using a joint neural network based model.

Type: Application

Filed: May 31, 2022

Publication date: November 30, 2023

Applicant: TENCENT AMERICA LLC

Inventors: Chunlei ZHANG, Brian YAN, Dong YU
TOOLS FOR CONFORMANCE OF NETWORK-BASED MEDIA PROCESSING (NBMP) DOCUMENTS AND ENTITIES

Publication number: 20230385544

Abstract: Methods, apparatus, and computer readable storage medium for verifying NBMP documents and entities. One method may include invoking an Application Programming Interface (API) corresponding to an API operation supported by an NBMP entity, the API operation being related to at least one of: a create operation; an update operation; a retrieve operation; or a delete operation; receiving a response from the NBMP entity; and determining, based on the response, whether the NBMP entity passes an API test corresponding to the API operation.

Type: Application

Filed: April 17, 2023

Publication date: November 30, 2023

Applicant: Tencent America LLC

Inventor: Iraj SODAGAR
SUBBLOCK LEVEL TEMPORAL MOTION VECTOR PREDICTION WITH MULTIPLE DISPLACEMENT VECTOR PREDICTORS AND AN OFFSET

Publication number: 20230388513

Abstract: Aspects of the disclosure provide a method and an apparatus including processing circuitry that receives a coded video bitstream comprising a current picture that includes a current block. The processing circuitry determines, based on a syntax element in the coded video bitstream, that the current block including a plurality of subblocks is coded in a subblock-based temporal motion vector prediction (SbTMVP) mode. The processing circuitry determines a plurality of displacement vector (DV) predictor (DVP) candidates and receives a base index indicating a DVP in the plurality of DVP candidates and a DV offset of the current block. The processing circuitry determines a DV based on the DVP and the DV offset. The DV indicates a block collocated with the current block in a collocated reference picture. The processing circuitry reconstructs a subblock in the plurality of subblocks based on motion information of a corresponding subblock in the collocated block.

Type: Application

Filed: November 9, 2022

Publication date: November 30, 2023

Applicant: Tencent America LLC

Inventors: Lien-Fei CHEN, Guichun LI, Xin ZHAO, Shan LIU
SUBBLOCK BASED MOTION VECTOR PREDICTOR DISPLACEMENT VECTOR REORDERING USING TEMPLATE MATCHING

Publication number: 20230388485

Abstract: Aspects of the disclosure provide a method and an apparatus for video encoding/decoding. The apparatus includes processing circuitry for: receiving prediction information of a current coding block in a current picture from a coded video bitstream, the prediction information indicating that the current coding block is coded using a subblock-based temporal motion vector prediction (SbTMVP) mode; deriving multiple displacement vector (DV) candidates by applying multiple DV offset candidates to a fixed DV predictor of the current coding block; comparing a template of the current coding block with each of multiple templates, each template of the multiple templates being located at a position specified by a corresponding one of the multiple DV candidates; calculating a cost value associated with each one of the multiple DV offset candidates based on the comparing; and reordering DV offset indices of the multiple DV offset candidates based on their calculated cost values.

Type: Application

Filed: November 10, 2022

Publication date: November 30, 2023

Applicant: Tencent America LLC

Inventors: Han GAO, Lien-Fei CHEN, Guichun LI, Xin ZHAO, Shan LIU
METHOD AND APPARATUS FOR ASYMMETRIC BLENDING OF PREDICTIONS OF PARTITIONED PICTURES

Publication number: 20230388484

Abstract: A decoding device is provided for decoding of a bitstream, and more specifically for predicting a picture area of an input image for decoding. The picture area has been divided into at least first and second parts by a partitioning boundary, and the pixels of each part are predicted according to suitable measures. Blending masks are then applied to generate blended regions for the predicted pixels, modifying the pixels to generate a complete prediction of the picture area based on both parts. The blending mask is based on first and second thresholds, which are defined relative to the partitioning boundary, and which may have different values to produce an asymmetrical blending relative to the boundary. Based on suitable threshold values, the prediction, combination, and decoding of the picture area is more adaptive to differing contents of the parts therein.

Type: Application

Filed: November 8, 2022

Publication date: November 30, 2023

Applicant: Tencent America LLC

Inventors: Han GAO, Xin ZHAO, Lien-fei CHEN, Shan LIU
SIGNALING OF DOWNSAMPLING FILTERS FOR CHROMA FROM LUMA INTRA PREDICTION MODE

Publication number: 20230388540

Abstract: Methods and apparatuses for performing chroma from luma (CfL) intra prediction, including: receiving a current block from a coded video bitstream; obtaining, from the coded video bitstream, a syntax element indicating which of two or more downsampling filters is used for predicting the current block in a CfL intra prediction mode; in response to the syntax element indicating that a first downsampling filter is used for the current block: determining a plurality of filter coefficients according to the first downsampling filter; and downsampling the current block based on the determined plurality of coefficients using a first number of sampling positions; in response to the syntax element indicating that a second downsampling filter is used for the current block: determining the plurality of filter coefficients according to the second downsampling filter; downsampling the current block based on the determined plurality of coefficients using a second number of sampling positions, wherein the second number of sam

Type: Application

Filed: November 9, 2022

Publication date: November 30, 2023

Applicant: TENCENT AMERICA LLC

Inventors: Jing YE, Xin Zhao, Liang Zhao, Shan Liu
METHOD FOR DERIVATION OF PICTURE OUTPUT FOR NON-REFERENCED PICTURE IN CODED VIDEO STREAM

Publication number: 20230388487

Abstract: A method, computer program, and computer system is provided for coding video data. Video data including a current picture and one or more other pictures is received. A first flag corresponding to whether the current picture is referenced by the one or more other pictures in a decoding order is checked. A second flag corresponding to whether the current picture is output; is checked. The video data is decoded based on values corresponding to the first flag and the second flag.

Type: Application

Filed: May 1, 2023

Publication date: November 30, 2023

Applicant: TENCENT AMERICA LLC

Inventors: Byeongdoo CHOI, Shan LIU, Stephan WENGER
TECHNIQUES FOR IMPROVED ZERO-SHOT VOICE CONVERSION WITH A CONDITIONAL DISENTANGLED SEQUENTIAL VARIATIONAL AUTO-ENCODER

Publication number: 20230386479

Abstract: A method, system, apparatus, and computer-readable medium for voice conversion using a conditional disentangled sequential variational auto-encoder (C-DSVAE) is provided. The method, performed by at least one processor, includes receiving input speech segments, encoding the input speech segments via a shared encoder to generate a speaker embedding and a content embedding, and encoding a posterior distribution of the speaker embedding via a speaker encoder and encoding a posterior distribution of the content embedding via a content encoder to obtain encoded results. The method further includes enabling a content bias, reshaping the content embedding using the content bias, and generating a reconstructed speech output based on the encoded results and the reshaped content embedding.

Type: Application

Filed: May 27, 2022

Publication date: November 30, 2023

Applicant: Tencent America LLC

Inventors: Chunlei ZHANG, Jiachen LIAN, Dong YU
Method and apparatus for video coding

Patent number: 11831911

Abstract: Aspects of the disclosure provide methods and apparatuses for video encoding/decoding. In some examples, an apparatus for video decoding includes receiving circuitry and processing circuitry. For example, the processing circuitry receives a bitstream including bits corresponding to residues associated with a prediction of a block in a coding tree unit (CTU) and determines a maximum transform size based on the CTU size. The maximum transform size is used in a partition to form, from the residues, one or more transform blocks that are respectively encoded into the bits. Then, the processing circuitry reconstructs based on the maximum transform size and from the bits the residues associated with the prediction of the block, and then the processing circuitry reconstructs samples of the block based on the reconstructed residues and the prediction of the block.

Type: Grant

Filed: March 30, 2022

Date of Patent: November 28, 2023

Assignee: Tencent America LLC

Inventors: Xin Zhao, Xiang Li, Shan Liu
Method and apparatus for improved implicit transform selection

Patent number: 11831889

Abstract: In a method of video encoding for an encoder, a determination is made to determine whether (i) an implicit transform scheme is enabled, and (ii) at least one of a low-frequency non-separable transform (LFNST) and a matrix-based intra predication mode (MIP) is invalid for a coding unit (CU). In response to the determination that the implicit transform scheme is enabled, and at least one of the LFNST and MIP is invalid, a primary transform type is determined based on a size of the CU. A primary transform is performed for a transform block that is partitioned from the CU in accordance with the determined primary transform type. A coded bitstream that indicates the primary transform type of the CU is subsequently output.

Type: Grant

Filed: October 4, 2021

Date of Patent: November 28, 2023

Assignee: TENCENT AMERICA LLC

Inventors: Xin Zhao, Xiang Li, Shan Liu
Method and apparatus for video coding

Patent number: 11831920

Abstract: Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for video encoding/decoding. An apparatus includes processing circuitry that receives metadata associated with a coded video bitstream. The metadata includes labeling information of one or more objects detected in a first picture that is coded in the coded video bitstream. The processing circuitry decodes the labeling information of the one or more objects in the first picture that is coded in the coded video bitstream. The processing circuitry applies the labeling information to the one or more objects in the first picture.

Type: Grant

Filed: August 27, 2021

Date of Patent: November 28, 2023

Assignee: TENCENT AMERICA LLC

Inventors: Shan Liu, Xiaozhong Xu, Wen Gao
ADJACENT SPATIAL MOTION VECTOR PREDICTOR CANDIDATES IMPROVEMENT

Publication number: 20230379452

Abstract: Method, device, and non-transitory storage medium for motion vector prediction (MVP) list construction for video coding are provided. One or more motion vector (MV) candidates may be retrieved from a reference MV bank, the one or more MV candidates being associated with a current block. A position to insert the one or more MV candidates from the reference MV bank into an MVP list associated with the current block is determined. The one or more MV candidates from the reference MV bank is inserted into the MVP list associated with the current block based on the position.

Type: Application

Filed: September 9, 2022

Publication date: November 23, 2023

Applicant: TENCENT AMERICA LLC

Inventors: Han GAO, Xin ZHAO, Liang ZHAO, Shan LIU
CANDIDATE LIST CONSTRUCTION IN INTRA-INTER BLENDING MODE

Publication number: 20230379456

Abstract: A method of video decoding includes creating a candidate list for a current block in a current picture included in a coded video bitstream. The method further includes determining a coding mode for a candidate block associated with the current block. The method further includes determining whether to add, to the candidate list, a vector associated the candidate block based on the determined coding mode. The method further includes reconstructing the current block using at least one candidate from the candidate list.

Type: Application

Filed: August 5, 2023

Publication date: November 23, 2023

Applicant: TENCENT AMERICA LLC

Inventors: Xiaozhong XU, Xiang LI, Shan LIU

prev … 18 19 20 21 22 23 24 25 26 … next