Patents Assigned to Tencent America LLC
  • Patent number: 11838341
    Abstract: Systems, devices, and methods for obtaining an event message track configured to be processed by a Dynamic Adaptive Streaming over HyperText Transfer Protocol (DASH) client for displaying media content, the event message track corresponds to a plurality of events comprising one or more of Media Presentation Description (MPD) events, inband events, timed metadata, Common Media Application Format (CMAF) events, and DASH events; obtaining a scheme identifier box identifying one or more event schemes that appear in the event message track, wherein each event scheme of the one or more event schemes specifies at least one parameter for processing an event among the plurality of events included in the event message track; and providing the event message track to the DASH client or displaying media content based on the event message track according to the one or more event schemes.
    Type: Grant
    Filed: March 25, 2022
    Date of Patent: December 5, 2023
    Assignee: TENCENT AMERICA LLC
    Inventor: Iraj Sodagar
  • Patent number: 11838390
    Abstract: A method and system for processing media content by at least one processor is provided. The method includes: obtaining a list of function repositories, each function repository storing one or more functions for processing the media content, obtaining a descriptor identifying a location of each of the function repositories identified in the list of function repositories, selecting a function repository, from among the function repositories, according to a criterion, and processing the media content using a function included in the selected function repository.
    Type: Grant
    Filed: May 3, 2022
    Date of Patent: December 5, 2023
    Assignee: TENCENT AMERICA LLC
    Inventor: Iraj Sodagar
  • Patent number: 11838347
    Abstract: A method and system for audio mixing in a teleconference using a Session Description Protocol (SDP) and Real-time Transport Protocol (RTP) signaling of audio mixing gain is provided. The method includes receiving an input audio stream from a 360-degree video/audio stream that includes an audio mixing gain which includes audio gain from the input audio stream and audio gain from overlay audio streams, declaring a RTP header extension for the input audio stream using the SDP, the RTP header extension, and signaling the audio mixing gain based on a single attribute of the SDP using the RTP header extension. An element of the RTP header extension may include an element identifier, a length of an extension element, and a magnitude of the audio mixing gain of the RTP header extension. The RTP header extension may use a one-byte or two-byte header extension format which is declared using the SDP.
    Type: Grant
    Filed: March 11, 2022
    Date of Patent: December 5, 2023
    Assignee: TENCENT AMERICA LLC
    Inventors: Rohit Abhishek, Iraj Sodagar
  • Patent number: 11838522
    Abstract: Aspects of the disclosure provide methods and apparatuses for video processing. In some examples, an apparatus for video processing includes processing circuitry. The processing circuitry converts a picture in a subsampled format in a color space into a non subsampled format in the color space. Then, the processing circuitry clips values of a color component of the picture in the non subsampled format before providing the picture in the non subsampled format as an input to a neural network based filter.
    Type: Grant
    Filed: August 31, 2021
    Date of Patent: December 5, 2023
    Assignee: TENCENT AMERICA LLC
    Inventors: Cheung Auyeung, Xiang Li, Shan Liu
  • Publication number: 20230388544
    Abstract: There is includes a method and apparatus comprising computer code configured to cause a processor or processors to perform obtaining volumetric data of at least one three-dimensional (3D) visual content, obtaining a patch by dividing a plurality of 3D meshes from the volumetric data, the patch including vertices of at least one of the 3D meshes, forming a prediction group comprising a subset of the vertices of the patch, and signaling a prediction mode of the prediction group collectively for the subset of the plurality of vertices of the patch.
    Type: Application
    Filed: April 19, 2023
    Publication date: November 30, 2023
    Applicant: TENCENT AMERICA LLC
    Inventors: Xiaozhong XU, Jun Tian, Xiang Zhang, Chao Huang, Shan Liu
  • Publication number: 20230388529
    Abstract: A video decoder is provided for the decoding of a video bitstream encoded in a temporal interpolated prediction (TIP) mode. First and second motion vectors pointing to respective reference frames, or reference pictures within those frames, are generated for a block of a current picture. The motion vectors are then refined by application of a decoder-side motion vector refinement (DMVR) process, based on a bilateral matching process, and the refined motion vectors are used to decode the block. The refinement may more specifically involve consideration of candidates for the refined motion vectors, selected by the bilateral matching. The refinement may be applied at both block and sub-block divisions of the current picture.
    Type: Application
    Filed: November 7, 2022
    Publication date: November 30, 2023
    Applicant: TENCENT AMERICA LLC
    Inventors: Han GAO, Xin Zhao, Liang Zhao, Shan Liu
  • Publication number: 20230388543
    Abstract: A plurality of neighboring vertices of a current vertex in a current frame of a mesh is determined. The current frame corresponds to the mesh at a first time instance. Each of the plurality of neighboring vertices is connected to the current vertex through a respective edge in the mesh. A plurality of neighboring estimation errors of the plurality of neighboring vertices is determined. Each of the plurality of neighboring estimation errors indicates a difference between a reference vertex of a corresponding one of the plurality of neighboring vertices in a reference frame and the corresponding one of the plurality of neighboring vertices in the current frame. The reference frame corresponds to the mesh at a second time instance. A prediction residue of the current vertex is determined based on the plurality of neighboring estimation errors. Prediction information of the current vertex is generated based on the determined prediction residue.
    Type: Application
    Filed: March 28, 2023
    Publication date: November 30, 2023
    Applicant: Tencent America LLC
    Inventors: Jun TIAN, Chao HUANG, Xiang ZHANG, Xiaozhong XU, Shan LIU
  • Publication number: 20230386507
    Abstract: A method, apparatus, and non-transitory computer-readable medium for automatic speech recognition using conditional factorization for bilingual code-switched and monolingual speech may include receiving an audio observation sequence comprising a plurality of frames, the audio observation sequence including audio in a first language or a second language. The approach may further include mapping the audio observation sequence into a first sequence of hidden representations, the mapping being generated by a first encoder corresponding to the first language and mapping the audio observation sequence into a second sequence of hidden representations, the mapping being generated by a second encoder corresponding to the second language. The approach may further include generating a label-to-frame sequence based on the first sequence of hidden representations and the second sequence of hidden representations, using a joint neural network based model.
    Type: Application
    Filed: May 31, 2022
    Publication date: November 30, 2023
    Applicant: TENCENT AMERICA LLC
    Inventors: Chunlei ZHANG, Brian YAN, Dong YU
  • Publication number: 20230385544
    Abstract: Methods, apparatus, and computer readable storage medium for verifying NBMP documents and entities. One method may include invoking an Application Programming Interface (API) corresponding to an API operation supported by an NBMP entity, the API operation being related to at least one of: a create operation; an update operation; a retrieve operation; or a delete operation; receiving a response from the NBMP entity; and determining, based on the response, whether the NBMP entity passes an API test corresponding to the API operation.
    Type: Application
    Filed: April 17, 2023
    Publication date: November 30, 2023
    Applicant: Tencent America LLC
    Inventor: Iraj SODAGAR
  • Publication number: 20230388513
    Abstract: Aspects of the disclosure provide a method and an apparatus including processing circuitry that receives a coded video bitstream comprising a current picture that includes a current block. The processing circuitry determines, based on a syntax element in the coded video bitstream, that the current block including a plurality of subblocks is coded in a subblock-based temporal motion vector prediction (SbTMVP) mode. The processing circuitry determines a plurality of displacement vector (DV) predictor (DVP) candidates and receives a base index indicating a DVP in the plurality of DVP candidates and a DV offset of the current block. The processing circuitry determines a DV based on the DVP and the DV offset. The DV indicates a block collocated with the current block in a collocated reference picture. The processing circuitry reconstructs a subblock in the plurality of subblocks based on motion information of a corresponding subblock in the collocated block.
    Type: Application
    Filed: November 9, 2022
    Publication date: November 30, 2023
    Applicant: Tencent America LLC
    Inventors: Lien-Fei CHEN, Guichun LI, Xin ZHAO, Shan LIU
  • Publication number: 20230388485
    Abstract: Aspects of the disclosure provide a method and an apparatus for video encoding/decoding. The apparatus includes processing circuitry for: receiving prediction information of a current coding block in a current picture from a coded video bitstream, the prediction information indicating that the current coding block is coded using a subblock-based temporal motion vector prediction (SbTMVP) mode; deriving multiple displacement vector (DV) candidates by applying multiple DV offset candidates to a fixed DV predictor of the current coding block; comparing a template of the current coding block with each of multiple templates, each template of the multiple templates being located at a position specified by a corresponding one of the multiple DV candidates; calculating a cost value associated with each one of the multiple DV offset candidates based on the comparing; and reordering DV offset indices of the multiple DV offset candidates based on their calculated cost values.
    Type: Application
    Filed: November 10, 2022
    Publication date: November 30, 2023
    Applicant: Tencent America LLC
    Inventors: Han GAO, Lien-Fei CHEN, Guichun LI, Xin ZHAO, Shan LIU
  • Publication number: 20230388484
    Abstract: A decoding device is provided for decoding of a bitstream, and more specifically for predicting a picture area of an input image for decoding. The picture area has been divided into at least first and second parts by a partitioning boundary, and the pixels of each part are predicted according to suitable measures. Blending masks are then applied to generate blended regions for the predicted pixels, modifying the pixels to generate a complete prediction of the picture area based on both parts. The blending mask is based on first and second thresholds, which are defined relative to the partitioning boundary, and which may have different values to produce an asymmetrical blending relative to the boundary. Based on suitable threshold values, the prediction, combination, and decoding of the picture area is more adaptive to differing contents of the parts therein.
    Type: Application
    Filed: November 8, 2022
    Publication date: November 30, 2023
    Applicant: Tencent America LLC
    Inventors: Han GAO, Xin ZHAO, Lien-fei CHEN, Shan LIU
  • Publication number: 20230388540
    Abstract: Methods and apparatuses for performing chroma from luma (CfL) intra prediction, including: receiving a current block from a coded video bitstream; obtaining, from the coded video bitstream, a syntax element indicating which of two or more downsampling filters is used for predicting the current block in a CfL intra prediction mode; in response to the syntax element indicating that a first downsampling filter is used for the current block: determining a plurality of filter coefficients according to the first downsampling filter; and downsampling the current block based on the determined plurality of coefficients using a first number of sampling positions; in response to the syntax element indicating that a second downsampling filter is used for the current block: determining the plurality of filter coefficients according to the second downsampling filter; downsampling the current block based on the determined plurality of coefficients using a second number of sampling positions, wherein the second number of sam
    Type: Application
    Filed: November 9, 2022
    Publication date: November 30, 2023
    Applicant: TENCENT AMERICA LLC
    Inventors: Jing YE, Xin Zhao, Liang Zhao, Shan Liu
  • Publication number: 20230388487
    Abstract: A method, computer program, and computer system is provided for coding video data. Video data including a current picture and one or more other pictures is received. A first flag corresponding to whether the current picture is referenced by the one or more other pictures in a decoding order is checked. A second flag corresponding to whether the current picture is output; is checked. The video data is decoded based on values corresponding to the first flag and the second flag.
    Type: Application
    Filed: May 1, 2023
    Publication date: November 30, 2023
    Applicant: TENCENT AMERICA LLC
    Inventors: Byeongdoo CHOI, Shan LIU, Stephan WENGER
  • Publication number: 20230386479
    Abstract: A method, system, apparatus, and computer-readable medium for voice conversion using a conditional disentangled sequential variational auto-encoder (C-DSVAE) is provided. The method, performed by at least one processor, includes receiving input speech segments, encoding the input speech segments via a shared encoder to generate a speaker embedding and a content embedding, and encoding a posterior distribution of the speaker embedding via a speaker encoder and encoding a posterior distribution of the content embedding via a content encoder to obtain encoded results. The method further includes enabling a content bias, reshaping the content embedding using the content bias, and generating a reconstructed speech output based on the encoded results and the reshaped content embedding.
    Type: Application
    Filed: May 27, 2022
    Publication date: November 30, 2023
    Applicant: Tencent America LLC
    Inventors: Chunlei ZHANG, Jiachen LIAN, Dong YU
  • Patent number: 11831911
    Abstract: Aspects of the disclosure provide methods and apparatuses for video encoding/decoding. In some examples, an apparatus for video decoding includes receiving circuitry and processing circuitry. For example, the processing circuitry receives a bitstream including bits corresponding to residues associated with a prediction of a block in a coding tree unit (CTU) and determines a maximum transform size based on the CTU size. The maximum transform size is used in a partition to form, from the residues, one or more transform blocks that are respectively encoded into the bits. Then, the processing circuitry reconstructs based on the maximum transform size and from the bits the residues associated with the prediction of the block, and then the processing circuitry reconstructs samples of the block based on the reconstructed residues and the prediction of the block.
    Type: Grant
    Filed: March 30, 2022
    Date of Patent: November 28, 2023
    Assignee: Tencent America LLC
    Inventors: Xin Zhao, Xiang Li, Shan Liu
  • Patent number: 11831889
    Abstract: In a method of video encoding for an encoder, a determination is made to determine whether (i) an implicit transform scheme is enabled, and (ii) at least one of a low-frequency non-separable transform (LFNST) and a matrix-based intra predication mode (MIP) is invalid for a coding unit (CU). In response to the determination that the implicit transform scheme is enabled, and at least one of the LFNST and MIP is invalid, a primary transform type is determined based on a size of the CU. A primary transform is performed for a transform block that is partitioned from the CU in accordance with the determined primary transform type. A coded bitstream that indicates the primary transform type of the CU is subsequently output.
    Type: Grant
    Filed: October 4, 2021
    Date of Patent: November 28, 2023
    Assignee: TENCENT AMERICA LLC
    Inventors: Xin Zhao, Xiang Li, Shan Liu
  • Patent number: 11831920
    Abstract: Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for video encoding/decoding. An apparatus includes processing circuitry that receives metadata associated with a coded video bitstream. The metadata includes labeling information of one or more objects detected in a first picture that is coded in the coded video bitstream. The processing circuitry decodes the labeling information of the one or more objects in the first picture that is coded in the coded video bitstream. The processing circuitry applies the labeling information to the one or more objects in the first picture.
    Type: Grant
    Filed: August 27, 2021
    Date of Patent: November 28, 2023
    Assignee: TENCENT AMERICA LLC
    Inventors: Shan Liu, Xiaozhong Xu, Wen Gao
  • Publication number: 20230379452
    Abstract: Method, device, and non-transitory storage medium for motion vector prediction (MVP) list construction for video coding are provided. One or more motion vector (MV) candidates may be retrieved from a reference MV bank, the one or more MV candidates being associated with a current block. A position to insert the one or more MV candidates from the reference MV bank into an MVP list associated with the current block is determined. The one or more MV candidates from the reference MV bank is inserted into the MVP list associated with the current block based on the position.
    Type: Application
    Filed: September 9, 2022
    Publication date: November 23, 2023
    Applicant: TENCENT AMERICA LLC
    Inventors: Han GAO, Xin ZHAO, Liang ZHAO, Shan LIU
  • Publication number: 20230379456
    Abstract: A method of video decoding includes creating a candidate list for a current block in a current picture included in a coded video bitstream. The method further includes determining a coding mode for a candidate block associated with the current block. The method further includes determining whether to add, to the candidate list, a vector associated the candidate block based on the determined coding mode. The method further includes reconstructing the current block using at least one candidate from the candidate list.
    Type: Application
    Filed: August 5, 2023
    Publication date: November 23, 2023
    Applicant: TENCENT AMERICA LLC
    Inventors: Xiaozhong XU, Xiang LI, Shan LIU