Abstract: Aspects of the disclosure provide methods and apparatuses for video encoding/decoding. In some examples, an apparatus for video decoding includes processing circuitry. The processing circuitry reconstructs a first sample in a video carried in a coded video bitstream based on a non linear mapping based filter with a first filter shape configuration. Then, the processing circuitry determines a switch from the first filter shape configuration to a second filter shape configuration, and reconstructs a second sample in the video based on the non linear mapping based filter with the second filter shape configuration.
Abstract: Aspects of the disclosure provide a method and an apparatus including processing circuitry for video decoding. The processing circuitry decodes, from a coded video bitstream, a first syntax element indicating whether a first component in the coded video bitstream is coded based on a second component in the coded video bitstream. The processing circuitry determines whether to decode one or more second syntax elements for a chroma related coding tool based on the first syntax element. The chroma related coding tool is a luma mapping with chroma scaling coding tool or a cross-component adaptive loop filter. The one or more second syntax elements are decoded when the first syntax element indicates that the first component is coded based on the second component. The one or more second syntax elements are not decoded when the first syntax element indicates that the first component is not coded based on the second component.
Abstract: A method, computer program, and computer system is provided for streaming immersive media. The method includes ingesting content in a two-dimensional format; converting the ingested content to a three-dimensional format based on a neural network; and streaming the converted content to a client end-point.
Abstract: Aspects of the disclosure provide a method and an apparatus including processing circuitry for video decoding. The processing circuitry can decode coded information for a current block in a current picture from a coded video bitstream. The coded information can indicate a string copy mode for the current block. The current block includes a current string. The processing circuitry can determine whether a flipping operation is performed to predict the current string. Based on the flipping operation being determined as performed to predict the current string, the processing circuitry can determine an original reference string based on a string vector of the current string. The processing circuitry can generate a flipped reference string by performing the flipping operation on the original reference string, and reconstruct the current string based on the flipped reference string.
Abstract: Systems and methods for split rendering of Network Based Media Processing (NBMP) are provided. A method includes deriving a network based media processing (NBMP) workflow, and assigning a plurality of workflow tasks of the NBMP workflow among a media sink, a media source, and at least one cloud element, network element, or edge element. A first subset of the plurality of the workflow tasks are assigned to the media source, a second subset of the plurality of the workflow tasks are assigned to the media sink, and a third subset of the plurality of the workflow tasks are assigned to the at least one cloud element, network element, or edge element. The first subset, the second subset, and the third subset do not overlap with each other.
Abstract: A method, computer program, and computer system is provided for streaming immersive media. Information corresponding to a characteristic of a client associated with media content is received based on the transfer of one or more tokens. A packaging process is associated with the media content using the one or more tokens. The media content is adapted according to the characteristic of the client based on the associated packaging process.
Abstract: A method of network-based media processing (NBMP) deployment with a framework for live uplink streaming (FLUS) and a fifth-generation media streaming (5GMS) application function (AF) can include establishing a provisioning session by a 5GMS application provider with the 5GMS AF in which a 5GMS application provider configures the 5GMS AF for a network-based media processing, configuring a FLUS sink by the 5GMS AF for the network-based media processing, selecting the FLUS sink by a FLUS source during a FLUS source discovery process, establishing a FLUS session by the FLUS source with the FLUS sink, and starting an uplink streaming by the FLUS source to the FLUS sink where media content of the uplink streaming is processed with the network-based media processing.
Abstract: Systems, apparatuses, and method may provide unmanned aerial system communication. A method performed by at least one processor included in an unmanned aerial system (UAS) includes: transmitting, to a UAS Service Supplier (USS) implemented on at least one server, a first registration request to register a first remote identification (RID) corresponding to the UAS with the USS; receiving, from the USS, an indication that the first RID is a duplicate RID that is registered with the USS; determining, based on the first RID, a second RID corresponding to the UAS; and transmitting, to the USS, a second registration request to register the second RID.
Type:
Application
Filed:
September 29, 2021
Publication date:
June 9, 2022
Applicant:
TENCENT AMERICA LLC
Inventors:
Paul Spencer DAWKINS, Stephan Wenger, Shuai Zhao
Abstract: Systems and methods for coding and decoding are provided. A method includes encoding a video stream, the coded video stream including a video parameter set (VPS) and video data partitioned into a plurality of layers; and sending the coded video stream to a decoder, wherein the VPS includes, (1) at least one first syntax element that specifies at least one first layer, from among the plurality of layers, to be outputted in an output layer set, and (2) at least one second syntax element that indicates profile-tier-level information of the output layer set.
Type:
Application
Filed:
February 23, 2022
Publication date:
June 9, 2022
Applicant:
TENCENT AMERICA LLC
Inventors:
Byeongdoo CHOI, Shan LIU, Stephan WENGER
Abstract: Aspects of the disclosure provide methods and apparatuses for video encoding/decoding. In some examples, an apparatus for video decoding includes processing circuitry. The processing circuitry determines a string vector for a current string in a current coding tree unit (CTU). The string vector points to a reference string for the current string. The processing circuitry determines that the string vector satisfies a requirement for an allowance of a partial overlapping of the current string and the reference string, and then reconstructs the current string based on the reference string.
Abstract: According to an aspect of the disclosure, a method of point cloud geometry encoding in a point cloud encoder is provided. In the method, geometry coding is performed on a point cloud at a first partition depth. Further, a plurality of largest coding units (LCUs) of the point cloud is determined at a second partition depth. A coding state of a LCU of the plurality of LCUs of the point cloud is set at the second partition depth. The geometry coding is performed on the plurality of LCUs of the point cloud at the second partition depth based on the coding state of the LCU at the second partition depth.
Abstract: A method, computer program, and computer system is provided for synthesizing speech at one or more speeds. A context associated with one or more phonemes corresponding to a speaking voice is encoded, and the one or more phonemes are aligned to one or more target acoustic frames based on the encoded context. One or more mel-spectrogram features are recursively generated from the aligned phonemes and target acoustic frames, and a voice sample corresponding to the speaking voice is synthesized using the generated mel-spectrogram features.
Abstract: There is included a method and apparatus comprising computer code configured to cause a processor or processors to perform obtaining video data comprising data of a plurality of semantically independent source pictures, determining, among the video data, whether references are associated with any of a first access unit (AU) and a second AU according to at least one picture order count (POC) signal value included with the video data, and outputting a first quantity of the references set to the first AU and a second quantity of the references set to the second AU based on the at least one POC signal value.
Type:
Grant
Filed:
October 5, 2020
Date of Patent:
June 7, 2022
Assignee:
TENCENT AMERICA LLC
Inventors:
Byeongdoo Choi, Shan Liu, Stephan Wenger
Abstract: A method of and a device for decoding a video stream including at least two coded video sequences that each use a respective first parameter set that differ in at least one value from each other, and each of the at least two coded video sequences including at least two coded pictures, and the at least two coded video sequences have different presentation times. The method including decoding and activating, by a decoder, a single second parameter set pertaining to all video sequences of the video stream before any coded picture of the at least two coded video sequences is decoded.
Abstract: There is includes a method and apparatus comprising computer code configured to cause a hardware processor or processors to perform intra prediction among a plurality of reference lines, to set a plurality of intra prediction modes for a zero reference line nearest to a current block of the intra prediction among non-zero reference lines, and to set one or more most probable modes for one of the non-zero reference lines.
Type:
Grant
Filed:
September 1, 2020
Date of Patent:
June 7, 2022
Assignee:
TENCENT AMERICA LLC
Inventors:
Liang Zhao, Xin Zhao, Xiang Li, Shan Liu
Abstract: A method and system for processing media content in Moving Picture Experts Group (MPEG) Network Based Media Processing (NBMP) includes obtaining a list of function repositories, each function repository storing one or more functions for processing the media content, obtaining a descriptor identifying a location of each of the function repositories identified in the list of function repositories, selecting a function repository, from among the function repositories, according to a criterion, and processing the media content using a function included in the selected function repository.
Abstract: A method, apparatus, and computer-readable medium for shortening an advertisement duration using moving picture experts group (MPEG)-dynamic adaptive streaming over http (DASH), including transmitting, to a client, a media presentation description (MPD) including a remote period including the advertisement duration, and an indication that the advertisement duration is able to be shortened; receiving an update request including a shortened advertisement duration; updating the MPD based on the shortened advertisement duration to generate an updated MPD including the shortened advertisement duration; and transmitting the updated MPD to the client.
Abstract: A method of extracting a sub-bitstream from an encoded video bitstream using at least one processor includes: obtaining an encoded video bitstream, the encoded video bitstream including a plurality of Network Abstraction Layer (NAL) units; obtaining an output layer set list; comparing the NAL units with the output layer set list; and removing NAL units that are not included in the output layer set list.
Type:
Grant
Filed:
October 6, 2020
Date of Patent:
June 7, 2022
Assignee:
TENCENT AMERICA LLC
Inventors:
Byeongdoo Choi, Shan Liu, Stephan Wenger
Abstract: Aspects of the disclosure provide methods, apparatuses, and a non-transitory computer-readable medium for video encoding/decoding. In a method, prediction information of a current block of a coding tree unit (CTU) in a coded bitstream is decoded. The prediction information indicates that the current block is coded in intra block copy (IBC) mode. A reference block is determined for the current block. A number of memory regions that contain samples of the reference block is less than a total number of memory regions for the CTU. The current block is reconstructed based on the reference block.
Abstract: There is included a method and apparatus comprising computer code configured to cause a processor or processors to perform publishing media presentation description data comprising main live program data, and signaling a client device about ad data and in-manifest data where the ad data instructs the client device of an initial end time at which to end a display of an ad by switching a display at the client device from the ad to the main live program data and where the in-manifest data instructs the client device to determine, during a streaming of the ad to the client device, an updated end time, prior to the end time, at which to end the streaming of the ad by switching the streaming at the client device from the ad to the main live program data.