Patents by Inventor Amir Said
Amir Said has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12256092Abstract: A device may perform a first prediction process for a first block of video data to produce a first residual. The device may apply a first transform process to the first residual to generate first transform coefficients for the first block of video data and encode the first transform coefficients. The device may perform a second prediction process for a second block of video data to produce a second residual. The device may determine that a second transform process, which includes the first transform process and at least one of a pre-adjustment operation or a post-adjustment operation, is to be applied to the second residual. The device may apply the first transform process and the pre- or post-adjustment operation to the second residual to generate second transform coefficients for the second block. The coding device may code the first and second transform coefficients.Type: GrantFiled: March 29, 2023Date of Patent: March 18, 2025Assignee: QUALCOMM INCORPORATEDInventors: Amir Said, Hilmi Enes Egilmez, Marta Karczewicz, Vadim Seregin
-
COMPRESSION OF BITSTREAM INDEXES FOR WIDE SCALE PARALLEL ENTROPY CODING IN NEURAL-BASED VIDEO CODECS
Publication number: 20240364890Abstract: Systems and techniques are described herein for processing video data. For example, an encoding device can obtain a sequence of video data and determine a minimum value in the sequence of video data. The encoding device can, based on the minimum value, identify positions in the sequence of video data associated with entry points for individually entropy codable parcels of a parallel entropy codable sequence of video data. The encoding device can generate the parallel entropy codable sequence of video data. The encoding device can further generate an index for the parallel entropy codable sequence of video data, the index identifying the individually entropy codable parcels within the parallel entropy codable sequence of video data.Type: ApplicationFiled: March 18, 2024Publication date: October 31, 2024Inventors: Amir SAID, Hoang Cong Minh LE, Farzad FARHADZADEH -
Publication number: 20240364925Abstract: Systems and techniques are described herein for processing video data. For example, a machine-learning based stereo video coding system can obtain video data including at least a right-view image of a right view of a scene and a left-view image of a left view of the scene. The machine-learning based stereo video coding system can compress the right-view image and the left-view image in parallel to generate a latent representation of the right-view image and the left-view image. The right-view image and the left-view image can be compressed in parallel based on inter-view information between the right-view image and the left-view image, determined using one or more parallel autoencoders.Type: ApplicationFiled: April 15, 2024Publication date: October 31, 2024Inventors: Hoang Cong Minh LE, Qiqi HOU, Farzad FARHADZADEH, Amir SAID, Auke Joris WIGGERS, Guillaume Konrad SAUTIERE, Reza POURREZA
-
Patent number: 12132919Abstract: A processor-implemented method for image compression using an artificial neural network (ANN) includes receiving, at an encoder of the ANN, an image and a spatial segmentation map corresponding to the image. The spatial segmentation map indicates one or more regions of interest. The encoder compresses the image according to a controllable spatial bit allocation. The controllable spatial bit allocation is based on a learned quantization bin size.Type: GrantFiled: November 15, 2022Date of Patent: October 29, 2024Assignee: QUALCOMM IncorporatedInventors: Yang Yang, Hoang Cong Minh Le, Yinhao Zhu, Reza Pourreza, Amir Said, Yizhe Zhang, Taco Sebastiaan Cohen
-
Publication number: 20240305785Abstract: An example computing device may include memory and one or more processors. The one or more processors may be configured to parallel entropy decode encoded video data from a received bitstream to generate entropy decoded data. The one or more processors may be configured to predict a motion vector based on the entropy decoded data. The one or more processors may be configured to decode a motion vector residual from the entropy decoded data. The one or more processors may be configured to add the motion vector residual and motion vector. The one or more processors may be configured to warp previous reconstructed video data with an overlapped block-based warp function using the motion vector to generate predicted current video data. The one or more processors may be configured to sum the predicted current video data with a residual block to generate current reconstructed video data.Type: ApplicationFiled: August 28, 2023Publication date: September 12, 2024Inventors: Ties Jehan Van Rozendaal, Hoang Cong Minh Le, Tushar Singhal, Amir Said, Krishna Buska, Guillaume Konrad Sautiere, Anjuman Raha, Auke Joris Wiggers, Frank Steven Mayer, Liang Zhang, Abhijit Khobare, Muralidhar Reddy Akula
-
Patent number: 12058348Abstract: A video encoder determines scaled transform coefficients, wherein determining the scaled transform coefficients comprises scaling transform coefficients of a block of the video data according to a given quantization step. The video encoder determines scalar quantized coefficients, wherein determining the scalar quantized coefficients comprises applying scalar quantization to the scaled transform coefficients of the block. Additionally, the video encoder applies a neural network that determines a respective set of probabilities for each respective transform coefficient of the block. The respective set of probabilities for the respective transform coefficient includes a respective probability value for each possible adjustment value in a plurality of possible adjustment values. Inputs to the neural network include the scaled transform coefficients and the scalar quantized coefficients.Type: GrantFiled: October 14, 2020Date of Patent: August 6, 2024Assignee: QUALCOMM IncorporatedInventors: Dana Kianfar, Auke Joris Wiggers, Amir Said, Taco Sebastiaan Cohen, Reza Pourreza Shahri
-
Publication number: 20240244246Abstract: A device may perform a first prediction process for a first block of video data to produce a first residual. The device may apply a first transform process to the first residual to generate first transform coefficients for the first block of video data and encode the first transform coefficients. The device may perform a second prediction process for a second block of video data to produce a second residual. The device may determine that a second transform process, which includes the first transform process and at least one of a pre-adjustment operation or a post-adjustment operation, is to be applied to the second residual. The device may apply the first transform process and the pre- or post-adjustment operation to the second residual to generate second transform coefficients for the second block. The coding device may code the first and second transform coefficients.Type: ApplicationFiled: March 29, 2023Publication date: July 18, 2024Inventors: Amir Said, Hilmi Enes Egilmez, Marta Karczewicz, Vadim Seregin
-
Patent number: 12008731Abstract: Certain aspects of the present disclosure provide techniques for compressing content using a neural network. An example method generally includes receiving content for compression. The content is encoded into a first latent code space through an encoder implemented by an artificial neural network trained to generate a latent space representation of the content. A first compressed version of the encoded content is generated using a first quantization bin size of a series of quantization bin sizes. A refined compressed version of the encoded content is generated by scaling the first compressed version of the encoded content into one or more second quantization bin sizes smaller than the first quantization bin size, conditioned at least on a value of the first compressed version of the encoded content. The refined compressed version of the encoded content is output for transmission.Type: GrantFiled: January 24, 2022Date of Patent: June 11, 2024Assignee: QUALCOMM IncorporatedInventors: Yadong Lu, Yang Yang, Yinhao Zhu, Amir Said, Taco Sebastiaan Cohen
-
Publication number: 20240121392Abstract: This disclosure describes entropy coding techniques for media data coded using neural-based techniques. A media coder is configured to determine a probability distribution function parameter for a data element of a data stream coded by a neural-based media compression technique, wherein the probability distribution function parameter is a function of a standard deviation of a probability distribution function of the data stream, determine a code vector based on the probability distribution function parameter, and entropy code the data element using the code vector.Type: ApplicationFiled: December 8, 2023Publication date: April 11, 2024Inventor: Amir Said
-
Patent number: 11943460Abstract: A computer-implemented method for operating an artificial neural network (ANN) includes receiving an input by the ANN. The ANN generates a latent representation of the input. The latent representation is communicated according to a bit rate based on a learned latent scaling parameter. The latent scaling parameter is learned based on a channel index and a tradeoff parameter value that corresponds to a value that balances the bit rate and a distortion.Type: GrantFiled: January 11, 2022Date of Patent: March 26, 2024Assignee: QUALCOMM INCORPORATEDInventors: Yadong Lu, Yang Yang, Yinhao Zhu, Amir Said, Reza Pourreza, Taco Sebastiaan Cohen
-
Patent number: 11876969Abstract: This disclosure describes entropy coding techniques for media data coded using neural-based techniques. A media coder is configured to determine a probability distribution function parameter for a data element of a data stream coded by a neural-based media compression technique, wherein the probability distribution function parameter is a function of a standard deviation of a probability distribution function of the data stream, determine a code vector based on the probability distribution function parameter, and entropy code the data element using the code vector.Type: GrantFiled: July 22, 2022Date of Patent: January 16, 2024Assignee: QUALCOMM IncorporatedInventor: Amir Said
-
Publication number: 20240013441Abstract: Systems and techniques are provided for coding (e.g., encoding and/or decoding) video data using camera motion information. For example, a decoding device can obtain a frame of encoded video data associated with an input frame, the frame of encoded video data including camera information associated with generating the video data and a residual. A camera motion compensated frame can be generated based on a reference frame and the camera information. Optical flow information associated with object motion determined based on at least the input frame and the reference frame can be generated. A motion compensated frame can be generated by warping the camera motion compensated frame based on the optical flow information. A reconstructed input frame can be generated based on the motion compensated frame and the residual.Type: ApplicationFiled: July 11, 2022Publication date: January 11, 2024Inventors: Hoang Cong Minh LE, Reza POURREZA, Amir SAID
-
Patent number: 11825090Abstract: Techniques are described herein for processing video data. For instance, a technique can include encoding one or more frames of video data using a video encoder, the video encoder including at least a quantization process; determining an actual bit rate of the encoded one or more frames; predicting an estimated bit rate using an encoder proxy, the encoder proxy including a statistical model for estimating a bit rate of the encoded one or more frames; determining, using the encoder proxy, a gradient of the estimated bit rate; and training the encoder proxy to predict the estimated bit rate based on the actual bit rate, the estimated bit rate, and the gradient.Type: GrantFiled: September 9, 2022Date of Patent: November 21, 2023Assignee: QUALCOMM IncorporatedInventor: Amir Said
-
Patent number: 11798197Abstract: A method of image compression includes receiving an image. Multiple quantized latent representations are generated to represent features of the image. Each of the quantized latent representations has a different resolution and is generated at staggered timings. Each of the later generated quantized latent representations is conditioned on each of the prior generated quantized latent representations. The multiple quantized latent representations are decoded to reconstruct the image.Type: GrantFiled: March 12, 2021Date of Patent: October 24, 2023Assignee: QUALCOMM IncorporatedInventors: Hoang Cong Minh Le, Reza Pourreza, Yang Yang, Yinhao Zhu, Amir Said, Yizhe Zhang, Taco Sebastiaan Cohen
-
Patent number: 11778234Abstract: An example method of decoding video data includes obtaining, from a coded video bitstream and for a current block of the video data, an indication of an intra-prediction mode that identifies an initial predictive block; filtering, in parallel, samples in a current line of a plurality of lines of the initial predictive block based on filtered values of samples in a preceding line of the plurality of lines and unfiltered values of samples in the current line to generate filtered values for samples for the current line; and reconstructing, using intra prediction, values of samples of the current block based on the filtered values of the samples of the current initial predictive block and residual data for the current block that represents a difference between the filtered values of the samples of the current initial predictive block and the values of samples of the current block.Type: GrantFiled: September 23, 2022Date of Patent: October 3, 2023Assignee: QUALCOMM IncorporatedInventors: Amir Said, Xin Zhao, Wei-Jung Chien, Jianle Chen, Marta Karczewicz
-
Publication number: 20230262267Abstract: This disclosure describes entropy coding techniques for media data coded using neural-based techniques. A media coder is configured to determine a probability distribution function parameter for a data element of a data stream coded by a neural-based media compression technique, wherein the probability distribution function parameter is a logarithmic function of a standard deviation of a probability distribution function of the data stream, determine a code vector based on the probability distribution function parameter, and entropy code the data element using the code vector.Type: ApplicationFiled: February 11, 2022Publication date: August 17, 2023Inventors: Amir Said, Yinhao Zhu
-
Publication number: 20230262222Abstract: This disclosure describes entropy coding techniques for media data coded using neural-based techniques. A media coder is configured to determine a probability distribution function parameter for a data element of a data stream coded by a neural-based media compression technique, wherein the probability distribution function parameter is a function of a standard deviation of a probability distribution function of the data stream, determine a code vector based on the probability distribution function parameter, and entropy code the data element using the code vector.Type: ApplicationFiled: July 22, 2022Publication date: August 17, 2023Inventor: Amir Said
-
Patent number: 11683527Abstract: An example device for coding video data includes a memory configured to store video data; and one or more processors implemented in circuitry and configured to: code a first codeword representing a selected transform scheme of a set of transform candidates of a multiple transform selection (MTS) scheme for a current block of video data, the selected transform scheme being a secondary transform of a set of available secondary transforms to be applied in addition to a primary transform; code a second codeword representing the secondary transform from the set of available secondary transforms; and apply the primary transform and the secondary transform during coding of residual data for the current block. The second codeword may be a value for a low-frequency non-separable transform (LFNST) syntax element.Type: GrantFiled: October 11, 2021Date of Patent: June 20, 2023Assignee: QUALCOMM IncorporatedInventors: Hilmi Enes Egilmez, Vadim Seregin, Amir Said, Marta Karczewicz
-
Patent number: 11677987Abstract: Techniques are described herein for processing video data. For instance, a process can include obtaining encoded video data. The process can include determining an intersection of values between values for a first termination byte of a first parcel of the encoded video data and values of a second termination byte of a second parcel of the encoded video data. The process can further include determining a joint termination byte for the first termination byte of the first parcel and the second termination byte of the second parcel. Values for the joint termination byte are based on the intersection of values. The process can include generating entropy coded data including the joint termination byte for the first parcel and the second parcel. The entropy coded data can be generated using arithmetic coding or binary coding.Type: GrantFiled: April 7, 2021Date of Patent: June 13, 2023Assignee: QUALCOMM IncorporatedInventor: Amir Said
-
Publication number: 20230179776Abstract: An example device applies a primary transform to a current block of video data to create primary transform coefficients. The device determines whether intra sub-partitioning is applied to the current block of video data. The device applies a primary transform to the current block. The device also determines whether a primary transform size for the current block of video data is at least a predetermined size. Based on intra sub-partitioning being applied and the primary transform size being at least the predetermined size, the device applies a secondary transform to primary transform coefficients and codes the current block of video data based on the secondary transform.Type: ApplicationFiled: February 1, 2023Publication date: June 8, 2023Inventors: Hilmi Enes Egilmez, Vadim Seregin, Amir Said, Marta Karczewicz