Patents by Inventor Hilmi Enes EGILMEZ

Hilmi Enes EGILMEZ has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20220086463
    Abstract: Systems and techniques are described herein for processing video data using a neural network system. For instance, a process can include generating, by a first convolutional layer of an encoder sub-network of the neural network system, output values associated with a luminance channel of a frame. The process can include generating, by a second convolutional layer of the encoder sub-network, output values associated with at least one chrominance channel of the frame. The process can include generating a combined representation of the frame by combining the output values associated with the luminance channel of the frame and the output values associated with the at least one chrominance channel of the frame. The process can include generating encoded video data based on the combined representation of the frame.
    Type: Application
    Filed: February 23, 2021
    Publication date: March 17, 2022
    Inventors: Muhammed Zeyd COBAN, Ankitesh Kumar SINGH, Hilmi Enes EGILMEZ, Marta KARCZEWICZ
  • Patent number: 11259052
    Abstract: This disclosure describes examples of extending the number of available discrete cosine transform (DCT) and discrete sine transform (DST) for encoding and decoding. A video coder may determine one or more transforms or inverse transforms to apply from a set of transforms or inverse transforms that includes DCT-2 or inverse DCT-2, DST-7 or inverse DST-7, DST-8 or inverse DST-8, DCT-3 or inverse DCT-3, DST-2 or inverse DST-2, DST-3 or inverse DST-3, DCT-4 or inverse DCT-4, DST-4 or inverse DST-4, DST-5 or inverse DST-5, DST-6 or inverse DST-6, and identity transform an inverse identity transform (IDT).
    Type: Grant
    Filed: July 15, 2019
    Date of Patent: February 22, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Yung-Hsuan Chao, Amir Said, Hilmi Enes Egilmez, Vadim Seregin, Marta Karczewicz
  • Patent number: 11240534
    Abstract: An example device for coding video data includes a memory configured to store video data; and one or more processors implemented in circuitry and configured to: code a first codeword representing a selected transform scheme of a set of transform candidates of a multiple transform selection (MTS) scheme for a current block of video data, the selected transform scheme being a secondary transform of a set of available secondary transforms to be applied in addition to a primary transform; code a second codeword representing the secondary transform from the set of available secondary transforms; and apply the primary transform and the secondary transform during coding of residual data for the current block. The second codeword may be a value for a low-frequency non-separable transform (LFNST) syntax element.
    Type: Grant
    Filed: April 2, 2020
    Date of Patent: February 1, 2022
    Assignee: QUALCOMM Incorporated
    Inventors: Hilmi Enes Egilmez, Vadim Seregin, Amir Said, Marta Karczewicz
  • Publication number: 20220030278
    Abstract: An example device for coding video data includes a memory configured to store video data; and one or more processors implemented in circuitry and configured to: code a first codeword representing a selected transform scheme of a set of transform candidates of a multiple transform selection (MTS) scheme for a current block of video data, the selected transform scheme being a secondary transform of a set of available secondary transforms to be applied in addition to a primary transform; code a second codeword representing the secondary transform from the set of available secondary transforms; and apply the primary transform and the secondary transform during coding of residual data for the current block. The second codeword may be a value for a low-frequency non-separable transform (LFNST) syntax element.
    Type: Application
    Filed: October 11, 2021
    Publication date: January 27, 2022
    Inventors: Hilmi Enes Egilmez, Vadim Seregin, Amir Said, Marta Karczewicz
  • Patent number: 11218694
    Abstract: Techniques are described for improving transform coding. For example, an encoded block of video data can be obtained, and a width and/or a height of the block can be determined. The width can be compared to a first threshold and/or the height can be compared to a second threshold. A horizontal transform and a vertical transform can be determined for the block based on comparing the width of the block to the first threshold and/or the height of the block to the second threshold. The horizontal transform and the vertical transform are determined without decoding a syntax element that indicates the horizontal transform and the vertical transform (e.g., the syntax element is not in an encoded video bitstream processed by a decoding device). In some cases, residual data is determined using the horizontal and vertical transforms, and a video block is determined using the residual data and a predictive block.
    Type: Grant
    Filed: September 19, 2019
    Date of Patent: January 4, 2022
    Assignee: QUALCOMM Incorporated
    Inventors: Vadim Seregin, Akshay Gadde, Hilmi Enes Egilmez, Marta Karczewicz, Amir Said
  • Patent number: 11206400
    Abstract: An example method of decoding video data includes receiving encoded data for a current block. The method further includes determining whether single tree partitioning is enabled for the current block and decoding the encoded data to generate luma transform coefficients for the current block and chroma transform coefficients for the current block. The method further includes, in response to determining that the single tree partitioning is enabled for the current block, applying an inverse low-frequency non-separable transform (LFNST) to only the luma transform coefficients for the current block to produce a residual block for the current block. The method further includes reconstructing the current block of the video data using the residual block and a prediction block for the current block.
    Type: Grant
    Filed: September 23, 2020
    Date of Patent: December 21, 2021
    Assignee: QUALCOMM Incorporated
    Inventors: Hilmi Enes Egilmez, Cheng-Teh Hsieh, Vadim Seregin, Marta Karczewicz
  • Patent number: 11184617
    Abstract: An example device for decoding video data includes a memory configured to store the video data and one or more processors coupled to the memory. The one or more processors are configured to reorganize 2-D dequantized coefficients according to a first ordering. The one or more processors are configured to apply an inverse low-frequency non-separable transform (LFNST) to the reorganized 2-D dequantized coefficients to create inverse transformed coefficients. The one or more processors are configured to reorganize the inverse transformed coefficients according to a second ordering, the second ordering being based on an array including values, wherein each value in the array corresponds to a position in a 2-D block and the values in the array denote indices of the 2-D block in a defined order. The one or more processors are configured to decode the video data based on the second ordered inverse transformed coefficients.
    Type: Grant
    Filed: September 18, 2020
    Date of Patent: November 23, 2021
    Assignee: Qualcomm Incorporated
    Inventors: Hilmi Enes Egilmez, Amir Said, Vadim Seregin, Marta Karczewicz
  • Patent number: 11153576
    Abstract: An example device for coding video data includes a memory configured to store the video data and one or more processors implemented in circuitry and communicatively coupled to the memory. The one or more processors are configured to determine whether scaling matrices may be applied to a low-frequency non-separable transform (LFNST) coded block. The one or more processors are also configured to, based on a determination that scaling matrices may not be applied to the LFNST coded block, not applying the scaling matrices to the LFNST coded block. The one or more processors are also configured to code the video data without applying the scaling matrices to the LFNST coded block.
    Type: Grant
    Filed: September 17, 2020
    Date of Patent: October 19, 2021
    Assignee: Qualcomm Incorporated
    Inventors: Adarsh Krishnan Ramasubramonian, Geert Van der Auwera, Vadim Seregin, Hilmi Enes Egilmez, Marta Karczewicz
  • Publication number: 20210321137
    Abstract: An example device for coding video data includes memory configured to store the video data and one or more processors implemented in circuitry and communicatively coupled to the memory. The one or more processors are configured to parse or signal all luma coefficients of a block of the video data from or to an encoded video bitstream. The one or more processors are configured to parse or signal at least one syntax element for the block after all the luma coefficients of the block are parsed or signaled from or to the encoded video bitstream, wherein the at least one syntax element comprises at least one of a low-frequency non-separable transform index for the luma coefficients or a multiple transform selection index. The one or more processors are also configured to code the block in accordance with the at least one syntax element.
    Type: Application
    Filed: March 26, 2021
    Publication date: October 14, 2021
    Inventors: Hilmi Enes Egilmez, Vadim Seregin, Marta Karczewicz
  • Publication number: 20210306678
    Abstract: An example video codec includes memory configured to store the video data and one or more processors implemented in circuitry and communicatively coupled to the memory. The one or more processors are configured to determine that a current mode of coding a current block of the video data is a single tree partitioning mode. Based on the current mode being the single tree partitioning mode, the one or more processors are configured to refrain from determining whether there is a non-DC coefficient for a chroma component of a transform unit (TU) for the current block and refrain from coding a low-frequency non-separable transformation (LFNST) index in response to the refraining of the determination of whether there is the non-DC coefficient. The one or more processors are configured to code the current block in the single partitioning mode with LFNST disabled.
    Type: Application
    Filed: March 24, 2021
    Publication date: September 30, 2021
    Inventors: Hilmi Enes Egilmez, Alican Nalci, Vadim Seregin, Marta Karczewicz
  • Patent number: 11134258
    Abstract: A video decoder configured to determine a residual block by, for a first coefficient of a coefficient group, receiving a first instance of a flag, wherein the first instance of the flag is set to a first value; in response to the first instance of the flag being set to the first value, copying a coefficient value of a neighboring coefficient for the first coefficient; for a second coefficient of the coefficient group, receiving a second instance of the flag, wherein the second instance of the flag is set to a second value; in response to the second instance of the flag being set to the second value, receiving an index; determining, from a list of coefficient values, a value corresponding to the index; and set a value for the second coefficient to the value corresponding to the index.
    Type: Grant
    Filed: February 10, 2020
    Date of Patent: September 28, 2021
    Assignee: QUALCOMM Incorporated
    Inventors: Marta Karczewicz, Hilmi Enes Egilmez
  • Publication number: 20210211685
    Abstract: A video decoder may be configured to receive a block of video data, and determine a value of a multiple transform set index for the block of video data based on a presence of non-zero transform coefficients in the block of video data other than a DC coefficient. The video decoder may then apply a transform to the block of video data based on the determined value of the multiple transform set index.
    Type: Application
    Filed: January 6, 2021
    Publication date: July 8, 2021
    Inventors: Hilmi Enes Egilmez, Vadim Seregin, Muhammed Zeyd Coban, Marta Karczewicz
  • Publication number: 20210203985
    Abstract: A video encoder and video decoder may code video data using intra prediction and a block-based delta pulse code modulation (BDPCM) mode. The BDPCM mode may include a vertical mode and a horizontal mode. The video encoder and video decoder may be configured to align the direction of an intra prediction mode to the direction of a BDPCM mode for both luma and chroma blocks.
    Type: Application
    Filed: December 18, 2020
    Publication date: July 1, 2021
    Inventors: Alican Nalci, Luong Pham Van, Marta Karczewicz, Geert Van der Auwera, Muhammed Zeyd Coban, Hilmi Enes Egilmez
  • Publication number: 20210195242
    Abstract: An example device for decoding video data includes a memory configured to store video data; and one or more processors implemented in circuitry and configured to: determine that a transform block of video data has a size of 8 8 coefficients and that the transform block is transformed using a low-frequency non-separable transform (LFNST); decode at least nine non-zero transform coefficients of the transform block; inverse transform the transform block using an inverse LFNST to reproduce a residual block corresponding to the transform block; and reconstruct a block of the video data using the residual block.
    Type: Application
    Filed: December 17, 2020
    Publication date: June 24, 2021
    Inventors: Hilmi Enes Egilmez, Muhammed Zeyd Coban, Marta Karczewicz, Vadim Seregin
  • Publication number: 20210195222
    Abstract: An example device includes memory configured to store video data and one or more processors implemented in circuitry and coupled to the memory. The one or more processors determine whether a chroma block of the video data is encoded using dual tree partitioning. The one or more processors determine whether transform skip mode for the chroma block is enabled. The one or more processors, based on the chroma block being encoded using dual tree partitioning and transform skip mode being enabled for the chroma block, infer a value of a low-frequency non-separable transform (LFNST) index for the chroma block.
    Type: Application
    Filed: December 21, 2020
    Publication date: June 24, 2021
    Applicant: QUALCOMM Incorporated
    Inventors: Hilmi Enes Egilmez, Alican Nalci, Muhammed Zeyd Coban, Marta Karczewicz
  • Publication number: 20210195192
    Abstract: A video coder may determine, for a transform block of video data, that at least one coefficient group, of the transform block, that comprises a non-zero transform coefficient is outside of a lowest frequency region of the transform block, wherein the at least one coefficient group is one of a plurality of coefficient groups that each comprise transform coefficients. The video coder may determine not to code a syntax element indicative of a multiple transform selection (MTS) for the transform block based at least in part on the determination of that the at least one coefficient group is outside of the lowest frequency region of the transform block. The video coder may code the video data based at least in part on the determination not to code the syntax element indicative of the multiple transform selection for the transform block.
    Type: Application
    Filed: December 17, 2020
    Publication date: June 24, 2021
    Inventors: Muhammed Zeyd Coban, Marta Karczewicz, Hilmi Enes Egilmez, Vadim Seregin
  • Patent number: 11032572
    Abstract: A video decoder is configured to determine a position of a last significant coefficient in a transform block of video data. The video decoder may then determine a value of a low-frequency non-separable transform (LFNST) index for the transform block based on the position of the last significant coefficient relative to a zero-out region of the transform block, wherein the zero-out region of the transform block includes both a first region within an LFNST region of the transform block and a second region of the transform block outside the LFNST region. The video decoder may then inverse transform the transform block in accordance with the value of the LFNST index.
    Type: Grant
    Filed: May 13, 2020
    Date of Patent: June 8, 2021
    Assignee: QUALCOMM Incorporated
    Inventors: Hilmi Enes Egilmez, Vadim Seregin, Muhammed Zeyd Coban, Marta Karczewicz
  • Publication number: 20210127137
    Abstract: A video decoder can be configured to receive, in a syntax structure that applies to a current block, graph-related information; determine a transform matrix based on the received graph-related syntax information; perform an inverse transform based on the determined transform matrix of one or more coefficient values to generate a residual block; and reconstruct the current block of the video data based on the residual block.
    Type: Application
    Filed: October 22, 2020
    Publication date: April 29, 2021
    Inventors: Hilmi Enes Egilmez, Amir Said, Vadim Seregin, Marta Karczewicz
  • Patent number: 10986340
    Abstract: An example device for decoding video data includes a memory configured to store video data; and a processor implemented in circuitry and configured to decode a truncated unary codeword representing a multiple transform (MT) scheme for a current block of the video data to determine the MT scheme; apply the MT scheme to transform coefficients of the current block to produce residual data for the current block of video data; and decode the current block using the residual data. The MT scheme may include a plurality of transforms, such as a horizontal transform and a vertical transform, a primary transform and a second transform, or any combination of separable and/or non-separable transforms. Thus, a single truncated unary codeword may represent the entire MT scheme, that is, each of a plurality of transforms of the MT scheme.
    Type: Grant
    Filed: May 30, 2019
    Date of Patent: April 20, 2021
    Assignee: QUALCOMM Incorporated
    Inventors: Hilmi Enes Egilmez, Yung-Hsuan Chao, Amir Said, Vadim Seregin, Marta Karczewicz, Wei-Jung Chien
  • Publication number: 20210112279
    Abstract: An example method of decoding video data that includes receiving one or more syntax elements of the video data indicative of whether a first type of coding scheme or a second type of coding scheme is applied to residual values of a block of video data coded with transform skip, wherein the residual values are indicative of a difference between the block and a prediction block, and wherein, in transform skip, the residual values are not transformed from a sample domain to a frequency domain. The method includes determining a type of coding scheme to apply to the residual values based on the one or more syntax elements, determining the residual values based on the determined type of coding scheme, and reconstructing the block based on the determined residual values and the prediction block.
    Type: Application
    Filed: October 8, 2020
    Publication date: April 15, 2021
    Inventors: Marta Karczewicz, Muhammed Zeyd Coban, Alican Nalci, Hilmi Enes Egilmez