Patents by Inventor Hadi Amirpour

Hadi Amirpour has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240121400
    Abstract: Techniques for predicting video encoding complexity are described herein. A method for predicting video encoding complexity includes performing video complexity feature extraction on a video segment to extract low-complexity frame-based features, predicting video encoding complexity for the video segment using the low-complexity frame-based features, and outputting a predicted encoding bitrate and a predicted encoding time. An embodiment may include implementing a hybrid model using a CNN, wherein a latent vector from a frame of the video segment is extracted and also may be used to predict video encoding complexity. The predicted encoding bitrates and encoding times may be provided to encoding infrastructure for use in optimizing a schedule of encodings.
    Type: Application
    Filed: September 22, 2023
    Publication date: April 11, 2024
    Applicant: Bitmovin GmbH
    Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
  • Publication number: 20240114183
    Abstract: Techniques for efficient two-pass encoding for live streaming are described herein. A method for efficient two-pass encoding may include extracting low-complexity features of a video segment, predicting an optimized constant rate factor (CRF) for the video segment using the low-complexity features, and encoding the video segment with the optimized CRF at a target bitrate. A system for efficient two-pass encoding may include a feature extraction module configured to extract low-complexity features from a video segment, a neural network configured to predict an optimized CRF as a function of the low-complexity features and a target bitrate, and an encoder configured to encode the video segment using the optimized CRF at the target bitrate.
    Type: Application
    Filed: September 22, 2023
    Publication date: April 4, 2024
    Applicant: Bitmovin GmbH
    Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
  • Publication number: 20240098247
    Abstract: Techniques for content-adaptive encoder preset prediction for adaptive live streaming are described herein. A method for content-adaptive encoder preset prediction for adaptive live streaming includes performing video complexity feature extraction on a video segment to extract complexity features such as an average texture energy, an average temporal energy, and an average lumiscence. These inputs may be provided to an encoding time prediction model, along with a bitrate ladder, a resolution set, a target video encoding speed, and a number of CPU threads for the video segment, to predict an encoding time, and an optimized encoding preset may be selected for the video segment by a preset selection function using the predicted encoding time. The video segment may be encoded according to the optimized encoding preset.
    Type: Application
    Filed: September 12, 2023
    Publication date: March 21, 2024
    Applicant: Bitmovin GmbH
    Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
  • Patent number: 11936864
    Abstract: According to embodiments of the disclosure, fast multi-rate encoding may be performed using machine learning by encoding a lowest quality representation to determine encoding parameters, processing raw data of the video using a neural network to obtain an intermediate output comprising encoding features, augmenting the intermediate output with additional encoding features to form a final tensor, and processing the final tensor with another neural network to obtain a classification output comprising a split or not split decision for an image data block. The classification output may be used to encode a highest quality representation, and then other representations of the video.
    Type: Grant
    Filed: December 2, 2021
    Date of Patent: March 19, 2024
    Assignee: BITMOVIN, INC.
    Inventors: Hadi Amirpour, Ekrem Çetinkaya, Christian Timmerer
  • Patent number: 11924437
    Abstract: The technology described herein relates to variable framerate encoding. A method for variable framerate encoding includes receiving shots, as segmented from a video input, extracting features for each of the shots, the features including at least a spatial energy feature and an average temporal energy, predicting a frame dropping factor for each of the shots based on the spatial energy feature and the average temporal energy, predicting an optimized framerate for each of the shots based on the frame dropping factor, downscaling and encoding each of the shots using the optimized framerate. The encoded shots may then be decoded and upscaled back to their original framerates.
    Type: Grant
    Filed: March 28, 2022
    Date of Patent: March 5, 2024
    Assignee: Bitmovin GmbH
    Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
  • Patent number: 11902580
    Abstract: The technology described herein relates to online per-title encoding. A method for online per-title encoding includes receiving a video input, generating segments of the video input, extracting a spatial feature and a temporal feature, predicting bitrate-resolution pairs based on the spatial feature and the temporal feature, using a discrete cosine transform (DCT)-based energy function, and per-title encoding segments of the video input for the predicted bitrate-resolution pairs. A system for online per-title encoding may include memory for storing a set of bitrates, a set of resolutions, and a machine learning module configured to predict bitrate resolution pairs based on low-complexity spatial and temporal features.
    Type: Grant
    Filed: December 8, 2021
    Date of Patent: February 13, 2024
    Assignee: BITMOVIN, INC.
    Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
  • Publication number: 20230388511
    Abstract: Techniques for implementing perceptually aware per-title encoding may include receiving an input video, a set of resolutions, a maximum target bitrate and a minimum target bitrate, extracting content aware features for each segment of the input video, predicting a perceptually aware bitrate-resolution pair for each segment using a model configured to optimize for a quality metric using constants trained for each of the set of resolutions, generating a target encoding set including a set of perceptually aware bitrate-resolution pairs, and encoding the target encoding set. The content aware features may include a spatial energy feature and an average temporal energy. According to these methods only a subset of bitrates and resolutions, less than a full set of bitrates and resolutions, are encoded to provide high quality video content for streaming.
    Type: Application
    Filed: April 27, 2023
    Publication date: November 30, 2023
    Applicant: Bitmovin GmbH
    Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
  • Publication number: 20230308657
    Abstract: The technology described herein relates to variable framerate encoding. A method for variable framerate encoding includes receiving shots, as segmented from a video input, extracting features for each of the shots, the features including at least a spatial energy feature and an average temporal energy, predicting a frame dropping factor for each of the shots based on the spatial energy feature and the average temporal energy, predicting an optimized framerate for each of the shots based on the frame dropping factor, downscaling and encoding each of the shots using the optimized framerate. The encoded shots may then be decoded and upscaled back to their original framerates.
    Type: Application
    Filed: March 28, 2022
    Publication date: September 28, 2023
    Applicant: Bitmovin GmbH
    Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
  • Publication number: 20230179800
    Abstract: The technology described herein relates to online per-title encoding. A method for online per-title encoding includes receiving a video input, generating segments of the video input, extracting a spatial feature and a temporal feature, predicting bitrate-resolution pairs based on the spatial feature and the temporal feature, using a discrete cosine transform (DCT)-based energy function, and per-title encoding segments of the video input for the predicted bitrate-resolution pairs. A system for online per-title encoding may include memory for storing a set of bitrates, a set of resolutions, and a machine learning module configured to predict bitrate resolution pairs based on low-complexity spatial and temporal features.
    Type: Application
    Filed: December 8, 2021
    Publication date: June 8, 2023
    Applicant: BITMOVIN, INC.
    Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
  • Publication number: 20230118010
    Abstract: A scalable per-title encoding technique may include detecting scene cuts in an input video received by an encoding network or system, generating segments of the input video, performing per-title encoding of a segment of the input video, training a deep neural network (DNN) for each representation of the segment, thereby generating a trained DNN, compressing the trained DNN, thereby generating a compressed trained DNN, and generating an enhanced bitrate ladder including metadata comprising the compressed trained DNN. In some embodiments, the method also may include generating a base layer bitrate ladder for CPU devices, and providing the enhanced bitrate ladder for GPU-available devices.
    Type: Application
    Filed: October 13, 2022
    Publication date: April 20, 2023
    Applicant: BITMOVIN, INC.
    Inventors: Hadi Amirpour, Christian Timmerer
  • Publication number: 20230007298
    Abstract: Techniques relating to per-title encoding using spatial and temporal resolution downscaling is disclosed. A method for per-title encoding includes receiving a video input comprised of video segments, spatially downscaling the video input, temporally downscaling the video input, encoding the video input to generate an encoded video, then temporally and spatially upscaling the encoded video. Spatially downscaling may include reducing a resolution of the video input, and temporally downscaling may include reducing a framerate of the video input. Objective metrics for the upscaled encoded video show improved quality over conventional methods.
    Type: Application
    Filed: June 29, 2022
    Publication date: January 5, 2023
    Applicant: BITMOVIN, INC.
    Inventors: Hadi Amirpour, Christian Timmerer
  • Patent number: 11546401
    Abstract: According to embodiments of the disclosure, information of higher and lower quality encoded video segments is used to limit Rate-Distortion Optimization (RDO) for each Coding Unit Tree (CTU). A method first encodes the highest bit-rate segment and consequently uses it to encode the lowest bit-rate video segment. Block structure and selected reference frame of both highest and lowest bit-rate video segments are used to predict and shorten RDO process for each CTU in middle bit-rates. The method delays just one frame using parallel processing. This approach provides time-complexity reduction compared to the reference software for middle bit-rates while degradation is negligible.
    Type: Grant
    Filed: November 5, 2020
    Date of Patent: January 3, 2023
    Assignee: BITMOVIN, INC.
    Inventors: Hadi Amirpour, Ekrem Çetinkaya, Christian Timmerer
  • Publication number: 20220141476
    Abstract: Disclosed are systems and methods for lightweight transcoding of video. A distributed computing system for lightweight transcoding includes an origin server and an edge node, the origin server having a memory and a processor and configured to receive an input video comprising a bitstream, encode the bitstream into a set of representations corresponding to a full bitrate ladder, generate encoding metadata for the set of representations, and provide a representation and encoding metadata for the set of representations to an edge node, the edge node having a memory and a processor and configured to transcode the bitstream, or segments thereof, into the set of representations, and to serve one or more of the representations to a client.
    Type: Application
    Filed: July 30, 2021
    Publication date: May 5, 2022
    Applicant: BITMOVIN, INC.
    Inventors: Alireza Erfanian, Hadi Amirpour, Christian Timmerer, Hermann Hellwagner
  • Publication number: 20220094928
    Abstract: According to embodiments of the disclosure, fast multi-rate encoding may be performed using machine learning by encoding a lowest quality representation to determine encoding parameters, processing raw data of the video using a neural network to obtain an intermediate output comprising encoding features, augmenting the intermediate output with additional encoding features to form a final tensor, and processing the final tensor with another neural network to obtain a classification output comprising a split or not split decision for an image data block. The classification output may be used to encode a highest quality representation, and then other representations of the video.
    Type: Application
    Filed: December 2, 2021
    Publication date: March 24, 2022
    Applicant: BITMOVIN, INC.
    Inventors: Hadi Amirpour, Ekrem Çetinkaya, Christian Timmerer
  • Publication number: 20210144190
    Abstract: According to embodiments of the disclosure, information of higher and lower quality encoded video segments is used to limit Rate-Distortion Optimization (RDO) for each Coding Unit Tree (CTU). A method first encodes the highest bit-rate segment and consequently uses it to encode the lowest bit-rate video segment. Block structure and selected reference frame of both highest and lowest bit-rate video segments are used to predict and shorten RDO process for each CTU in middle bit-rates. The method delays just one frame using parallel processing. This approach provides time-complexity reduction compared to the reference software for middle bit-rates while degradation is negligible.
    Type: Application
    Filed: November 5, 2020
    Publication date: May 13, 2021
    Applicant: BITMOVIN, INC.
    Inventors: Hadi Amirpour, Ekrem Çetinkaya, Christian Timmerer