Patents by Inventor Vignesh V. Menon

Vignesh V. Menon has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Content-adaptive encoder preset prediction for adaptive live streaming

Patent number: 12355939

Abstract: Techniques for content-adaptive encoder preset prediction for adaptive live streaming are described herein. A method for content-adaptive encoder preset prediction for adaptive live streaming includes performing video complexity feature extraction on a video segment to extract complexity features such as an average texture energy, an average temporal energy, and an average lumiscence. These inputs may be provided to an encoding time prediction model, along with a bitrate ladder, a resolution set, a target video encoding speed, and a number of CPU threads for the video segment, to predict an encoding time, and an optimized encoding preset may be selected for the video segment by a preset selection function using the predicted encoding time. The video segment may be encoded according to the optimized encoding preset.

Type: Grant

Filed: September 12, 2023

Date of Patent: July 8, 2025

Assignee: Bitmovin GmbH

Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
Perceptually-aware online per-title encoding for live video streaming

Patent number: 12166997

Abstract: Techniques for implementing perceptually aware per-title encoding may include receiving an input video, a set of resolutions, a maximum target bitrate and a minimum target bitrate, extracting content aware features for each segment of the input video, predicting a perceptually aware bitrate-resolution pair for each segment using a model configured to optimize for a quality metric using constants trained for each of the set of resolutions, generating a target encoding set including a set of perceptually aware bitrate-resolution pairs, and encoding the target encoding set. The content aware features may include a spatial energy feature and an average temporal energy. According to these methods only a subset of bitrates and resolutions, less than a full set of bitrates and resolutions, are encoded to provide high quality video content for streaming.

Type: Grant

Filed: April 27, 2023

Date of Patent: December 10, 2024

Assignee: BITMOVIN, GMBH

Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
Video Encoding Complexity Predictor

Publication number: 20240121400

Abstract: Techniques for predicting video encoding complexity are described herein. A method for predicting video encoding complexity includes performing video complexity feature extraction on a video segment to extract low-complexity frame-based features, predicting video encoding complexity for the video segment using the low-complexity frame-based features, and outputting a predicted encoding bitrate and a predicted encoding time. An embodiment may include implementing a hybrid model using a CNN, wherein a latent vector from a frame of the video segment is extracted and also may be used to predict video encoding complexity. The predicted encoding bitrates and encoding times may be provided to encoding infrastructure for use in optimizing a schedule of encodings.

Type: Application

Filed: September 22, 2023

Publication date: April 11, 2024

Applicant: Bitmovin GmbH

Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
Efficient Two-Pass Encoding Scheme for Adaptive Live Streaming

Publication number: 20240114183

Abstract: Techniques for efficient two-pass encoding for live streaming are described herein. A method for efficient two-pass encoding may include extracting low-complexity features of a video segment, predicting an optimized constant rate factor (CRF) for the video segment using the low-complexity features, and encoding the video segment with the optimized CRF at a target bitrate. A system for efficient two-pass encoding may include a feature extraction module configured to extract low-complexity features from a video segment, a neural network configured to predict an optimized CRF as a function of the low-complexity features and a target bitrate, and an encoder configured to encode the video segment using the optimized CRF at the target bitrate.

Type: Application

Filed: September 22, 2023

Publication date: April 4, 2024

Applicant: Bitmovin GmbH

Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
Content-adaptive Encoder Preset Prediction for Adaptive Live Streaming

Publication number: 20240098247

Abstract: Techniques for content-adaptive encoder preset prediction for adaptive live streaming are described herein. A method for content-adaptive encoder preset prediction for adaptive live streaming includes performing video complexity feature extraction on a video segment to extract complexity features such as an average texture energy, an average temporal energy, and an average lumiscence. These inputs may be provided to an encoding time prediction model, along with a bitrate ladder, a resolution set, a target video encoding speed, and a number of CPU threads for the video segment, to predict an encoding time, and an optimized encoding preset may be selected for the video segment by a preset selection function using the predicted encoding time. The video segment may be encoded according to the optimized encoding preset.

Type: Application

Filed: September 12, 2023

Publication date: March 21, 2024

Applicant: Bitmovin GmbH

Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
Variable framerate encoding using content-aware framerate prediction for high framerate videos

Patent number: 11924437

Abstract: The technology described herein relates to variable framerate encoding. A method for variable framerate encoding includes receiving shots, as segmented from a video input, extracting features for each of the shots, the features including at least a spatial energy feature and an average temporal energy, predicting a frame dropping factor for each of the shots based on the spatial energy feature and the average temporal energy, predicting an optimized framerate for each of the shots based on the frame dropping factor, downscaling and encoding each of the shots using the optimized framerate. The encoded shots may then be decoded and upscaled back to their original framerates.

Type: Grant

Filed: March 28, 2022

Date of Patent: March 5, 2024

Assignee: Bitmovin GmbH

Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
Low-latency online per-title encoding

Patent number: 11902580

Abstract: The technology described herein relates to online per-title encoding. A method for online per-title encoding includes receiving a video input, generating segments of the video input, extracting a spatial feature and a temporal feature, predicting bitrate-resolution pairs based on the spatial feature and the temporal feature, using a discrete cosine transform (DCT)-based energy function, and per-title encoding segments of the video input for the predicted bitrate-resolution pairs. A system for online per-title encoding may include memory for storing a set of bitrates, a set of resolutions, and a machine learning module configured to predict bitrate resolution pairs based on low-complexity spatial and temporal features.

Type: Grant

Filed: December 8, 2021

Date of Patent: February 13, 2024

Assignee: BITMOVIN, INC.

Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
Perceptually-aware Online Per-title Encoding for Live Video Streaming

Publication number: 20230388511

Abstract: Techniques for implementing perceptually aware per-title encoding may include receiving an input video, a set of resolutions, a maximum target bitrate and a minimum target bitrate, extracting content aware features for each segment of the input video, predicting a perceptually aware bitrate-resolution pair for each segment using a model configured to optimize for a quality metric using constants trained for each of the set of resolutions, generating a target encoding set including a set of perceptually aware bitrate-resolution pairs, and encoding the target encoding set. The content aware features may include a spatial energy feature and an average temporal energy. According to these methods only a subset of bitrates and resolutions, less than a full set of bitrates and resolutions, are encoded to provide high quality video content for streaming.

Type: Application

Filed: April 27, 2023

Publication date: November 30, 2023

Applicant: Bitmovin GmbH

Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
Variable Framerate Encoding Using Content-Aware Framerate Prediction for High Framerate Videos

Publication number: 20230308657

Abstract: The technology described herein relates to variable framerate encoding. A method for variable framerate encoding includes receiving shots, as segmented from a video input, extracting features for each of the shots, the features including at least a spatial energy feature and an average temporal energy, predicting a frame dropping factor for each of the shots based on the spatial energy feature and the average temporal energy, predicting an optimized framerate for each of the shots based on the frame dropping factor, downscaling and encoding each of the shots using the optimized framerate. The encoded shots may then be decoded and upscaled back to their original framerates.

Type: Application

Filed: March 28, 2022

Publication date: September 28, 2023

Applicant: Bitmovin GmbH

Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
Low-Latency Online Per-Title Encoding

Publication number: 20230179800

Abstract: The technology described herein relates to online per-title encoding. A method for online per-title encoding includes receiving a video input, generating segments of the video input, extracting a spatial feature and a temporal feature, predicting bitrate-resolution pairs based on the spatial feature and the temporal feature, using a discrete cosine transform (DCT)-based energy function, and per-title encoding segments of the video input for the predicted bitrate-resolution pairs. A system for online per-title encoding may include memory for storing a set of bitrates, a set of resolutions, and a machine learning module configured to predict bitrate resolution pairs based on low-complexity spatial and temporal features.

Type: Application

Filed: December 8, 2021

Publication date: June 8, 2023

Applicant: BITMOVIN, INC.

Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer