Patents by Inventor Vignesh V. Menon
Vignesh V. Menon has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12166997Abstract: Techniques for implementing perceptually aware per-title encoding may include receiving an input video, a set of resolutions, a maximum target bitrate and a minimum target bitrate, extracting content aware features for each segment of the input video, predicting a perceptually aware bitrate-resolution pair for each segment using a model configured to optimize for a quality metric using constants trained for each of the set of resolutions, generating a target encoding set including a set of perceptually aware bitrate-resolution pairs, and encoding the target encoding set. The content aware features may include a spatial energy feature and an average temporal energy. According to these methods only a subset of bitrates and resolutions, less than a full set of bitrates and resolutions, are encoded to provide high quality video content for streaming.Type: GrantFiled: April 27, 2023Date of Patent: December 10, 2024Assignee: BITMOVIN, GMBHInventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
-
Publication number: 20240121400Abstract: Techniques for predicting video encoding complexity are described herein. A method for predicting video encoding complexity includes performing video complexity feature extraction on a video segment to extract low-complexity frame-based features, predicting video encoding complexity for the video segment using the low-complexity frame-based features, and outputting a predicted encoding bitrate and a predicted encoding time. An embodiment may include implementing a hybrid model using a CNN, wherein a latent vector from a frame of the video segment is extracted and also may be used to predict video encoding complexity. The predicted encoding bitrates and encoding times may be provided to encoding infrastructure for use in optimizing a schedule of encodings.Type: ApplicationFiled: September 22, 2023Publication date: April 11, 2024Applicant: Bitmovin GmbHInventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
-
Publication number: 20240114183Abstract: Techniques for efficient two-pass encoding for live streaming are described herein. A method for efficient two-pass encoding may include extracting low-complexity features of a video segment, predicting an optimized constant rate factor (CRF) for the video segment using the low-complexity features, and encoding the video segment with the optimized CRF at a target bitrate. A system for efficient two-pass encoding may include a feature extraction module configured to extract low-complexity features from a video segment, a neural network configured to predict an optimized CRF as a function of the low-complexity features and a target bitrate, and an encoder configured to encode the video segment using the optimized CRF at the target bitrate.Type: ApplicationFiled: September 22, 2023Publication date: April 4, 2024Applicant: Bitmovin GmbHInventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
-
Publication number: 20240098247Abstract: Techniques for content-adaptive encoder preset prediction for adaptive live streaming are described herein. A method for content-adaptive encoder preset prediction for adaptive live streaming includes performing video complexity feature extraction on a video segment to extract complexity features such as an average texture energy, an average temporal energy, and an average lumiscence. These inputs may be provided to an encoding time prediction model, along with a bitrate ladder, a resolution set, a target video encoding speed, and a number of CPU threads for the video segment, to predict an encoding time, and an optimized encoding preset may be selected for the video segment by a preset selection function using the predicted encoding time. The video segment may be encoded according to the optimized encoding preset.Type: ApplicationFiled: September 12, 2023Publication date: March 21, 2024Applicant: Bitmovin GmbHInventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
-
Patent number: 11924437Abstract: The technology described herein relates to variable framerate encoding. A method for variable framerate encoding includes receiving shots, as segmented from a video input, extracting features for each of the shots, the features including at least a spatial energy feature and an average temporal energy, predicting a frame dropping factor for each of the shots based on the spatial energy feature and the average temporal energy, predicting an optimized framerate for each of the shots based on the frame dropping factor, downscaling and encoding each of the shots using the optimized framerate. The encoded shots may then be decoded and upscaled back to their original framerates.Type: GrantFiled: March 28, 2022Date of Patent: March 5, 2024Assignee: Bitmovin GmbHInventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
-
Patent number: 11902580Abstract: The technology described herein relates to online per-title encoding. A method for online per-title encoding includes receiving a video input, generating segments of the video input, extracting a spatial feature and a temporal feature, predicting bitrate-resolution pairs based on the spatial feature and the temporal feature, using a discrete cosine transform (DCT)-based energy function, and per-title encoding segments of the video input for the predicted bitrate-resolution pairs. A system for online per-title encoding may include memory for storing a set of bitrates, a set of resolutions, and a machine learning module configured to predict bitrate resolution pairs based on low-complexity spatial and temporal features.Type: GrantFiled: December 8, 2021Date of Patent: February 13, 2024Assignee: BITMOVIN, INC.Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
-
Publication number: 20230388511Abstract: Techniques for implementing perceptually aware per-title encoding may include receiving an input video, a set of resolutions, a maximum target bitrate and a minimum target bitrate, extracting content aware features for each segment of the input video, predicting a perceptually aware bitrate-resolution pair for each segment using a model configured to optimize for a quality metric using constants trained for each of the set of resolutions, generating a target encoding set including a set of perceptually aware bitrate-resolution pairs, and encoding the target encoding set. The content aware features may include a spatial energy feature and an average temporal energy. According to these methods only a subset of bitrates and resolutions, less than a full set of bitrates and resolutions, are encoded to provide high quality video content for streaming.Type: ApplicationFiled: April 27, 2023Publication date: November 30, 2023Applicant: Bitmovin GmbHInventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
-
Publication number: 20230308657Abstract: The technology described herein relates to variable framerate encoding. A method for variable framerate encoding includes receiving shots, as segmented from a video input, extracting features for each of the shots, the features including at least a spatial energy feature and an average temporal energy, predicting a frame dropping factor for each of the shots based on the spatial energy feature and the average temporal energy, predicting an optimized framerate for each of the shots based on the frame dropping factor, downscaling and encoding each of the shots using the optimized framerate. The encoded shots may then be decoded and upscaled back to their original framerates.Type: ApplicationFiled: March 28, 2022Publication date: September 28, 2023Applicant: Bitmovin GmbHInventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer
-
Publication number: 20230179800Abstract: The technology described herein relates to online per-title encoding. A method for online per-title encoding includes receiving a video input, generating segments of the video input, extracting a spatial feature and a temporal feature, predicting bitrate-resolution pairs based on the spatial feature and the temporal feature, using a discrete cosine transform (DCT)-based energy function, and per-title encoding segments of the video input for the predicted bitrate-resolution pairs. A system for online per-title encoding may include memory for storing a set of bitrates, a set of resolutions, and a machine learning module configured to predict bitrate resolution pairs based on low-complexity spatial and temporal features.Type: ApplicationFiled: December 8, 2021Publication date: June 8, 2023Applicant: BITMOVIN, INC.Inventors: Vignesh V. Menon, Hadi Amirpour, Christian Timmerer