Patents Assigned to FastVDO LLC

Method and apparatus for applying deep learning techniques in video coding, restoration and video quality analysis (VQA)

Patent number: 12137230

Abstract: Video quality analysis may be used in many multimedia transmission and communication applications, such as encoder optimization, stream selection, and/or video reconstruction. An objective VQA metric that accurately reflects the quality of processed video relative to a source unprocessed video may take into account both spatial measures and temporal, motion-based measures when evaluating the processed video. Temporal measures may include differential motion metrics indicating a difference between a frame difference of a plurality of frames of the processed video relative to that of a corresponding plurality of frames of the source video. In addition, neural networks and deep learning techniques can be used to develop additional improved VQA metrics that take into account both spatial and temporal aspects of the processed and unprocessed videos.

Type: Grant

Filed: April 15, 2022

Date of Patent: November 5, 2024

Assignee: FASTVDO LLC

Inventors: Pankaj N. Topiwala, Madhu Peringassery Krishnan, Wei Dai
Method and apparatus for applying deep learning techniques in video coding, restoration and video quality analysis (VQA)

Patent number: 11310509

Abstract: Video quality analysis may be used in many multimedia transmission and communication applications, such as encoder optimization, stream selection, and/or video reconstruction. An objective VQA metric that accurately reflects the quality of processed video relative to a source unprocessed video may take into account both spatial measures and temporal, motion-based measures when evaluating the processed video. Temporal measures may include differential motion metrics indicating a difference between a frame difference of a plurality of frames of the processed video relative to that of a corresponding plurality of frames of the source video. In addition, neural networks and deep learning techniques can be used to develop additional improved VQA metrics that take into account both spatial and temporal aspects of the processed and unprocessed videos.

Type: Grant

Filed: December 11, 2020

Date of Patent: April 19, 2022

Assignee: FastVDO LLC

Inventors: Pankaj N. Topiwala, Madhu Peringassery Krishnan, Wei Dai
High dynamic range image/video coding

Patent number: 11265559

Abstract: A disclosed configuration includes a system (or a computer implemented method or a non-transitory computer readable medium) for automatically preprocessing higher dynamic range image data into lower dynamic range image data through a data adaptive tuning process. By automatically preprocessing the higher dynamic range image data into the lower dynamic range image data through the data adaptive tuning process, an existing encoding process for encoding the standard dynamic range image data can be applied to the lower dynamic range image data while preserving metadata sufficient to recover image fidelity even in the high dynamic range. In one aspect, the system (or a computer implemented method or a non-transitory computer readable medium) provides for backwards compatibility between high dynamic range video services and existing standard dynamic range services. In one aspect, regrading is applied in a domain that is perceptually more uniform than the domain it is initially presented.

Type: Grant

Filed: December 24, 2020

Date of Patent: March 1, 2022

Assignee: FastVDO LLC

Inventors: Pankaj N. Topiwala, Wei Dai, Madhu P. Krishnan
Adaptive resolution change in video streams

Patent number: 11206404

Abstract: A video coding system in which video images of a video bitstream are rescaled prior to encoding, and again at the decoder upon reception. When encoding a given video frame, the video encoder deduces a level of resampling to apply to a reference frame in order to properly predict blocks in the given video frame or the full given video frame, and carries out one or more predictions by first applying a resampling process on the reference frame data at the deduced level. To decode the given video frame of the bitstream, a video decoder compares a size of the given video frame to sizes of a reference frame to determine a level of resampling for the reference frame data, and carries out predictions to generate predicted data by first applying the determined level of resampling to the reference frame data.

Type: Grant

Filed: July 7, 2020

Date of Patent: December 21, 2021

Assignee: FastVDO LLC

Inventors: Pankaj N. Topiwala, Madhu Peringassery Krishnan, Wei Dai
Method and apparatus for applying deep learning techniques in video coding, restoration and video quality analysis (VQA)

Patent number: 10880551

Abstract: Video quality analysis may be used in many multimedia transmission and communication applications, such as encoder optimization, stream selection, and/or video reconstruction. An objective VQA metric that accurately reflects the quality of processed video relative to a source unprocessed video may take into account both spatial measures and temporal, motion-based measures when evaluating the processed video. Temporal measures may include differential motion metrics indicating a difference between a frame difference of a plurality of frames of the processed video relative to that of a corresponding plurality of frames of the source video. In addition, neural networks and deep learning techniques can be used to develop additional improved VQA metrics that take into account both spatial and temporal aspects of the processed and unprocessed videos.

Type: Grant

Filed: July 10, 2019

Date of Patent: December 29, 2020

Assignee: FastVDO LLC

Inventors: Pankaj N. Topiwala, Madhu Peringassery Krishnan, Wei Dai
High dynamic range image/video coding

Patent number: 10880557

Abstract: A disclosed configuration includes a system (or a computer implemented method or a non-transitory computer readable medium) for automatically preprocessing higher dynamic range image data into lower dynamic range image data through a data adaptive tuning process. By automatically preprocessing the higher dynamic range image data into the lower dynamic range image data through the data adaptive tuning process, an existing encoding process for encoding the standard dynamic range image data can be applied to the lower dynamic range image data while preserving metadata sufficient to recover image fidelity even in the high dynamic range. In one aspect, the system (or a computer implemented method or a non-transitory computer readable medium) provides for backwards compatibility between high dynamic range video services and existing standard dynamic range services. In one aspect, regrading is applied in a domain that is perceptually more uniform than the domain it is initially presented.

Type: Grant

Filed: June 3, 2016

Date of Patent: December 29, 2020

Assignee: FASTVDO LLC

Inventors: Wei Dai, Madhu P. Krishnan, Pankaj N. Topiwala
Enhancements of the AV1 video codec

Patent number: 10834400

Abstract: Encoding and decoding systems disclosed enhance the AV1/VPX codecs in the context of 8-bit SDR video and 10-bit HDR video content, for applications including streaming and high quality coding for content contribution editing. For SDR content, lapped biorthogonal transforms for near lossless applications and used and optimized resampling filter pairs for adaptive resolution coding in streaming applications are used. For HDR content, a data adaptive grading technique in conjunction with the VP9/VP10 encoder may be used. The encoding/decoding system provides substantial value in the coding of HDR content, and provides backward compatibility with SDR.

Type: Grant

Filed: August 21, 2017

Date of Patent: November 10, 2020

Assignee: FastVDO LLC

Inventors: Pankaj N. Topiwala, Wei Dai, Madhu Peringassery Krishnan
Method and system for fully immersive virtual reality

Patent number: 10650590

Abstract: Methods and systems use a video sensor grid over an area, and extensive signal processing, to create a model-based view of reality. Grid-based synchronous capture, point cloud generation and refinement, morphology, polygonal tiling and surface representation, texture mapping, data compression, and system-level components for user-directed signal processing, is used to create, at user demand, a virtualized world, viewable from any location in an area, in any direction of gaze, at any time within an interval of capture. This data stream is transmitted for near-term network-based delivery, and 5G. Finally, that virtualized world, because it is inherently model-based, is integrated with augmentations (or deletions), creating a harmonized and photorealistic mix of real, and synthetic, worlds. This provides a fully immersive, mixed reality world, in which full interactivity, using gestures, is supported.

Type: Grant

Filed: September 7, 2017

Date of Patent: May 12, 2020

Assignee: FASTVDO LLC

Inventors: Pankaj N. Topiwala, Wei Dai, Madhu P. Krishnan
Artificial intelligence based generation and analysis of 3D models

Patent number: 10621779

Abstract: Artificial intelligence based techniques are used for analysis of 3D objects in conjunction with each other. A 3D model of two or more 3D objects is generated. Features of 3D objects are matched to develop a correspondence between the 3D objects. Two 3D objects are geometrically mapped and an object is overlayed on another 3D object to obtain a superimposed object. Match analysis of 3D objects is performed based on machine learning based models to determine how well the objects are spatially matched. The analysis of the objects is used in augmented reality applications.

Type: Grant

Filed: May 25, 2018

Date of Patent: April 14, 2020

Assignee: FastVDO LLC

Inventors: Pankaj N. Topiwala, Madhu Peringassery Krishnan, Wei Dai
Visual language for human computer interfaces

Patent number: 10372226

Abstract: Embodiments of the invention recognize human visual gestures, as captured by image and video sensors, to develop a visual language for a variety of human computer interfaces. One embodiment provides a method for recognizing a hand gesture positioned by a user hand. The method includes steps of capturing a digital color image of a user hand against a background, applying a general parametric model to the digital color image of the user hand to generate a specific parametric template of the user hand, receiving a second digital image of the user hand positioned to represent a hand gesture, detecting a hand contour of the hand gesture based at least in part on the specific parametric template of the user hand, and recognizing the hand gesture based at least in part on the detected hand contour. Other embodiments include recognizing hand gestures, facial gestures or body gestures captured in a video.

Type: Grant

Filed: November 8, 2016

Date of Patent: August 6, 2019

Assignee: FASTVDO LLC

Inventors: Wei Dai, Madhu Peringassery Krishnan, Pankaj Topiwala
Adaptive coding, transmission and efficient display of multimedia (ACTED)

Patent number: 10306238

Abstract: Embodiments of the invention receive videos and feedback data associated with the videos from a client device and adaptively encode the videos based on the feedback data. The encoded videos are suitable to be transmitted over a network and displayed on the client device. Embodiments of an encoding server adaptively changes resolution of a video on the fly or scale the video quality up or down based on the factors described by the feedback data, including network condition for transmitting the encoded video, network delay, encoder and decoder processing capacity and feedback from viewers of the decoded video. Furthermore, the encoding server adaptively encodes the video based on a combination of various factors described by the feedback data.

Type: Grant

Filed: March 2, 2017

Date of Patent: May 28, 2019

Assignee: FASTVDO LLC

Inventors: Pankaj Topiwala, Wei Dai, Madhu Peringassery Krishnan
Device and method for fast block-matching motion estimation in video encoders

Patent number: 10306260

Abstract: A solution is provided to estimate motion vectors of a video. A multistage motion vector prediction engine is configured to estimate multiple best block-matching motion vectors for each block in each video frame of the video. For each stage of the motion vector estimation for a block of a video frame, the prediction engine selects a test vector form a predictor set of test vectors, computes a rate-distortion optimization (RDO) based metric for the selected test vector, and selects a subset of test vectors as individual best matched motion vectors based on the RDO based metric. The selected individual best matched motion vectors are compared and a total best matched motion vector is selected based on the comparison. The prediction engine selects iteratively applies one or more global matching criteria to the selected best matched motion vector to select a best matched motion vector for the block of pixels.

Type: Grant

Filed: June 5, 2017

Date of Patent: May 28, 2019

Assignee: FASTVDO LLC

Inventors: Alexandros Tourapis, Hye-Yeon Cheong, Pankaj Topiwala
Motion-assisted visual language for human computer interfaces

Patent number: 10168794

Abstract: Embodiments of the invention recognize human visual gestures, as captured by image and video sensors, to develop a visual language for a variety of human computer interfaces. One embodiment of the invention provides a computer-implement method for recognizing a visual gesture portrayed by a part of human body such as a human hand, face or body. The method includes steps of receiving the visual signature captured in a video having multiple video frames, determining a gesture recognition type from multiple gesture recognition types including shaped-based gesture, position-based gesture, motion-assisted and mixed gesture that combining two different gesture types. The method further includes steps of selecting a visual gesture recognition process based on the determined gesture type and applying the selected visual gesture recognition process to the multiple video frames capturing the visual gesture to recognize the visual gesture.

Type: Grant

Filed: November 27, 2017

Date of Patent: January 1, 2019

Assignee: FASTVDO LLC

Inventors: Wei Dai, Madhu Peringassery Krishnan, Pankaj Topiwala
Motion-assisted visual language for human computer interfaces

Patent number: 9829984

Abstract: Embodiments of the invention recognize human visual gestures, as captured by image and video sensors, to develop a visual language for a variety of human computer interfaces. One embodiment of the invention provides a computer-implement method for recognizing a visual gesture portrayed by a part of human body such as a human hand, face or body. The method includes steps of receiving the visual signature captured in a video having multiple video frames, determining a gesture recognition type from multiple gesture recognition types including shaped-based gesture, position-based gesture, motion-assisted and mixed gesture that combining two different gesture types. The method further includes steps of selecting a visual gesture recognition process based on the determined gesture type and applying the selected visual gesture recognition process to the multiple video frames capturing the visual gesture to recognize the visual gesture.

Type: Grant

Filed: November 20, 2013

Date of Patent: November 28, 2017

Assignee: FastVDO LLC

Inventors: Wei Dai, Madhu Peringassery Krishnan, Pankaj Topiwala
Device and method for fast block-matching motion estimation in video encoders

Patent number: 9674548

Abstract: A solution is provided to estimate motion vectors of a video. A multistage motion vector prediction engine is configured to estimate multiple best block-matching motion vectors for each block in each video frame of the video. For each stage of the motion vector estimation for a block of a video frame, the prediction engine selects a test vector form a predictor set of test vectors, computes a rate-distortion optimization (RDO) based metric for the selected test vector, and selects a subset of test vectors as individual best matched motion vectors based on the RDO based metric. The selected individual best matched motion vectors are compared and a total best matched motion vector is selected based on the comparison. The prediction engine selects iteratively applies one or more global matching criteria to the selected best matched motion vector to select a best matched motion vector for the block of pixels.

Type: Grant

Filed: October 13, 2014

Date of Patent: June 6, 2017

Assignee: FastVDO LLC

Inventors: Alexandros Tourapis, Hye-Yeon Cheong, Pankaj Topiwala
Adaptive coding, transmission and efficient display of multimedia (acted)

Patent number: 9609336

Abstract: Embodiments of the invention receive videos and feedback data associated with the videos from a client device and adaptively encode the videos based on the feedback data. The encoded videos are suitable to be transmitted over a network and displayed on the client device. Embodiments of an encoding server adaptively changes resolution of a video on the fly or scale the video quality up or down based on the factors described by the feedback data, including network condition for transmitting the encoded video, network delay, encoder and decoder processing capacity and feedback from viewers of the decoded video. Furthermore, the encoding server adaptively encodes the video based on a combination of various factors described by the feedback data.

Type: Grant

Filed: March 26, 2014

Date of Patent: March 28, 2017

Assignee: FastVDO LLC

Inventors: Pankaj Topiwala, Wei Dai, Madhu Peringassery Krishnan
Visual language for human computer interfaces

Patent number: 9524028

Abstract: Embodiments of the invention recognize human visual gestures, as captured by image and video sensors, to develop a visual language for a variety of human computer interfaces. One embodiment provides a method for recognizing a hand gesture positioned by a user hand. The method includes steps of capturing a digital color image of a user hand against a background, applying a general parametric model to the digital color image of the user hand to generate a specific parametric template of the user hand, receiving a second digital image of the user hand positioned to represent a hand gesture, detecting a hand contour of the hand gesture based at least in part on the specific parametric template of the user hand, and recognizing the hand gesture based at least in part on the detected hand contour. Other embodiments include recognizing hand gestures, facial gestures or body gestures captured in a video.

Type: Grant

Filed: March 14, 2013

Date of Patent: December 20, 2016

Assignee: FastVDO LLC

Inventors: Wei Dai, Madhu Peringassery Krishnan, Pankaj Topiwala
Device and method for fast block-matching motion estimation in video encoders

Patent number: 8913660

Abstract: Motion estimation is the science of predicting the current frame in a video sequence from the past frame (or frames), by slicing it into rectangular blocks of pixels, and matching these to past such blocks. The displacement in the spatial position of the block in the current frame with respect to the past frame is called the motion vector. This method of temporally decorrelating the video sequence by finding the best matching blocks from past reference frames—motion estimation—makes up about 80% or more of the computation in a video encoder. In this patent disclosure, we define a method of searching only a very sparse subset of possible displacement positions (or motion vectors) among all possible ones, to see if we can get a good enough match, and terminate early. This sparse subset of motion vectors is preselected using prior knowledge and extensive testing on video sequences.

Type: Grant

Filed: April 14, 2006

Date of Patent: December 16, 2014

Assignee: FastVDO, LLC

Inventors: Alexis Michael Tourapis, Hye-Yeon Cheong, Pankaj N. Topiwala
Motion-Assisted Visual Language For Human Computer Interfaces

Publication number: 20140347263

Abstract: Embodiments of the invention recognize human visual gestures, as captured by image and video sensors, to develop a visual language for a variety of human computer interfaces. One embodiment of the invention provides a computer-implement method for recognizing a visual gesture portrayed by a part of human body such as a human hand, face or body. The method includes steps of receiving the visual signature captured in a video having multiple video frames, determining a gesture recognition type from multiple gesture recognition types including shaped-based gesture, position-based gesture, motion-assisted and mixed gesture that combining two different gesture types. The method further includes steps of selecting a visual gesture recognition process based on the determined gesture type and applying the selected visual gesture recognition process to the multiple video frames capturing the visual gesture to recognize the visual gesture.

Type: Application

Filed: November 20, 2013

Publication date: November 27, 2014

Applicant: FastVDO LLC

Inventors: Wei Dai, Madhu Peringassery Krishnan, Pankaj Topiwala
ADAPTIVE CODING, TRANSMISSION AND EFFICIENT DISPLAY OF MULTIMEDIA (ACTED)

Publication number: 20140307785

Abstract: Embodiments of the invention receive videos and feedback data associated with the videos from a client device and adaptively encode the videos based on the feedback data. The encoded videos are suitable to be transmitted over a network and displayed on the client device. Embodiments of an encoding server adaptively changes resolution of a video on the fly or scale the video quality up or down based on the factors described by the feedback data, including network condition for transmitting the encoded video, network delay, encoder and decoder processing capacity and feedback from viewers of the decoded video. Furthermore, the encoding server adaptively encodes the video based on a combination of various factors described by the feedback data.

Type: Application

Filed: March 26, 2014

Publication date: October 16, 2014

Applicant: FastVDO LLC

Inventors: Pankaj Topiwala, Wei Dai, Madhu Peringassery Krishnan

1 2 next