Systems and Methods for Group of Pictures Encoding

Systems and methods are provided encoding a video. Video data is captured. The video data is stored in a group of pictures format, where the group of pictures format comprises a plurality of GOP files, each GOP file including exactly one group of pictures unit. The video data is encoded into a prespecified format. Encoding includes accessing the GOP files containing the video data in the group of pictures format, and encoding the video data into the prespecified format using the GOP files to generate an encoded video.

Skip to: Description  ·  Claims  · Patent History  ·  Patent History
Description
CROSS REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. Provisional Patent Application No. 61/955,946, filed Mar. 20, 2014, entitled “Systems and Methods for Group of Pictures Encoding,” the entirety of which is herein incorporated by reference.

FIELD

This disclosure is related generally to digital video and more particularly to digital video encoding.

BACKGROUND

The demand for digital video continues to increase, especially in view of the continually falling costs of bandwidth for delivery of such video. While data processing speeds continue to increase, video encoding is still often a time consuming endeavor. Thus, there is a continuing need to streamline video encoding processes.

SUMMARY

Systems and methods are provided encoding a video. Video data is captured. The video data is stored in a group of pictures format, where the group of pictures format comprises a plurality of GOP files, each GOP file including exactly one group of pictures unit. The video data is encoded into a prespecified format. Encoding includes accessing the GOP files containing the video data in the group of pictures format, and encoding the video data into the prespecified format using the GOP files to generate an encoded video.

As another example, a system for encoding a video includes a computer-readable medium encoded with video data stored in a group of pictures format, wherein the group of pictures format comprises a plurality of GOP files, each GOP file including exactly one group of pictures unit. A data processor is configured to encode the video data in the group of pictures format using the GOP files to generate an encoded video.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram depicting example contents of a group of pictures (GOP) file.

FIG. 2 is a block diagram depicting a system for encoding a stream of video.

FIG. 3 is a block diagram depicting a system encoding video data in multiple formats at the same time using GOP files.

FIG. 4 is a diagram depicting an example combination of two video data portions in an encoded video.

FIG. 5 is a diagram depicting example GOP files involved in generating the encoded video of FIG. 4.

FIG. 6 is a diagram depicting a process for reencoding a video based on a transition adjustment.

DETAILED DESCRIPTION

FIG. 1 is a block diagram depicting example contents of a group of pictures (GOP) file. A GOP file 102 is a file containing data associated with a collection of video frames. A frame is one of many still pictures that when displayed in fast sequence produces a video display. Digital video utilizes a variety of video frame types as a form of compression that can facilitate transmission of digital video utilizing reduced bandwidth. Video frames can include I frames (e.g., intra coded picture frames) that contain a picture that is coded independently of all other frames. Thus, an I frame can be rendered on its own, without referencing data from other frames. Other types of video frames require reference to data from other frames for picture rendering. For example, an intermediate video frame may identify only pixels of a picture that have changed since a previous frame or previous I frame. By not including data associated with unchanged pixels, such an intermediate video frame often requires less data than an I frame (especially in portions of video containing limited pixel changes where displayed objects exhibit little motion). Example intermediate frames include P frames (e.g., predictive coded picture frames) that contain motion compensated difference information relative to one previously decoded picture and B frames (e.g., bipredictive coded picture frames) that contain motion compensated difference information relative to one or more previously decoded pictures. In the example of FIG. 1, captured video data 104 includes a first independently coded I frame 106 followed by a plurality of intermediate frames 108 (e.g., P frames and B frames), further followed by a second independently coded I frame 110.

A group of pictures is an independent unit of video data that includes exactly one independently coded picture frame and one or more intermediate frames that directly or indirectly reference the independently coded picture frame. A GOP file 102 contains exactly one group of pictures unit 112. A group of pictures unit includes one independently coded picture frame (e.g., I frame 106) and one or more intermediate frames 108 that directly or indirectly reference the independently coded frame. The GOP file 102 does not include the second independently coded frame 110. Each group of pictures of the video data 104 is compiled into a GOP file 102, where each GOP file 102 is stored in a computer-readable medium 114.

A computer-readable medium containing a collection of GOP files (e.g., an unencoded collection of an I frame and referencing P and B frames) can be utilized to generate encoded video in a variety of contexts. FIG. 2 is a block diagram depicting a system for encoding a stream of video. A computer-readable medium 202 is encoded with video data stored in a group of pictures format, where the group of pictures format comprises a plurality of GOP files 204, 206, 208, each GOP file 204, 206, 208 including exactly one group of pictures unit. A data processor 210 is configured to encode the video data in the group of pictures format using the GOP files 204, 206, 208 to generate and encoded video 212. In one example, the data processor 210 is configured to sequentially access the GOP files 204, 206, 208, process each GOP file, and add data to the encoded video 212 based on that GOP file.

FIG. 3 is a block diagram depicting a system encoding video data in multiple formats at the same time using GOP files. A computer-readable medium 302 contains a plurality of GOP files 304, 306, 308. A data processor(s) 310 is configured to access a first GOP file 304. The data processor 310 then encodes video data from the first GOP file 304 into a first format as part of a first format file 312. Then, or at the same time, the data processor 310 encodes video data from the first GOP file 304 into a second format as a part of a second format file 314. The data processor then accesses a second GOP file 306. The data processor 310 then encodes video data from the second GOP file 306 into the first format as part of the first format file 312. Then, or at the same time, the data processor 310 encodes video data from the second GOP file 306 into the second format as a part of the second format file 314. The system and process described in FIG. 3 can accelerate encoding of the two files 312, 314 into different formats by limiting the memory accesses to each group of pictures to a single access. If the video files in different formats 312, 314 were encoded wholly in serial, the data processor 310 would need to access each group of pictures from the computer-readable medium 302 twice.

FIG. 4 is a diagram depicting an example combination of two video data portions in an encoded video. First video data 402 and second video data 404 are captured and represented on respective timelines. A system receives instructions to generate an encoded video that includes a first portion 406 that includes the first video data 402 and a second portion 408 that includes the second video data. The instructions further command a transition portion. In the example of FIG. 4, the transition between the first portion 406 and the second portion 408 is a cross fade, where during the transition more of the second video data 404 is displayed on the screen and less of the first video data is included as the transition progresses. Example transitions include cross fades, dissolves, peels, slides, hard transitions, and wipes.

FIG. 5 is a diagram depicting example GOP files involved in generating the encoded video of FIG. 4. First video data 102 includes 8 GOP files, and second video data also includes 8 GOP files. In the commanded encoded video 106, GOP files 1.2, 1.3, 1.4, and 1.5 are present containing first video data, while GOP files 2.5, 2.6, and 2.7 are present containing second video data. In the commanded encoded video, GOP files 1.2, 1.3, 2.6, and 2.7 are displayed in full, while GOP files 1.4, 1.5, and 2.5 are associated a subset of GOP files that are associated with the transition. A data processor is configured to receive the identification of the portion of the first video data (i.e., the portions associated with GOP files 1.2, 1.3, 1.4, and 1.5) and the portion of the second video data (i.e., the portions associated with GOP files 2.5, 2.6, and 2.7) to be included in the encoded video 106 and accesses those GOP files. The processor is further configured to use the accessed GOP files to encode the video 506 in a prespecified format, including the transition, as commanded.

FIG. 6 is a diagram depicting a process for reencoding a video based on a transition adjustment. An encoded video 602 includes a portion associated with first video data 604 and a portion associated with second video data 606, as generated in the example of FIG. 5. In the encoded video, a first subset of GOP files (i.e., GOP files 1.4, 1.5, and 2.5) are associated with the transition between the first video data 604 and the second video data 606. A command is received to adjust the transition as shown at 608 to generate a reencoded video. Using traditional methods, the entirety of the first video data 604 and the second video data 606 would need to be accessed to generate the reencoded video 608. In contrast, the use of GOP files in the depicted system eliminates the need for access and reencoding for several of the GOP files. A second subset of GOP files (i.e., GOP files 1.3, 1.4, 2.3, and 2.4) are associated with the transition of the reencoded video.

The system is configured to access GOP files associated with the first subset and the second subset (i.e., GOP files 1.3, 1.4, 1.5, 2.3, 2.4, and 2.5). The reencoded video 608 will change at the portions associated with these GOP files, but will not change at portions associated with GOP files outside of the first subset and the second subset (i.e., GOP files 1.2, 2.6, and 2.7). The system reencodes the encoded video 602 based on the accessed first and second subset GOP files without reaccessing the GOP files outside of the first subset and the second subset. Instead, the system reutilizes the encoded portions of those unchanged periods in the encoded video. In the example of FIG. 6, the system reuses portions of the encoded video associated with GOP files 1.2, 2.6, and 2.7 in the reencoded video without accessing the associated GOP files or reencoding video data associated with those portions. The system reencodes portions associated GOP files 1.3, 1.4, 2.3, 2.4, and 2.5 while removing portions associated with GOP file 1.5 (e.g., remove a portion of the encoded video 602 associated with the transition, remove a portion of the encoded video 602 associated with the changed transition, and using the encoded video, GOP files from the first subset, and GOP files from the second subset to reencode the encoded video).

This application uses examples to illustrate the invention. The patentable scope of the invention includes other examples.

Claims

1. A method of encoding a video, comprising:

capturing video data;
storing the video data in a group of pictures format, wherein the group of pictures format comprises a plurality of GOP files, each GOP file including exactly one group of pictures unit;
encoding the video data into a prespecified format, wherein encoding comprises: accessing the GOP files containing the video data in the group of pictures format; and encoding the video data into the prespecified format using the GOP files to generate an encoded video.

2. The method of claim 1, wherein encoding comprises encoding the video data into two prespecified formats at the same time.

3. The method of claim 2, wherein said encoding comprises:

accessing a first GOP file;
encoding video data from the first GOP file into a first format as part of a first format file;
encoding video data from the first GOP file into a second format as part of a second format file;
accessing a second GOP file after encoding video data from the first GOP file into both of the first format and the second format;
encoding video data from the second GOP file into the first format as part of the first format file;
encoding video data from the second GOP file into the second format as part of the second format file.

4. The method of claim 2, wherein the encoding comprises two transmuxing operations performed at the same time.

5. The method of claim 1, further comprising:

capturing second video data;
storing the second video data in the group of pictures format;
receiving an identification of a portion of the video data and a portion of the second video data to be included in an encoded video;
wherein GOP files associated with the portion of the video data and the portion of the second video data are accessed; and
wherein the accessed GOP files are used to encode the video in the prespecified format to generate the encoded video.

6. The method of claim 5, wherein the encoded video includes the portion of the video data followed by the portion of the second video data.

7. The method of claim 6, wherein the encoded video includes a transition between the portion of the video data and the portion of the second video data, wherein a subset of the accessed GOP files are associated with the transition.

8. The method of claim 7, further comprising:

receiving a command to change the transition;
determining a second subset of the accessed GOP files associated with the changed transition;
accessing the subset of the GOP files;
accessing the second subset of the GOP files;
reencoding the encoded video based on GOP files in the subset and the second subset of the GOP files without reaccessing GOP files outside of the subset and the second subset.

9. The method of claim 8, wherein reencoding includes:

removing a portion of the encoded video associated with the transition;
removing a portion of the encoded video associated with the changed transition;
using the encoded video, GOP files from the subset of GOP files, and GOP files from the second subset of GOP files to reencode the encoded video.

10. The method of claim 1, wherein a GOP file comprises an independently coded frame and one or more dependently coded frames, wherein the dependently coded frames identify changes from the independently coded frame.

11. The method of claim 1, wherein a GOP file includes exactly one I frame and one or more B frames.

12. The method of claim 11, wherein a B frame cannot be generated without reference to a preceding I frame.

13. The method of claim 1, wherein an I frame is an intra coded picture, and wherein a B frame is a bipredictive coded picture.

14. A system for encoding a video, comprising:

a computer-readable medium encoded with video data stored in a group of pictures format, wherein the group of pictures format comprises a plurality of GOP files, each GOP file including exactly one group of pictures unit;
a data processor configured to encode the video data in the group of pictures format using the GOP files to generate an encoded video.

15. The system of claim 14, wherein the data processor is configured to encode the video data into two prespecified formats at the same time.

16. The system of claim 15, wherein the data processor is configured to:

accessing a first GOP file from the computer-readable medium;
encoding video data from the first GOP file into a first format as part of a first format file;
encoding video data from the first GOP file into a second format as part of a second format file;
accessing a second GOP file from the computer-readable medium after encoding video data from the first GOP file into both of the first format and the second format;
encoding video data from the second GOP file into the first format as part of the first format file;
encoding video data from the second GOP file into the second format as part of the second format file; and
storing the first format file and the second format file in the computer-readable medium.

17. The system of claim 14, wherein the computer-readable medium is further encoded with second video data in the group of pictures format, wherein the data processor is configured to access GOP files associated with a portion of the video data and a portion of the second video data, and wherein the data processor is configured to generate the encoded video using the accessed GOP files.

18. The system of claim 17, wherein the encoded video includes the portion of the video data followed by the portion of the second video data, wherein the encoded video includes a transition between the portion of the video data and the portion of the second video data, wherein a subset of the accessed GOP files are associated with the transition.

19. The system of claim 18, wherein upon receiving a command to change the transition, the data processor is configured to:

determine a second subset of the accessed GOP files associated with the changed transition;
access GOP files from the subset of the GOP files from the computer-readable medium;
access GOP files from the second subset of the GOP files from the computer-readable medium;
reencode the encoded video based on GOP files from the subset of the GOP files and the second subset of the GOP files without reaccessing GOP files outside of the subset and the second subset.

20. The system of claim 19, wherein the data processor is configured to reencode the encoded video by:

removing a portion of the encoded video associated with the transition;
removing a portion of the encoded video associated with the changed transition; and
using the encoded video, GOP files from the subset of GOP files, and GOP files from the second subset of GOP files to reencode the encoded video.
Patent History
Publication number: 20150271492
Type: Application
Filed: Mar 2, 2015
Publication Date: Sep 24, 2015
Inventor: Eric Burns (Seattle, WA)
Application Number: 14/634,964
Classifications
International Classification: H04N 19/114 (20060101); H04N 19/136 (20060101); H04N 19/577 (20060101); H04N 19/503 (20060101); H04N 19/593 (20060101); H04N 19/177 (20060101);