Scene Change Detector
A scene change detector provides automatic detection of scene changes by utilizing frame-to-frame differences in chrominance components of video frames analyzed for information content.
This application claims the benefit of U.S. provisional application 61/089,760 filed Aug. 18, 2008 hereby incorporated in its entirety by reference.
BACKGROUND OF THE INVENTIONThe present invention relates to systems for detecting changes in scenes in video and, in particular, to an improved method of automatically detecting changes in scenes in a series of uncompressed video frames.
Video is typically stored as a sequence of time ordered frames, each frame essentially being a still image comprised of picture elements (pixels) providing values of the image over a rectilinear grid. The pixel values may, for example, be encoded in RGB format, where each pixel value is an ordered triplet providing the perceived brightness of a red, blue, or green component, or Y′UV encoded where each pixel value is an ordered triplet, the first describing a monochromatic luminance of the pixel and the latter two providing a vector indicating a chrominance or color value of the pixel. Y′UV encoding takes advantage of a decreased spatial sensitivity of human vision to color to decrease the information depth of the chrominance components with respect to luminance components.
It is often desired to be able to separate a video file into scenes, a scene generally representing an abrupt change in the video image as a result of a change in camera orientation. Scenes constitute the fundamental element used for video editing and therefore are useful as a prelude to such editing. In addition, scenes generally follow a logical division of the information presented by the video and therefore can be used to catalog or index the video, for example, by capturing a single representative frame from each scene.
Automated systems for detecting changes in scenes are often fooled by sudden shifts in the brightness of the image or by changes in a portion of the image in which no scene change has occurred.
SUMMARY OF THE INVENTIONThe present inventors have determined that improved automatic scene detection can be obtained by looking not at the brightness of the video image but rather at changes in its color components. In particular, for uncompressed video in the Y′UV format, the present invention may review a single chrominance component (U or V).
Specifically then, the present invention provides a scene change detector for video receiving a color video signal of time ordered frames of pixels providing luminance and chrominance components. A chrominance extractor extracts a selected chrominance component from the frames to produce chrominance frames without luminance information. Next, an information change detector receives a sequence of chrominance frames to output a scene change signal based on changes between sequential chrominance frames.
It is thus a feature of at least one embodiment of the invention to provide a scene change detector operating on the insight that scene change information is accentuated in chrominance signals. It is a feature at least one embodiment of the invention to provide a scene change detector that may work efficiently as a reduced data set represented by a single chrominance signal.
The information change detector may measure the information content of a difference between sequential frames.
It is thus a feature of at least one embodiment of the invention to analyze frame-to-frame chrominance changes to reduce the data that must be analyzed for information content.
The measure of information content may be entropy.
It is thus a feature of at least one embodiment of the invention to simply quantify information content.
The information change detector provides an output that is a function of a change in changes of sequential chrominance frames.
It is thus a feature of at least one embodiment of the invention to accentuate change sensitivity.
The chrominance signal is the U chrominance signal.
It is thus a feature of at least one embodiment of the invention to select a chrominance component that may be particularly sensitive to scene changes.
The color video signal may be uncompressed.
It is thus a feature of at least one embodiment of the invention to provide a system that may operate on a real-time basis with raw video data.
The information change detector further may further include a comparator comparing a change in a value of a function of information content of a difference between frames to a threshold value set to provide a predetermined number of scene changes per a time interval.
It is thus a feature of at least one embodiment of the invention to provide a single parameter that may be simply adjusted to correct for a variety of different video media content.
The scene change detector may further include a display for displaying the threshold value juxtaposed on a time plot of the function and including a control for varying the threshold value to visually move the threshold value with respect to the plot of the function.
It is thus a feature of at least one embodiment of the invention to provide a simple user interface compatible with the present invention.
These particular features and advantages may apply to only some embodiments falling within the claims and thus do not define the scope of the invention.
Referring now to
The color video signal 12 is received by a chrominance extractor 14 that receives the uncompressed color video signal 12 and extracts the chrominance frames 16. Each chrominance frame 16 consists of a video frame providing only the extractive chrominance signal 13 or 15. In the preferred embodiment, chrominance signal 13 is used; however, chrominance signal 15 may also be used. With uncompressed video in the Y′UV format, the chrominance extractor 14 may simply be an electrical switch connected to the appropriate chrominance signal 13 or 15.
The chrominance frame 16 is then received by a change detector 20 that processes the chrominance frames 16 and outputs a scene change signal 18 indicating whether there has been a scene change.
Referring now to
Specifically, the frame buffer 22 may output a sequential first chrominance frame 24, second chrominance frame 26 and third chrominance frame 28. A first subtractor 30 receives first chrominance frame 24 and second chrominance frame 26 and subtracts the second chrominance frame 26 from first chrominance frame 24 to output a first difference frame 34. A second subtractor 32 receives second chrominance frame 26 and third chrominance frame 28 and subtracts the third chrominance frame 28 from the second chrominance frame 26 and outputs a second difference frame 36. The subtractions are performed on a pixel-by-pixel basis so that the first difference frame 34 and second difference frame 36 have the same image size as the received frames 24-26.
Each difference frame 34 and 36 is received by a corresponding magnitude extractor taking the absolute value of every pixel in the frames 34 and 36 to produce a first absolute difference frame 42 corresponding to the difference frame 34 and a second absolute difference frame 44 corresponding to the difference frame 36.
The absolute difference frames 42 and 44 maybe describe mathematically by the following equations (1) and (2):
Fn−1(x,y)=|Cn−1(x,y)−Cn,(x,y)″for all x,y (1)
Fn(x,y)=|Cn(x,y)−Cn+1(x,y)| for all x,y (2)
where Fn−1(x,y) is the absolute difference frame 42 and Fn−1(x,y) is the absolute difference frame 44, Cn−1(x,y) is chrominance frame 24, Cn(x,y) is chrominance frame 26, Cn+1(x,y) is chrominance frame 28, n is an index indicating the time/position of the chrominance frames in the video sequence, and x and y are the coordinates of the video elements (for example, pixels) within the frames. For example, x may have values 0 to 639 and y may have values 0 to 479 for a frame that is 640 elements in width and 480 elements in height.
The first absolute difference frame 42 is then received by a first entropy processor 50 and the second absolute difference frame 44 is received by a second entropy processor 60. These entropy processors will measure the amount of difference in the information between successive chrominance frames 24, 26, and 28 as will be described.
Referring now to
where Fn are the absolute difference frames 42 and 44, pFn(i) is the probability of a specific element value i in frame Fn where i has the range of values, typically, from 0 to 255, and wherein Hn are entropy outputs 46 and 48 of the entropy processors 50 and 60, respectively.
The entropy outputs 46 and 48, respectively, of the entropy processor 50, 60 are then received by an entropy measurement differencer 80 which determines changes in the entropy in successive frame pairs to output the scene change signal 18.
Referring now to
The probability signal 63 is then received by a companding device 64 that computes and outputs the logarithm of the probability 65. The probability signal 63 is also received by a multiplier that multiplies probability signal 63 with the logarithm of the probability 65 producing as an output the product 68. A summing component 70 receives and sums the products 68 over the range of i to produce a single entropy value. In one embodiment, the summing component 70 would sum 256 separate probability values corresponding to each possible pixel value. The summing component 70 produces the value output Hn described above represented by entropy outputs 46 and 48.
Turning now to
Turning now to
Turning now to
Referring now to
The video scene change detector may utilize uncompressed color video from local storage device 116 via storage controller 118, video playback device 132 via input/output controller 130, video capture device 134 via input/output controller 130, local network storage device 136 on local network computer system 138 via intranet 136 accessed with network access unit 142, and remote server storage device 142 on remote computer system 144 via Internet 146 access with network access unit 142. Although only one local network computer system and one remote computer system 144 are depicted, multiple local computer systems and multiple remote computer systems may be connected to the intranet 140 and the Internet 146, respectively. The network access unit 142, display controller 122, processor 112, memory 114, input/output controller 130, and storage controller 118 are all connected to one another and communicate with one another over bus 148. Although only one bus is depicted, multiple buses may be used in computer 100.
It is specifically intended that the present invention not be limited to the embodiments and illustrations contained herein, but include modified forms of those embodiments including portions of the embodiments and combinations of elements of different embodiments as come within the scope of the following claims. The term “entropy” should be considered synonymous with measuring information content or information complexity and is not to be limited to the descriptions contained herein.
Claims
1. A scene change detector for video comprising:
- an input receiving a color video signal of time ordered frames of pixels providing luminance and chrominance components;
- a chrominance extractor extracting a selected chrominance component from the frames to produce chrominance frames without luminance information; and
- an information change detector receiving a sequence of chrominance frames to output a scene change signal based on changes between sequential chrominance frames.
2. The scene change detector of claim 1 wherein the information change detector provides a measure of the information content of a difference between sequential frames.
3. The scene change detector of claim 2 wherein the measure of information content is entropy.
4. The scene change detector of claim 1 wherein the information change detector provides an output that is a function of a change in changes of sequential chrominance frames.
5. The scene change detector of claim 4 wherein the change in changes of sequential chrominance frames takes a difference between sequential chrominance frames, determines the information content of the difference, and determines the difference in the information content of the difference.
6. The scene change detector of claim 1 wherein the chrominance signal is a U chrominance signal.
7. The scene change detector of claim 1 wherein the color video signal is uncompressed.
8. The scene change detector of claim 1 wherein the information change detector further includes a comparator comparing a change in a value of a function of information content of a difference between frames to a threshold value set to provide a predetermined number of scene changes per a time interval.
9. The scene change detector of claim 8 further including a display for displaying the threshold value juxtaposed on a time plot of the function and including a control for varying the threshold value to visually move the threshold value with respect to the plot of the function.
10. A method of detecting scene changes in a color video signal of time ordered frames of pixels providing luminance and chrominance components, the method comprising the steps of:
- (1) extracting a chrominance component from the frames to produce chrominance frames without luminance information; and
- (2) detecting a change in a measure of information content in sequential chrominance frames to provide an output indicating a change of scene.
11. The method of claim 10 wherein the detecting measures the information content of a difference between sequential frames.
12. The method of claim 11 wherein the measure of information content is entropy.
13. The method of claim 10 wherein the chrominance signal is a U chrominance signal.
14. The method of claim 10 wherein the detecting measures a function of a change in changes of sequential chrominance frames.
15. The method of claim 14 wherein the change in changes of sequential chrominance frames takes a difference between sequential chrominance frames, determines the information content of the difference, and determines the difference in the information content of the difference.
16. The method of claim 10 wherein the color video signal is uncompressed.
17. The method of claim 10 wherein the detecting compares a change in a value of a function of information content of a difference between frames to a threshold value set to provide a predetermined number of scene changes per a time interval.
18. The method of claim 17 further including the step of displaying the threshold value juxtaposed on a time plot of the function and varying the threshold value to visually move the threshold value with respect to the plot of the function.
Type: Application
Filed: Aug 17, 2009
Publication Date: Feb 18, 2010
Inventor: Patrick Seeling (Stevens Point, WI)
Application Number: 12/542,429
International Classification: H04N 5/14 (20060101);