VIDEO FRAME PROCESSING METHOD
A video frame processing method, which comprises: (a) capturing at least one first video flame via a first camera; (b) capturing at least one second video frame via a second camera; and (c) adjusting one candidate second video frame of the second video frames based on one of the first video frame to generate a target single view video frame.
Latest MEDIATEK INC. Patents:
- PROCESS-VOLTAGE SENSOR WITH SMALLER CHIP AREA
- PRE-CHARGE SYSTEM FOR PERFORMING TIME-DIVISION PRE-CHARGE UPON BIT-LINE GROUPS OF MEMORY ARRAY AND ASSOCIATED PRE-CHARGE METHOD
- ALWAYS-ON ARTIFICIAL INTELLIGENCE (AI) SECURITY HARWARE ASSISTED INPUT/OUTPUT SHAPE CHANGING
- Semiconductor package structure
- Semiconductor structure with buried power rail, integrated circuit and method for manufacturing the semiconductor structure
This application is a continuation of U.S. Ser. No. 14/221,260, filed Mar. 20, 2014, which is incorporated herein by reference. U.S. Ser. No. 14/221,260 claims the benefit of priority under 119(e) of U.S. Provisional Application No. 61/803,881, filed Mar. 21, 2013, which is incorporated herein by reference.
BACKGROUNDConventionally, a camera captures a frame via a camera parameter. The camera parameter can have, for example, a frame capturing parameter such as an exposure time or a frame rate. Exposure time (or named shutter speed) is the effective length of time a camera's shutter is open. Exposure time along with the aperture of the lens (also called f-number) determines the amount of light that reaches the film or an image sensor inside the camera. Long exposure time will cause image blur easily. On the contrary, short exposure time will cause in dark or noise easily. The aperture or camera sensor is always small in a camera phone (or a smartphone). When captured video frame resolution increases, the amount of light of each pixel will decrease. Therefore, h is hard to set a balanced camera parameter to generate a fine video frame.
A stereo camera is a type of camera with two or more cameras with a separate image sensor for each camera. A stereo camera is always used to generate a multi-view video frame (ex, a 3D video frame) based on the video frames generated from different cameras. Also, different camera parameters can be applied to different cameras. An electronic device with a stereo camera becomes more popular in recent years (ex. a smart phone with a stereo camera), since the user may hope he can capture a stereo image at any time he wants.
SUMMARYTherefore, one objective of the present application is to provide a video frame processing method to generate a target single view video frame from the video frames captured by a stereo camera.
One embodiment of the present application discloses a video frame processing method, which comprises: (a) capturing at least one first video frame via a first camera; (b)capturing at least one second video frame via a second camera; and (c) adjusting one candidate second video frame of the second video frames based on one of the first video frame to generate a target single view video frame.
Another embodiment of the present application discloses: a video frame processing method, which comprises: (a)capturing at least one first video frame via a first camera utilizing a first camera parameter; (b) capturing at least one second video frame via a second camera utilizing a second camera parameter, wherein the second camera parameter is different from the first camera parameter; and (c) generating a target single view video frame corresponding to a specific time point according to the at least one first video frame, the first camera parameter and the second camera parameter.
In view of above-mentioned embodiments, the target single view video frames can be generated from video frames with different camera parameters. Therefore a better target single view video frame can be acquired.
These and other objectives of the present invention will no doubt become obvious to those of ordinary skill in the art after reading the following detailed description of the preferred embodiment that is illustrated in the various figures and drawings.
In the following embodiment, two cameras are taken for example to explain the concept of the present application. However, please note more than two cameras can be applied to this application.
The target single view video frames TSF_1, TSF 2. TSF_5 are, for example, the video frames to be output to a display for displaying, but not limited. Such target single view video frame can be generated based on only the first video frame, or both the first video frame and the second video flame, which will be described later,
Step 201
Capture first video frames and second video frames by the first camera and the second camera, respectively.
Step 203
Analyze the video quality for one of the first video frames and the video quality for one of the second video frames corresponding to the analyzed first video frame. For example, analyze the video quality for the first video frame FF 2 and the second video frame SF 2. A first video frame and a second video frame which are corresponding to each other are, for example, frames captured at the same or similar time point by the first and the second cameras, respectively. However, the present application is not limited thereto.
Many methods can be applied to analyze the video quality. For example, the blur level/the sharpness level of each entire video frame and/or the region of interest (e.g., one or combination of the face region, the center region, and the auto-focus region) in each video frame can be computed to analyze the video quality. Also, the video frame;can be compared to a reference video frame to analyze the video, quality. Besides, at least one of the following parameters of each entire video frame and/or the region of interest can be computed to analyze the video quality: noise, edge, dynamic range, blocking artifact, mean intensity, color temperature, scene composition, people face and/or animal presence, image content that attracts more or less interest, and Spatial I temporal I Frequency masking.
Step 205
Determine if the video quality of the first video frame is better than a quality threshold. If yes, it means the video quality is good, go to step 207, if not, it means the video quality is bad, go to step 209.
Step 207
Select the first video frame analyzed in the step 203 as the target single view video frame. For example, the first video frame FF 1 is determined to have a video quality better than the quality threshold, therefore the first video frame FF 1 is selected as the target single view video frame TSF 1.
Step 209
Determine if the video quality of the second video frame is better than the quality threshold. If yes, go to step 213, if not go to step 211.
Step 211
Interpolate the target single view video frame from the first video frame and the second video frame analyzed in the step 203. For example, both the first video frame FF 5 and the second video frame SF 5 are determined to have a video quality lower than the quality threshold, therefore the target single view video frame TSF 5 is interpolated from the first video frame FE 5 and the second video frame SF 5.
Step 213
Select the second video frame analyzed in the step 203 and warp it. For example, the first video frame FF 3 is determined to have a video quality lower than the quality threshold and the second video frame SF 3 is determined to have a video quality better than the quality threshold. Therefore, the second video frame SL_3 is warped based on the first video frame FF_3 to generate a warped second video frame. To be specific, the warp operation performed on the second video frame is to eliminate the difference between the second video frame and the corresponding first video frame due to the diligent viewing perspectives.
For the convenience for understanding, the second video frame can be named as a candidate second frame if the second video frame corresponds to the first video frame analyzed in the step 203. For example, if the first video frame FF_4 is analyzed in the step 203, the second video frame SF_4 is named a candidate second video frame, and the first video frame FF 4 is named a candidate first video frame.
Please note, in one embodiment, the step 203 only analyzes the first video frame. In such embodiment, the steps 209, 211 can be removed, such that the step 205 can go to the step 213 if the result of the step 205 is no.
Step 215
Select the adjustment result generated in the step 213 as the target single view video frame. For example, in
The step 213 can be replaced by other steps. In one embodiment, the second video frame is warped, and the target single view video frame is generated via synthesizing the warped second video frame and the first video frame corresponding to the warped second video frame. For example, in
Alternatively, the steps 213, 215 can be replaced by: enhancing the first video image based on the second video frame to generate the target single view video frame if the video quality of the first video frame is bad and the video quality of the second video frame is good. For example, if the first video frame FF 4 has a bad video quality and the second video frame SF_4 has a good video quality, the first video frame FF 4 is enhanced based on the second video frame SF 4 to generate the target single view video frame TSF_4.
Furthermore, the step 211 can be replaced by other steps. In one embodiment, the first video frame analyzed in the step 203 is selected as the target single view video frame, since both video qualities of the first and the second video frames analyzed in the step 203 are worse than the quality threshold. Alternatively, one of the first and the second video frames analyzed in the step 203 having a better video quality will be selected as the target single view video frame. That is, either the first video frame or the second video frame can be selected as the target single view video if both the first video frame or the second video have video qualities worse than the quality threshold,
In the embodiment of
Additionally, in one embodiment the video quality analyzing and determining steps are removed from the flow chart in
In the embodiment of
Step 501
Capture first video frames and second video frames by the camera and the second camera, respectively.
Step 503
Determine if the video frame captured at a specific time point by a major camera exists. In this embodiment, the first camera is set as the major camera. In another embodiment, however, the second camera is set as the major camera. If yes, go to step 505, if not, go to step 507.
Step 505
Select the video frame from the major camera as the target single view video frame. For example, assuming that the specific time point is time points T_1, T_3, or T_5, the first video frames FF_1, FF_2 and FF_3 exist, thus the first video frames FF_1, FF_2 and FF_3 are selected as the target single it video frames TSF_1, TSF_3 and TSF_5, without modification. In one embodiment, some modifications can be applied to the first video frames FF_1, FF_2 and FF_3. For example, the process of brightness adjustment and/or sharpness enhancement can be performed to the first video frames FF_1, FF_2 and FF_3, and then the modified first video frames are selected as the target single view video frames.
Step 507
Select a video frame captured by a camera different from the major camera (e.g., the second camera). For example, assuming that the specific time point is time points T_2 or T_4, the first video frames do not exist, thus the second video frames SF 1, SF 2 are selected.
Step 509
Adjust the video frame selected in the step 507 based on one or ore video frames from the major camera captured prior to or after the video frame selected in the step 507. For examples, assuming that one of the second video frames is selected in the step 507, the selected second video frame is warped and/or interpolated according to one or more first video frames prior to or after the selected second frame. Please note the interpolate operation in this application is not limited to a video frame directly adjacent to the selected video frame. For example, an interpolated video frame corresponding to the time point T_4 may be generated based on the second video frame SF 2 and at least one of the first video frames FF_2 and FF_3. Alternatively, the interpolated video frame corresponding to the time point T_4 may be generated based on the second video frame SF 2 and the first video frame FF 1.
Step 511
Select the adjustment result generated in the step 509 as the target single view video frame. In this embodiment, the second video frame which is warped and/or interpolated is selected as the target single view video frame.
In another embodiment, the step 509 in
In yet another embodiment, the step 505 in
In still another embodiment, all of the first video frames and the second video frames arc warped to the same specific view, and then selected as the target single view video frames for output. For example, in
In the embodiment of
The embodiments of
The above-mentioned embodiments can he summarized as: A video frame processing method, comprising: (a) capturing at least one first video frame via a first camera; (b) capturing at least one second video frame via a second camera, (c) adjusting one candidate second video frame of the second video frames based on one of the first video frame to generate a target single view video frame.
In one embodiment, one of the two cameras which can provide the highest video frame resolution will be selected as a major camera (e.g., the A camera), and the target single view video frames can be generated via following method interleavingly selecting an A video frame without modification and an interpolated B video image as the target single view video frame. In
Step 701
Capture A video frames and B video frames by the A camera and the B camera, respectively.
Step 703
Set the camera with a largest video frame resolution as the major camera. For example, in the embodiment of
Step 705
Determine if the video frame captured at a specific time point by the major camera exists. If yes, go to step 707, if not, go to step 709.
Step 707
Select the video frame from the major camera as the target single view video frame. For example, at time points T_1 and T_3, the A video frames AF_1 and AF_2 exist, thus the A video frames AF 1 and AF_2 are selected as the target single view video frames. In one embodiment, the A video frames AF_1 and AF_2 can be directly selected as the target single view video frames. However, in another embodiment, some modification (ex,sharpness enhancement or brightness adjustment) can be performed to the A video frames first, and then the modified. A video frames can be selected as the target single view video frames.
Step 709
Interpolate a new video frame from the video frame captured at the specific time point by the camera different from the major camera and at least one of the adjacent video frames captured by the A camera and/or the B camera. It is assumed that the. A camera is the major camera, then a new video frame is interpolated from the B video frame captured at the specific time point and at least one of the adjacent video frames captured by the A camera and/or the B camera, wherein the adjacent video frames are the video frames captured at or near the specific time point. Taking the time point T_2 as an example, the adjacent video frames may he one or a combination of the A video frames AF 1 and AF_2, and the B video frames BF 1 and BF 3.
Step 711
Select the interpolated video frame as the target single view video frame. For example, at the time point T_2, no A video frame exist, thus the target single view video frame is generated from interpolating the B video frame BF_2, and at least one of B video frames BF_1, BF_3, and A video frames AF_1, AF 2.
In another embodiment, one of the two cameras which can support the highest video frame rate will be selected as a major camera (e.g., the B camera), and the target single view video frames in
Step 801
Capture A video frames and B video frames by the A camera and the B camera, respectively.
Step 803
Set the camera with a largest frame rate as the major camera. For example, in the embodiment of
Step 805
For each video frame captured by the major camera, interpolate a new video frame from the video frame captured by the major camera and at least one of the corresponding video frames captured by the camera different from the major camera and the adjacent video frames captured by the major camera. It is assumed that the B camera is the major camera, then for each B video frame, interpolate a new video frame from the B video frame and at least one of: the corresponding A video frames and the adjacent B video frames. In which, the corresponding A video frames are video frames captured at or near (e.g., prior to or after) a time point that the B video frame is captured, and the adjacent B video frames are video frames captured prior to or after the B video frame. Taking the B video frame BF 3 in
The video frame resolution of the interpolated video frame is larger than which of the B video frame. For example, each of the interpolated B video frames BFM_1 to BFM_5 has video frame resolution the same as which of the A video frames AF 1 to AF_3, as shown in
Step 807
Select the interpolated video frame as the target single view video frame. For example, in
Step 1001
Capture A video frames and B video frames by the A camera and the B camera respectively.
Step 1003
Set the camera with a lowest video frame rate as the major camera. For example 1 in the embodiment of
Step 1005
Determine if the video frame captured at a specific time point by the major camera exists. If yes 1 go to step 1007, if not, go to step 1009.
Step 1007
Select the video frame from the major camera as the target single view video frame. For example 1 at time points T_1 and T_3 1 the A video frames AF_1 and AF_2 exist 1 thus the A video frames AF 1 and AF_2 are selected as the target single view video frames 1 with or without modification as illustrated in the embodiment of
Step 1009
Interpolate a new video frame from the video frame captured at the specific time point by the camera different from the major camera and at least one of the adjacent video frames captured by the A camera and/or the B camera. It is assumed that the A camera is the major camera 1 then a new video frame is interpolated from the B video frame captured at the specific time point and at least one of the adjacent video frames captured by the A camera and/or the B camera 1 wherein the adjacent video frames are the video frames captured at or near the specific time point. Taking the time point T_2 as an example 1 the adjacent video frames may be one or a combination of the A video frames AF 1 and AF_2, and the B video frames BF 1 and BF 3.
Step 1011
Select the interpolated video frame as the target single view video frame. For example, at the time points T_2, no A video frame exist, thus the target single view video frame is generated from interpolating the B video frame BF_2, and at least one of B video frames BF_1, BF_3, A video frames AF_1, AF 2.
Step 1101
Capture A video frames and B video frames by the A camera and the B camera, respectively.
Step 1103
Set the camera with a largest frame rate as the major camera. For example, in the embodiment of
Step 1105
For each video frame captured by the major camera, interpolate a new video frame from the video frame captured by the major camera and at least one of the corresponding video frames captured by the camera different from the major camera and the adjacent video frames captured by the major camera. It is assumed that the B camera is the major camera, then for each B video frame, interpolate a new video frame from the B video frame and at least one of the corresponding A video frames and the adjacent B video frames. In which, the corresponding A video frames are video frames captured at or near (e.g., prior to or after) a time point that the B video frame is captured, and the adjacent B video frames are, video frames captured prior to or after the B video frame. Taking the B video frame BF 3 in
Step 110
Select the interpolated video frame as the target single view video frame.
In this embodiment, the video frame rate of the target single view video frames is the same as the B video frames. Furthermore, the video quality can be enhanced by synthesizing the B video frames with the A video frames. For example, when the brightness for some dark regions of the B video frame is too low, it may be compensated based on corresponding region of the A video frame.
Please note the A video frames and the B video frames of the embodiments in
It would be appreciated that the brightness of the video frame is affected by factors such as the exposure time and/or the ISO speed. In another embodiment of the present application, the camera parameter of the A camera and the camera parameter of the B camera are the exposure time and/or the ISO speed, and are set differently. In this embodiment, assuming that the A camera is selected as the major camera, then the A video frame can be compensated by the corresponding B video frame. To be specific, for the A video frame corresponding to each specific time point, at least one region to be enhanced in the A video frame is determined according to the camera parameter of the A camera, and at least one region to be referenced in the B video frame, corresponding to the specific time point is determined according to the camera parameter of the B camera. Accordingly, the at least one region to be enhanced in the A video frame can be enhanced based on the at least one region to be referenced in the B video frame.
For example, assuming that some region in the A video image is overexposed while the brightness of the other region is fine due to the camera parameter of the A camera, and some region in the B video image is over dark while the brightness of the other region is fine due to the camera parameter of the B camera. The overexposed region (i.e., the region to be enhanced) in the A video image can be enhanced by the bright parts (i.e., the region to be referenced) in the corresponding B video image.
For another example, assuming that some region in the A video image is over dark while the brightness of the other region is fine due to the camera parameter of the A camera, and some region in the B video image is overexposed while the brightness of the other region is fine due to the camera parameter of the B camera. The over-dark region (i.e., the region to be enhanced) in the A video image can be enhanced by the dark parts (i.e., the region to be referenced) in the corresponding B video image.
For example, in one embodiment of
In view of the embodiments illustrated in
Please note the corresponding relations of the first/second cameras and the A/B cameras are not fixed. For example, in one embodiment the video frame processing method comprises: determining if the, first video frame corresponding to the specific time point exists; if yes, selecting the first video frame as the target single view video frame; and if not, interpolating a target single view video frame from one of the second video frames corresponding to the specific time point and one or more adjacent frames captured by the first camera and the second camera. In such case the first camera indicates the A camera, and the second camera indicates the B camera.
In another embodiment, the video frame processing method comprises: generating the target single view frame according to one candidate first frame, and at least one of: at least first video frame prior to or after the candidate first video frame, and at least second video frame prior to or after the candidate first video frame. In such case the first camera indicates the B camera, and the second camera indicates the A camera.
In view of above-mentioned embodiments, the target single view video frames can be generated from video frames with different camera parameters and video frame capturing parameters. Therefore a better tar get single view video, frame can be acquired,
Those skilled in the art will readily observe that numerous modifications and alterations of the device and method may be made while retaining the teachings of the invention. Accordingly, the above disclosure should be construed as limited only by the metes and bounds of the appended claims.
Claims
1. A video frame processing method, comprising:
- (a) capturing at least one first video frame via a first camera utilizing a first camera parameter;
- (b) capturing at least one second video frame via a second camera utilizing a second camera parameter, wherein the second camera parameter is different from the first camera parameter; and
- (c) generating a target single view video frame corresponding to a specific time point according to the at least one first video frame, the first camera parameter and the second camera parameter.
2. The video frame processing method of claim 1, wherein the step (c) comprises:
- determining if the first video frame corresponding to the specific time point exists;
- if yes, selecting the first video frame as the target single view video frame; and
- if not, interpolating the target single view video frame according to one of the second video frames corresponding to the specific time point and one or more adjacent frames captured by the first camera and/or the second camera.
3. The video frame processing method of claim 2, wherein the first camera parameter and the second camera parameter are video frame resolutions, wherein the first camera parameter is higher than the second camera parameter, and a video frame rate of the first camera is identical to or lower than a video frame rate of the second camera.
4. The video frame processing method of claim 2, wherein the first camera parameter and the second camera parameter are video frame rates, wherein the first camera has a video frame rate lower than which of the first cameras.
5. The video frame processing method of claim 1, wherein the step (c) generates the target single view video frame further according to one of the second video frames.
6. The video frame processing method of claim 5, wherein the step (c) interpolates the target single view video frame from the first video frame corresponding to the specific time point and the one of the second video frames.
7. The video frame processing method of claim 5, wherein the first camera parameter and the second camera parameter are video frame capturing parameters, and the step (c) generates the target single view video frame according to, video &aim capturing parameters for the first video frame corresponding to the specific time point and the one of the second video frames.
8. The video frame processing method of claim 7, wherein the frame capturing parameter comprises at least one of: capturing time, exposure time, depth of field, focus, ISO speed, and white balance.
9. The video frame processing method of claim 1, wherein the step (c) generates the target single view video frame further according to at least one adjacent frames captured by the first camera and or the second camera.
10. The video frame processing method of claim 1, wherein the first camera parameter and the second camera parameter are video frame rates, wherein the first camera has a video frame rate higher than which of the second camera.
11. The video frame processing method of claim 10, wherein the target single view video frame has an image resolution higher than which of the first video frame.
12. The video frame processing method of claim 1, wherein the first camera parameter and the second camera parameter are a exposure time and/or an ISO speed, wherein the step (c) comprises:
- determining at least one region to be enhanced in the first video frame corresponding to the specific time point according to the first camera parameter;
- determining at least one region to be referenced in the second video frame corresponding to the specific time point according to the second camera parameter; and
- enhancing the at least one region to be enhanced in the first video frame based on the at least one region to be referenced in the second video frame.
13. The video frame processing method of claim 1, wherein the target single view video frame further corresponds to a viewing perspective of the first video frame, wherein generating the target video frame comprising:
- selecting the first video frame as the target single view video frame when image quality of the first video frame is better than a quality threshold; and
- generating the target single view video frame by adjusting the first video frame or the second video frame when the image quality of the first video frame is not better than the quality threshold, including: interpolating the first video frame and another video frame captured via the first camera to generate a third video frame, and adjusting the second video frame based the third frame to generate the target single view video frame.
Type: Application
Filed: Jan 18, 2018
Publication Date: May 24, 2018
Applicant: MEDIATEK INC. (Hsin-Chu City)
Inventors: Chi-Cheng JU (Hsinchu City), Ding-Yun Chen (Taipei City), Cheng-Tsai Ho (Taichung City), Chia-Ming Cheng (Hsinchu City), Po-Hao Huang (Kaohsiung City), Yuan-Chung Lee (Tainan City), Chung-Hung Tsai (Chu-Pei City)
Application Number: 15/874,243