ELECTRONIC DEVICE
An electronic device according to the present invention includes at least one memory and at least one processor which function as: an acquisition unit configured to acquire line-of-sight information relating to a line of sight of a user; a display control unit configured to execute control to display a captured image on a display surface; a tracking unit configured to detect an object in the image so as to track the object; and a setting unit configured to set a tracking target of the tracking unit, wherein the setting unit resets the tracking target based on the line-of-sight information and a tracking history of an object that is the tracking target in a case where the object stops being detected.
This application is a continuation of application Ser. No. 17/168,749, filed Feb. 5, 2021, the entire disclosure of which is hereby incorporated by reference.
BACKGROUND OF THE INVENTION Field of the InventionThe present invention relates to an electronic device capable of executing tracking of an object within an imaging area.
Description of the Related ArtAn imaging apparatus can perform object tracking (tracking of an object within an imaging area) to keep a target (tracking target) in focus. In object tracking, it is sometimes necessary to reset the target when the target has stopped being detected. The target may fail to be detected, for example, when the target is hidden behind another object, or when the target moves out of the screen (image display area where the captured image is displayed). Japanese Patent Application Laid-open No. 2018-7272 discloses a method whereby object tracking is continued without the target being reset when the target is detected within a predetermined area within a threshold time from the time point when the target stopped being detected within the predetermined area. This method enables stable object tracking even when the target temporarily stops being detected.
However, it is difficult to determine a singular threshold time of waiting for the target to be detected again, because there can be various cases where the target may become temporarily undetectable. For this reason, favorable resetting of a target (tracking target) is sometimes not possible with the method disclosed in Japanese Patent Application Laid-open No. 2018-7272.
SUMMARY OF THE INVENTIONThe present invention provides a technique that enables favorable resetting of a tracking target.
An electronic device according to the present invention includes at least one memory and at least one processor which function as: an acquisition unit configured to acquire line-of-sight information relating to a line of sight of a user; a display control unit configured to execute control to display a captured image on a display surface; a tracking unit configured to detect an object in the image so as to track the object; and a setting unit configured to set a tracking target of the tracking unit, wherein the setting unit resets the tracking target based on the line-of-sight information and a tracking history of an object that is the tracking target in a case where the object stops being detected.
Further features of the present invention will become apparent from the following description of exemplary embodiments with reference to the attached drawings.
The present invention will be hereinafter described in detail based on its illustrative embodiments with reference to the accompanying drawings. An object of the present invention is to provide a technique that enables favorable resetting of a tracking target. When, for example, the threshold time for waiting for a target (object) to be detected again in a result of object detection is short, sometimes the target may be switched to another object as a result of being reset, even though the target will be detected again after a while. On the other hand, if the threshold time is long, the target may keep failing to be detected and a good photo opportunity may be missed. The present invention features resetting of a tracking target based on information of the user's line of sight and the tracking history of an object when the target stops being detected. The following embodiments shall not limit the present invention. While a plurality of features are described below, it does not mean that all of them are essential for the present invention. The plurality of features described below may be combined in any way. Same or similar constituent elements in the accompanying drawings are given the same reference numerals to omit repetitive description.
The following embodiments will be described in relation to a case where the present invention is applied to an imaging apparatus (specifically, a lens-changeable digital camera). However, the present invention is applicable to any electronic device to which an object-tracking function (function that executes tracking of an object within an imaging area) can be installed. Such electronic device includes a video camera, computer equipment (personal computer, tablet computer, media player, PDA, etc.), mobile phone, smartphone, game machine, robot, drone, drive recorder, and so on. These are only examples and the present invention can be applied to other electronic devices. While the digital camera described below includes the object-tracking function, a line-of-sight detection function, imaging function, display function, etc., the present invention is also applicable to a configuration that has these functions separately installed on several mutually communicable devices (e.g., a main machine and a remote controller).
[Configuration]
The lens unit 150 includes a communication terminal 6 that makes contact with a communication terminal 10 provided to the main body 100 when attached to the main body 100. Power is supplied from the main body 100 to the lens unit 150 via the communication terminal 10 and communication terminal 6. A lens system control circuit 4 of the lens unit 150 and a system control unit 50 of the main body 100 are mutually communicable via the communication terminal 10 and communication terminal 6.
The lens unit 150 includes a lens assembly 103 that is an imaging optical system composed of a plurality of lenses including a movable lens. The movable lens at least includes a focus lens. Depending on the lens unit 150, one or more of a zoom lens, a blur correction lens, and so on, can further be included. An AF drive circuit 3 includes a motor, an actuator and the like for driving the focus lens. The focus lens is driven by the lens system control circuit 4 controlling the AF drive circuit 3. A diaphragm drive circuit 2 includes a motor actuator and the like for driving a diaphragm 102. The aperture size of the diaphragm 102 is adjusted by the lens system control circuit 4 controlling the diaphragm drive circuit 2.
A mechanical shutter 101 is driven by the system control unit 50 to adjust the exposure time of an image sensor 22. The mechanical shutter 101 is kept fully open when shooting a movie.
The image sensor 22 is a CCD image sensor or a CMOS image sensor, for example. The image sensor 22 includes two-dimensionally arranged multiple pixels, each pixel being provided with one micro lens, one color filter, and one or more opto-electronic conversion units. In this embodiment, each pixel includes a plurality of opto-electronic conversion units and each pixel is configured to be able to output a signal from each of the opto-electronic conversion units. The pixels configured this way enable generation of image signals for captured images, stereoscopic image pairs, and phase-difference AF, from signals read out from the image sensor 22.
The two opto-electronic conversion units 201a and 201b provided to the pixel share one color filter 252 and one micro lens 251. Light that has passed through a partial region 253a and a partial region 253b of the emission exit (region 253) enters the opto-electronic conversion unit 201a and the opto-electronic conversion unit 201b, respectively.
Therefore, a pair of stereoscopic images are formed by images respectively formed by signals read out from the opto-electronic conversion units 201a and opto-electronic conversion units 201b of the pixels included in a given pixel area. The stereoscopic image pair can be used as image signals (A image signal and B image signal) for phase-difference AF. Further, a normal image signal (captured image) can be obtained by adding signals respectively read out from the opto-electronic conversion units 201a and opto-electronic conversion units 201b of each pixel.
In this embodiment, each pixel of the image sensor 22 functions both as the pixel for generating a signal for phase-difference AF (focus detection pixel) and the pixel for generating a normal image signal (imaging pixel). Optionally, some of the pixels of the image sensor 22 may be configured as the focus detection pixels, and the other pixels may be configured as imaging pixels.
While the configuration illustrated in
Referring back to
The image data (RAW image data) output by the A/D converter 23 is processed as required at an image processing unit 24, and stored in a memory 32, via a memory control unit 15. The memory 32 is used as a buffer memory for storing image data or audio data temporarily, or as a video memory for a display unit 28.
The image processing unit 24 applies predetermined image processing to the image data to generate a signal or image data, or acquire and/or generate various pieces of information. The image processing unit 24 may be a dedicated hardware circuit such as an ASIC designed to realize specific functions, for example, or a configuration with a processor such as a DSP executing software to realize specific functions.
The image processing the image processing unit 24 applies here includes pre-processing, color interpolation, correction, detection, data processing, evaluation value calculation, and so on. Pre-processing includes signal amplification, reference level adjustment, defect pixel correction, and so on. Color interpolation is a process of interpolating the values of color components not contained in the image data and also called demosaicing. Correction includes white balance adjustment, correction of luminance of the image, correction of optical aberrations of the lens unit 150, color calibration, and so on. Detection includes detection and tracking of a characteristic area (e.g., face area, human body area), identification of a person, and so on. Data processing includes scaling, encoding, decoding, header information generation, and so on. Evaluation value calculation includes calculation of evaluation values of pairs of image signals for phase-difference AF, or for contrast AF, evaluation values used for automatic exposure control, and so on. These are examples of image processing the image processing unit 24 can carry out, and should not be understood as limiting the image processing carried out by the image processing unit 24. The evaluation value calculation may be performed by the system control unit 50.
A D/A converter 19 generates an analog signal suited to display at the display unit 28 from the image data for display stored in the memory 32, and supplies the generated analog signal to the display unit 28. The display unit 28 includes a liquid crystal display apparatus, for example, and executes display on the basis of the analog signal from the D/A converter 19 on a display surface.
Shooting a movie (imaging control) while displaying the footage (display control) continuously allows the display unit 28 to function as an electronic view finder (EVF). The movie displayed to cause the display unit 28 to function as an EVF is called a live view image. The display unit 28 may be provided inside the main body 100 to be viewed through an eyepiece, or may be provided on a housing surface of the main body 100 to be viewable without an eyepiece. The display unit 28 may be provided to both of inside the main body 100 and on the housing surface.
The system control unit 50 is a CPU (also called MPU or microprocessor), for example. The system control unit 50 controls the operations of the main body 100 and the lens unit 150 by reading a program stored in a non-volatile memory 56 into a system memory 52 and executing the program to realize the functions of the camera system. The system control unit 50 sends various commands to the lens system control circuit 4 via communication through the communication terminals 10 and 6 to control the operation of the lens unit 150.
The non-volatile memory 56 stores the program executed by the system control unit 50, various setting values of the camera system, image data of a GUI (Graphical User Interface), and so on. The system memory 52 is a main memory the system control unit 50 uses when executing a program. The data (information) stored in the non-volatile memory 56 may be re-writable.
The system control unit 50, as one of the operations it performs, carries out an automatic exposure control (AE) process based on an evaluation value generated by the image processing unit 24 or itself, to determine a shooting condition. The shooting conditions for capturing a still image are the shutter speed, aperture value, and sensitivity, for example. The system control unit 50 determines one or more of the shutter speed, aperture value, and sensitivity in accordance with an AE mode that has been set. The system control unit 50 controls the aperture value (aperture size) of the diaphragm mechanism in the lens unit 150. The system control unit 50 also controls the operation of the mechanical shutter 101.
The system control unit 50 drives the focus lens of the lens unit 150 on the basis of an evaluation value or an amount of defocus generated by the image processing unit 24 or itself, to perform autofocus detection (AF) causing the lens assembly 103 to focus on an object within a focus detection area.
A system timer 53 is a built-in clock and used by the system control unit 50.
An operation unit 70 includes a plurality of input devices (button, switch, dial, and so on) the user can operate. Some of the input devices of the operation unit 70 have a name corresponding to the assigned function. While a shutter button 61, a mode change switch 60, a power switch 72 are illustrated separately from the operation unit 70 for convenience, these are included in the operation unit 70. When the display unit 28 is a touch display including a touchscreen, the touchscreen is also included in the operation unit 70. Operations of the input devices included in the operation unit 70 are monitored by the system control unit 50. When the system control unit 50 detects an operation of an input device, the system control unit 50 executes processing in accordance with the detected operation.
The shutter button 61 includes a first shutter switch 62 that turns on and outputs a signal SW1 when half-pressed, and a second shutter switch 64 that turns on and outputs a signal SW2 when fully pressed. When the system control unit 50 detects the signal SW1 (first shutter switch 62 ON), the system control unit executes a preparatory operation for shooting a still image. The preparatory operation includes the AE process and AF process. When the system control unit 50 detects the signal SW2 (second shutter switch 64 ON), the system control unit executes shooting of a still image (imaging and recording operations) in accordance with the shooting condition determined by the AE process.
The operation unit 70 of this embodiment includes a line-of-sight detection unit 701 that detects the line of sight (direction of line of sight) of a user and outputs the detection results (line-of-sight information regarding the user's line of sight). The system control unit 50 can execute various control processes in accordance with the line-of-sight information provided by the line-of-sight detection unit 701. Although the line-of-sight detection unit 701 is not a component directly operated by the user, it is included in the operation unit 70 because the line of sight detected by the line-of-sight detection unit 701 is dealt with as an input.
The display unit 28 displays a live view image, for example, and the user peering in through the window of the eyepiece can observe the displayed contents of the display unit 28 through an eye lens 701d and a dichroic mirror 701c. A light source 701e can emit infrared light toward the direction of the eyepiece window (toward outside of the main body 100). When the user is peering into the finder, the infrared light emitted by the light source 701e is reflected by the eyeball 501a and returns into the finder. The infrared light incident in the finder is reflected toward a light-receiving lens 701b by the dichroic mirror 701c.
The light-receiving lens 701b forms an infrared image of the eyeball on the imaging plane of an image sensor 701a. The image sensor 701a is a two-dimensional imaging device having a filter for the infrared imaging. The image sensor 701a for the line-of-sight detection may have a fewer number of pixels than that of the image sensor 22 for shooting. The eyeball image captured by the image sensor 701a is sent to the system control unit 50. The system control unit 50 locates the positions of the retinal reflection of infrared light and the pupil in the eyeball image and detects the line-of-sight direction from the positional relationship between them. The system control unit 50 locates the position in the display unit 28 the user is gazing at (point of gaze in the displayed image) based on the detected line-of-sight direction. Alternatively, the positions of the retinal reflection and the pupil in the eyeball image may be located by the image processing unit 24, and the system control unit 50 may obtain their locations from the image processing unit 24.
The present invention does not depend on the method of detecting the line of sight or the configuration of the line-of-sight detection unit. The configuration of the line-of-sight detection unit 701 is not limited to the one illustrated in
Referring back to
A power supply unit 30 includes a battery, an AC adapter, and so on. An I/F 18 is an interface for the recording medium 200 such as a memory card, a hard disk, and so on. Data files such as captured images and audio are recorded in the recording medium 200. The data files recorded in the recording medium 200 are read out through the I/F 18, and can be played back via the image processing unit 24 and the system control unit 50.
A communication unit 54 realizes communication with an external device by at least one of wireless communication and wired communication. Images captured by the image sensor 22 (captured images, including live view images), and images recorded in the recording medium 200 can be sent to the external device via the communication unit 54. Image data and various other pieces of information can be received from an external device via the communication unit 54.
An orientation detection unit 55 detects the orientation of the main body 100 relative to the direction of gravity. The orientation detection unit 55 may be an angular velocity sensor, or an angular velocity sensor. The system control unit 50 can record orientation information in accordance with the orientation detected by the orientation detection unit 55 during shooting in the data file in which the image data obtained by the shooting is stored. The orientation information can be used, for example, for displaying the recorded image in the same orientation as when it was captured.
The main body 100 of this embodiment can carry out various control processes to make a characteristic area detected by the image processing unit 24 an appropriate image. For example, the main body 100 can carry out autofocus detection (AF) for causing the characteristic area to come into focus, and automatic exposure control (AE) for giving a correct exposure to the characteristic area. The main body 100 can also carry out automatic white balance for setting a correct white balance for the characteristic area, and automatic flash adjustment for regulating the amount of light to achieve a correct brightness for the characteristic area. Control processes to correctly display the characteristic area are not limited to these. The image processing unit 24 applies a known method to a live view image, for example, detects areas determined to comply with the definition of a predetermined characteristic as characteristic areas, and outputs information such as the position, size, and credibility of each characteristic area to the system control unit 50. The present invention does not depend on the type of the characteristic area or the method of detecting the characteristic area. Since a known method can be used to detect characteristic areas, the description of the method of detecting characteristic areas is omitted.
Feature areas can also be used for detecting object information. When the characteristic area is a face area, for example, whether the red-eye effect is appearing, whether the eyes are closed, or expressions (e.g., smile) are detected as object information. The object information is not limited to these.
This embodiment allows for selection of one characteristic area (main object area) that is to be used for various control processes or for obtaining object information, using the line of sight of the user, from a plurality of characteristic areas that are for example multiple image areas of varying sizes and positions. Hereinafter, the user's act of directing the gaze so as to be detected by the line-of-sight detection unit 701 will be referred to as a line-of-sight input.
[Characteristics of Line-of-Sight Input]
An input of line of sight has the following two characteristics as opposed to an input by operation of a switch or button. First, the input timing cannot be specified as instructed by the user. The time when an operation of a button is detected, for example, can be regarded as the timing of an inputting operation as instructed by the user. The line-of-sight direction, however, is continuously detected and changing, so that it is not possible to determine the line-of-sight direction of which time point is the line-of-sight direction intended by the user to be an instruction.
The second characteristic is the instability of the line-of-sight direction because of the line-of-sight input being based on a living body. Even though the user intends to be gazing at a certain point, the eyeballs may move slightly, or the user may shift the gaze unintentionally. Therefore, to locate the point of gaze in the image, it is necessary to statistically process the line-of-sight direction detected over a certain period of time.
[Issues in Tracking Process]
The tracking process is a process of detecting an object that is set to be the target from an image captured by the image sensor 22 (captured image). In this embodiment, the image processing unit 24 can track a preset target (tracking target), and keep the target in focus by detecting the focal point using the signal of the target area. If, however, the target is hidden behind another object, or the target moves out of the image display area, tracking is interrupted, in which case it is necessary to keep waiting for the target to reappear, or to reset the target. The image display area is an area where the captured image is displayed, and it is one of the entire area of the display surface of the display unit 28, the entire area of a screen (such as a window) presented in the display unit 28, and some area of the display surface or the screen.
Japanese Patent Application Laid-open No. 2018-7272 discloses a method whereby object tracking is continued without the target being reset when the target is detected within a predetermined area within a threshold time from the time point when the target stopped being detected within the predetermined area. However, it is difficult to determine a singular threshold time, because of which favorable resetting of the target is sometimes not possible.
Inputting a line of sight when the target has stopped being detected is another possible method of resetting the target. However, it is difficult to reset the target as intended by the user by using an input of a line of sight only, because of the issues of the inability to input a user instruction at a specific timing and of the instability of the line-of-sight direction.
First EmbodimentHereinafter, a first embodiment of the present invention will be described. To solve the issues in the tracking process described above, in the first embodiment, the timing of resetting the target, when the target has stopped being detected, is controlled on the basis of an input of a line of sight (line-of-sight information) and the tracking result (tracking history) of the target.
In the first embodiment, the image processing unit 24 is able to reset the target. For example, if a person or an animal that is a desirable object as a target is detected in the captured image after the target has stopped being detected, the image processing unit 24 automatically resets the detected object as the target.
In the first embodiment, the target is reset at a timing that is determined based on an input of a line of sight and a tracking result, instead of after a predetermined time has passed since the target stopped being detected. For example, if the gaze stays at the position where the target has been lost (where the target stopped being detected), it is determined that the user wishes the target to reappear, and the target is waited for to reappear (to be detected again) rather than the target-resetting being activated. On the other hand, if the gaze is shifted away from the position where the target was lost, it is determined that the user has given up tracking the lost target and wishes to track another object, and the target-resetting is activated without the target being waited for to reappear.
By controlling whether or not to execute the resetting of the target based on the past tracking result and the user input of the line of sight this way, a control that more closely fits the user's intention than the method that uses only an input of a line of sight is possible.
A person 301 and a person 302 are present in the captured image displayed on the screen of
In
In
In this way, automatic activation of the resetting of the target after a predetermined time has passed causes switching of the target to the person 302, even if the user wishes to keep tracking the person 301.
In the first embodiment, therefore, the control is carried out as shown in the flowchart of
At step S1, the system control unit 50 sets a target, using the image processing unit 24 or the like. For example, a starting area for the object tracking to be started is displayed in the display unit 28, and an object present within the starting area is set as the target at the timing at which the signal SW1 is detected. The control is not limited to this method. The user may select a target using a touchscreen provided in the display unit 28, or the line-of-sight detection unit 701. Alternatively, the image processing unit 24 may automatically set a detected object such as a person or an animal as the target. The system control unit 50 sets the position of the target as the tracking position, and displays a tracking frame surrounding the target in the display unit 28 on the basis of the tracking position.
At step S2, the system control unit 50 detects the set target from the current frame (captured image), using the image processing unit 24 or the like. For example, the system control unit 50 detects the position of the target that was detected in the frame immediately before the current frame in the current frame using colors, characteristic points, motion vectors, and so on. During the tracking of the object, the processing steps S2 to S6 are repeated. At step S2 for the first time, the current frame is the next one of the frame in which the target was set at step S1.
At step S3, the system control unit 50 determines whether or not a target has been detected at step S2. The system control unit 50 advances the process to step S4 if it determines that the target has been detected, and advances the process to step S5 if it determines that the target has not been detected.
At step S4, the system control unit 50 updates the tracking position to a position where the target was detected at step S2, and updates the display of the tracking frame in the display unit 28. The template for the tracking (detection of the target) such as colors, characteristic points, motion vectors, and so on may be updated at this time.
At step S5, the system control unit 50 performs a target resetting determination process, which is to be described later.
At step S6, the system control unit 50 determines whether or not object tracking is to be ended. The system control unit 50 ends the object tracking (this process flow) if it determines that the object tracking is to be ended, and returns the process to step S2 if it determines that the object tracking is not to be ended (the object tracking is to be continued). Here, the system control unit determines that the object tracking is to be ended when the signal SW1 has stopped being detected. In the case where the object tracking is to be continued during the period in which live view images are shown in the display unit 28, an instruction to end the object tracking may be given by a tracking end button provided to the operation unit 70.
At step S51, the system control unit 50 determines whether or not a line of sight (line-of-sight direction) detected by the line-of-sight detection unit 701 is directed toward the tracking failure area. In other words, the system control unit 50 determines whether or not the line-of-sight position on the display surface of the display unit 28 is located within the tracking failure area. The system control unit 50 advances the process to step S53 if it determines that the line of sight is directed toward the tracking failure area, and advances the process to step S52 if it determines that the line of sight is not directed toward the tracking failure area (that the line of sight has been shifted away from the tracking failure area).
At step S53, the system control unit 50 waits for the target to reappear (to be detected again), rather than resets the target. Specifically, the system control unit 50 ends the target resetting determination process (step S5), rather than resets the target. While the system control unit here keeps waiting for the target to reappear rather than resets the target if the line of sight is directed toward the tracking failure area, this may not be necessarily so. For example, the system may stop waiting for the target to reappear and reset the target when the waiting time for the reappearance of the target has reached a threshold (waiting time upper limit), rather than depending on the line of sight.
At step S52, the system control unit 50 resets the target. In resetting the target, for example, a person or an animal that is desirable as a target is detected, and the detected object is reset as the target. In the case where a plurality of objects (target candidates) are detected, one of the plurality of detected objects that is closest to the current target may be reset as the target. For example, in resetting the target during the tracking of an object that is an animal, if an animal and a person are detected as target candidates, the animal may be preferentially selected and reset as the target. If no target candidates are detected, the object tracking may be ended, and the object tracking may be restarted from step S1 of
According to the target resetting determination process shown in
In
In such a case, the system control unit 50 determines at step S51 that the line of sight is directed toward the tracking failure area, and waits for the target to reappear at step S53. At this time, the tracking frame 325 is not updated and kept displayed at the same position as the tracking frame 324. During the time in which the target is waited for to reappear, the tracking frame may be hidden, and only the tracking failure area may be displayed.
In
In
In such a case, the system control unit 50 determines at step S51 that the line of sight is not directed toward the tracking failure area (that the gaze has been shifted away from the tracking failure area), and resets the target at step S52. In
In
Hereinafter, a second embodiment of the present invention will be described. In the second embodiment, when the imaging direction is changed due to a panning operation or the like during the period when the target is being waited for to reappear, the position of the tracking failure area is changed in accordance with the change in the imaging direction.
In such a case, even though the user is looking at the same object position, the line-of-sight position (horizontal or vertical position) on the display surface of the display unit 28 will change. In
At step S54, the system control unit 50 determines whether or not a panning operation is continued from before the tracked target was lost based on the amount of panning operation until the tracked target was lost (the target became undetectable) and the amount of panning operation after the tracked target was lost. The system control unit 50 advances the process to step S51 if it determines that the panning operation is continued from before the tracked target was lost, and advances the process to step S55 if not. The latter case is when the panning operation was started after the tracked target was lost, rather than being continued from before the tracked target was lost. The amount of panning operation (amount of change in the imaging direction) is calculated on the basis of an orientation detection signal acquired from the orientation detection unit 55, for example. Alternatively, the amount of panning operation may be estimated from an image between frames, using the image processing unit 24.
At step S55, the system control unit 50 corrects (changes) the position of the tracking failure area in accordance with the amount of panning operation after the tracked target was lost. For example, the position of the tracking failure area is changed to the opposite direction from that of the panning operation (change in the imaging direction) with the same magnitude as the panning operation.
Panning operation is performed in instances that are roughly classified into the following first and second cases.
In the first case, the camera is unintentionally shaken, or the user intentionally adjusts the imaging direction slightly in preparation for the reappearance of the target. In the first case, the change in the view area is often relatively small.
In the second case, while the target is hidden, the user carries on the panning operation, predicting the movement of the target. For example, a situation where, during shooting of a running person with a telescopic lens, an obstacle such as a pedestrian comes in front of this running person, is imaginable. Another possible situation is where, during shooting of a flying airplane, an obstacle such as a tree comes in front of this airplane. In such situations, the movement of the target (running person or airplane) is readily predictable so that the user can continue the panning operation even though the target is obscured by the obstacle and not visible.
In the first case, it is desirable to correct the position of the tracking failure area in accordance with the amount of panning operation (amount of change in the imaging direction), as described with reference to
In the target resetting determination process of
The processing steps S51 to S53 of
According to the target resetting determination process of
Hereinafter, a third embodiment of the present invention will be described. In the third embodiment, the tracking failure area is extended to an area that includes an obstacle in the foreground to be able to handle the case in which the position where the target was obscured by the obstacle differs from the position where the target will reappear.
In
In
To deal with the situation described above, when a second object (non-target) such as an obstacle present in the foreground of the first object is detected in an area including at least part of the tracking failure area where the first object (target) was lost, the tracking failure area is extended to an area that includes the second object. The tracking failure area after the extension is the second object area or an area including the second object and its surroundings.
Alternatively, the image processing unit 24 may detect objects and locate the second object. At this time, an amount of defocus may also be obtained, to additionally carry out determination of credibility as to whether or not the detected object is the second object (obstacle obscuring the first object).
When the second object is detected, an extended tracking failure area is determined based on the detection result of the second object, and the tracking failure area is extended. In
According to the target resetting determination process of
Hereinafter, a fourth embodiment of the present invention will be described. The fourth embodiment allows for more accurate determination of whether or not the user is waiting for the target to reappear in cases where the target has moved out from an edge of the display surface of the display unit 28 (image display area).
The target may move out of the display surface during the shooting with a panning operation, for example. In such a case, the user already knows which of the plurality of edges (upper, lower, left, and right edges) of the display surface the target will reappear from. Accordingly, rather than using only the area where the target was lost as the tracking failure area, an area including one edge where the target went (moved) out among the plurality of edge parts of the display surface is used as the tracking failure area. This way, it is possible to determine more accurately whether or not the user wishes the target to reappear. The edge part of the display surface may be an area including the entire edge on the side where the target went out, or an area including a part of this edge (not including other parts), as long as it is an area wider than the area where the target was lost.
In
In
Accordingly, in the fourth embodiment, when the tracking failure area is present in an edge part of the display surface, the tracking failure area is extended to include (cover) this edge part.
This way, it is possible to determine more accurately whether or not the user is waiting for the target to reappear in cases where the target has moved out from an edge of the display surface (image display area).
According to the target resetting determination process of
Hereinafter, a fifth embodiment of the present invention will be described. The fifth embodiment allows for more favorable resetting of the target when the target becomes undetectable even though the target is present within the captured image.
Object tracking (target detection), which is achieved using colors, characteristic points, motion vectors and the like, cannot be carried on with 100% certainty, whichever method is used. Therefore, sometimes the target may be determined undetectable even though the target is present within the captured image. In such a case, the target is assumed to have changed color or shape, and it is more desirable to reset the target, rather than wait for the target to reappear, for example by creating a template again to detect the object from an area close to the tracking failure area.
Accordingly, in the fifth embodiment, when the target is determined to be undetectable at step S3 of
This way, object tracking can be carried on without the target being changed when the target becomes undetectable due to a change in the color or shape of the target even though the target is present within the captured image.
The present disclosure allows for more favorable resetting of a tracking target.
OTHER EMBODIMENTSEmbodiment(s) of the present invention can also be realized by a computer of a system or apparatus that reads out and executes computer executable instructions (e.g., one or more programs) recorded on a storage medium (which may also be referred to more fully as a ‘non-transitory computer-readable storage medium’) to perform the functions of one or more of the above-described embodiment(s) and/or that includes one or more circuits (e.g., application specific integrated circuit (ASIC)) for performing the functions of one or more of the above-described embodiment(s), and by a method performed by the computer of the system or apparatus by, for example, reading out and executing the computer executable instructions from the storage medium to perform the functions of one or more of the above-described embodiment(s) and/or controlling the one or more circuits to perform the functions of one or more of the above-described embodiment(s). The computer may comprise one or more processors (e.g., central processing unit (CPU), micro processing unit (MPU)) and may include a network of separate computers or separate processors to read out and execute the computer executable instructions. The computer executable instructions may be provided to the computer, for example, from a network or the storage medium. The storage medium may include, for example, one or more of a hard disk, a random-access memory (RAM), a read only memory (ROM), a storage of distributed computing systems, an optical disk (such as a compact disc (CD), digital versatile disc (DVD), or Blu-ray Disc (BD)™), a flash memory device, a memory card, and the like.
The embodiments described above are merely examples. Any configurations obtained by suitably modifying or changing some configurations of the embodiments within the scope of the subject matter of the present invention are also included in the present invention. The present invention also includes other configurations obtained by suitably combining the configurations of the embodiments.
While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. The scope of the following claims is to be accorded the broadest interpretation so as to encompass all such modifications and equivalent structures and functions.
This application claims the benefit of Japanese Patent Application No. 2020-019781, filed on Feb. 7, 2020, which is hereby incorporated by reference herein in its entirety.
Claims
1. An electronic device comprising at least one memory and at least one processor which function as:
- an acquisition unit configured to acquire line-of-sight information relating to a line of sight of a user;
- a tracking unit configured to detect an object in an image so as to track the object; and
- a setting unit configured to set a tracking target of the tracking unit, wherein
- the setting unit resets the tracking target based on the line-of-sight information in a case where the object stops being detected.
2. The electronic device according to claim 1, wherein
- the at least one memory and the at least one processor further function as a display control unit configured to execute control to display an area in which the object is detected by the tracking unit as distinguishable from another area.
3. The electronic device according to claim 2, wherein
- the display control unit configured to execute control to further display an area in which the object is not detected by the tracking unit as distinguishable from another area.
4. The electronic device according to claim 1, wherein
- the setting unit updates a position of the tracking target in a case where the object is detected.
5. The electronic device according to claim 1, wherein
- the at least one memory and the at least one processor further function as a display control unit configured to execute control to display the image on a display surface, and
- the setting unit resets the tracking target based on the line-of-sight information and a tracking history of the object in a case where the object stops being detected.
6. The electronic device according to claim 5, wherein,
- in a case where the object stops being detected, the setting unit
- waits for the object to be detected again rather than resets the tracking target if a line of sight is directed toward a determination area based on a tracking history of the object in an image display area where the image is displayed, and
- resets the tracking target rather than waits for the object to be detected again if a line of sight is shifted away from the determination area.
7. The electronic device according to claim 6, wherein
- the determination area includes an area where the object stops being detected.
8. The electronic device according to claim 6, wherein
- the setting unit changes a position of the determination area in accordance with a change in an imaging direction of the image.
9. The electronic device according to claim 8, wherein
- the setting unit does not change a position of the determination area in accordance with a change in the imaging direction of the image if the imaging direction is changed continuously from before the object stops being detected.
10. The electronic device according to claim 6, wherein,
- in a case where an object that is not the tracking target is detected in an area including at least part of the determination area, the setting unit extends the determination area to an area including this object.
11. The electronic device according to claim 6, wherein,
- in a case where the object moves out of an image display area where the image is displayed and stops being detected, the setting unit uses an area including one edge on a side where the object moves out among a plurality of edge parts of the image display area as the determination area.
12. The electronic device according to claim 6, wherein,
- in a case where a change in an amount of defocus of the determination area before and after a time when the object stops being detected is smaller than a threshold, the setting unit resets the tracking target rather than waits for the object to be detected again even when a line of sight is directed toward the determination area.
13. The electronic device according to claim 6, wherein,
- in a case where a period of time for which the object is waited for to be detected again reaches a threshold, the setting unit stops waiting for the object to be detected again and resets the tracking target.
14. A control method of an electronic device, comprising:
- acquiring line-of-sight information relating to a line of sight of a user;
- detecting an object in an image so as to track the object; and
- setting a tracking target, wherein
- the tracking target is reset based on the line-of-sight information in a case where the object stops being detected.
15. A non-transitory computer readable medium that stores a program, wherein
- the program causes a computer to execute a control method of an electronic device, comprising:
- acquiring line-of-sight information relating to a line of sight of a user;
- detecting an object in an image so as to track the object; and
- setting a tracking target, and
- the tracking target is reset based on the line-of-sight information in a case where the object stops being detected.
Type: Application
Filed: Aug 15, 2022
Publication Date: Feb 9, 2023
Inventor: Yasuhiro Matsuo (Saitama)
Application Number: 17/888,156