PERSON TRACKING SUPPORT DEVICE
A person tracking support device identifies persons appearing in at least one of videos, and displays a person tracking screen including a first display section that synchronously reproduces monitoring videos selected from the videos and a second display section that displays an appearance section in which the person appears in at least one of the monitoring videos for each persons. When a user performs an operation of designating a time point of the appearance section related to a target person on the person tracking screen, the person tracking support device displays a snapshot of the monitoring videos at the time point on the first display section, and displays person tracking information for the user to recognize a portion in which the target person appears in the snapshot on the person tracking screen.
The present application claims priority under 35 U.S.C. § 119 to Japanese Patent Application No. 2024-081078, filed on May 17, 2024, the contents of which application are incorporated herein by reference in their entirety.
TECHNICAL FIELDThe present disclosure relates to a technique for supporting tracking of a person appearing in a video captured by a camera.
BACKGROUND ARTPatent Literature 1 discloses a crime prevention system that identifies suspicious behavior of a person included in a camera image. The security system monitors the behavior of a person included in a plurality of camera images continuously captured along a time series by a monitoring camera. The crime prevention system detects a person region of each of persons included in a plurality of camera images, and performs tracking processing for identifying the persons included in the plurality of camera images in time series on the basis of person image data included in the person region. Then, when the identification by the tracking processing transitions to failure in the middle, the security system determines that the person of the suspicious behavior is included in the camera image.
In addition, as documents showing the technical level of the present technical field, there are the following Patent Documents 2 and 3.
LIST OF RELATED ART
-
- Patent Document 1: Japanese Patent Application Laid-Open No. 2023-086471
- Patent Document 2: Japanese Patent Application Laid-Open No. 2023-156963
- Patent Document 3: Japanese Patent Application Laid-Open No. 2024-032346
A case where a user tries to track a specific person appearing in a video while checking the video will be considered. In this case, the tracking of the person is normally performed over a plurality of videos captured by a plurality of cameras. Therefore, the user who tracks the person is required to locate the person to be tracked during the video time for each of the plurality of videos. Conventionally, this tracking operation requires a considerable amount of human cost or time cost.
The present disclosure has been made in view of the above problems. An object of the present disclosure is to provide a technique that enables efficiency improvement of a tracking operation when a user tracks a person shown in a video.
One aspect of the present disclosure relates to a person tracking support device.
The person tracking support device includes:
-
- a storage device configured to store data of a plurality of videos captured by a plurality of cameras; and
- processing circuitry.
The processing circuitry is configured to:
-
- identify one or more persons in at least one of the plurality of videos; and
- display a person tracking screen on a display device, the person tracking screen including a first display section that synchronously reproduces and displays one or more monitoring videos selected from the plurality of videos during a video time, and a second display section that displays an appearance section for each of the one or more persons, the appearance section being a section in which the person appears in at least one of the one or more monitoring videos during the video time.
The processing circuitry is further configured to, when a user performs an operation of designating a time point of the appearance section related to a target person among the one or more persons on the person tracking screen:
-
- display a snapshot of the one or more monitoring videos at the time point on the first display section; and
- display person tracking information on the person tracking screen, the person tracking information being information for causing the user to recognize a portion in which the target person appears in the snapshot.
According to the present disclosure, a person tracking screen is displayed, the person tracking screen including a first display unit that synchronously reproduces and displays one or more selected monitoring videos during a video time, and a second display section that displays an appearance section that appears in at least one of the one or more monitoring videos during the video time for each of one or more identified persons. This allows the user to easily determine in which section each person appearing in the video is displayed. Then, when the user performs an operation of designating one time point of the appearance section related to the target person, a snapshot of one or a plurality of monitoring videos at the one time point and the person tracking information are displayed. Thus, the user can easily recognize the portion in which the target person appears at each time point during the appearance section. As described above, according to the present disclosure, it is possible to improve the efficiency of the tracking work by the user.
Hereinafter, embodiments of the present disclosure will be described with reference to the accompanying drawings. In the drawings, the same or corresponding components are denoted by the same reference numerals, and the description thereof will be simplified or omitted.
1. Person Tracking Support DeviceThe person tracking support device 100 displays a “person tracking screen 300” for supporting the user 2 to track the person 10 shown in the video 30 on the display device 210 by using the data of the plurality of acquired videos 30. The user 2 tracks the person 10 shown in the video 30 on the person tracking screen 300. The person tracking screen 300 will be described in detail later. Tracking the person 10 in the video 30 is useful in the field of traffic management, behavior analysis, and the like.
Here, a case where the user 2 tries to track a specific person 10-T (hereinafter, referred to as a “target person 10-T”) will be considered. At this time, the user 2 needs to look at the target person 10-T during the video time for each of the plurality of videos 30. In the example shown in
The processing circuitry 110 executes various kinds of processing. The processing circuitry 110 may be implemented as, for example, a general-purpose processor, a special-purpose processor, a central processing unit (CPU), a graphics processing unit (GPU), an application specific integrated circuit (ASIC), a field-programmable gate array (FPGA), an integrated circuit, a conventional circuit, or a combination of one or more of these. A processor including transistors and other circuits is an example of the processing circuitry 110. The processing circuitry 110 may also be referred to as circuitry. The circuitry may be hardware programmed to implement or perform the functions described in this disclosure.
The storage device 120 stores various kinds of information necessary for execution of processing of the processing circuitry 110. The storage device 120 is configured by a recording medium such as a random access memory (RAM), a read only memory (ROM), a solid state drive (SSD), or a hard disk drive (HDD). The storage device 120 stores a computer program 121 executable by the processing circuitry 110. The computer program 121 is configured by a plurality of instructions describing processing to be executed by the processing circuitry 110. The computer program 121 may be recorded in a computer-readable recording medium. The processing circuitry 110 that executes the computer program 121 and the storage device 120 cooperate with each other to realize the functions of the person tracking support device 100.
The storage device 120 further stores a video data D10. The video data D10 is used to manage the plurality of videos 30 captured by the plurality of cameras 20. The person tracking support device 100 acquires the video 30 captured by each camera 20, and records and manages the acquired video 30 in the video data D10. The person tracking support device 100 may sequentially acquire the video 30 from each camera 20 online. The person tracking support device 100 may acquire the video 30 from the user 2 via the input device 220. Each video 30 managed in the video data D10 includes information on the time of image capture and information on the camera 20 that captured the video 30.
The person identifier F10 performs an identification process of identifying one or more persons 10 appearing in at least one of the plurality of videos 30 managed by the video data D10, and generates person identification information IDF. By executing the identification process, unique identification information (ID) is assigned to each identified person 10. In addition, by executing the identification process, a portion in which each identified person 10 appears in each video 30 is specified. The portion of the video 30 in which the person 10 appears is represented by, for example, a bounding box surrounding the person 10. That is, in this case, the position of the bounding box is specified by the execution of the identification processing. The portion of the video 30 where the person 10 appears can be represented in other forms such as a polygon surrounding the person 10 or the skeleton position of the person 10. In the following description, a case where the position of the bounding box is specified by executing the identification processing will be considered.
The identification processing typically includes tracking processing for detecting the person 10 appearing in the video 30 and tracking the detected person 10 in the video, and human re-identification processing for identifying the same person 10 between different videos 30. The tracking process and the person re-identification process are well-known techniques, and the methods thereof are not particularly limited. For example, the identification process can be implemented by an AI pipeline including a learned machine learning model configured to perform a tracking process and a person re-identification process using the plurality of videos 30 as inputs.
The person identification information IDF is information in which, for each of the plurality of videos 30, the ID of the person 10 shown in the video 30 for each frame is associated with a portion of the video 30 in which the person 10 of the ID is shown.
By referring to the person identification information IDF generated by the person identifier F10 in this way, it is possible to acquire the section in which each identified person 10 is shown in the video 30 and the portion shown in the video 30 for each of the plurality of videos 30. The person identifier F10 may store the generated person identification information IDF in the storage device 120.
Refer to
The person tracking screen 300 displayed on the display device 210 by the person tracking screen display unit F20 will be described in detail below.
2. Person Tracking Screen 2-1. Screen ConfigurationThe first display section 310 synchronously reproduces and displays one or a plurality of videos 30 during the video time. One or more videos 30 displayed on the first display section 310 are selected by the user 2 from among the videos 30 managed in the video data D10. The person tracking screen display unit F20 receives a selection input of the video 30 from the user 2. The selection input may be performed by selecting the camera 20. For example, the user 2 selects a camera 20 from the list of the plurality of cameras 20, the video 30 of which the user 2 wants to check. In this case, the video 30 captured by the selected camera 20 is displayed on the first display section 310. The person tracking screen display unit F20 may further receive an input of designation of a video time from the user 2. For example, the user 2 designates a period based on the imaging date and time. Hereinafter, each of the one or more videos 30 displayed on the first display section 310 is also referred to as a “monitoring video 30”. In the example illustrated in
The first display section 310 displays a button 311 for starting playback of the monitoring video 30 and a bar 312 indicating a playback portion of the monitoring video 30 during the video time. When the user 2 operates the button 311, “synchronous reproduction” of the monitoring video 30 is started. That is, in the first display section, each monitoring video 30 is reproduced while synchronizing the imaging time with each other. When the user 2 operates the button 311 during the reproduction of the monitoring video 30, the reproduction of the monitoring video 30 may be temporarily stopped. The bar 312 moves in accordance with the playback position of the monitoring video 30. The user 2 can recognize the reproduction position of the monitoring video 30 in the video time by checking the position of the bar 312.
A bounding box BB surrounding the identified person 10 is superimposed on each monitoring video 30. In
The first display section 310 displays the camera information CF of the camera 20 that captures the monitoring video 30 in association with the monitoring video 30. The camera information CF is information that can identify the camera 20. In the example shown in
The second display section 320 displays section 322 (hereinafter, referred to as “appearance section 322”) in which one or more identified persons 10 appear in at least one of one or more monitoring videos 30 in the video time for each of one or more identified persons 10. The person tracking screen display unit F20 acquires the appearance section 322 of each person 10 identified in the monitoring video 30 by referring to the person identification information IDF.
In the example illustrated in
On the person tracking screen 300 having the above-described screen configuration, the user 2 can track the target person 10-T as follows.
First, the user 2 performs an operation of designating a point in time of the appearance section 322 related to the target person 10-T. For example, the user 2 operates the pointing device to move the pointer 4 over a point in the appearance section 322. In the example illustrated in
When the user 2 performs an operation of designating a point in time of the appearance section 322, the first display section displays a snapshot of each monitoring video 30 at the designated point in time. That is, each monitoring video 30 stops at the reproduction position corresponding to the designated time point. The bar 312 may move to a playback location corresponding to the specified point in time. A snapshot may also be referred to as a “still image”.
Further, when the user 2 performs an operation of designating a point in time of the appearance section 322, “person tracking information 323” is displayed on the person tracking screen 300. The person tracking information 323 is information for causing the user 2 to recognize a portion in which the target person 10-T appears in the snapshot. In particular, in the present embodiment, the person tracking information 323 is information in which the image IM obtained by cutting out the region of the bounding box BB surrounding the target person 10-T from the snapshot and the camera information CF of the camera 20 that has captured the monitoring video 30 showing the target person 10-T are associated with each other.
The user 2 can recognize a portion in which the target person 10-T appears in the snapshot of the monitoring video 30 as follows from the person tracking information 323 according to the present embodiment. User 2 can specify monitoring video 30 (target monitoring video) showing target person 10-T by collating camera information CF of the person tracking information 323 with camera information CF associated with each monitoring video 30. In this sense, the camera information CF of the person tracking information 323 can be said to be information for specifying the target monitoring video. Then, the user 2 can recognize a portion in which the target person 10-T appears by comparing the image IM of the person tracking information 323 with the image surrounded by each bounding box BB on the target monitoring video. In this way, the user 2 can easily recognize a portion in which the target person 10-T appears in the snapshot of the monitoring video 30 from the person tracking information 323. In order to make it easier to recognize the portion in which the target person 10-T appears, the person tracking screen display unit F20 may match the color of the bounding box BB surrounding each person 10 and the color of the display of the appearance section 322 related to each person 10.
The user 2 can easily recognize a portion in which the target person 10-T appears at each time point during the appearance section 322 by appropriately changing the time point designated in the appearance section 322 related to the target person 10-T. As described above, according to the present embodiment, the user 2 can efficiently track the target person 10-T on the person tracking screen 300.
First, in step S10, the user 2 operates the input device 220 to designate a point in time of the appearance section 322 related to the target person 10-T. The operation input by the user 2 is transmitted to the person tracking support device 100.
In response to the designation of the one time point of the appearance section 322, next, in step S20, the person tracking screen display unit F20 displays a snapshot of the monitoring video 30 at the designated one time point on the first display section 310.
Next, in step S30, the person tracking screen display unit F20 generates the person tracking information 323 of the target person 10-T based on the person identification information IDF. To be more specific, the person tracking screen display unit F20 cuts out a region of a bounding box BB surrounding the target person 10-T from the snapshot of the monitoring video 30. Then, the person tracking screen display unit F20 generates the person tracking information 323 by associating the image IM of the region surrounding the cut-out target person 10-T with the camera information CF obtained by capturing the target monitoring video in which the target person 10-T appears.
Next, in step S40, the person tracking screen display unit F20 displays the generated person tracking information 323 on the person tracking screen 300.
2-3. Display of Virtual Space ImageThe person tracking screen display unit F20 may further display a “virtual space video” on the person tracking screen 300 in order to make the tracing of the target person 10-T by the user 2 more efficient. The virtual space video is a video in which the motion of the person 10 shown in the monitoring video 30 during the video time is represented by an object in the virtual space.
By displaying the virtual space image 40 in this way, the user 2 can check the motion of the target person 10-T from various viewpoints through the virtual space image 40. As a result, the user 2 can track the target person 10-T more efficiently.
3. EffectAs described above, according to the present embodiment, the person tracking screen including the first display section 310 that synchronously reproduces and displays one or more monitoring videos 30 during the video time and the second display section 320 that displays the appearance section 322 projected on at least one of one or more monitoring videos 30 during the video time for each of one or more identified persons 10 is displayed. Thus, the user can easily determine in which section each person 10 shown in the video 30 is shown. Then, when the user 2 performs an operation of designating one time point of the appearance section 322 related to the target person 10-T, a snapshot of one or a plurality of monitoring videos 30 at the one time point and the person tracking information 323 are displayed. Thus, the user can easily recognize a portion in which the target person 10-T appears at each time point during the appearance section 322. As described above, according to the present embodiment, the efficiency of the tracking work by the user 2 can be improved.
Claims
1. A person tracking support device comprising:
- a storage device configured to store data of a plurality of videos captured by a plurality of cameras; and
- processing circuitry, wherein
- the processing circuitry is configured to: identify one or more persons in at least one of the plurality of videos; and display a person tracking screen on a display device, the person tracking screen including a first display section that synchronously reproduces and displays one or more monitoring videos selected from the plurality of videos during a video time, and a second display section that displays an appearance section for each of the one or more persons, the appearance section being a section in which the person appears in at least one of the one or more monitoring videos during the video time, and wherein
- the processing circuitry is configured to, when a user performs an operation of designating a time point of the appearance section related to a target person among the one or more persons on the person tracking screen: display a snapshot of the one or more monitoring videos at the time point on the first display section; and display person tracking information on the person tracking screen, the person tracking information being information for causing the user to recognize a portion in which the target person appears in the snapshot.
2. The person tracking support device according to claim 1, wherein
- the processing circuitry is further configured to display a bounding box surrounding the one or more persons in a superimposed manner on the one or more monitoring videos displayed on the first display section, and wherein
- the person tracking information is information in which an image obtained by cutting out a region of the bounding box surrounding the target person from the snapshot is associated with information for specifying a target monitoring video in which the target person appears among the one or more monitoring videos.
3. The person tracking support device according to claim 2, wherein
- the information for specifying the target monitoring video is information of the camera that has captured the target monitoring video among the plurality of cameras.
4. The person tracking support device according to claim 1, wherein
- the processing circuitry is further configured to: generate a virtual space video in which a motion of the one or more persons during the video time is represented by an object in a virtual space; and display the virtual space video on the person tracking screen so as to be synchronously reproduced with the one or more monitoring videos.
5. A person tracking support device comprising:
- a storage device configured to store data of a plurality of videos captured by a plurality of cameras; and
- processing circuitry, wherein
- the processing circuitry is configured to: identify a person in each of the plurality of videos; and cut out a region surrounding the person from snapshot of the video in which the person appears and generate a person tracking information in which an image obtained by cutting out the region surrounding the person from the snapshot is associated with information of the camera that has captured the video in which the person appears among the plurality of cameras.
Type: Application
Filed: May 7, 2025
Publication Date: Nov 20, 2025
Inventors: Ryuya NISHINO (Tokyo-to), Hsuan-Kung YANG (Tokyo-to), Hitoshi KAMADA (Tokyo-to), Norimasa KOBORI (Tokyo-to)
Application Number: 19/200,667