Image processing apparatus, image processing method, and storage medium
An image processing apparatus is described comprising a first acquisition unit configured to acquire image data from an image capturing apparatus and a second acquisition unit configured to acquire display installation information indicating a configuration of one or more display screens in a display installation to be simulated. A generation unit is provided to generate, based on the display installation information and the acquired image data, a simulation image that simulates display of the image on the display installation. A display control unit is provided to display the simulation image on a display device.
Latest Canon Patents:
This application is a continuation application of U.S. patent application Ser. No. 16/458,841, filed Jul. 1, 2019, which claims the benefit of Japanese Patent Application No. 2018-125941, filed Jul. 2, 2018, all of which is hereby incorporated by reference herein in its entirety.
BACKGROUND OF THE INVENTION Field of the InventionThe present invention relates to a technique for simulating a display installation configured to display an image with a wide field of view.
Description of the Related ArtA conventional system configured to display an image on a display screen arranged so as to cover a field of view of a viewer to thereby provide a high realistic sensation to the viewer is known as an image display installation. Japanese Patent Application Laid-Open No. 2007-318754 discusses a method of displaying an image on a screen that has a large viewing angle and a spherical shape with a depressed surface facing a viewer. In this method, an image to be displayed on the screen is generated by performing mapping processing in which a flat image is pasted to the spherical shape.
When capturing video using a video camera with the intention of displaying the video on such a large display screen, it can be difficult to understand how the final image will appear. In particular, in such a display installation for displaying an image with a wide field of view, an image acquired by executing predetermined geometric transformation or projection transformation on a flat captured image is used as a display image. For this reason, in the conventional technique, how a captured image is to appear when the captured image is displayed on the display installation can be checked only by displaying the image in the environment in which the display installation is situated.
SUMMARY OF THE INVENTIONThe present invention is directed to a technique of presenting how an image is to appear when the image is displayed on a display installation by generating an image of an environment in which the image is displayed on the display installation, viewed from a virtual viewpoint.
According to an aspect of the present invention, an image processing apparatus includes a first acquisition unit configured to acquire image data from an image capturing apparatus, a second acquisition unit configured to acquire display installation information indicating a configuration of one or more display screens in a display installation to be simulated, a generation unit configured to generate, based on the display installation information and the acquired image data, a simulation image that simulates display of the image on the display installation, and a display control unit configured to display the simulation image on a display device.
Further features of the present invention will become apparent from the following description of exemplary embodiments with reference to the attached drawings.
Various applicable exemplary embodiments of the present invention will be described below with reference to the attached drawings. It should be noted that the disclosed exemplary embodiments are not intended to limit the scope of the present invention, and not every combination of features described in the exemplary embodiment is always essential to a technical solution of the present invention.
In a first exemplary embodiment, an imaging assistance system will be described. The image assistance system is configured to be used in conjunction with an image capturing apparatus 101 in the form of a video camera. In other embodiments the image capturing apparatus could be a digital camera for capturing still images. The photographer using the image capturing apparatus to capture still images wears a head-mounted display (HMD) 104 configured to display how a captured image might look when displayed on a display installation. The display installation, which is simulated in the HMD, is the hardware for displaying the captured video to one or more viewers at a display location. The display location might typically be a location, such as an art gallery or cinema, where large immersive displays are commonly found. However, there is no reason why the display location should be limited to this and display in a domestic environment is possible in principle.
However, during the image capturing using an image capturing apparatus 101, it is difficult for a photographer to recognize how an image being captured is to be later displayed on the display installation. Accordingly, in the present exemplary embodiment, the display location is simulated in a virtual space, and a display image in the virtual space is generated based on an orientation direction of a user and then displayed so that a simulation of the display installation is displayed to the user wearing the HMD.
As the image processing apparatus 102, for example, a personal computer (PC) is used. The image processing apparatus 102 acquires a moving image from the image capturing apparatus 101, performs various types of image processing on the moving image to generate a display image, and displays the display image on the second display apparatus 104. The image processing apparatus 102 displays, on the first display apparatus 103, a GUI screen for setting parameters necessary for various types of processing. The image processing apparatus 102 acquires various parameters necessary for image processing based on user operations input via the GUI displayed on the first display apparatus 103.
An input I/F 206 is, for example, a serial bus interface such as a universal serial bus (USB) interface or Institute of Electrical and Electronics Engineers (IEEE) 1394 interface and connects an input device 207 such as a keyboard and mouse. The CPU 201 is capable of reading data from the input device 207 via the input OF 206. A display OF 208 is an interface such as a digital visual interface (DVI) or high-definition multimedia interface (HDMI) (registered trademark) and connects a display device 209 such as a liquid crystal display, projector, and HMD. The CPU 201 transmits data to the display device 209 via the display I/F 208 and displays the data on the display device 209. In the present exemplary embodiment, the display device 209 is a liquid crystal display and a HMD which are the first display apparatus 103 and the second display apparatus 104, respectively, illustrated in
The image acquisition unit 301 acquires a frame image of a moving image from the image capturing apparatus 101 and outputs the acquired frame image as an input image to the first texture generation unit 306 and the second texture generation unit 307.
The ideal viewpoint setting unit 302 sets, as an origin in a virtual space, the position of an ideal viewpoint in viewing an image displayed on the display unit 105 of the display installation. The ideal viewpoint setting unit 302 outputs information indicating the set virtual space to the virtual space setting unit 305 and the first texture generation unit 306.
The image capturing parameter setting unit 303 acquires, as information about the image capturing apparatus 101, the focal length of the lens, the projection method, and the size of the image capturing sensor that are input by the user via the GUI displayed on the first display apparatus 103. The image capturing parameter setting unit 303 sets the acquired image capturing parameter and outputs the set image capturing parameter to the first texture generation unit 306.
The position/orientation acquisition unit 304 acquires, from the HMD, orientation information for calculating the position, direction, and gradient of the head portion of the user wearing the HMD. The position/orientation acquisition unit 304 outputs the acquired orientation information to the rendering unit 308.
The virtual space setting unit 305 sets a virtual space for simulating the actual display installation to be simulated. First, the virtual space setting unit 305 arranges a first screen object corresponding to the display unit (display) 105 based on display installation information indicating the structure of the display unit 105 in the display installation. In the present exemplary embodiment, the first screen object is an object including three flat screens connected together and forming an isosceles trapezoid shape when viewed from above, as illustrated in
The first texture generation unit 306 generates a first texture to be displayed on the first screen object from the frame image of the moving image based on the information indicating the position of the ideal viewpoint and the image capturing parameter, and outputs the first texture to the rendering unit 308. The second texture generation unit 307 generates a second texture to be displayed on the second screen object from the frame image of the moving image, and outputs the second texture to the rendering unit 308.
The rendering unit 308 performs rendering processing based on the various types of input information, generates an image to be displayed on the HMD, and outputs the image to the display image output unit 309. The display image output unit 309 outputs the display image generated by the rendering unit 308 to the HMD and controls the display.
A process to be executed by the image processing apparatus 102 according to the present exemplary embodiment will be described below.
In step S401, the virtual space setting unit 305 sets the first screen object for reproducing, in the virtual space, the display unit (display) 105 of the display installation that is arranged in the display environment to be simulated. The virtual space setting unit 305 acquires display installation information about the size and arrangement of each display of the display unit 105 in response to a user instruction input via the GUI displayed on the first display apparatus 103.
In step S402, the virtual space setting unit 305 sets a floor surface 901 and a background in the virtual space.
In step S403, the ideal viewpoint setting unit 302 sets an ideal viewpoint position for viewing the display unit 105 in the display environment in which the display installation is actually placed. In the present exemplary embodiment, the ideal viewpoint position is set to a position located at a distance Dview from the center of the central display in the display installation as illustrated in
In step S404, the image capturing parameter setting unit 303 sets an image capturing parameter of the connected image capturing apparatus 101. The image capturing parameter setting unit 303 sets the image capturing parameter based on a user instruction input via the GUI displayed on the first display apparatus 103. On the GUI illustrated in
In step S405, the image acquisition unit 301 acquires frame image data representing one frame of the moving image being captured by the image capturing apparatus 101.
In step S406, the position/orientation acquisition unit 304 acquires orientation information about the position, direction, and gradient of the head portion of the user. The position/orientation acquisition unit 304 acquires information about the acceleration sensor and the gyro sensor from the HMD and acquires the pattern of the known infrared LED mounted on the HMD from the infrared camera. Then, the position/orientation acquisition unit 304 derives position/orientation information by estimating a position/orientation using a publicly-known three-dimensional position/orientation estimation algorithm. A coordinate system that indicates the estimated position/orientation matches the coordinate system set in the virtual space illustrated in
In step S407, the virtual space setting unit 305 sets a second screen object 801. The second screen object 801 is an object for displaying an image indicated by the frame image data acquired in step S405. Specifically, the second screen object 801 displays a full angle-of-view image captured by the image capturing apparatus 101. The second screen object 801 is a flat or rectangular object formed of a uniform mesh and is used as auxiliary information in imaging. The aspect ratio of the second screen object 801 is set so as to match the aspect ratio of the frame image acquired in step S405.
In step S408, the display image generation unit 310 generates a display image to be displayed on the HMD based on the information and data set or acquired in the previous steps. Details of the processing will be described later. In step S409, a control unit (not illustrated) performs standby processing based on the frame rate of the HMD. For example, in a case where the frame rate is 60 fps, the control unit adjusts the processing time in such a manner that the total processing time of steps S405 to S409 is about 16 msec.
In step S410, the display image output unit 309 outputs the display image generated in step S408 to the HMD and displays the display image on the HMD. In step S411, the control unit (not illustrated) determines whether to end the display. If the user presses an end button 607 on the GUI illustrated in
Details of display image generation processing will be described below. In the display image generation processing, the first texture to be displayed on the first screen object 701 and the second texture to be displayed on the second screen object 801 are generated. Further, the display image to be displayed on the HMD is generated by performing rendering processing using the previously-set virtual space information and the frame image data about the moving image acquired from the image capturing apparatus 101. The first screen object 701 is a virtual object for simulating an image displayed on the actual display unit 105 in the display environment in which the display installation to be simulated is arranged in the real space. Specifically, the first texture is an image for reproducing how a video image appears on the display unit 105. On the other hand, the second texture matches the frame image data acquired from the image capturing apparatus 101 and is set to the second screen object 801 in order to check the imaging angle of view of the image capturing apparatus 101.
In step S501, the first texture generation unit 306 generates the first texture to be displayed on the first screen object 701. The first texture generation unit 306 sets the virtual camera at the origin in such a manner that the optical axis indicates the positive direction of the Z-axis and the upward direction of the virtual camera indicates the positive direction of the Y-axis. Specifically, the first texture generation unit 306 sets the virtual camera arranged at the ideal viewpoint position. The first texture generation unit 306 generates the first texture by inversely projecting the acquired frame image from the set virtual camera to the first screen object 701 based on the image capturing parameter set in step S404. The generated first texture is set to the first screen object 701 and rendered. The first screen object 701 to which the first texture is set indicates how the image appears when the image is displayed on the display installation arranged in the display environment. For this reason, the user who is viewing the HMD is able to view a captured scene while how the captured scene is displayed on the display installation is geometrically correctly reproduced.
In step S502, the second texture generation unit 307 generates the second texture to be displayed on the second screen object 801. The frame image acquired from the image capturing apparatus 101 is directly set as the second texture. Specifically, the second texture is an image based on which the imaging angle of view of the image capturing apparatus 101 can be checked.
In step S503, the rendering unit 308 generates an image of the virtual space viewed from the virtual viewpoint as a display image to be displayed on the HMD. Specifically, the rendering unit 308 performs rendering processing using the previously-set virtual space information including the first screen object 701, the second screen object 801, and the floor and background, the first texture, and second texture. The rendering unit 308 sets a virtual camera corresponding to the right eye of the HMD and a virtual camera corresponding to the left eye of the HMD as virtual cameras for use in rendering. The rendering unit 308 sets a camera parameter such as the angle of view of the virtual cameras for use in rendering based on the viewing angle of the HMD and the resolution of the display panel. The rendering unit 308 sets the positions and orientations of the virtual cameras so as to match the orientation information about the head portion of the user that is acquired in step S406. At this time, the rendering unit 308 corrects the positions of the two virtual cameras based on the eye width of the right and left eyes in such a manner that the distance between the virtual cameras match the eye width. The rendering unit 308 renders the virtual space using the set virtual cameras as the virtual viewpoint. Further, the rendered image is geometrically corrected based on the distortion of the eye lens of the HMD so that the image is displayed without distortion when viewed through the eye lens of the HMD.
As described above, in the present exemplary embodiment, the first screen object 701 corresponding to the display unit 105 in the display installation to be simulated, the second screen object 801 for checking the angle of view of the image capturing apparatus 101, and the floor and background are set in the virtual space. Then, rendering processing is performed based on the position/orientation of the head portion of the user using the frame image of the moving image acquired from the image capturing apparatus 101 to thereby generate a display image. At this time, the display image is generated stereoscopically and based on the position and orientation of the head portion of the user wearing the HMD, as described in the present exemplary embodiment. The generated display image is displayed on the HMD worn by the user to thereby provide the user with an experience that is as though the user views the video image in the real display environment.
In reality, it is difficult to temporarily provide a display installation at an imaging location and check an image displayed on the display installation at the imaging location. However, the simulation performed as through the user is in the display environment in which the display installation is situated as in the present exemplary embodiment leads to imaging assistance with framing to prevent a missing subject. Further, it also becomes possible to check and conduct camera work and framing during imaging so as to increase a realistic sensation at the time of displaying in the real display environment. Further, an image being captured that is acquired in real time from the image capturing apparatus 101 is directly displayed in the virtual space besides a simulation image displayed on the display installation. This makes it possible to compare and check the video image being captured and how the image displayed on the display installation based on the captured video image appears.
In a second exemplary embodiment, a method of controlling rendering of the second screen object 801 based on a user instruction will be described below. As described above in the first exemplary embodiment, the second screen object 801 is a virtual object set in the virtual space in order to check the angle of view of the image capturing apparatus 101. In the present exemplary embodiment, the second screen object 801 in the display image displayed on the HMD is displayed or hidden based on a user instruction. Each configuration similar to that in the first exemplary embodiment is given the same reference numeral, and detailed description thereof is omitted.
A process executed by the image processing apparatus 102 according to the present exemplary embodiment will be described below.
In step S1201, the virtual space setting unit 305 sets a first screen object for reproducing, in the virtual space, a real display that is situated in the real display environment and is to be simulated. In step S1202, the virtual space setting unit 305 sets the floor surface 901 and a background. In step S1203, the ideal viewpoint setting unit 302 sets an ideal viewpoint position for viewing the real display in the real display environment.
In step S1204, the image capturing parameter setting unit 303 sets an image capturing parameter according to the setting of the connected image capturing apparatus 101. In step S1205, the image acquisition unit 301 acquires one frame of frame image data of a moving image from the image capturing apparatus 101. In step S1206, the position/orientation acquisition unit 304 acquires user orientation information. In step S1207, the first texture generation unit 306 generates a first texture to be displayed on the first screen object 701.
In step S1208, the control unit (not illustrated) determines whether to perform full angle-of-view displaying. This is performed by acquiring the state of a full angle-of-view display ON/OFF checkbox 606 on the GUI illustrated in
In step S1209, the virtual space setting unit 305 sets the second screen object 801. In step S1210, the second texture generation unit 307 generates a second texture to be displayed on the second screen object 801. In step S1211, the rendering unit 308 executes rendering processing using previously-set virtual space information, the first texture, and the second texture and generates a display image to be displayed on the HMD. Details of the processing are similar to those in step S503 described above, but in the case in which the control unit determines that full angle-of-view displaying is not to be performed in step S1208, the second screen object 801 is not rendered with respect to the display image.
In step S1212, the control unit (not illustrated) performs standby processing based on the frame rate of the HMD. For example, in the case in which the frame rate is 60 fps, the control unit adjusts the processing time in such a manner that the total processing time of steps S1205 to S1212 is about 16 msec. In step S1213, the display image output unit 309 outputs the display image generated in step S1211 to the HMD and displays the display image on the HMD.
In step S1214, the control unit (not illustrated) determines whether to end the display. If the user presses the end button 607 on the GUI illustrated in
In a case where the user desires to check how an image being captured is to be displayed on the display installation during imaging, the second screen object is displayed which leads to imaging assistance. On the other hand, in a case where the user desires to check only how a video image appears on the display installation during imaging or check whether a captured video image is a suitable content for the display unit in the display installation, the second screen object is unnecessary. Accordingly, in the present exemplary embodiment, the second screen object 801 in the display image displayed on the HMD is displayed or hidden based on a user instruction. This makes it possible to provide a simulation image of the display installation according to a user demand.
As described above in the present exemplary embodiment, in the case of simulating the display installation in which the plurality of flat displays is arranged so as to be in contact with each other, the first screen object 701 is formed of a plurality of rectangular objects connected together. In order to increase the visibility of a boundary of the rectangular objects, the display image generation unit 310 can, for example, convert pixels in the display image that correspond to the boundaries of the plurality of rectangular objects into a pixel value that is clearly visible, such as black. Alternatively, the virtual space setting unit 305 can correct the coordinates of the vertexes at the time of setting the first screen object in such a manner that a space is formed in the boundaries of the plurality of rectangular objects. As a result of correcting the coordinates of the vertexes of the rectangular objects, the boundary lines between the rectangular objects are emphasized in the rendered display image.
In the present exemplary embodiment, an example of the case in which “180 inches, 3 screens, 120 degrees” is selected as the display installation structure is described. For example, the display image can be changed according to a set parameter in a case where the user changes the display apparatus structure 604 on the GUI. In the present exemplary embodiment, how the display image is changed in the case in which the display apparatus structure 604 is changed will be described below. First, the display image in the case in which “180 inches, 3 screens, 120 degrees” is selected is an image 1501 illustrated in
In the above-described exemplary embodiment, the process is described in which the display image generation unit 310 changes the display image according to the position and orientation of the user wearing the HMD so that the user is provided with an experience that is as though the user views the video image in the real display environment. Alternatively, a process of determining whether to use the position/orientation of the user at the time of rendering can be included. For example, there is a situation in which it is desirable to display an image in such a manner that a first screen object is always at the front regardless of the orientation of the user during image capturing. Accordingly, if it is determined that the position/orientation of the user is not to be used based on a user instruction, the position of the virtual camera at the time of rendering is fixed at the origin, the optical axis direction of the virtual camera is fixed at the positive direction of the Z-axis, and the upward direction of the virtual camera is fixed at the positive direction of the Y-axis. As a tool for selecting whether to use the position/orientation of the user, a position/orientation ON/OFF checkbox can be provided on the GUI illustrated in
In the above-described exemplary embodiments, the example in which the display image generated by rendering the virtual space is displayed on the HMD which is the second display apparatus 104 is described. Alternatively, a display apparatus other than the HMD can be used as the second display apparatus 104. For example, a mobile information terminal such as a smartphone or tablet can be used as the second display apparatus 104. In the case of a smartphone or tablet, the orientation and direction of an information terminal are detectable, so that a display image can be generated by acquiring orientation/direction information from the information terminal and rendering the virtual space based on the direction of the terminal and the generated display image can be output as in the above-described exemplary embodiments. Alternatively, a personal computer or input apparatus (mouse, touch panel display) can be used as the second display apparatus 104. In a case of a normal display, however, it is sometimes impossible to acquire orientation information. In this case, a display image acquired by rendering the virtual space from the virtual viewpoint is displayed on the display and presented to the user. At this time, it is desirable to allow the user to set the virtual viewpoint with respect to the display image through an operation using the mouse or touch panel. A display image rendered based on the viewpoint position designated via the input apparatus can be displayed.
In the above-described exemplary embodiments, software in which the CPU 201 executes a program to thereby realize each configuration illustrated in
Embodiment(s) of the present invention can also be realized by a computer of a system or apparatus that reads out and executes computer executable instructions (e.g., one or more programs) recorded on a storage medium (which may also be referred to more fully as a ‘non-transitory computer-readable storage medium’) to perform the functions of one or more of the above-described embodiment(s) and/or that includes one or more circuits (e.g., application specific integrated circuit (ASIC)) for performing the functions of one or more of the above-described embodiment(s), and by a method performed by the computer of the system or apparatus by, for example, reading out and executing the computer executable instructions from the storage medium to perform the functions of one or more of the above-described embodiment(s) and/or controlling the one or more circuits to perform the functions of one or more of the above-described embodiment(s). The computer may comprise one or more processors (e.g., central processing unit (CPU), micro processing unit (MPU)) and may include a network of separate computers or separate processors to read out and execute the computer executable instructions. The computer executable instructions may be provided to the computer, for example, from a network or the storage medium. The storage medium may include, for example, one or more of a hard disk, a random-access memory (RAM), a read only memory (ROM), a storage of distributed computing systems, an optical disk (such as a compact disc (CD), digital versatile disc (DVD), or Blu-ray Disc (BD)™), a flash memory device, a memory card, and the like.
While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. The scope of the following claims is to be accorded the broadest interpretation so as to encompass all such modifications and equivalent structures and functions.
Claims
1. An image processing apparatus comprising:
- an first acquisition unit configured to acquire image data of a captured image;
- a second acquisition unit configured to acquire display installation information indicating a configuration of one or more display screens in a display installation to be displayed;
- a first generation unit configured to generate, by performing rendering using the acquired image data and acquired display installation information, a display image viewed from a virtual viewpoint corresponding to the display installation in a virtual space, to be displayed on the display installation,
- wherein the display image being at least a part of the captured image; and
- a second generation unit configured to generate, by performing rendering, a display image data corresponding to a full angle-of-view of the acquired captured image in the virtual space for displaying the acquired captured image,
- wherein the generated display image on a display device different from the display installation in order to arrange the acquired captured image and the generated display image on one screen of the display device.
2. The image processing apparatus according to claim 1, wherein the display installation has a curved screen.
3. The image processing apparatus according to claim 1, wherein the display installation includes a combination of a plurality of flat display screens.
4. The image processing apparatus according to claim 1,
- wherein the display image is generated by rendering the captured image data on a first virtual object corresponding to the display installation in a virtual space.
5. The image processing apparatus according to claim 1, wherein the display device is a head-mounted display.
6. The image processing apparatus according to claim 1, wherein the acquired captured image and the generated display image do not overlap on the one screen.
7. An image processing method comprising:
- acquiring image data of a captured image;
- acquiring display installation information indicating a configuration of one or more display screens in a display installation to be displayed;
- generating, by performing rendering using the acquired image data and acquired display installation information, a display image viewed from a virtual viewpoint corresponding to the display installation in a virtual space, to be displayed on the display installation,
- wherein the display image being at least a part of the captured image; and
- generating, by performing rendering, a display image data corresponding to a full angle-of-view of the acquired captured image in the virtual space for displaying the acquired captured image,
- wherein the generated display image on a display device different from the display installation in order to arrange the acquired captured image and the generated display image on one screen of the display device.
8. A non-transitory computer-readable storage medium storing instructions that, when executed by a computer, cause the computer to perform a method comprising:
- acquiring image data of a captured image;
- acquiring display installation information indicating a configuration of one or more display screens in a display installation to be displayed;
- generating, by performing rendering using the acquired image data and acquired display installation information, a display image viewed from a virtual viewpoint corresponding to the display installation in a virtual space, to be displayed on the display installation,
- wherein the display image being at least a part of the captured image; and
- generating, by performing rendering, a display image data corresponding to a full angle-of-view of the acquired captured image in the virtual space for displaying the acquired captured image,
- wherein the generated display image on a display device different from the display installation in order to arrange the acquired captured image and the generated display image on one screen of the display device.
9396588 | July 19, 2016 | Li |
20040157662 | August 12, 2004 | Tsuchiya |
20150248792 | September 3, 2015 | Abovitz |
20200112707 | April 9, 2020 | Kobayashi |
2018097847 | June 2018 | JP |
Type: Grant
Filed: Apr 30, 2021
Date of Patent: Jan 24, 2023
Patent Publication Number: 20210256941
Assignee: Canon Kabushiki Kaisha (Tokyo)
Inventor: Masatoshi Ishii (Koganei)
Primary Examiner: Edward Martello
Application Number: 17/245,914
International Classification: G09G 5/38 (20060101); G06F 3/01 (20060101); G06T 11/00 (20060101); G09G 5/14 (20060101);