POINT-OF-VIEW IMAGE WARP SYSTEMS AND METHODS
An electronic device may include an electronic display to display an image based on processed image data. The electronic device may also include image processing circuitry to generate the processed image data. The image processing circuitry may receive input image data corresponding to an image in a first perspective and warp the input image data from the first perspective to a second perspective, generating warped image data. Additionally, the image processing circuitry may determine one or more occluded regions in the second perspective and determine fill-data corresponding to the occluded regions. The processed image data may be generated by combining the warped image data and the fill-data.
This application is a Continuation of and claims priority to U.S. patent application Ser. No. 17/356,223, filed Jun. 23, 2021, and entitled “Point-of-View Image Warp Systems and Methods”, which is incorporated herein by reference in its entirety for all purposes.
BACKGROUNDThe present disclosure relates generally to image processing and, more particularly, to image warp operations correcting for a change in point-of-view.
This section is intended to introduce the reader to various aspects of art that may be related to various aspects of the present disclosure, which are described and/or claimed below. This discussion is believed to be helpful in providing the reader with background information to facilitate a better understanding of the various aspects of the present disclosure. Accordingly, it should be understood that these statements are to be read in this light, and not as admissions of prior art.
Electronic devices often use one or more electronic displays to present visual information such as text, still images, and/or video by displaying one or more images. For example, such electronic devices may include computers, mobile phones, portable media devices, tablets, televisions, virtual-reality headsets, and vehicle dashboards, among many others. To display an image, an electronic display may control light emission of its display pixels based at least in part on corresponding image data.
An image may be represented by image data defining areas with particular resolutions or distributions of pixel values. However, in some instances, it may be desirable to change the resolutions or distributions of the pixel values to account for different display scenarios. For example, image data may be warped to account for environmental surroundings, display characteristics, a viewer's point-of-view (POV) and other factors that may distort the perceived image to a viewer. Thus, before being displayed, the image data may be processed to warp the image using the desired changes to the amount or distribution of pixel values such that the perceived image is not distorted. However, at least in some instances, when warping for changes in POV, objects may be revealed for which no image data exists, which may lead to image artifacts.
SUMMARYA summary of certain embodiments disclosed herein is set forth below. It should be understood that these aspects are presented merely to provide the reader with a brief summary of these certain embodiments and that these aspects are not intended to limit the scope of this disclosure. Indeed, this disclosure may encompass a variety of aspects that may not be set forth below.
In some scenarios, an image may appear distorted when perceived by a viewer due to differences between the perspective with which the image was captured or rendered and the viewer's point-of-view (POV)/perspective. Furthermore, a viewer's POV (e.g., as determined based on location and/or eye-tracking) relative to the display may alter how the viewer perceives the image. For example, in augmented, virtual, or mixed reality devices, an image may be captured, previously or in real time, and displayed with or without augmentation to a viewer as if the image was reality. However, the image may have been captured with a POV different from that which the viewer would normally experience. For example, objects may appear shorter, taller, wider, smaller, or otherwise out of perspective relative to itself or other objects if the image is left uncorrected. Thus, before being displayed, the image data may be processed to warp the image such that the perceived image has reduced or no distortion.
A POV warp may assist in altering the image data such that, when displayed, the image appears from a perspective consistent with that of the viewer. Such a POV warp may also shift objects' relative positions and generate one or more occluded regions in the image. For example, a foreground object may shift in a POV warp such that a portion of the object or background is visible after the warp that was not previously a part of the input image. As such, in some embodiments, occluded regions may be filled-in by blending, generating new pixel values, or additional image captures (e.g., from additional cameras. Additionally or alternatively, in some embodiments, filling-in the occluded regions may utilize machine learning (e.g., deep learning) to estimate the missing pixel values.
Additionally, in some embodiments, the correction for the POV warp may utilize a pixel grid to map pixel locations in the input image data to the warped locations corrected for the viewer's POV. During generation of the pixel grid, occlusions may be identified as invalid regions having no input image data associated therewith. As the occluded regions will be filled-in with fill-data, the invalid regions may be ignored when warping and fetching image data to increase operational efficiency.
Various aspects of this disclosure may be better understood upon reading the following detailed description and upon reference to the drawings in which:
One or more specific embodiments of the present disclosure will be described below. These described embodiments are only examples of the presently disclosed techniques. Additionally, in an effort to provide a concise description of these embodiments, all features of an actual implementation may not be described in the specification. It should be appreciated that in the development of any such actual implementation, as in any engineering or design project, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related and business-related constraints, which may vary from one implementation to another. Moreover, it should be appreciated that such a development effort might be complex and time consuming, but may nevertheless be a routine undertaking of design, fabrication, and manufacture for those of ordinary skill having the benefit of this disclosure.
When introducing elements of various embodiments of the present disclosure, the articles “a,” “an,” and “the” are intended to mean that there are one or more of the elements. The terms “comprising,” “including,” and “having” are intended to be inclusive and mean that there may be additional elements other than the listed elements. Additionally, it should be understood that references to “one embodiment” or “an embodiment” of the present disclosure are not intended to be interpreted as excluding the existence of additional embodiments that also incorporate the recited features. Furthermore, the phrase A “based on” B is intended to mean that A is at least partially based on B. Moreover, the term “or” is intended to be inclusive (e.g., logical OR) and not exclusive (e.g., logical XOR). In other words, the phrase A “or” B is intended to mean A, B, or both A and B.
Electronic devices often use electronic displays to present visual information. Such electronic devices may include computers, mobile phones, portable media devices, tablets, televisions, virtual-reality headsets, and vehicle dashboards, among many others. An electronic display may take the form of a liquid crystal display (LCD), a light emitting diode (LED) display, an organic light emitting diode (OLED) display, a plasma display, or the like.
To display an image, an electronic display controls the luminance (and, as a consequence, the color) of its display pixels based on corresponding image data received at a particular resolution. For example, an image data source may provide image data as a stream of pixel data, in which data for each pixel indicates a target luminance (e.g., brightness and/or color) of one or more display pixels located at corresponding pixel positions. In some embodiments, image data may indicate luminance per color component, for example, via red component image data, blue component image data, and green component image data, collectively referred to as RGB image data (e.g., RGB, sRGB). Additionally or alternatively, image data may be indicated by a luma channel and one or more chrominance channels (e.g., YCbCr, YUV, etc.), grayscale (e.g., gray level), or other color basis. It should be appreciated that a luma channel, as disclosed herein, may encompass linear, non-linear, and/or gamma-corrected luma values.
In some scenarios, the image to be displayed may, if unaltered, appear distorted when perceived by a viewer. Distortions could be due to environmental effects, properties of the display, the viewer's point-of-view (POV) perspective, image processing warps such as shifts and scaling, and/or other distorting factors. For example, the display may include a screen, opaque or transparent, with curved edges and/or lensing effects that may distort an image if displayed without correction. Furthermore, a viewer's POV (e.g., as determined based on location and/or eye-tracking) relative to the display may alter how the viewer perceives the image. For example, in augmented reality (AR), virtual reality (VR), or mixed reality (MR) devices, an image may be captured, previously or in real time, and displayed with or without augmentation to a viewer as if the image was reality. However, the image may have been captured with a POV different from that which the viewer would normally experience if the image were reality. For example, objects may appear shorter, taller, wider, smaller, or otherwise out of perspective relative to itself or other objects. Thus, before being displayed, the image data may be processed to warp the image such that the perceived image has reduced or no distortion.
A POV warp may cause one or more occluded regions in the image. For example, a foreground object may shift in a POV warp such that a portion of the object or background is visible after the warp that was not previously a part of the input image. As such, in some embodiments, occluded regions may be filled-in by blending, generating new pixel values, additional image captures (e.g., from additional cameras), etc. Additionally or alternatively, in some embodiments, filling-in the occluded regions may utilize machine learning (e.g., deep learning) to estimate the missing pixel values.
In some embodiments, the correction for some types of warps, for example the POV warp may utilize a pixel grid to map pixel locations in the input image data to the warped locations corrected for the viewer's POV. However, warps that use a coarser grid may result in image artifacts, while finer grid warping may be taxing on processing bandwidth and/or take additional time. To avoid image artifacts and to increase efficiency, a hierarchical grid may have a variable grid partition size when performing the warp operation. In some embodiments, the hierarchical grid may have a quad-tree grid structure determined by reiteratively splitting grid partitions into fourths depending on image statistics associated with a corresponding portion of the image data. For example, the hierarchical grid may be determined by starting with 64×64 pixel partitions. Depending on the image statistics (e.g., homogeny, occlusions, perceived depth, edges, or other image features that may warrant finer warp calculations), that particular 64×64 partition of the grid may remain a full partition or the partition may be split into four partitions of 32×32 pixels. Each new partition may be maintained or further split depending on the image statistics until a desired granularity is reached. Furthermore, some partitions may be designated as invalid by being out of an active area of the display or as including an occlusion. As discussed herein invalid regions may be ignored for warp purposes to reduce bandwidth and filled-in separately, if necessary (e.g., in the active area).
With the coordinate mapping to the input image data for each partition, hierarchical interpolation may be used to obtain coordinates for each grid point. For example, coordinates corresponding to a 32×32 pixel partition may be interpolated from the 64×64 partition coordinates. Subsequently, coordinates corresponding to a 16×16 pixel partition may be interpolated from the 32×32 partition coordinates and so on until a coordinate mapping is determined for each grid point. Additionally, at each stage of interpolation, if the coordinates for a partition size is available (e.g., the grid was split into smaller than 64×64 partitions), coordinates for the corresponding pixels may be already calculated as part of the grid and may be directly used instead of interpolated.
By reducing warp calculations in partitions where artifacts are less likely and performing finer warp operations in certain regions based on the image statistics, processing time and/or bandwidth usage may be decreased while image quality is maintained. Moreover, the reduced processing time may provide for real-time or enhanced feedback to a user's change in position/POV. As should be appreciated, the grid sizing is given as an example and is non-limiting. Further, the disclosed techniques regarding the pixel grid may be used for any of multiple different image processing operations including but not limited to warps such as the POV warps, geometric distortion warps, temporal warps, etc. Indeed, other image processing operations may utilize a hierarchical grid structure for increased efficiency and/or reduced processing time. Accordingly, to improve image quality and/or increase efficiency, the present disclosure provides techniques for a general hierarchical grid and for enhanced POV warping.
One embodiment of an electronic device 10 that utilizes the hierarchical grid is shown in
The electronic device 10 may include one or more electronic displays 12, input devices 14, input/output (I/O) ports 16, a processor core complex 18 having one or more processors or processor cores, local memory 20, a main memory storage device 22, a network interface 24, a power source 26, and image processing circuitry 28. The various components described in
The processor core complex 18 may be operably coupled with local memory 20 and the main memory storage device 22. The local memory 20 and/or the main memory storage device 22 may include tangible, non-transitory, computer-readable media that store instructions executable by the processor core complex 18 and/or data to be processed by the processor core complex 18. For example, the local memory 20 may include random access memory (RAM) and the main memory storage device 22 may include read only memory (ROM), rewritable non-volatile memory such as flash memory, hard drives, optical discs, and/or the like.
The processor core complex 18 may execute instructions stored in local memory 20 and/or the main memory storage device 22 to perform operations, such as generating source image data. As such, the processor core complex 18 may include one or more general purpose microprocessors, one or more application specific processors (ASICs), one or more field programmable logic arrays (FPGAs), or any combination thereof.
The network interface 24 may connect the electronic device 10 to a personal area network (PAN), such as a Bluetooth network, a local area network (LAN), such as an 802.11x Wi-Fi network, and/or a wide area network (WAN), such as a 4G, LTE, or 5G cellular network. In this manner, the network interface 24 may enable the electronic device 10 to transmit image data to a network and/or receive image data from the network.
The power source 26 may provide electrical power to operate the processor core complex 18 and/or other components in the electronic device 10. Thus, the power source 26 may include any suitable source of energy, such as a rechargeable lithium polymer (Li-poly) battery and/or an alternating current (AC) power converter.
The I/O ports 16 may enable the electronic device 10 to interface with various other electronic devices. The input devices 14 may enable a user to interact with the electronic device 10. For example, the input devices 14 may include buttons, keyboards, mice, trackpads, and the like. Additionally or alternatively, the electronic display 12 may include touch sensing components that enable user inputs to the electronic device 10 by detecting occurrence and/or position of an object touching its screen (e.g., surface of the electronic display 12).
The electronic display 12 may display a graphical user interface (GUI) of an operating system, an application interface, text, a still image, or video content. To facilitate displaying images, the electronic display 12 may include a display panel with one or more display pixels. Additionally, each display pixel may include one or more sub-pixels, which each control the luminance of a color component (e.g., red, green, or blue). As used herein, a display pixel may refer to a collection of sub-pixels (e.g., red, green, and blue subpixels) or may refer to a single sub-pixel.
As described above, the electronic display 12 may display an image by controlling the luminance of the sub-pixels based at least in part on corresponding image data. In some embodiments, the image data may be received from another electronic device, for example, via the network interface 24 and/or the I/O ports 16. Additionally or alternatively, the image data may be generated by the processor core complex 18 and/or the image processing circuitry 28. Moreover, in some embodiments, the electronic device 10 may include multiple electronic displays 12 and/or may perform image processing (e.g., via the image processing circuitry 28) for one or more external electronic displays 12, such as connected via the network interface 24 and/or the I/O ports 16.
The electronic device 10 may be any suitable electronic device. To help illustrate, one example of a suitable electronic device 10, specifically a handheld device 10A, is shown in
The handheld device 10A may include an enclosure 30 (e.g., housing) to, for example, protect interior components from physical damage and/or shield them from electromagnetic interference. Additionally, the enclosure 30 may surround, at least partially, the electronic display 12. In the depicted embodiment, the electronic display 12 is displaying a graphical user interface (GUI) 32 having an array of icons 34. By way of example, when an icon 34 is selected either by an input device 14 or a touch-sensing component of the electronic display 12, an application program may launch.
Furthermore, input devices 14 may be provided through openings in the enclosure 30. As described above, the input devices 14 may enable a user to interact with the handheld device 10A. For example, the input devices 14 may enable the user to activate or deactivate the handheld device 10A, navigate a user interface to a home screen, navigate a user interface to a user-configurable application screen, activate a voice-recognition feature, provide volume control, and/or toggle between vibrate and ring modes. Moreover, the I/O ports 16 may also open through the enclosure 30. Additionally, the electronic device may include one or more cameras 36 to capture pictures or video. In some embodiments, a camera 36 may be used in conjunction with a virtual reality or augmented reality visualization on the electronic display 12.
Another example of a suitable electronic device 10, specifically a tablet device 10B, is shown in
As described above, the electronic display 12 may display images based at least in part on image data. Before being used to display a corresponding image on the electronic display 12, the image data may be processed, for example, via the image processing circuitry 28. In general, the image processing circuitry 28 may process the image data for display on one or more electronic displays 12. For example, the image processing circuitry 28 may include a display pipeline, memory-to-memory scaler and rotator (MSR) circuitry, warp compensation circuitry, or additional hardware or software means for processing image data. The image data may be processed by the image processing circuitry 28 to reduce or eliminate image artifacts, compensate for one or more different software or hardware related effects, and/or format the image data for display on one or more electronic displays 12. As should be appreciated, the present techniques may be implemented in standalone circuitry, software, and/or firmware, and may be considered a part of, separate from, and/or parallel with a display pipeline or MSR circuitry.
To help illustrate, a portion of the electronic device 10, including image processing circuitry 28, is shown in
The electronic device 10 may also include an image data source 38, a display panel 40, and/or a controller 42 in communication with the image processing circuitry 28. In some embodiments, the display panel 40 of the electronic display 12 may be a liquid crystal display (LCD), a light emitting diode (LED) display, an organic LED (OLED) display, or any other suitable type of display panel 40. In some embodiments, the controller 42 may control operation of the image processing circuitry 28, the image data source 38, and/or the display panel 40. To facilitate controlling operation, the controller 42 may include a controller processor 44 and/or controller memory 46. In some embodiments, the controller processor 44 may be included in the processor core complex 18, the image processing circuitry 28, a timing controller in the electronic display 12, a separate processing module, or any combination thereof and execute instructions stored in the controller memory 46. Additionally, in some embodiments, the controller memory 46 may be included in the local memory 20, the main memory storage device 22, a separate tangible, non-transitory, computer-readable medium, or any combination thereof.
The image processing circuitry 28 may receive source image data 48 corresponding to a desired image to be displayed on the electronic display 12 from the image data source 38. The source image data 48 may indicate target characteristics (e.g., pixel data) corresponding to the desired image using any suitable source format, such as an 8-bit fixed point aRGB format, a 10-bit fixed point aRGB format, a signed 16-bit floating point aRGB format, an 8-bit fixed point YCbCr format, a 10-bit fixed point YCbCr format, a 12-bit fixed point YCbCr format, and/or the like. In some embodiments, the image data source 38 may be included in the processor core complex 18, the image processing circuitry 28, or a combination thereof. Furthermore, the source image data 48 may reside in a linear color space, a gamma-corrected color space, or any other suitable color space. As used herein, pixels or pixel data may refer to a grouping of sub-pixels (e.g., individual color component pixels such as red, green, and blue) or the sub-pixels themselves.
As described above, the image processing circuitry 28 may operate to process source image data 48 received from the image data source 38. The data source 38 may include captured images from cameras 36, images stored in memory, graphics generated by the processor core complex 18, or a combination thereof. The image processing circuitry 28 may include one or more sets of image data processing blocks 50 (e.g., circuitry, modules, or processing stages) such as the warp processing block 52. As should be appreciated, multiple other processing blocks 54 may also be incorporated into the image processing circuitry 28, such as a color management block, a dither block, a rotate block, etc. Furthermore, in some embodiments, multiple warp processing blocks 52 may be used to provide separate warp operations for different applications of the image processing circuitry 28. For example, different warp processing blocks 52 may be used for image data from different image data sources 38 (e.g., captured images, graphically generated images, etc.). The image data processing blocks 50 may receive and process source image data 48 and output display image data 56 in a format (e.g., digital format and/or resolution) interpretable by the display panel 40. Further, the functions (e.g., operations) performed by the image processing circuitry 28 may be divided between various image data processing blocks 50, and while the term “block” is used herein, there may or may not be a logical separation between the image data processing blocks 50.
The warp processing block 52 may facilitate improving perceived image quality by changing the amount (e.g., resolution) or distribution (e.g., shape, relative size, perspective, etc.) of pixel values to account for different display scenarios during one or more warp operations. To help illustrate,
The POV warp sub-block 60 may provide compensation for a viewer's POV (e.g., as determined based on location, eye relief, and/or focus) relative to the display as compared to the capture of the image to be displayed, for example via one or more cameras 36. Moreover, the viewer's POV may be monitored via any suitable method such as eye-tracking. Additionally, a height of the viewer may be determined based on a user input or a relative height difference between the electronic device 10 and an estimated position of the viewer with respect to the electronic device 10.
As non-limiting examples, augmented, virtual, or mixed reality devices, may capture images, previously or in real time, and display them with or without augmentation to a viewer as if the image was physically in view of the viewer. However, the image may have been captured with a perspective different from that which the viewer would normally experience, which may alter how the viewer perceives the image. For example, objects may appear shorter, taller, wider, smaller, or otherwise out of perspective relative to themselves or other objects. Thus, in some embodiments, the POV warp sub-block 60 may warp the image data before it is displayed such that the perceived image has reduced or no distortion.
In some scenarios, a POV warp 68 from a source frame 70 (a.k.a. a source space) to a POV warped frame 72 (a.k.a. a warped space) may cause one or more occluded regions 74 in the image 76, as shown in
In some embodiments, the warp processing block 52 may generate image data to fill-in an occluded region 74 in the POV warped frame 72. For example, the occluded region 74 may be filled-in by blending pixel values surrounding the occluded region 74 or by using image data from additional image captures such as images captured at a different time (e.g., before or after the original capture) from one or more different angles and/or by an additional image capture device (e.g., camera 36) having a different perspective. For example, in some embodiments, cameras 36 in stereo provide perspectives that may be combined to reduce or eliminate each other's occluded regions 74. In such a case, input image data 64 from the second camera 36 may undergo a POV warp 68 to the POV warped frame 72, and the portion corresponding to the occluded region 74 may be used as fill-data. Additionally or alternatively, in some embodiments, the warp processing block 52 may generating new pixel values to estimate and fill-in the pixel values of the occluded region 74 by utilizing painting algorithms and/or machine learning (e.g., deep learning).
Additionally, in some embodiments, sampling or resampling may occur with a window 82 that overlaps with or abuts edges of an occluded region 74. In some scenarios, filtering across occluded region 74 boundaries may generate artifacts 84. As such, in some embodiments, filtering may be reduced or eliminated for areas about the occluded region 74.
In some embodiments, to apply different types of warps, for example the POV warp 68 and/or the warp processing block 52 may utilize a pixel grid to map the pixel locations in the source frame 70 to locations in the POV warped frame 72 corrected for the viewer's POV. Performing the warp on a pixel grid may increase efficiency by creating a mapping of pixel coordinates without fetching or processing all of the input image data. For example, after the POV warp 68 or other warp operation certain image data may be superfluous, and reducing or eliminating fetching and/or processing of unused data may increase efficiency. In general, the pixel grid may include grid points that define mappings to coordinates of the input image data 64 in the source frame 70 (i.e., source space). The mappings may be generated based on the POV warp 68, which may be accomplished in software or hardware. After determining the mappings, portions of the input image data 64, corresponding to the coordinate mappings of the grid points, may be fetched to resolve the pixel grid, generating the image 76 in the POV warped frame 72.
In some scenarios, a warp using a coarser grid may result in one or more image artifacts such as blurring. On the other hand, while a warp using a finer grid may provide increased clarity and reduced image artifacts, finer grids may be taxing on processing bandwidth and/or take additional time to render. As such, in some embodiments, the warp processing block 52 may utilize a hierarchical grid 100, as in
In some embodiments, the hierarchical grid 100 may have a quad-tree grid structure determined by reiteratively splitting or merging grid partitions 102 by fourths depending on image statistics associated with a corresponding portion of the input image data 64. For example, the hierarchical grid 100 may be determined via a top-down split 104 starting with a uniform grid 106 of relatively larger grid partitions 102. The coarse uniform grid 106 may undergo zero, one, or more intermediate split partition stages 108 before arriving at the hierarchical grid 100. Additionally or alternatively, a hierarchical grid 100 may be generated by bottom-up merging 110. Bottom-up merging 110 may begin with a uniform grid 112 of relatively smaller grid partitions 102 that may undergo zero, one, or more intermediate merged partition stages 114 before arriving at the hierarchical grid 100. Moreover, the top-down split 104 and/or bottom-up merge 110 may be accomplished iteratively or in a single pass. Furthermore, a base partition 116 may be set depending on a desired maximum granularity that the coarse uniform grid 106 may start with during a top-down split 104. Similarly, in some embodiments, the bottom-up merge 110 may not merge to include grid partitions 102 larger than the base partition 116. Although discussed herein as having a quad-tree grid structure, the hierarchical grid 100 may use any suitable structure depending on implementation. For example, the structure of the hierarchical grid may bifurcate in halves, fourths, eighths, or any suitable fraction at each hierarchical tier/level.
To help further illustrate,
In addition to modulating the granularity based on the image statistics for areas of interest relative to the input image data 64, the hierarchical grid 100 may also be partitioned based on an active area of the display 12. For example, the input image data 64 may include portions of the image 76 that lie outside the displayed portion (e.g., active area) of the image 76. In some embodiments, grid partitions 102 of the hierarchical grid 100 that are outside of the active area and/or correspond to occluded regions 74 may be designated as invalid region 118 or partition. In some embodiments, invalid region 118 may be ignored (e.g., corresponding input image data 64 not fetched) for warp purposes to reduce bandwidth usage and increase speed and efficiency.
When a warp (e.g., the POV warp 68, a geometric warp, a temporal warp, etc.) is applied to the hierarchical grid 100 by the POV warp sub-block 60 or the hierarchical grid interpolation sub-block 62, at least one characteristic grid point for each grid partition 102 is mapped to a pixel coordinate in the source frame 70 of the input image data 64. The mapping may be calculated in hardware or software based on POV parameters corresponding to the viewer's eye's location, eye relief, eye focus and/or other POV calculations relative to the display 12, image capturing mechanism (e.g., camera 36), or object of interest. As should be appreciated, more than one characteristic grid point may be mapped per grid partition 102 depending on implementation. Moreover, the characteristic grid point may be aligned with an edge, corner, or middle of the grid partition 102 for simplistic reference. For example, the characteristic grid point may be the top left corner grid point of each grid partition 102 and mapped to coordinates (e.g., “x” and “y” coordinates relative to a pixel grid of the input image data 64) in the source frame 70. Furthermore, while at the characteristic grid point mapping for each grid partition 102 is known (e.g., calculated), the additional grid points of the grid partitions 102 may be unknown from the partition mapping. As such, the hierarchical grid interpolation sub-block 62 may hierarchically interpolate between the known characteristic grid point coordinate mappings to determine the coordinates for the remaining grid point mappings.
Furthermore, interpolation may continue until the coordinate mappings are interpolated for a 1×1 grid partition size, completing the mapping for each pixel position in the warped grid to the input image data 64 of the source frame 70. As should be appreciated, any suitable method for interpolation such as bilinear or bicurvature interpolation may be used to interpolate the coordinate mappings. In some embodiments, a more accurate interpolation (e.g., bicurvature interpolation) may be used while determining higher tier interpolations such as the 32×32 grid point interpolations and 16×16 grid point interpolations while lower tier interpolations such as the 1×1 grid interpolations may use more efficient interpolation methods such as bilinear interpolation. As should be appreciated, although stated herein as using “x” and “y” coordinates, any suitable coordinate system may be used to map the input image data 64 to warped image data such as the POV warped frame 72. Furthermore, in some embodiments, the “x” and “y” coordinates interpolations may be treated separately and to allow for parallel processing of the interpolations.
To help further illustrate the hierarchical interpolation,
Using the characteristic grid points 122 of the base partitions 116 and the characteristic grid points 124 for the 32×32 grid partitions 102A, the characteristic grid points 126 for the 16×16 grid partitions 102B may be interpolated. As with the 32×32 grid partitions 102A, if one or more base partitions 116 were split into 16×16 grid partitions 102B or smaller grid partitions 102C, 102D, some of the characteristic grid points 126 for the 16×16 grid partitions 102B may already be known from the warp calculation. The iterated interpolations may continue at each tier of the hierarchical interpolation until mappings for each grid point are determined, either from the warp operation or by interpolation.
Additionally, while depicted as the upper-left-most grid point, as should be appreciated, any relative position within the base partition 116 and grid partitions 102 may be used as characteristic grid points 122, 124, 126. Furthermore, as mentioned above, although the base partition 116 is exampled by a 64×64 grid partition, any suitable size base partition 116 may be utilized with iterated interpolations occurring until each grid point is resolved.
By reducing warp calculations in partitions where artifacts are less likely and performing finer warp operations in certain regions based on the image statistics, processing time and/or bandwidth usage may be decreased while image quality is maintained. Accordingly, the present techniques for a general hierarchical grid and for enhanced POV warping improve image quality and/or increase efficiency provide for real-time or enhanced feedback to a user's change in position/POV.
Although the above referenced flowcharts are shown in a given order, in certain embodiments, process/decision blocks may be reordered, altered, deleted, and/or occur simultaneously. Additionally, the referenced flowcharts are given as illustrative tools and further decision and process blocks may also be added depending on implementation.
The specific embodiments described above have been shown by way of example, and it should be understood that these embodiments may be susceptible to various modifications and alternative forms. It should be further understood that the claims are not intended to be limited to the particular forms disclosed, but rather to cover all modifications, equivalents, and alternatives falling within the spirit and scope of this disclosure.
The techniques presented and claimed herein are referenced and applied to material objects and concrete examples of a practical nature that demonstrably improve the present technical field and, as such, are not abstract, intangible or purely theoretical. Further, if any claims appended to the end of this specification contain one or more elements designated as “means for [perform]ing [a function] . . . ” or “step for [perform]ing [a function] . . . ”, it is intended that such elements are to be interpreted under 35 U.S.C. 112(f). However, for any claims containing elements designated in any other manner, it is intended that such elements are not to be interpreted under 35 U.S.C. 112(f).
Claims
1. An electronic device comprising:
- an electronic display configured to display an image based on warped image data; and
- image processing circuitry configured to: determine image statistics corresponding to input image data, wherein the input image data comprises a first perspective; determine a hierarchical grid comprising a first set of grid points that define mappings from the first perspective to a second perspective; determine whether to split the hierarchical grid into a first plurality of grid partitions based on the image statistics; based on determining to split the hierarchical grid into the first plurality of grid partitions, determining a second set of grid points corresponding to the first plurality of grid partitions based on the first set of grid points; and warp the input image data from the first perspective to the second perspective to generate, at least in part, the warped image data based on the mappings corresponding to the first set of grid points, the second set of grid points, or both.
2. The electronic device of claim 1, wherein the image processing circuitry is configured to determine whether to split the hierarchical grid into the first plurality of grid partitions by determining whether a portion of the hierarchical grid is invalid.
3. The electronic device of claim 2, wherein the portion of the hierarchical grid is invalid if the portion comprises pixel locations outside of an active area of the electronic display, the portion comprises an occlusion, or both.
4. The electronic device of claim 2, wherein, based on determining that the portion of the hierarchical grid is invalid, the image processing circuitry is configured to not fetch a portion of the input image data corresponding to the portion of the hierarchical grid and to not apply warp operations to the portion of the hierarchical grid.
5. The electronic device of claim 1, the image processing circuitry is configured to determine to split the hierarchical grid into the first plurality of grid partitions based on determining that the hierarchical grid comprises one or more occluded regions.
6. The electronic device of claim 5, wherein the one or more occluded regions comprises portions of the image in the second perspective not visible in the first perspective.
7. The electronic device of claim 5, wherein the image processing circuitry is configured to determine fill-data corresponding to the one or more occluded regions and incorporate the fill-data into the warped image data.
8. The electronic device of claim 7, wherein the image processing circuitry is configured to determine the fill-data for an occluded region of the one or more occluded regions by:
- receiving second input image data corresponding to the image in a third perspective;
- warping the second input image data from the third perspective to the second perspective to generate second warped image data; and
- selecting a portion of the second warped image data corresponding to the occluded region as the fill-data.
9. The electronic device of claim 1, wherein the image processing circuitry is configured to warp the input image data by fetching portions of the input image data according to the mappings corresponding to the first set of grid points, the second set of grid points, or both.
10. The electronic device of claim 1, wherein the image statistics are indicative of whether one or more regions of the hierarchical grid comprise edge effects, changes in perceivable depth, homogeneity, an occlusion, or any combination thereof.
11. Image processing circuitry configured to:
- determine a hierarchical grid in a first space indicative of a first perspective based on a warp from a second space, indicative of a second perspective different from the first perspective, to the second space, wherein grid points of the hierarchical grid define mappings to coordinates of input image data in the second space, wherein determining the hierarchical grid comprises iterative top-down splitting of a base partition into a first plurality of partitions that define the hierarchical grid or iterative bottom-up merging of a second plurality of partitions to form the hierarchical grid;
- fetch portions of the input image data based on the mappings to the coordinates of the input image data; and
- generate warped image data in the first space based on the fetched portions of the input image data.
12. The image processing circuitry of claim 11, wherein the image processing circuitry is configured to determine the hierarchical grid by the iterative top-down splitting of the base partition into the first plurality of partitions, wherein a first split of the iterative top-down splitting comprises:
- determining that a region of the base partition comprises an occlusion in the first perspective; and
- splitting the base partition into at least two first-tier partitions, wherein a first first-tier partition of the at least two first-tier partitions comprises the occlusion and a second first-tier partition of the at least two first-tier partitions does not comprise the occlusion.
13. The image processing circuitry of claim 12, wherein a second split of the iterative top-down splitting comprises splitting the first first-tier partition into at least two second-tier partitions, wherein a first second-tier partition of the at least two second-tier partitions comprises the occlusion, a second second-tier partition of the at least two second-tier partitions does not comprise the occlusion, and wherein the first plurality of partitions of the hierarchical grid comprise the second first-tier partition and the second second-tier partition without further subdivision of the second first-tier partition or the second second-tier partition.
14. The image processing circuitry of claim 12, wherein the image processing circuitry is configured to:
- determine fill-data corresponding to the occlusion; and
- generate processed image data in the first space by blending the warped image data and the fill-data.
15. The image processing circuitry of claim 14, wherein the image processing circuitry is configured to determine the fill-data by:
- blending a portion of the warped image data or a portion of the input image data along a perimeter of the occlusion;
- warping second input image data from a third perspective to the first perspective to generate second warped image data and selecting a portion of the second warped image data corresponding to the occlusion as the fill-data;
- estimating the fill-data, via a machine learning algorithm, based on image characteristics surrounding the occlusion; or
- any combination thereof.
16. The image processing circuitry of claim 12, wherein the hierarchical grid comprises a hierarchical quad-tree grid, wherein the at least two first-tier partitions comprise four first-tier partitions.
17. The image processing circuitry of claim 11, wherein the image processing circuitry is configured to generate the warped image data by interpolating pixel values based on relative locations of pixel positions within the hierarchical grid and the mappings.
18. A non-transitory, machine-readable medium comprising instructions, wherein, when executed by one or more processors, the instructions cause the one or more processors to perform operations or to control image processing circuitry to perform the operations, wherein the operations comprise:
- determining image statistics corresponding to input image data, wherein the input image data comprises a first perspective;
- determining a hierarchical grid comprising a first set of grid points that define mappings from the first perspective to a second perspective;
- determining whether to split the hierarchical grid into a first plurality of grid partitions based on the image statistics;
- based on determining to split the hierarchical grid into the first plurality of grid partitions, determining a second set of grid points corresponding to the first plurality of grid partitions based on the first set of grid points; and
- warping the input image data from the first perspective to the second perspective to generate, at least in part, warped image data based on the mappings corresponding to the first set of grid points, the second set of grid points, or both.
19. The non-transitory, machine-readable medium of claim 18, wherein the operations comprise determining to split the hierarchical grid into the first plurality of grid partitions based on determining that the hierarchical grid comprises one or more occluded regions based on the image statistics.
20. The non-transitory, machine-readable medium of claim 19, wherein the operations comprise determining fill-data corresponding to the one or more occluded regions and incorporating the fill-data into the warped image data.
Type: Application
Filed: Apr 10, 2024
Publication Date: Aug 1, 2024
Inventors: Jim C. Chou (San Jose, CA), Jian Zhou (Pleasanton, CA)
Application Number: 18/631,844