Remote sensing imageryaccuracy analysis method and apparatus
A method of enhancing a resolution of an image by fusing images includes applying a principal component analysis to a multispectral image to obtain a plurality of principal components, and replacing a first component in the plurality of principal components by a panchromatic image. The method further includes resampling remaining principal components to a resolution of the panchromatic image, and applying an inverse principal analysis to the panchromatic image and the remaining principal components to obtain a fused image of the panchromatic image and the multispectral image.
This application is a divisional application of U.S. patent application Ser. No. 11/279,982, filed Apr. 17, 2006, which claims priority to provisional application Ser. Nos. 60/671,508, 60/671,517 and 60/671,520, all filed on Apr. 15, 2005, the entire contents of each of which are incorporated herein by reference.
STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH AND DEVELOPMENTThis invention was made with Government support under SBAHQ-03-1-0023 awarded by the U.S. Small Business Administration. The government has certain rights in the invention.
FIELD OF THE INVENTIONThe present invention relates generally to image sensing and image treatment methods and systems and more particularly to image accuracy analysis.
BRIEF SUMMARY OF THE INVENTIONAn aspect of the present invention is to provide a method of enhancing a resolution of an image by fusing images. The method includes applying a principal component analysis to a multispectral image to obtain a plurality of principal components, and replacing a first component in the plurality of principal components by a panchromatic image. The method further includes resampling remaining principal components to a resolution of the panchromatic image, and applying an inverse principal analysis to the panchromatic image and the remaining principal components to obtain a fused image of the panchromatic image and the multispectral image.
Another aspect of the present invention is to provide a method of pansharpening an image by fusing images. The method includes applying a wavelet-based pansharpening to a plurality of bands in a multispectral image and a panchromatic image to obtain a pansharpened image, and computing quality metrics on the pansharpened image.
A further aspect of the present invention is to provide a method of compressing and decompressing an image. The method includes preprocessing an image, applying a discrete wavelet transform on the preprocessed image to decompose the preprocessed image into a plurality of sub-bands, and applying a quantization to each sub-band in the plurality of sub-bands. The method further includes partitioning the plurality of sub-bands into a plurality of code-blocks, encoding each code-block in the plurality of code-blocks independently to obtain a code-blocks stream, applying a rate control process to the code-blocks stream to obtain a bit-stream; and organizing the bit-stream to obtain a compressed image. The method may further include transforming the compressed image using embedded block decoding to obtain embedded decoded block data, re-composing the embedded decoded block data using an inverse discrete wavelet decomposition process, performing a dequantization by assigning a single quantum value to a range of values to obtain a dequantized data, and performing a decoding process on the dequantized data to substantially reconstruct the image.
Another aspect of the present invention is to provide a method of inserting geolocation into a JP2 file. The method includes inputting a GeoTIFF image file, extracting a GeoTIFF header that contains references to geographic metadata, creating a degenerated GeoTIFF image using the extracted geographic metadata, performing a geographic markup language (GML) conversion, inserting the degenerated GeoTIFF image into a universally unique identifier (UUID) box of the JP2 file, inserting the geographic markup language into an extandible markup language (XML) box of the JP2 file, and compressing the JP2 file using JP2000 image compression to obtain a GeoJPEG2000 image file.
Yet another aspect of the present invention is to provide a method for analyzing and estimating horizontal accuracy in imaging systems. The method includes inputting image locations and true locations in x and y directions, calculating a root mean square error in both x and y directions, and computing a horizontal accuracy by using the root mean square error in the x and y directions.
Image quality assessment can play a role in many imaging applications, including in remote sensing applications, where both spatial and spectral variations characterization may be needed. An example of use of image quality metrics is to quantitatively measure a quality of an image that correlates with perceived quality. Image quality metrics can also be used to benchmark different image processing algorithms by comparing objective metrics. Assessment measures can be divided into subjective and objective groups. Subjective measures are obtained according to rating by human observers, where judgment of different viewers may differ. Due to this and many other inherent drawbacks associated with subjective measures, quantitative or objective measures are used. A reliable image quality metric would provide a consistent, accurate and monotonic prediction of a quality of an image.
For example, in remote sensing, multispectral (MS) images are widely used for land cover classification, change detection and many other applications. In many applications, a number of preprocessing steps may be used prior to applying any classification or segmentation algorithm. The preprocessing steps may include resampling, atmospheric correction, sharpening, filtering, etc. In many applications, the effects of the preprocessing can degrade the performance of the segmentation or classification. Hence, assessing a quality of the image may be needed after any such preprocessing step.
Many metrics can be used to evaluate the quality of an image. In an embodiment of the invention, mean square error (MSE) is used to evaluate the quality of an image. MSE refers to the average of sum of squares of the error between two images. The MSE is defined as follows:
σ2msE[|u(m,n)−v(m,n)|2] (1)
where u(m, n) and v(m, n) are two images of size m×n and E denotes the mathematical expectation.
An approximation of the MSE can also be used. The average least square error metric, which computed as shown in equation (2), is used as an approximate to MSE:
The MSE quantifies the amount of difference in the energy of signals. Even though, the mean square error metric may have some limitation when used as a global measure of an image quality, when used as a local measure it can be effective in predicting the image quality accurately.
Another metric that may be used to evaluate the quality of an image is a root mean square (RMSE) which is related to MSE. Indeed, a root mean square error (RMSE) is the square root of the MSE. RMSE quantifies the average amount of distortion in each pixel in an image. MSE and RMSE give an account of the spectral fidelity of an image.
Another metric that may be used is the correlation function. The closeness between two images can be quantified in terms of correlation function. Correlation coefficients range from −1 to +1. A correlation coefficient value of +1 indicates that the two images are highly correlated, i.e., very close to one another. A correlation coefficient of −1 indicates that the two images are exactly opposite to each other. The correlation coefficient is computed as expressed in the following equation:
where A and B are the two images between which the correlation is computed. Correlation coefficients between each band of the multispectral image before and after a processing step quantify the spectral quality of the image after processing.
A further metric that may be used is the mean value of the pixels in a band. The mean value of pixels in a band of an image is the central value of the distribution of the pixels in that band of the image. A relative shift in the mean value (RM) quantifies the changes in the histogram of the image that may be the result of processing the image. The changes in the standard deviation of the distribution can also be considered in addition to the shift in mean. The relative shift in the mean value can be expressed mathematically as follows:
Another metric that may be of interest for the evaluation of a quality of an image is the entropy. The entropy is defined as the amount of information contained in a signal. The entropy of an image can be evaluated as shown in the following equation:
where d is the number of gray levels possible and p(di) is the probability of occurrence of a particular gray level di in the image.
Yet another metric that can be used is the image noise index (INI). The image noise index (INI) is based on entropy. An image may undergo processing to obtain a processed image. The image may also be reconstructed from the processed image. Hence, a reconstructed image is obtained by going through the reverse process to get the estimate of the original multispectral image from the processed image. Entropy may be defined for each of the images prior to processing (entropy of the original image is x), after processing (the entropy of the processed image is y) and after reconstruction from the processed image (the entropy of the reconstructed image is z).
The value of (y−x) then gives the increased information content of the processed image. This increased information may be useful information, noise or both. The quantity |x−z| is the unwanted information or noise. Hence, the amount of useful information is the difference between these two values:
Signal=(y−x)−|x−z| (6)
The image noise index (INI) is expressed as a ratio between the two values (y−x) and |x−z| minus 1:
Therefore, it can be understood that a positive value of INI indicates an improvement in the information content and a negative value of INI indicates degradation of information in the processed image.
Another metric that may be used is the normalized difference vegetation index (NDVI). NDVI can be used to quantify the strength of a vegetation area. NDVI can be defined as follows:
where NIR is a near infrared band pixel value and R is a red band pixel value. The NDVI varies between +1 and −1. A value closer to +1 indicates dense vegetation. A NDVI value very close to zero represents water. NDVI is an important feature that is used in to distinguish between many classes. This metric quantifies the variations in the NDVI due to any preprocessing.
In an embodiment of the invention, image resampling is performed on an image as an example of process that can affect a quality of an image. The image resampling is applied on an image in order to study the effects of resampling on the image quality and to evaluate the image quality using various metrics.
Image resampling is a technique often used in remote sensing applications. Resampling techniques are used to estimate pixel values in between available samples. Many different resampling techniques such as nearest neighbor (NN), linear interpolation and cubic convolution can be used. One objective of resampling techniques is to minimize the residual difference between the actual and predicted pixel values.
In this resampling example, the spectral and spatial quality of nearest neighbor and cubic convolution method are compared. In the nearest neighbor method, the intermediate pixel values can be replicated from adjacent pixel values. This provides no spectral distortion but sharp edges of the image are distorted and spatial artifacts can be observed. The cubic convolution method tries to fit a polynomial of degree three and compute the intermediate pixel value. This method introduces some spectral distortion but the spatial quality of this method is better than nearest neighbor.
The correlation among the spectral bands can also be computed in the original and resampled multispectral image. In most application the spectral quality can be preserved after resampling. Hence, the correlation among the spectral bands for the resampled images and the original multispectral bands are expected to match.
In one example, co-registered subsets of Quickbird multispectral (MS) and panchromatic (PAN) image were considered. The original multispectral image is resampled to the resolution of the panchromatic image. TABLE 1 shows the values of the correlation coefficients computed. The nearest neighbor (NN) method has the same correlation as that of the original multispectral. This can be explained by the fact that the pixel values are not changed during the nearest neighbor (NN) resampling method. The cubic convolution method introduces some spectral distortion.
The correlation between the panchromatic image (PAN) and the resampled multispectral bands is also computed. The cubic convolution technique provides higher correlation with the panchromatic image, which indicates that it has good spatial quality compared to the NN resampling technique. TABLE 2 indicates the correlation values computed.
The relative shift in mean of each band is also computed. As expected, the shift for NN resampling is found to be zero. The values computed are shown in TABLE 3.
Image fusion algorithms improve the low spatial resolution of the multispectral images using the spatial information from the corresponding panchromatic image. These image fusion algorithms can be used as a preprocessing step before feature extraction or classification is done on a multispectral image. Image fusion algorithms are also called pansharpening. Pansharpening combines information from a multispectral image and spatial information from a panchromatic image into a single fused image. The single image has both high spectral and spatial resolutions. The high spatial and spectral resolutions help to enhance features, provide detail information of targets or objects in the image and improve classification accuracy. Pansharpening algorithms include intensity-hue-saturation sharpening (HIS), principal component analysis (PCA) sharpening, Brovey sharpening, multiplicative sharpening and color normalized sharpening. Some of pansharpening algorithms are available in commercial remote sensing packages like ERDAS, commercialized by Leica Geosystems, and ENVI, commercialized by Research Systems Incorporated.
In an embodiment of the invention, quality metrics are applied to an image fused using the principal component analysis (PCA) method.
In an embodiment of the invention, the MSE and RMSE values are computed between each band of the fused and original multispectral image. The values are shown in TABLE 4.
TABLE 4 shows that the DN values have changed a lot in band 4 due to sharpening. The DN values in other bands are also distorted a lot. However, the quality of the fused image cannot be based solely on this metric, i.e., solely based on MSE and/or RMSE, because some of the newly-added information might be useful information.
In an embodiment of the invention, the correlation between different band combinations is computed to quantify the spectral and spatial quality of the image. TABLE 5 shows the value of the correlation coefficients computed between the four spectral bands. It is expected that the correlation for the fused bands should be as close as possible to that of the original multispectral bands to ensure preservation of spectral information. In addition, it is expected that the fusion process should not increase the correlation between the spectral bands.
The values in TABLE 5 indicate that there are variations in the spectral information of the fused image. Thus, the fused image does not improve the performance of spectral based classification. The correlation is also computed between each band of the original multispectral and panchromatic image. For this combination, the correlation of the fused image should be higher because the fused image has more spatial information compared to the panchromatic image.
In an embodiment of the invention, correlation values between each band of MS image and the pan image is computed for both the original MS image and the fused image. The correlation values are reported in TABLE 6. The correlation values indicate that the fused image has more useful spatial information compared the original multispectral MS image, as there is an increase in the corresponding values. The correlation values are better indicators of the spatial and spectral quality of the image.
In an embodiment of the invention, the relative shift mean (in %) for each band of the fused image is computed. The relative shift in mean of each band of the fused image helps to visualize the changes in the histogram. A positive shift in mean indicates the shift towards white and a negative shift indicates shift towards gray. The computed values are shown in the TABLE 7. The histogram of band 3 has shifted a lot compared to other bands. The histograms are also plotted to aid visual comparison.
In an embodiment of the invention, the image noise index (INI) is computed for each band of the fused image as shown in TABLE 8. The degradation of spectral information in all the bands is indicated by a negative INI value. The information loss or unwanted information is higher in band 4 compared to other bands.
In an embodiment of the invention, NDVI values are computed for a subset of vegetated area.
Therefore, from the above results, it can be seen that PAN dominates the spatial information which is shown from the high correlation of the fused bands with the panchromatic image. This is because the PAN replaces the first principal component, which contains most of the spectral information. In addition, the spectral information is lost compared to the multispectral image, which is reflected in MSE, RMSE, INI and relative shift in mean.
The quality of an image may not be predicted accurately by just considering only one of the metrics discussed above. In an embodiment of the invention, a combination of metrics (at least two) allows to evaluate the quality of the image with more precision. The combination of metrics selected can vary based on the type of preprocessing and the application of the image.
In another embodiment, a pansharpening and image quality interface (PSIQI) application incorporating a wavelet-based pansharpening is implemented. The PSIQI application is applied on images to sharpen images and/or compute quality metrics for sharpened data. Before inputting image data, i.e. multispectral image data and panchromatic image data, the image data is co-registered. A user is able to specify a location of the multispectral image and corresponding panchromatic image.
The PSIQI application performs a wavelet-based sharpening on the specified image data sets for sharpening. Image size and number of bits per pixel control the block size. Block processing can be used for efficient memory handling and for increasing the speed. Quality metrics chosen are then computed over the entire data set and stored in a file, for example a text file. The PSIQI application can be used in two modes of operation, sharpening and quality metric modes. The sharpening mode is used to sharpen the data using a wavelet-based method and compute the quality metrics. The quality metric mode is used only to compute metrics on data sharpened using other methods. A user may select a band to be sharpened. In addition, the application provides a tunable sharpening ways by allowing the user to select different mother wavelets, by enabling initial and final histogram matching steps and by enabling filtering using a selected filter such as a Wiener filter. While using the quality metrics mode, the tunable options are switched off and the user can only choose bands that exist in the sharpened data.
In an embodiment of the invention, PSIQI application is used to sharpen a co-registered IKONOS image set in the sharpening mode. In an embodiment of the invention, bands 1, 2, and 3 of the data are sharpened using the wavelet-based method and quality metrics, such as the mean square error (MSE), root mean square error (RMSE), correlation metrics, and relative shift in the mean are computed. In an embodiment of the invention, a bi-orthogonal 4.4 wavelet is used as the mother wavelet, for example. In an embodiment of the invention, initial and final histogram match are applied. In further embodiment of the invention, a Wiener filter is applied on the sharpened data to remove noise due to sharpening.
As shown in TABLE 9, the average change in the pixel value of band 3 is slightly higher when compared with the other bands. In addition, the mean of band 3 is shifted by 0.13%.
The spectral quality of the sharpened image can be ascertained by comparing the correlation between each band in the image before and after sharpening. The correlation values computed are shown in TABLE 10. These correlation values indicate a slight variation in the spectral information in the sharpened data and an increase in spatial information.
As stated above, image quality evaluation methods are used on image data. The image data can be provided in a large size. For example, the size of the multi-resolution Quickbird GeoTIFF image used in this study is 380 MB. Therefore, compression of the images may be needed for storing and transmitting to save storage space, bandwidth, and to lower the transmission times. Image compression can be performed in either “lossy” or lossless fashion. Lossless compression may be desirable in critical situations where any loss in image data and quality may lead to erroneous analysis. However, in various other applications lossy compression may be preferred as it provides high compression ratio that results in smaller image sizes. However, the trade off with using lossy compression is that as the compression ratio increases, increased spatial and spectral features of the image can be lost. Hence, it may be worthwhile to analyze the impact of image compression on image quality.
In an embodiment of the invention, an encoding and decoding process JPEG2000 product is used to compress and decompress images using wavelet transformation opposed to its predecessor JPEG that uses discrete cosine transformation (DCT). Wavelet transform-based image compression algorithms allow images to be retained without much distortion or loss when compared to JPEG, and hence are recognized as a superior method. The JPEG2000 encoding and decoding process includes a JPEG2000 encoding process and a JPEG2000 decoding process. The encoding and decoding process are divided into several stages as can be seen from
Many formats for the images can be used including TIFF (Tagged Image File Format) which is used to store and transfer digital satellite imagery, scanned aerial photos, elevation models, scanned maps or the results of many types of geographic analysis. TIFF is the only full-featured raster file format in the public domain, capable of supporting compression, tiling, and extension to include geographic metadata. The main strengths of TIFF are a highly flexible and platform-independent format that is supported by numerous image-processing applications. Another image format is the GeoTIFF format. A GeoTIFF file is a TIFF 6.0 file and inherits the file structure as described in the corresponding portion of the TIFF specifications. GeoTIFF uses a small set of reserved TIFF tags to store a broad range of georeferencing information, catering to geographic as well as projected coordinate systems needs. The geographic data can then be used to position the image in the correct location and geometry on the screen of a geographic information display.
There are two approaches for the implementation of geo-referencing data in JP2 files, which are inserting the data into either one of the two boxes. One approach is UUID (Universally Unique Identifiers) box, which provides a tool by which vendors can add additional data to a file without risking conflict with other vendors. Another approach is XML (eXtensible Markup Language) box that provides a tool for vendors to add XML formatted information to a JP2 file. Since a UUID box and a XML box can be used to add vendor specific information, two open standards can be used, each making use of either UUID box or XML box. For example in GeoTIFF, a UUID box termed as GeoTIFF box contains a specified UUID and a degenerated GeoTIFF file. By degenerated, it means a valid GeoTIFF file excluding image information. A UUID represents a 128-bit unique value and the UUID for the box is static unsigned char geotiff_box={0xb1, 0x4b, 0xf8, 0xbd, 0x08, 0x3d, 0x4b, 0x43, 0xa5, 0xae, 0x8c, 0xd7, 0xd5, 0xa6, 0xce, 0x03}. The UUID box, i.e., the GeoTIFF box, contains a valid GeoTIFF file which contains the geo-referencing information about the file being compressed and a very simple image with the constraints: image height and width are both 1 pixel, datatype is 8-bit, color space is grayscale, the (single) pixel must have a value of 0 for its (single) sample. The intent of containing the valid GeoTIFF file is that any compliant GeoTIFF reader or writer would be able to read or write the image.
Another file formatting that can be used in JP2 file is Geographic Markup Language (GML). GML is an XML-based encoding standard for geographic information developed by the OpenGIS Consortium (OGC). In this approach, geo-location information coded in GML is stored in a non-proprietary way within JPEG2000 XML box. For example, the JPEG2000_GeoLocation in GML given below refers to a JP2 file with an EPSG code of 32610 (PCS_WGS84_UTM_zone—10N), origin 631333.108344E, 4279994.858126N, a cell size of X=4 and Y=4, and a rotation of 0.0.
A JPEG2000_GeoLocation XML Element containing a RectifiedGrid construct contains the geographic information. The RectifiedGrid includes an ID of “JPEG2000_GeoLocation—1” with a dimension equal to 2. The origin element is also included and is provided an id of “JPEG2000_Origin.” The Point specifies the coordinate of the bottom-left corner of the bottom-left cell in the image. The srsName can be an immediate EPSG code. However, if an existing EPSG code is not available, the srsName refers to a full SpatialReferenceSystem element definition within a same JP2 XML box. A pair of offsetVector elements define vertical and horizontal cell “step” vectors, and may include a rotation. A conformant reader is usually set to ignore all other elements within the JPEG2000_GeoLocation element.
In an embodiment of the invention, a toolkit for image compression and metadata insertion is developed using Java version J2SE 1.4.2. Object-oriented interfaces for manipulating different formats of images can be provided by various vendors. Examples of APIs that are used are Java advanced imaging (JAI), Java image I/O, LuraWave.jp2 Java and GeoTIFF-JAI. JAI is a cross-platform, flexible, extensible toolkit for adding advanced image processing capabilities to applications for the Java platform. Java Image I/O API provides a pluggable architecture for working with images stored in files. It offers substantially more flexibility and power than the previously-available APIs for loading and saving images. LuraWave.jp2 JAVA/JNI-SDK for Windows (demo version) is a part of the LuraWave.jp2 image compression software family and is based on Algo Vision LuraTech's implementation of the JPEG2000 image compression standard and is fully compliant with the Part 1 of the JPEG2000 International Standard. GeoTIFF—JAI is a “geotiff” extension to the Java Advanced Imaging component and is an opens source interface developed by Niles Ritter.
In an embodiment of the invention, Front-end and the codes to compute image quality metrics are developed using Matlab 6.5.1, release 13. Matlab provides a java interface to access classes written in Java and call the object's methods.
Image quality metrics are figures of merit used for the evaluation of imaging systems or processes. The image quality metrics can be broadly classified into two categories, subjective and objective. In objective measures of image quality metrics, some statistical indices are calculated to indicate the reconstructed image quality. The image quality metrics provide some measure of closeness between two digital images by exploiting the differences in the statistical distribution of pixel values. Examples of error metrics used for comparing compression are Mean Square Error and Peak Signal to Noise Ratio (PSNR). The MSE, RMSE and correlation metrics are described in the above paragraphs.
Peak Signal to Noise Ratio (PSNR) measures the estimates of the quality of reconstructed image compared with an original image and is a standard way to measure image fidelity. A ‘signal’ is the original image and ‘noise’ is the error in a reconstructed image due to compression and decompression. PSNR is a single number that reflects the quality of reconstructed image and is measured in decibels (db):
where S is a maximum pixel value and RMSE is the Root Mean Square Error of the image. The actual value of PSNR is not meaningful but a comparison between two values between different reconstructed images gives one measure of quality. As seen from inverse relation between MSE and/or RMSE and PSNR, low value of MSE and/or RMSE translates to a higher value of PSNR. This implies that a higher value of PSNR is better.
In an embodiment of the invention, geo-referencing metadata is inserted into a JPEG2000 (jp2) file during compression, for example using the method illustrated in
In an embodiment of the invention, the reversible compressions were performed at different ratios on a test image and the JPEG2000 file is decompressed back to TIFF file format. The quality metrics were then calculated to compare the original and the reconstructed images. The test image is a 1024×1024 pixels subset of Quickbird multi-spectral image of the Memphis, Tenn. area. The image is compressed at various compression ratios and then decompressed using JPEG2000 method and the quality metrics of the reconstructed image is computed by using the original image as a benchmark.
A visual comparison of the images that are reconstructed with the original image shows that the reconstructed images lose some fine details as the compression ratio is increased. However, the original and the reconstructed image which is compressed at the ratio of 30 are undistinguishable when doing a visual inspection. But the differences are more pronounced after each successive increase in compression ratio.
Similarly, when the above images are zoomed such that details are more visible, difference in various pixels can be seen as the encoding ratio increases.
Even though the images seem to appear similar to the naked eye, quality metrics show that the images are being distorted as can be seen from the MSE and RMSE of the images that are provided in the TABLE 11 and the PSNR provided in the TABLE 12. Similarly, the correlation coefficient is provided in TABLE 13.
As expected, the MSE and RMSE are equal to 0 and PSNR is infinity when lossless compression is performed. Lossless compression reduces the size of the image around a factor of 2. Therefore, a lossy compression ratio of 2 performs as well as lossless compression. As the encoding ratio increases, the MSE and RMSE values also increase accordingly, implying that the distortion in the image increases as the compressed image gets smaller in size, which go along with the theoretical expectations.
Another interesting fact observed is that the fourth band (near Infrared) had the maximum values of MSE and RMSE, which is also understandable as that band contains larger pixel values and therefore is further distorted when compared to the other bands.
Similarly, PSNR values decrease as the compression ratio increases. For comparable values of PSNR, from a compression ratio 0.33 to 0.005 its value ranges from 197.43 db to 161.1 db in band 1, which shows a range of 36.33. Furthermore, it can be seen that PSNR value decreases most in the fourth band.
However, an interesting fact observed is that the correlation between the different bands of images does not change by much even when the compression is performed at a ratio of 1:200. As can be seen from
In the following paragraphs, a method and system for analyzing and estimating horizontal accuracy in imaging systems such as mapping and the like is described.
Mechanical limitations of instrument, sensor position and orientation, curvature of the earth, and unforeseen human errors are some of the sources for mapping inaccuracies that are usually encountered in mapping (e.g., geospatial mapping) or imaging processes. One such spatial discrepancy is the horizontal positional inaccuracy of the remotely acquired image. Due to the aforementioned sources of errors, the horizontal positional information of an object obtained from a remotely acquired image may deviate from its true real world measurement. Although some of the potential causes for spatial errors can be substantially eliminated or reduced, estimation and/or evaluation of horizontal inaccuracies may be needed to assess the reliability of the information retrieved from the image.
The horizontal positional error of an object can be represented by a random variable pair (x, y). The random variables x and y correspond to the error encountered in the X (longitude) and Y (latitude) directions respectively. The error can be considered as a deviation of the measured values from the true values. The two random variables can be assumed to be independent, with a Gaussian distribution and zero mean. The joint probability density distribution for these random variables (x, y) is given by the following equation:
By rearranging equation (10), equation (11) is obtained.
As it can be observed from equation (11), for a given value (x, y) the probability density function represents the square of the radius of circle assuming that variances (σx and σy) in both the dimensions are equal. The probability for an error random variable pair (x, y) to be contained within a circle of radius R can be defined by the circular error probability function P(R). The circular error probability function can be derived from equation (11). A condensed form for P(R) for the case when σx and σy are equal is given by the following equation:
where σx=σy=σc and R is the radial distance.
The National Map Accuracy Standard (NMAS) specifies that 90% of well-defined points in an image or map should fall within a certain radial distance R. Therefore, substituting the left hand side of equation (12) with 0.90 yields the horizontal accuracy standard as specified by NMAS which is given by the following equation:
CE90=2.1460σc (13)
where σx=σy=σc.
The calculation for σx is shown below:
where ximage and xrealworld are the coordinates of the control points measured from the image and real world, respectively, and n is the number of such control points. σy is calculated similar to equation (14):
where yimage and yrealworld are the coordinates of the control points measured from the image and real world respectively, and n is the number of such control points.
For cases where σx and σy are not equal, the error distribution takes on a more elliptical shape rather than being truly circular. Although this is the case, it can be shown that a Gaussian circular distribution can be still substituted for N elliptical distribution for certain
ratios, where σmin is the minimum value between σx and σy, and σmax is the maximum value between σx and σy.
For cases where σx and σy are not equal and
ratio is between 0.6 and 1.0, it can be shown that σc is estimated by a linear combination of σx and σy as given by the following equation:
σc=0.5222σmin+0.4778σmax (16)
where σmin is the minimum value between σx and σy, and σmax is the maximum value between σx and σy. A further approximation of equation (15) is given in equation (17), which is adopted by NSSDA (Federal Geographic Data Committee 1988) for United States standard for spatial data:
σc=0.5(σmin+σmax) (17)
For cases where σx and σy are not equal and
ratio is between 0.2 and 0.6, σc is estimated using an interpolated value from statistical data that relates
A computer algorithm (CE90 TOOLKIT version 1.0) is developed to allow a user to automate circular error distribution analysis procedures. In an embodiment of the invention, the computer code is written in Matlab. However, it must be appreciated that other computer languages and/or computer mathematical packages may be used.
The coordinates for the ground control points (GCP), which are obtained from the remotely acquired image and measured using the global positioning system (GPS), can be loaded into the toolkit (code) as input files or through data entry forms.
In an embodiment of the invention, the CE90 toolkit includes a graphical user interface (GUI). The graphical user interface is shown in
ratio is also calculated and displayed on the user interface (RMSEmin/RMSEmax in
In an embodiment of the invention, the graphical user interface is configured to interactively tie a GPS point to a point in the imagery. The results may be stored in a simple text file or other file formats. The values are computed based on the points interactively chosen.
The images were automatically stretched to enhance contrast and easily visualize the image data. This functionality is added just for display purposes only.
In an embodiment of the invention, to perform an error distribution analysis, a user can choose options from the tools pull-down menu. The tools pull-down menu allows the user to select radial plot or vector plot options as shown in
An example case is shown in
ratio is found to be between 0.6 and 0.1 and hence corresponding strategy is adopted in the estimation of σc. The circle in
In addition, in an embodiment of the invention, the CE90 graphical user interface allows the user to display the offset vector plot, which represents the magnitude and direction of the error random variables (x, y). This is done by choosing the offset plot from the tools pull-down menu. The user can also input an appropriate scale value to make the error magnitude and directions more visible.
While various embodiments of the present invention have been described above, it should be understood that they have been presented by way of example, and not limitation. It will be apparent to persons skilled in the relevant art(s) that various changes in form and detail can be made therein without departing from the spirit and scope of the present invention. In fact, after reading the above description, it will be apparent to one skilled in the relevant art(s) how to implement the invention in alternative embodiments. Thus, the present invention should not be limited by any of the above-described exemplary embodiments.
Moreover, the methods and systems of the present invention, like related systems and methods used in the imaging arts are complex in nature, are often best practiced by empirically determining the appropriate values of the operating parameters, or by conducting computer simulations to arrive at best design for a given application. Accordingly, all suitable modifications, combinations and equivalents should be considered as falling within the spirit and scope of the invention.
In addition, it should be understood that the figures, are presented for example purposes only. The architecture of the present invention is sufficiently flexible and configurable, such that it may be utilized in ways other than that shown in the accompanying figures.
Further, the purpose of the Abstract of the Disclosure is to enable the U.S. Patent and Trademark Office and the public generally, and especially the scientists, engineers and practitioners in the art who are not familiar with patent or legal terms or phraseology, to determine quickly from a cursory inspection the nature and essence of the technical disclosure of the application. The Abstract of the Disclosure is not intended to be limiting as to the scope of the present invention in any way.
Claims
1. A method of evaluating the effects of image manipulation, such as sharpening and compressing an original image, by comparing the original image to the manipulated image results through use of a collection of quality metrics, the method comprising:
- applying a principal component analysis to a multispectral image to obtain a plurality of principal components;
- replacing a first component in the plurality of principal components by a panchromatic image;
- resampling remaining principal components to a resolution of the panchromatic image; and
- applying an inverse principal analysis to the panchromatic image and the remaining principal components to obtain a fused image of the panchromatic image and the multispectral image.
2. The method of claim 1, further comprising computing quality metrics on the fused image.
3. The method of claim 2, wherein the computing of the quality metrics comprises computing a mean square error value between a band of the fused image and a band of the multispectral image.
4. The method of claim 2, wherein the computing of the quality metrics comprises computing a root mean square value between a band of the fused image and a band of the multispectral image.
5. The method of claim 2, wherein the computing of the quality metrics comprises computing a correlation between a first band in the multispectral image and a second band in the multispectral image and between a first band in the fused image and a second band in the fused image.
6. The method of claim 2, wherein the computing of the quality metrics comprises computing a correlation between a band in the multispectral image and the panchromatic image and between a band in the fused image and the panchromatic image.
7. The method of claim 2, wherein the computing of the quality metrics comprises computing a relative shift mean for each band of the fused image.
8. The method of claim 2, wherein the computing of the quality metrics comprises computing histograms of bands in the multispectral image and computing histograms of bands in the fused image.
9. The method of claim 8, further comprising comparing between a histogram of a band in the multispectral image and a histogram of a band in the fused image.
10. The method of claim 2, wherein the computing of the quality metrics comprises computing an image noise index for each band of the fused image.
11. The method of claim 10, wherein a negative value of the image noise index for a band corresponds to a degradation of spectral information for the band.
12. The method of claim 2, wherein the computing of the quality metrics comprises computing a normalized difference vegetation index (NDVI) for the fused image and computing a normalized difference vegetation index for the multispectral image and correlating the normalized difference vegetation index for the fused image and the normalized difference vegetation index for the multispectral image.
13. A method of evaluating the effects of image manipulation, such as sharpening and compressing an original image, by comparing the original image to the manipulated image results through use of a collection of quality metrics, the method comprising:
- applying a wavelet-based pansharpening to a plurality of bands in a multispectral image and a panchromatic image to obtain a pansharpened image; and
- computing quality metrics on the pansharpened image.
14. The method of claim 13, wherein the applying of the wavelet-based pansharpening comprises using a bi-orthogonal mother wavelet.
15. The method of claim 13, further comprising applying filtering on the pansharpened image to remove noise.
16. The method of claim 15, wherein applying the filtering comprises applying a Wiener filter on the pansharpened image.
17. The method of claim 13, wherein the computing of the quality metrics comprises computing a root mean square value for each band of the pansharpened image.
18. The method of claim 13, wherein the computing of the quality metrics comprises computing a correlation between a first band in the multispectral image and a second band in the multispectral image and between a first band in the fused image and a second band in the pansharpened image.
19. The method of claim 13, wherein the computing of the quality metrics comprises computing a correlation between a band in the multispectral image and the panchromatic image and between a band in the fused image and the panchromatic image.
20. The method of claim 13, wherein the computing of the quality metrics comprises computing a relative shift mean for each band of the pansharpened image.
21. A method of evaluating the effects of image manipulation, such as sharpening and compressing an original image, by comparing the original image to the manipulated image results through use of a collection of quality metrics, the method comprising:
- preprocessing an image;
- applying a discrete wavelet transform on the preprocessed image to decompose the preprocessed image into a plurality of sub-bands;
- applying a quantization to each sub-band in the plurality of sub-bands;
- partitioning the plurality of sub-bands into a plurality of code-blocks;
- encoding each code-block in the plurality of code-blocks independently to obtain a code-blocks stream;
- applying a rate control process to the code-blocks stream to obtain a bit-stream; and
- organizing the bit-stream to obtain a compressed image.
22. The method of claim 21, further comprising:
- transforming the compressed image using embedded block decoding to obtain embedded decoded block data;
- re-composing the embedded decoded block data using an inverse discrete wavelet decomposition process;
- performing a dequantization by assigning a single quantum value to a range of values to obtain a dequantized data; and
- performing a decoding process on the dequantized data to substantially reconstruct the image.
23. The method of claim 22, wherein the image has a tagged image file format (TIFF).
24. The method of claim 22, wherein the image has a GeoTIFF format.
25. The method of claim 22, wherein the applying of the discrete wavelet transform on the preprocessed image comprises decomposing each preprocessed image tile in a plurality of preprocessed image tiles into a high and low sub-bands of the preprocessed image tile with a low-pass filter and a high-pass filter.
26. The method of claim 22, wherein the applying of the quantization to each sub-band in the plurality of sub-bands comprises assigning a range of values to a single quantum value in each sub-band.
27. The method of claim 22, wherein the partitioning of the plurality of sub-bands into the plurality of code-blocks comprises partitioning the plurality of sub-bands into the plurality of code-blocks such that the code-blocks from each sub-band have substantially a same size.
28. A method of evaluating the effects of image manipulation, such as sharpening and compressing an original image, by comparing the original image to the manipulated image results through use of a collection of quality metrics, the method comprising:
- inputting a GeoTIFF image file;
- extracting a GeoTIFF header that contains references to geographic metadata;
- creating a degenerated GeoTIFF image using the extracted geographic metadata;
- performing a geographic markup language (GML) conversion;
- inserting the degenerated GeoTIFF image into a universally unique identifier (UUID) box of the JP2 file;
- inserting the geographic markup language into an extandible markup language (XML) box of the JP2 file; and
- compressing the JP2 file using JP2000 image compression to obtain a GeoJPEG2000 image file.
29. The method of claim 28, wherein the compressing using the JP2000 image compression comprises compressing with LuraWave.jp2 image compression code, JP2 Java/JNI-SDK or GeoTIFF-JAI.
30. The method of claim 28, wherein the compressing using the JP2000 image compression comprises using a compression code developed using java.
31. The method of claim 28, further comprising:
- decompressing the GeoJP2000 image file to obtain a decompressed TIFF image file; and
- computing quality metrics to compare the GeoTIFF image and the decompressed TIFF image file.
32. The method of claim 31, wherein the compressing of the JP2 file comprises compressing the JP2 file at a plurality of compression ratios.
33. The method of claim 32, wherein the computing of the quality metrics comprises computing a mean square error value for each band of the decompressed image file at each compression ratio in the plurality of compression ratios.
34. The method of claim 33, wherein as the compression ratio increases the mean square error value increases for each band.
35. The method of claim 32, wherein the computing of the quality metrics comprises computing a root mean square value for each band of the decompressed image file at each compression ratio in the plurality of compression ratios.
36. The method of claim 35, wherein as the compression ratio increases the root mean square error value increases for each band.
37. The method of claim 32, wherein the computing of the quality metrics comprises computing a peak signal to noise ratio (PSNR) for each band of the decompressed image file at each compression ratio in the plurality of compression ratios.
38. The method of claim 37, wherein as the compression ratio increases the peak signal to noise ratio decreases for each band.
39. The method of claim 32, wherein the computing of the quality metrics comprises computing a correlation between a first band and a second band in the decompressed image file at each compression ratio in the plurality of compression ratios.
40. The method of claim 39, wherein as the compression ratio increases the correlation between the first and second bands remains substantially constant.
Type: Application
Filed: Jun 7, 2010
Publication Date: Dec 16, 2010
Inventors: Charles G. O'Hara (Columbus, MS), Anil Cheriyadat (Troy, NY), Suyoung Seo (Starkville, MS), Bijay Shrestha (Starkville, MS), Veeraraghavan Vijayaraj (Knoxville, TN), Nicolas H. Younan (Starkville, MS)
Application Number: 12/802,448
International Classification: G06K 9/00 (20060101); G06K 9/40 (20060101);