Image compression

A method of image compression with non-redundant complex wavelet transforms applied using a triband decomposition. Variant transforms for real and complex inputs allow for elimination of redundancy.

Skip to: Description  ·  Claims  · Patent History  ·  Patent History
Description
CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This application claims prioritv from the following provisional patent application: application Ser. No.: 60/428,422, filed Nov. 22, 2002.

BACKGROUND OF THE INVENTION

[0002] This invention relates to image compression, and more particularly, to image and video compression methods and devices.

[0003] Recently, Digital Still Cameras (DSCs) have become a very popular consumer appliance appealing to a wide variety of users ranging from photo hobbyists, web developers, real estate agents, insurance adjusters, photo-journalists to everyday photography enthusiasts. Recent advances in large resolution CCD arrays coupled with the availability of low-power digital signal processors (DSPs) has led to the development of DSCs that have the resolution and quality offered by traditional film cameras. These DSCs offer several additional advantages compared to traditional film cameras in terms of data storage, manipulation, and transmission. The digital representation of captured images enables the user to easily incorporate the images into any type of electronic media and transmit them over any type of network; see FIG. 10. The ability to instantly view and selectively store captured images provides the flexibility to minimize film waste and instantly determine if the image needs to be captured again. With its digital representation the image can be corrected, altered, or modified after its capture, and stored on memory cards for battery-powered cameras.

[0004] Further, DSCs can be extended to capture video clips (short video sequences) and to compress (sequences of) images with methods such as JPEG or JPEG2000. FIGS. 9a-9b depict functions and blocks of a digital camera system with the image compression block providing JPEG, JPEG2000, and/or other compressions. JPEG provides compression by transforming 8×8 blocks of pixels into the frequency domain with an 8×8 DCT (discrete cosine transform) and then quantizing the DCT coefficient blocks, scanning the 8×8 quantized coefficients into a one-dimensional sequence, and variable length coding (VLC) the sequence.

[0005] In contrast to JPEG, JPEG2000 uses wavelet decomposition with both lossy and lossless compression enables progressive transmission by resolution (which can generate a small image from the code for the full size image), and facilitates scalable video with respect to resolution, bit-rate, color component, or position with transcoding by using Motion JPEG2000. Indeed, FIGS. 11a-11b illustrate JPEG2000 image analysis with a three-level wavelet decomposition indicated in the lower right portion of FIG. 11b. Also, Christopoulos et al, The JPEG2000 Still Image Coding System: an Overview, 46 IEEE Tran.Cons.Elect. 1103 (2000).

[0006] However, the real wavelet transforms used in JPEG2000 suffer from three shortcomings: (i) lack of shift invariance, (ii) lack of directionality, and (iii) lack of explicit phase information. Complex wavelet transforms, in which the real and imaginary parts of the transform coefficients are an approximate Hilbert-transform pair, offer solutions to these three shortcomings. This enables efficient statistical models for the coefficients that are also geometrically meaningful. Indeed, there are distinct relationships between complex coefficient magnitudes and phases, and edge orientations and positions, respectively. These relationships allows development of an effective hidden Markov tree model for the complex wavelet coefficients; for example see Choi et al, Hidden Markov Tree Modeling of Complex Wavelet Transforms, 2000 IEEE ICASSP 133. Unfortunately, the success of geometric modeling in complex wavelet coefficients has been limited to the class of redundant, or over-complete, complex transforms. This redundancy complicates any application to problems such as image/video compression for DSCs and for wireless-linked Internet transmission where parsimonious signal representations are critical.

[0007] To address the redundancy problem, Fernandes et al, A New Directional, Low-Redundancy, Complex-Wavelet Transform, 2001 IEEE ICASSP 3653 provided a low redundancy by projection and negative frequency discard. Subsequently, Fernandes introduced the Non-Redundant Complex Wavelet Transforms (NCWT); see for example, Fernandes et al, A New Framework for Complex Wavelet Transforms, 51 IEEE Trans. Signal Proc. 1825 (2003). But this implementation can be viewed as a combination of a downsampled positive-frequency projection filter with a traditional dual-band real wavelet transform. Therefore, at the finest scale, the complex wavelet transform has resolution 4× lower than the real input signal. These NCWTs do enjoy directionality and explicit phase information because of the approximate Hilbert-transform relationship between real and imaginary parts of their transform coefficients. To date, however, they have been significantly less amenable to geometric modeling than their redundant counterparts.

[0008] T. D. Tran et al, Linear-Phase Perfect Reconstruction Filter Bank: Lattice Structure, Design, and Application in Image Coding, 48 IEEE Trans. Signal Proc. 133 (2000) discloses general methods of filter bank design.

SUMMARY OF THE INVENTION

[0009] The invention provides a separable two-dimensional non-redundant complex wavelet image transform using one-dimensional triband transforms. Preferred embodiments include constituent filter designs.

[0010] This has advantages including reduction in encoding memory use while maintaining complex wavelet properties.

BRIEF DESCRIPTION OF THE DRAWINGS

[0011] FIG. 1 shows a preferred embodiment method.

[0012] FIGS. 2a-2c show constituent filter magnitudes.

[0013] FIGS. 3a-3d are one-dimensional filter structures.

[0014] FIGS. 4a-4b illustrate the preferred embodiment two-dimensional transform in the frequency domain.

[0015] FIGS. 5a-5b are filter structures.

[0016] FIGS. 6a-6c show preferred embodiment scaling function and complex wavelet and corresponding filter frequency dependencies.

[0017] FIG. 7 illustrates two-dimensional complex wavelet basis functions (imaginary parts).

[0018] FIGS. 8a-8c show an input image and corresponding vertical subband complex wavelet coefficients (magnitude and phase).

[0019] FIGS. 9a-9b show digital camera functions and blocks.

[0020] FIG. 10 shows image compression/decompression.

[0021] FIGS. 11a-11b illustrate JPEG2000.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

[0022] 1. Overview

[0023] The preferred embodiment image compression methods efficiently encode (sequences of) images by successive applications of a one-dimensional, non-redundant, triband (three output subbands) complex wavelet decomposition with some negative frequency discards as illustrated in FIG. 1. FIGS. 3a-3b show the filter banks used in FIG. 1, and FIGS. 2a-2c show the idealized frequency responses of the three subband filters of FIGS. 3a-3b.

[0024] FIGS. 4a-4b show the ideal consequent two-dimensional frequencies involved in the applications of the one-dimensional transforms of FIG. 1; namely, FIG. 4a shows frequencies after the vertical variable filtering step and FIG. 4b shows the frequencies after the two horizontal variable filtering steps.

[0025] Preferred embodiment implementations of the subband filters include a parameterization approach and a lifting approach. In particular, FIGS. 6b-6c show the frequency responses (magnitude and phase) of the lowpass and the positive-frequency highpass subband filters of a preferred embodiment implementation using a parameterization approach. FIG. 6a shows the corresponding preferred embodiment scaling function and complex wavelet.

[0026] FIGS. 9a-9b illustrate a digital camera which has the DSP and/or IMX (image coprocessor) to compute the complex wavelet transform coefficients. The input and encoded output images are stored in the memory.

[0027] The preferred embodiment linear-phase, semi-orthogonal, directional NCWT design uses a triband (downsample by 3) filter bank which permits a natural, direct NCWT implementation using complex wavelet filters and a real scaling filter. At the finest scale, the resulting complex wavelet transform has resolution 3× lower than the real input signal. The preferred embodiment design has properties (directionality, magnitude coherence, and phase coherence) that may make the two-dimensional non-redundant coefficients amenable to geometric modeling.

[0028] 2. Two-Dimensional Non-Redundant Complex Wavelet Transform

[0029] FIG. 1 illustrates the steps of a first preferred embodiment method of two-dimensional, non-redundant complex wavelet transformation as separable and using one-dimensional triband filtering of (the z-transform of) an input image. The (highpass) negative frequencies of the first one-dimensional transform are discarded as illustrated in FIG. 4a, and the (highpass) negative frequencies of the second transform on the lowpass subband are also discarded as illustrated in FIG. 4b; this eliminates redundancy. In effect, the lowpass (scaling function) is a real transform, and the highpass (complex wavelet function) has redundancy due to the complex conjugate symmetry for real input, so negative frequencies can be discarded. Of course, by symmetry, positive frequencies could have been discarded and negative frequencies retained.

[0030] In more detail, presume an input N×M image, x(n,m), with 0≦n<N and 0≦m<M and two-dimensional z-transform X(z1, z2). The first preferred embodiment method (FIG. 1) uses the two one-dimensional triband non-redundant complex wavelet transforms shown as filter banks in FIGS. 3a-3b. The preferred embodiment method applies the transform of FIG. 3a to real-valued inputs: first to X(z1, z2) with respect to the second variable and then to the first stage lowpass output, X0(z1, z), with respect to the first variable; while it applies the transform of FIG. 3b to complex-valued input: the positive frequencies of the highpass output of the first stage, X+(z1, z). H0(z), H+(z), and H−(z), are the z-transforms of the ideal impulse responses for the three subband filters, and FIGS. 2a-2c show the magnitudes for ideal filters; the actual filters will be constructed below. H0(z) is taken to be real-valued while H+(z) and H−(z) are complex conjugates.

[0031] Consider FIG. 1: first apply the one-dimensional transform of FIG. 3a on the vertical (second) variable of X(z1, z2) to obtain a real-valued, (not-yet-downsampled) lowpass subband H0(z2) X(z1, z2) plus a complex-valued, (not-yet-downsampled) highpass positive-frequencies subband H+(z2) X(z1, z2). The frequency-domain support of these subbands is depicted in FIG. 4a where the frequency, &ohgr;, as usual is related to z by z=|z|ej&ohgr; and thus X(ej&ohgr;1, ej&ohgr;2) is the Fourier transform of x(n,m). Redundancy considerations justify the elimination of the subband H−(z2)X(z1, z2) (which downsamples to X−(z1, z)) because H−(z2) is just the complex conjugate of H+(z2) and X(z1, z2) is real-valued. Then downsample (in spatial domain) by a factor of 3 with respect to the second variable to yield z-transforms X0(z1, z) and X+(z1, z). Note that H0(z2)X(z1, z2) has support (ideally) in the range [−&pgr;/3, &pgr;/3] and H+(z2)X(z1, z2) has support (ideally) in the range [&pgr;/3, &pgr;], so aliasing can be avoided in the downsampling by 3. Generally, downsampling by 3 converts z-transform Y(z) into z-transform Y(z1/3)+Y(z1/3 e−j2&pgr;/3)+Y(z1/3 e−j4&pgr;/3), but in practice the filter banks are implemented as polyphase filters as illustrated in FIG. 3c and described in the next section.

[0032] Complete the FIG. 1 method by performing two different one-dimensional transforms along the horizontal (first) variable of the subbands X0(z1, z) and X+(z1, z). Because the subband X0(z1, z) contains real-valued transform coefficients, apply the transform of FIG. 3a, thereby obtaining two output subbands (not-yet-downsampled): real-valued lowpass subband H0(z1) X0(z1, z) plus complex-valued highpass-positive-frequencies subband H+(z1) X0(z1, z). Then again downsample (in spatial domain) by a factor of 3 with respect to the first variable to yield X00(z′, z) and X0+(z′, z). Again, H0(z1)X0(z1, z) has support (ideally) in the range [−&pgr;/3, &pgr;/3] and H+(z1)X0(z1, z) has support (ideally) in the range [&pgr;/3, &pgr;], so aliasing-free downsampling by a factor of 3. This downsampling yields subbands X00(z′, z) and X0+(z′, z). Once again, the subband X0−(z′,z) is discarded because it is redundant due to the real-valued input.

[0033] Finally, apply the FIG. 3b transform to the complex coefficients of the X+(z1, z) subband to obtain three output subbands: H0(z1)X+(z1,z), H+(z1)X+(z1,z), and H−(z1)X+(z1,z). Then again downsample by a factor of 3 to yield X+0(z′, z), X++(z′, z), and X+−(z′, z), respectively. Again, the filterings typically are implemented as polyphase filter banks and the non-ideal frequency responses (e.g., FIGS. 6b-6c) will result in some aliasing.

[0034] The first level of preferred embodiment two-dimensional NCWT is now complete. The five output subbands ideally partition the (prior-to-downsampling) frequency domain as shown in FIG. 4b. This transform has higher directionality than the real wavelet transform, because the latter transform cannot differentiate between features oriented at 45 and −45 degrees. Subsequent levels of the transform are obtained by recursively transforming the lowpass subband X00(z′, z). The two-dimensional NCWT is easily inverted by applying the appropriate one-dimensional synthesis filter banks along the columns and rows of the two-dimensional transform coefficients. The following section describes design of the one-dimensional NCWTs of FIGS. 3a-3b.

[0035] 3. One-Dimensional Filter Banks for Complex Inputs

[0036] FIG. 3b shows the three-band analysis filter bank that performs the first level of a non-redundant complex wavelet decomposition of X(z), the z-transform of x(n), a complex-valued input signal. H0(z) has real filter coefficients while H+(z) and H−(z) have complex filter coefficients such that H+(z)*=H−(z), which signifies that the H+(z) filter coefficients are complex conjugates of the H−(z) filter coefficients. The idealized magnitude responses of these filters are shown in FIGS. 2a-2c. Provided that H0(z) satisfies certain existence conditions, X0(z) represents a scaling-coefficient sequence while X+(z) and X−(z) represent wavelet-coefficient sequences. Because both |H+(&ohgr;)| and |H−(&ohgr;)| have one-sided magnitude responses (FIGS. 2a, 2c), each of the wavelet coefficient sequences X+(z) and X−(z) exhibits a Hilbert-transform relationship between their real and imaginary parts. This property will enable directionality and explicit phase information in the two-dimensional NCWT. The following argument shows that the decomposition provided by FIG. 3b is non-redundant. Let the input signal x(n) consist of N complex numbers. Then due to the decimation in each subband, the subband signals x0(n), x+(n), and x−(n) each has N/3 complex coefficients. Since the input and the transform coefficients each require the same amount of storage space (for N complex numbers), the transform is non-redundant. Implementations of the analysis filter bank in FIG. 3b must address the following five design constraints.

[0037] 1. The frequency responses of the analysis filters must approximate the idealized magnitude responses in FIGS. 2a-2c.

[0038] 2. To ensure that the one-dimensional NCWT for real inputs is nonredundant, require H+(z)*=H−(z).

[0039] 3. To obtain smooth wavelet basis functions, H0(z) must satisfy existence and vanishing-moment conditions.

[0040] 4. For image/video compression applications, the H0(z), H+(z), and H−(z) filter bank should be linear-phase and orthogonal.

[0041] 5. A synthesis filter bank that reconstructs X(z) from the subband signals X0(z), X+(z), and X−(z) must exist.

[0042] Multi-band filter bank design is a difficult problem, and no direct design method satisfies all the above criteria simultaneously. Therefore, some preferred embodiments adopt the a parameterization approach and other preferred embodiments use a lifting approach to design the analysis filter bank. Preliminarily, note that practical filter bank implementation typically uses the polyphase filter approach illustrated in FIG. 3c. The polyphase matrix E(z) would have a first row consisting of the three polyphase components of H0(z), a second row consisting of the three polyphase components of H+(z), and a third row consisting of the three polyphase components of H−(z). In particular, H0(z) is a polynomial in z and the 0th, 1st, and 2d polyphase components are just the terms of H0(z) which have a power of z equal, modulo 3, to 0, −1, and −2, respectively. Thus the three elements of the first row of E(z) are the z-transforms of the three phases of the impulse response h0(n). Similarly for the second and third rows.

[0043] Parameterization Approach

[0044] First, follow Tran et al. (see Background cite) to specify a length-9, 3-band, orthogonal, linear-phase, real-coefficient filter bank. Then exploit the free parameters to impose two vanishing moments on the scaling filter. Let Ê(z) denote the polyphase matrix of the analysis filter bank of real-valued filters in the resulting system.

[0045] Next, define 1 C = [ 1 0 0 0 1 / 2 j / 2 0 1 / 2 - j / 2 ] ⁢   ⁢ and ⁢   ⁢ S = [ 1 0 0 0 cos ⁢   ⁢ θ sin ⁢   ⁢ θ 0 - sin ⁢   ⁢ θ cos ⁢   ⁢ θ ]

[0046] Now, the first, second, and third rows of the polyphase matrix CÊ(z) are to contain the preferred embodiment polyphase components for H0(z), H+(z), and H−(z), respectively. Note that C essentially combines the two real-valued filters, H1(z), and H2(z), from the Tran et al construction into the complex conjugate pair H+(z) and H−(z). These analysis filters satisfy all preceding constraints except for Constraint 1, which is violated because the magnitude responses |H+(&ohgr;)| and |H−(&ohgr;)| differ from the idealized responses in FIGS. 2a, 2c. To satisfy Constraint 1, improve the wavelet-filter magnitude responses by introducing free optimization parameters without violating the other constraints. Therefore, define 2 U ⁡ ( z ) = [ z - 1 0 0 0 z - 1 u ⁡ ( 1 - z - 2 ) 0 0 z - 1 ] ⁢   ⁢ and ⁢   ⁢ V ⁡ ( z ) = [ z - 1 0 0 0 z - 1 0 0 v ⁡ ( 1 - z - 2 ) z - 1 ]

[0047] and generate a new analysis filter bank with polyphase matrix E(z) defined by E(z)=CV(z)U(z)S Ê(z). Observe that the entries in the first rows of C, S, U(z), V(z) guarantee that the scaling filter specified by E(z) is the same (modulo shifts) as the scaling filter specified by Ê(z). Hence, Constraint 3 is still satisfied by the E(z) system. Now, S introduces the free parameter &thgr; into E(z) without affecting Constraint 4 because S is orthogonal and also preserves linear phase. Next, consider the matrices U(z) and V(z). These are left-extension matrices that lengthen the wavelet filters by introducing free parameters u and v into the analysis filter bank while preserving linear phase. However, orthogonality of the wavelet filters is not preserved by these matrices. The zeros in the first rows and first columns of the left-extension matrices ensure that the the scaling filter specified by V(z)U(z)SÊ(z) is orthogonal to its own shifts as well as to shifts of the wavelet filters, although the wavelet filters are not orthogonal to their own shifts. Thus, in addition to semi-orthogonality, the basis associated with the V(z)U(z)SÊ(z) system also has orthogonal scaling functions. Therefore, the V(z)U(z)SÊ(z) filter bank satisfies a weakened form of Constraint 4 in which “orthogonal” is replaced by “semi-orthogonal and H0(z) should be shift-orthogonal.” Note that the scaling filter and two wavelet filters associated with the V(z)U(z)SÊ(z) system have lengths 9, 15, and 21, respectively, because the U(z) and V(z) lengthen the original length-9 Ê(z) system. Finally, the matrix C is introduced to transform the real-coefficient polyphase matrix V(z)U(z)SÊ(z) into E(z), the second and third rows of which specify complex-coefficient filters H+(z) and H−(z) that satisfy Constraint 2. Optimizing over the real-valued, free parameters &thgr;, u, v, yields E(z) with wavelet-filter magnitude responses that have minimum mean-squared error with respect to the idealized responses in FIGS. 2a, 2c. With superscript H denoting the Hermitian conjugate (complex conjugate plus transpose), the polyphase matrix for the synthesis filter bank corresponding to E(z) is given by R(z)=ÊH(z−1)SHU(z−1)V(z−1)CH because ÊH(z), S, and C are paraunitary and U(z)−1=U(z−1), V(z)−1=V(z−1). FIG. 6a depicts the scaling function associated with this H0(z) and the complex wavelet associated with H+(z) (solid line indicating the real part and broken line the imaginary part). FIGS. 6b-6c show the frequency responses (magnitude and phase) for H0(z) and H+(z), respectively.

[0048] Section 4 describes the modifications of the filter bank for the real-valued inputs as in FIG. 3a.

[0049] Lifting Approach

[0050] FIG. 3d shows an alternative filter bank construction using the lifting approach. In particular, the preferred embodiment approach first generates a real-valued filter bank of filters H0(z), H1(z), and H2(z) (analogous to the foregoing Ê(z)) by the lifting method and then forms H+(z) and H−(z) as the complex conjugate pair [H1(z)±jH2(z)]/2. The lifting approach automatically provides a perfect-reconstruction synthesis filter bank that satisfies the third design constraint. The filters Tij(z) in FIG. 3d denote the lifting filters used to create the subband signals X0(z), X1(z), and X2(z). The lifting filters determine the analysis filters as:

H0(z)=1+z−1 T01(z3)+z−2 T02(z3)

H1(z)=z−1+T10(z3) H0(z)+z−2 T12(z3)

H2(z)=z−2+T20(z3) H0(z) T21(z3) H1(z )

[0051] As previously noted, the butterfly of H1(z) and H2(z) at the right edge of FIG. 3d converts to the H+(z) and H−(z) filters.

[0052] FIG. 5a shows the synthesis filter bank corresponding to the analysis filter bank of FIG. 3d.

[0053] To find suitable lifting filters, impose the existence and vanishing wavelet-moment criteria on the foregoing analysis filters and perform numerical optimization to obtain analysis filters with frequency responses approximating those shown in FIGS. 2a-2c. For example, a short filter set obtained using this approach is:

H0(z)=0.5774+0.5774 z−1+0.5774 z−2

H1(z)=−0.4349+0.5651 z−1+−0.1303 z−2

H2(z)=−0.3333−0.3333 z−1+0.6667 z−2

[0054] Besides ensuring the existence of a synthesis filter bank, the lifting approach has two other practical advantages that are exploited in the design. First, the lifting approach corresponds to a lattice decomposition that enables a very efficient implementation. Second, the filter bank enjoys the cardinal interpolation property; this guarantees that no initialization error is incurred while using the filter bank.

[0055] 4. One-Dimensional Non-Redundant Complex Wavelet Transform for Real Input

[0056] Creating the one-dimensional NCWT for real-valued inputs necessitates slight modifications to the filter banks of the previous section. Consider FIG. 3b and assume that the input x(n) is a real-valued signal consisting of N real numbers. Since H0(z) has real coefficients, x0(n) has N=3 real numbers, while x+(n) and x−(n) each have N=3 complex numbers. However, because complex numbers have real and imaginary parts, the transform coefficients as x0(n), x+(n), and x−(n) require storage space for 5N/3 real numbers. Therefore, for real-valued input, this transform is redundant because the transform coefficients require more storage space than the input signal.

[0057] To obtain a non-redundant transform for real-valued input, observe that x−(n) is the complex conjugate of x+(n), because x(n) is real-valued and while H+(z)*=H−(z). Therefore x−(n) contains the same information as x+(n), and may be discarded because it is redundant. FIG. 3a shows the modified analysis filter bank for the one-dimensional NCWT after eliminating (indicated by broken lines) the H−(z) branch in the FIG. 3b filter bank. In this case, if x(n) has N real numbers, then x0(n) and x+(n) have N/3 real numbers and N/3 complex numbers, respectively. The transform is now non-redundant because the input signal and transform coefficients each require the same amount of storage space. To reconstruct X(z), generate X−(z) from X+(z) by setting X−(z)=X+(z)* and then using the synthesis filter bank associated with the one-dimensional NCWT of FIG. 3b and described previously. In practice, reconstruct X(z) without generating X−(z) by use of X0(z) and the real and imaginary parts of x+(n) as input to the synthesis polyphase matrix ÊH(z−1)SHU(z−1)V(z−1).

[0058] For the lifting approach, the analysis filter bank for real input simply discards the X−(z) output (retain only half of the butterfly); and the synthesis filter bank also separates the real and imaginary parts of X+(z) as inputs as illustrated in FIG. 5b.

[0059] 5. Properties

[0060] The preferred embodiment two-dimensional NCWT has properties that may be useful for image/video processing.

[0061] Directionality: FIG. 7 shows the imaginary parts of the two-dimensional complex basis functions for the four directional wavelet subbands. As discussed in Section 2, the two-dimensional NCWT has higher directionality than the real wavelet transform. Specifically, the filters provide distinct basis functions for the 45-degree and the −45-degree subbands, in addition to vertical and horizontal basis functions.

[0062] Magnitude coherence: FIG. 8b shows the magnitude of the first-level vertical subband of the two-dimensional NCWT of the Barbara image (FIG. 8a). The magnitudes successfully identify image regions with strong directional tendency. In addition (and unlike real wavelet coefficients), the magnitudes have a smooth envelope along edges.

[0063] Phase coherence: Also shown in FIG. 8c are the phases of the complex coefficients of the first-level vertical subband of the two-dimensional NCWT of the Barbara image (FIG. 8a). In regions with strong directional tendency (i.e. where coefficient magnitudes are large), the phases typically demonstrate a degree of structure, or coherency.

[0064] The above properties suggest that the preferred embodiment two-dimensional NCWT may be well-suited to image processing and geometric modeling. For example, a zerotree compression algorithm could be developed based on coefficient magnitudes. Such techniques will require a nona-tree structure due to the triband filter bank: each complex coefficient will have 9 children (instead of 4, as with dual-band real wavelet transforms). In addition, the higher decimation provides greater frequency separation between wavelet scales (more than one octave), and so less depth will be needed in the tree.

[0065] 6. Multiple Decomposition Levels

[0066] If more than one level of decomposition is required, such as to create a coefficient tree, store the lowpass subband, X00, coefficients until image (tile component) has been filtered, and then repeat the two-dimensional transform on the lowpass subband. That is, the center square in FIG. 4b corresponds to the input for the next decomposition level.

[0067] 7. Systems

[0068] The preferred embodiment methods are well-suited for environments which require continuous compression and storage of video or sequences of images which contain only partial spatial updates. FIGS. 9a-9b illustrates functional and system blocks of a preferred embodiment digital camera which includes the preferred embodiment JPEG2000 implementations; the encoding steps may be programmed into flash and/or ROM instruction memory for the processors shown in FIG. 9b (RISC, DSP, IMX, VLC).

[0069] 8. Modifications

[0070] The preferred embodiments can be varied while maintaining the features of image wavelet transform based on a separable, three-band, non-redundant wavelet transform.

[0071] For example, the three filters of the one-dimensional filter bank could have various characteristics provided the passbands and stopbands approximate those of FIGS. 2a-2c ideals which limits aliasing connected with the downsampling by 3; differing filter designs provide differing approximations to these ideals with tradeoffs of the number of filter taps (filter length) required; and the filters used with respoect to the first image dimension could differ from the filters used with respect to the second image dimension; the filters may be designed with various optimization criteria (see Tran et al section VI); the number of levels of two-dimensional decomposition can be varied which allows various depth zerotrees; and so forth.

Claims

1. An image transform method, comprising:

(a) first filtering an input two-dimensional real-valued digital image with respect to a first dimension, said first filtering including a first lowpass filtering to yield a lowband and a first highpass filtering to yield a highband;
(b) second filtering said lowband from step (a) with respect to a second dimension, said second filtering including a second lowpass filtering to yield a low-lowband and a second highpass filtering to yield a high-lowband; and
(c) third filtering said highband from step (a) with respect to said second dimension, said third filtering including a third lowpass filtering to yield a low-highband, a third highpass filtering to yield a first high-highband, and a fourth highpass filtering to yield a second high-highband, wherein said third and fourth highpass filterings have impulse response z-transforms as complex conjugates.

2. The method of claim 1, wherein:

(a) said first filtering is polyphase with 3 phases.

3. The method of claim 2, wherein:

(a) said second and third filterings are each polyphase with 3 phases.

4. The method of claim 1, wherein:

(a) said first lowpass filter has a passband approximating the frequency range [−&pgr;/3, &pgr;/3] and said first highpass filter has a passband approximating the frequency range [&pgr;/3, &pgr;]; and
(b) prior to said second filtering, downsampling said lowband by a factor of 3 with respect to said first dimension.

5. The method of claim 4, wherein:

(a) after said second filtering, downsampling each of said low-lowband and said high-lowband by a factor of 3 with respect to said second dimension.

6. The method of claim 1, wherein:

(a) prior to said third filtering, downsampling said highband by a factor of 3 with respect to said first dimension.

7. The method of claim 6, wherein:

(a) after said third filtering, downsampling each of said low-highband, said first high-highband, and said second high-highband by a factor of 3 with respect to said second dimension.

8. The method of claim 1, wherein:

(a) said first lowpass and first highpass filterings corresponds to a real scaling function and a complex wavelet, respectively.

9. The method of claim 1, further comprising:

(a) repeating steps (a)-(c) of claim 1 for an input derived from said low-lowband.

10. A non-redundant, complex-wavelet transformer for two-dimensional images, comprising:

(a) a first one-dimensional polyphase filter bank for input real-valued, two-dimensional digital images, said first filter bank with three phase filters and a lowpass output and a positive-frequency highpass output;
(b) a second one-dimensional polyphase filter bank coupled to the lowpass output of said first filter bank, said second filter bank with three phase filters and a lowpass output and a positive-frequency highpass output; and
(c) a third one-dimensional polyphase filter bank coupled to the highpass output of said first filter bank, said third filter bank with three phase filters and a lowpass output, a positive-frequency highpass output, and a negative-frequency highpass output.

11. The transformer of claim 10, wherein:

(a) said first, second, and third filter banks are implemented as programs on a programmable processor.

12. A digital camera, comprising:

(a) a sensor;
(b) an image pipeline coupled to said sensor, said image pipeline including an image compressor with
(i) a first one-dimensional polyphase filter bank for input real-valued, two-dimensional digital images, said first filter bank with three phase filters and a lowpass output and a positive-frequency highpass output;
(ii) a second one-dimensional polyphase filter bank coupled to the lowpass output of said first filter bank, said second filter bank with three phase filters and a lowpass output and a positive-frequency highpass output; and
(iii) a third one-dimensional polyphase filter bank coupled to the highpass output of said first filter bank, said third filter bank with three phase filters and a lowpass output, a positive-frequency highpass output, and a negative-frequency highpass output.

13. The camera of claim 12, wherein:

(a) said first, second, and third filter banks are implemented as programs on a programmable processor.
Patent History
Publication number: 20040120592
Type: Application
Filed: Nov 21, 2003
Publication Date: Jun 24, 2004
Patent Grant number: 7330597
Inventor: Felix Fernandes (Plano, TX)
Application Number: 10719281