original image’s coefficients. F or instance, the DWT of an input biomedical image f (x, y)
can be shown as:
f
(x, y) −→ DWT −→
F
(k
1
, k
2
, j)
where
F(k
1
, k
2
, j) are the 2-D DWT coefficients at scale j. A shift of the image will result in a
different set of coefficients
f
(x + Δx, y + Δy) −→ DWT −→
F
(k
�
1
, k
�
2
, j)
where k
�
1
�= k
1
+ a
1
·Δx and k
�
2
�= k
2
+ a
2
·Δy for (a
1
, a
2
), (Δx, Δy) ∈ Z, indicating that the two
sets of coefficients are not translated versions of one another.
Shift-variance causes significant chall enges in a feature extraction problem. For example,
Fig. 12. Image (simulated benign lesion).
consider the image of Figure 12 (the ce nter circle can be considered as a circumscribed benign
lesion, or something to that effect). If this circle is translated by a small amount ( which is
equivalent to the lesion being located in different regions of an image), the extracted features
would be different. To illustrate this, the image in Figure 12 is translated by shifts of
(Δx, Δy)
= {(0,0), (0,1), (1,0), (1,1)} and for each translation, the DWT is performed. Then, the mean
and variance of the wavelet coefficients are extracted from the LH band (moments are RST
invariant, so any invariance would be a conseq uence of the transform). The e xtracted features
are shown in Table 2. As shown by these resul ts, images with pathology (tex ture) located
in different regions of the images would result in different feature sets, thus leading to high
misclassification results.
For shift-invariant features, it is necessary to utilize a shift-invariant discrete wavelet
Input shift
(Δx, Δy) Mean μ Variance σ
2
(0,0) -0.050537 97.017
(0,1) -0.051025 100.42
(1,0) 0.057861 96.82
(1,1) 0.058350 98.383
Table 2. Mean μ and variance σ
2
of the DWT co efficients of the LH band for circular
transl ates
(Δx, Δy) of Figure 12.
transform (SIDWT) on the input image f
(x, y)
f (x, y) −→ SIDWT −→
F
(k
1
, k
2
, j)
to compute the wavelet coefficients
F(k
1
, k
2
, j). The representation achieved by such a
transform would be considered shift-invariant if a shift of the input image
(Δx, Δy) ∈ Z results
in output co efficients which are exactly the same as
F
(k
1
, k
2
, j), or a spatially shifted version
of it. This may be shown by
f
(x + Δx, y + Δy) −→ SIDWT −→
F
(k
�
1
, k
�
2
, j)
where k
�
1
= k
1
+ b
1
·Δx and k
�
2
= k
2
+ b
2
·Δy for some (b
1
, b
2
) ∈ Z. If the coefficients are exactly
the same: b
1
= b
2
= 0.
The shift-variant property of the DWT is widel y known and several so lutions have been
proposed. Mallat et. al use an overcomplete, redundant dictionary, which corresponds to
filtering without decimation Mallat (1998) Bradley (2003). From the filtered and fully sampl ed
vers ion of the image, local extrema are used for translation invariance since a shi ft in the input
image results in a corresponding shift of the extrema Mallat (1998) Liang & Parks (1994).
Since there is no decimation, each level of decomposition contains as many samples as the
input image, thus making the algorithm computationally comple x. It also requires significant
memory bandwidth.
Simoncelli et. al propos e an approximate shift-invariant DWT algorithm by relaxing the
critical sampling requirements of the DWT Simoncelli et al. (1992). This algorithm is known as
the power-shiftable DWT since the power in each subband remains constant. As explained in
Bradley (2003), the shift-variant property is also related to aliasing caused by the DWT filters.
The power shiftabl e transform tries to reme dy this proble m by reducing the aliasing of the
mother wavelet in the frequency domain. The modifications to the mother wavelet result in a
loss of orthogonali ty Liang & Parks (1998).
The Matching Pursuit (MP) algorithm can also achieve a shift-invari ant representation,
when the decomposition di ctionary contains a large amount of redundant wavelet basis
functions Mallat & Zhang (1993). However, the MP algorithm is extremely computationally
complex and arriving at a transformed representation causes significant delays Cohen et al.
(1997). Bradley combines features of the DWT pyramidal decomposition with the
`
a trous
algorithm Mallat (1998), which prov ides a trade off between sparsity of the representation and
time-invariance Bradley (2003). Critical sampling is only carried out for a certain number of
subbands and the rest are all fully sampled. This representation only achi eves an approximate
shift-invariant DWT Bradley (2003).
The algorithms discussed either try to minimize the aliasing error by relaxing critical
subsampling and/or add redundancy into the wavelet bas is set. However, these algorithms
either suffer from lack of orthogonality (which is not always an issue for feature extraction),
achieve an approximate shift-invariant representation, are computationally complex or
require significant memory resources. To combat these downfalls, the SIDWT algorithm
proposed by Beylkin, which computes the DWT for all circular shifts in a computationally
efficient manner Beylkin (1992) is utilized. The proposed SIDWT utilizes orthogonal wavelets,
thereby resulting in less redundancy in the representation Liang & Parks (1994), and a more
efficient implementation. Belkyn’s work has also been extended to 2-D signals by L iang et.
al Liang & Parks (1994) Liang & Parks (1998) Liang & Parks (1996) and its performance in a
biomedical image feature extraction ap plicatio n will be investigated.
199
Shift-Invariant DWT for Medical Image Classification