Hindawi Publishing Corporation
EURASIP Journal on Applied Signal Processing
Volume 2006, Article ID 10968, Pages 1–15
DOI 10.1155/ASP/2006/10968
A Complete Image Compression Scheme Based on Overlapped
Block Transform with Post-Processing
C. Kwan,1 B. Li,2 R. Xu,1 X. Li,1 T. Tran,3 and T. Nguyen4
1 Intelligent
Automation, Inc. (IAI), 15400 Calhoun Drive, Suite 400, Rockville, MD 20855, USA
of Computer Science and Engineering, Ira. A. Fulton School of Engineering, Arizona State University, P. O. Box 878809,
Tempe, AZ 85287-8809, USA
3 Department of Electrical and Computer Engineering, The Whiting School of Engineering, The Johns Hopkins University, Baltimore,
MD 21218, USA
4 Department of Electrical and Computer Engineering, Jacobs School of Engineering, University of California, San Diego, La Jolla,
CA 92093-0407, USA
2 Department
Received 29 April 2005; Revised 19 December 2005; Accepted 21 January 2006
Recommended for Publication by Dimitrios Tzovaras
A complete system was built for high-performance image compression based on overlapped block transform. Extensive simulations
and comparative studies were carried out for still image compression including benchmark images (Lena and Barbara), synthetic
aperture radar (SAR) images, and color images. We have achieved consistently better results than three commercial products in
the market (a Summus wavelet codec, a baseline JPEG codec, and a JPEG-2000 codec) for most images that we used in this study.
Included in the system are two post-processing techniques based on morphological and median filters for enhancing the perceptual
quality of the reconstructed images. The proposed system also supports the enhancement of a small region of interest within an
image, which is of interest in various applications such as target recognition and medical diagnosis.
Copyright © 2006 Hindawi Publishing Corporation. All rights reserved.
1.
INTRODUCTION
The importance of image compression may be illustrated by
the following examples. For TV-quality color image that is
512 × 512 with 24-bit color, it takes 6 million bits to represent the image. For 14 × 17 inch radiograph scanned at
70 micrometer with 12-bit gray scale, it takes about 1200
million bits. If one uses a telephone line with 28,800 baud
rate to transmit 1 frame of TV image without compression,
it will take 4 minutes, and it will take 11.5 hours to transmit a frame of radiograph. Commonly used image compression approaches such as JPEG use discrete-cosine-transform
(DCT)-based transform which introduces annoying block
artifacts, especially at high compression ratio, making such
approaches undesirable for applications such as target recognition and medical diagnosis.
The main objective in this research is to achieve high
compression ratios for still images, such as SAR, and color
images, without suffering from the annoying blocking artifacts from a JPEG-like coder (DCT-based) or ringing artifacts from wavelet-based codecs (JPEG-2000, e.g.). We
aim at building a complete codec that can provide similar
perceptual quality as other algorithms but with a higher compression ratio. Additionally, we also want to provide the flexibility in image transmission with embedded bit streams and
the region-of-interest enhancement that is often of interest
in many applications.
The objective was achieved mainly by using the overlapped block transform wavelet coder (OBTWC). OBTWC
transforms a set of overlapped blocks (e.g., 40 × 40 pixels) into 8 × 8 blocks in the frequency domain. By using
a bank of filters with carefully designed coefficients in performing the image transformation, the coder retains the simplicity of block transform and, at the same time, does not
have blocking artifacts in high compression ratios due to the
presence of overlapped block transform. Meanwhile, compared with zero-tree wavelet transform, the OBTWC offers
more flexibility in frequency spectrum partitioning, higher
energy compaction, and parallel processing for fast implementation. OBTWC also maps the transformed image into a
multiresolution representation that resembles the zero-tree
wavelet transform, and thus embedded stream is a reality.
In addition to adopting the OBTWC, we also propose two
post-processing techniques that aim at improving the visual
2
EURASIP Journal on Applied Signal Processing
quality by eliminating some ringing artifacts at very high
compression ratio. Reference [1] summarized the application
of OBTWC to SAR image compression. However, in [1], we
did not give details of our algorithm, the post-processing algorithms, the tool for region-of-interest selection, and compression results of other images.
The rest of the paper is organized as follows. In Section 2,
we review the background and theory of OBTWC. Section 3
summarizes our results. The still image compression results
include benchmark images (Lena and Barbara), SAR images, and color images. Since degradation in high compression ratio images is unavoidable, two post-processing techniques were developed in this research to enhance the perceptual performance of reconstructed images. A novel technique to enhance a small region of an image was also developed here which could be useful for target recognition.
Extensive comparative studies have been carried out with
a wavelet coder from commercial market, a baseline JPEG
coder (DCT-based), and a JPEG-2000 coder (wavelet-based).
Our coder performs consistently better in almost all the images that we used in this study. A computational complexity
analysis is also carried out in this section. Finally, Section 4
concludes the paper with some suggestions for future research.
2.
THEORETICAL BACKGROUND ON THE OBTWC
ALGORITHM
2.1. Background
Popular image compression schemes such as JPEG [2] use
DCT as the core technology. DCT suffers from the blocking
artifacts in high compression ratio, and hence it is not suitable for high compression ratio applications. The development of the lapped orthogonal transform [3–5] and its generalized version GenLOT [6, 7] helps to solve the annoying
blocking artifact problem to a certain extent by borrowing
pixels from the adjacent blocks to produce the transform coefficients of the current block. However, global information
has not been taken to its full advantage in most cases, the
quantization and the entropy coding of the transform coefficients are still done independently from block to block.
Subband coding has been used in JPEG-2000 thanks to
the development of the discrete wavelet transform [8, 9].
Wavelet representations with implicit overlapping and variable-length basis functions produce smoother and more
perceptually pleasant reconstructed images. Moreover, wavelet’s multiresolution characteristics have created an intuitive foundation on which simple, yet sophisticated, methods
of encoding the transform coefficients are developed.
Instead of aiming for exceptional decorrelation between
subbands, current state-of-the-art wavelet coders [10–12]
look for other filter properties that still maintain perceptual
quality at low bit rates, and then exploit the correlation across
the subbands by an elegant combination of scalar quantizers
and bit-plane entropy coders. Global information is taken
into account at every stage. Nevertheless, in frequency domain, the conventional wavelet transform simply provides
an octave-band representation of signals. The conventional
dyadic wavelet transform performs a nonuniform M-band
partition of the frequency spectrum. This may lead to low
energy compaction, especially when applying to mediumto high-frequency signals, or signals with well-localized frequency components. In such cases, M-channel uniform filter
banks may be better alternatives.
From a filter bank viewpoint, the dyadic wavelet transform is simply an octave-band representation for signals; the
discrete dyadic wavelet transform can be obtained by iterating on the lowpass output of a PR (perfect reconstruction)
two-channel filter bank with enough regularity [13–15]. For
a true wavelet decomposition, one iterates on the lowpass
output only, whereas for a wavelet-packet decomposition,
one may iterate on any output.
Progressive image transmission scheme is perfect for the
recent explosion of the World Wide Web. This coding approach first introduced by [10] relies on the fundamental idea that more important information (defined here as
what decreases a certain distortion measure the most) should
be transmitted first. Assume that the distortion measure is
mean-squared error (MSE), the transform is paraunitary,
and transform coefficients ci j are transmitted one by one,
it can be proven that the mean-squared error decreases by
[ci j ]/N, where N is the total number of pixels. Therefore,
larger coefficients should be transmitted first [16]. If one bit
is transmitted at a time, this approach can be generalized to
ranking the coefficients by bit planes and the most significant
bits are transmitted first [10–12]. The most sophisticated
wavelet-based progressive transmission schemes [11, 12] result in an embedded bit stream (i.e., it can be truncated at
any point by the decoder to yield the best corresponding reconstructed image).
Although the wavelet tree provides an elegant hierarchical data structure which facilitates quantization and entropy
coding of the coefficients, the efficiency of the coder heavily depends on the transform’s ability in generating “enough”
zero trees. For nonsmooth images (such as SAR image) that
contain a lot of texture and edges, wavelet-based zero tree
algorithms are not efficient. As will be seen shortly, our proposed OBTWC shown in Figure 1 is a lot better in terms of
achieving higher compression ratio while retaining the same
perceptual image quality.
2.2.
Theory of OBTWC
The theory of lattice structures and design methods for the
two-channel filter banks are well established [13, 17]. It is
shown in [13] that linear-phase and paraunitary properties cannot be simultaneously imposed on two-channel filter banks, unless for the special case of Haar wavelets. However, when more channels are allowed in the systems, both
of the above properties can coexist [13]. For instance, the
DCT (discrete cosine transform) and LOT (lapped orthogonal transform) are two examples where both the analysis and
synthesis filters Hk (z) and Fk (z) are linear-phase FIR filters
and the corresponding filter banks are paraunitary. In this
section, the lattice structure of the M-channel linear-phase
C. Kwan et al.
3
Wavelet transform
H0w1
DC
x[n]
H0 (z)
H1 (z)
H1w1
2
2
M
.
.
.
.
.
.
.
.
.
2
2
M
H1w1
H0w1
HM −1 (z)
.
.
.
.
.
.
M
Embedded
bit-plane
coder Compressed
bit stream
Block transform
Figure 1: Proposed OBTWC.
paraunitary filter bank (OBTWC) is discussed. It is assumed
that the number of channels M is even and the filter length L
is a multiple of M, that is, L = NM.
It is shown in [6] that M/2 filters (in analysis or synthesis)
have symmetric impulse responses and the other M/2 filters
have antisymmetric impulse responses. Under the assumptions on N, M, and on the filter symmetry, the polyphase
transfer matrix H p (z) of a linear-phase paraunitary filter
bank of degree N − 1 can be decomposed as a product of
orthogonal factors and delays [6], that is,
H p (z) = SQTN −1 Λ(z)TN −2 Λ · · · Λ(z)T0 Q,
(1)
where
I 0
Q=
,
0 J
I 0
Λ(z) =
,
0 z−1 I
1 S0 0
S= √
2 0 S1
I J
.
I −J
(2)
Here I and J are the identity and reversed matrices, respectively. S0 and S1 can be any M/2 × M/2 orthogonal matrices
and Ti are M × M orthogonal matrices
I I
Ti =
I −I
Ui 0
0 Vi
I I
= WΦi W,
I −I
(3)
where Ui and Vi are arbitrary orthogonal matrices. The
factorization [17] covers all linear-phase paraunitary filter
banks with an even number of channels. In other words,
given any collection of filters Hk (z) that comprise such a filter
bank, one can obtain the corresponding matrices S, Q, and
Tk (z). The synthesis procedure is given in [6]. The building
blocks in [17] can be rearranged into a modular form where
both the DCT and LOT are special cases [6],
H p (z) = KN −1 (z)KN −2 (z) · · · K1 (z)K0 ,
where Ki (z) = Φi WΛ(z)W.
(4)
The class of OBTWCs, defined in this way, allows us to view
the DCT and LOT as special cases, respectively, for N = 1 and
N = 2. The degrees of freedom reside in the matrices Ui and
Vi which are only restricted to be real M/2 × M/2 orthogonal matrices. Similar to the lattice factorization in (1), the
factorization in (4) is a general factorization that covers all
linear-phase paraunitary filter banks with M even and length
L = MN.
Based on our analysis, there still exists correlation between DC coefficients. To decorrelate the DC band even
more, several levels of wavelet decomposition can be used
depending on the input image size. Besides the obvious increase in the coding efficiency of DC coefficients thanks
to deeper coefficient trees, wavelets provide variably longer
bases for the signal’s DC component, leading to smoother
reconstructed images, that is, blocking artifacts are further
reduced. Regularity objective can be added in the transform
design process to produce M-band wavelets, and a waveletlike iteration can be carried out using uniform-band transforms as well.
The complete proposed coder diagram is depicted in
Figure 1. It is a hybrid combination of block transform and
wavelet transform. The waveform transform is used for the
DC band and overlapped block transforms are used for other
bands. The advantage is the enhanced capability of capturing
and separating the localized signal components in the frequency domain.
2.3.
Determination of block transform coefficients
The filter coefficients in Hi (z) of Figure 1 require very careful
design. We use the following well-known guidelines for filter
coefficients to produce a good perceptual image codec.
(i) The filter coefficients should be smooth and symmetric
(or antisymmetric). Smoothness controls the noise in
a region with constant background. Symmetry allows
the use of symmetric extension to process the image’s
borders.
(ii) They should decay to zero smoothly at both ends. Nonsmoothness at the ends causes discontinuity between
4
EURASIP Journal on Applied Signal Processing
(iii)
(iv)
(v)
(vi)
blocks when the image is compressed. This blocking
artifact is typical in JPEC because the DCT coefficients
are not smooth at the ends.
The bandpass and highpass filters should have no DC
leakage. Higher-frequency bands will be quantized
severely. It is desirable for the lowpass band to contain
all of the DC information. Otherwise, if the bandpass
and highpass responses to ω = 0 are not zero, we see
the checkerboard artifact.
The coefficients should be chosen to maximize coding
gain. The coding gain is an approximate measure of
energy compaction. A higher gain means higher energy compaction.
Their lengths should be reasonably short to avoid excessive ringing and reasonably long to avoid blocking.
In the frequency range |ω| ≤ π/M, the bandpass and
highpass responses should be small. This minimizes the
quantization effect on bandpass and highpass filters.
To satisfy the above properties, we used an optimization technique. The cost function is a weighted linear combination
of coding gain, DC leakage, attenuation around mirror frequencies, and stopband attenuation. It is defined as
Coverall = k1 Ccoding gain + k2 CDC + k3 Cmirror
+ k4 Canalysis stopband + k5 Csynthesis stopband
(5)
Ccoding gain = 10 log M −1
k=0
σxi2 fi 2
1/M ,
(6)
where σx2 is the variance of the input signal, σxi2 is the variance
of the ith subband, and fi 2 is the norm of the ith synthesis
filter.
The DC leakage cost function measures the amount of
DC energy that leaks out to the bandpass and highpass subbands. The main idea is to concentrate all signal energy at
DC into the DC coefficients. This proves to be advantageous
in both signal decorrelation and in the prevention of discontinuities in the reconstructed signals. Low DC leakage can
prevent the annoying checkerboard artifact that usually occurs when high-frequency bands are severely quantized. The
DC cost function is defined as
CDC =
M
−1 L
−1
hi (n).
(7)
i=1 n=0
The mirror frequency cost function is a generalization of
CDC . Frequency attenuation at mirror frequencies is important in the further reduction of blocking artifacts. The corresponding cost function is
Cmirror =
M
−1
i=0
jω 2
Hi e m ,
ωm =
2πm
,
M
1≤m≤
Canalysis stopband =
M
.
2
(8)
M
−1
Csynthesis stopband =
i=0
M
−1
i=0
ω∈Ωstopband
2
Wia e jω Hi e jω dω,
ω∈Ωstopband
2
Wis e jω Fi e jω dω.
(9)
In the analysis bank, the stopband attenuation cost helps
in improving the signal decorrelation and decreasing the
amount of aliasing. In meaningful images, we know a priori that most of the energy is concentrated in low-frequency
region. Hence, high stopband attenuation in this part of the
frequency spectrum becomes extremely desirable. In the synthesis bank, the reverse is true. Synthesis filters covering lowfrequency bands need to have high stopband attenuation
near and/or at ω = π to enhance their smoothness. The biased weighting can be enforced using two simple linear functions Wia (e jω ) and Wis (e jω ).
The optimization of cost function in (5) is performed
by using a nonlinear optimization routine called Simplex in
MATLAB. The results are the optimized filter coefficients.
2.4.
with ki the weighting factors.
The coding gain cost function is defined as
σx2
Stopband attenuation criterion measures the sum of all of the
filters’ energy outside the designated passbands. Mathematically,
Comparison summary between
OBTWC, DCT, and wavelet
Consumers and manufacturers are pushing for higher and
higher number of pixels in digital cameras, camcorders, and
high-definition TVs. All these advancements call for stringent demands for faster and nicer compression codecs. It will
be ideal for a codec to have fast compression and, at the same
time, achieves very satisfactory perceptual quality and signalto-noise ratio. The proposed OBTWC has exactly these qualities.
Table 1 summarizes the comparison between three codecs. It can be seen that the proposed codec has more advantages than DCT and wavelet. It is the balanced quality
between computational speed and performance that makes
the proposed OBTWC stands out among the other codecs.
2.5.
Implementation of a complete coder
The proposed method was implemented by replacing the
transform of an H.263+ codec by the GenLOT transform
(using only the I-frame mode for still image compression),
with appropriate coefficient reordering. The entropy coding
and other parts of the codec are kept the same.
3.
STILL IMAGE COMPRESSION
Although the component technologies of OBTWC for still
image compression were developed before this research, this
is the first time that we applied the software to SAR images,
and color images. Extensive comparative studies with two
commercial products have been carried out in this research.
C. Kwan et al.
5
Table 1: Comparison of different codecs.
DCT
(core technology in standards
such as JPEG, MPEG,
H263, etc.)
Performance metrics
Transmits most important information first
Wavelet
(zero-tree dyadic wavelet
transform and core
technology of JPEG-2000)
OBTWC
(proposed overlapped
block transform
wavelet coder)
Simplicity of block transform
(less memory required)
Encodes the whole frame
(larger on-board memory)
Block artifacts
(lose details in high compression ratio)
Better performance (than DCT)
More computations (than DCT)
Ringing effect
Flexibility in frequency spectrum partitioning
and higher energy compaction
Capture and separate localized signal
components in the frequency domain
Produces smoother and more perceptually
pleasant reconstructed images
Enhances the compression ratio of existing
techniques without sacrificing too much
of the performance/perceptual quality
Texture preservation
(suitable for SAR compression)
Reversible integer GenLOT available whereas the
standard codec does not allow reversible integer
transform (useful for mobile communications)
Parallel processing capability
In terms of military applications, one can directly apply our
still image compression algorithm for image storage and
archiving.
3.1. Benchmark images compression
In this section, we summarize the application of several
progression transmission codecs, including SPIHT (waveletbased method), JPEG, JPEG-2000, and our OBTWC. Benchmark images (Lena and Barbara) were used in this comparative study.
The objective performance criterion we used is called
peak signal-to-noise ratio (PSNR) which is defined as
PSNR = 10 log
2552
(1/M)
M
n=1 on
− rn
2 ,
(10)
where on is the nth pixel in the original image and rn is the
nth pixel in the reconstructed image. This is a popular objective method to measure distortion in image compression
Table 2: Coding results of various progressive coders for Lena.
Lena
Comp. ratio
1:8
1 : 16
1 : 32
1 : 64
1 : 100
1 : 128
Progressive transmission coders
SPIHT (9-7WL) JPEG JPEG-2000 OBTWC
40.41
37.21
34.11
31.10
29.35
28.38
39.91
36.38
32.90
29.67
27.80
26.91
40.32
37.27
34.14
31.00
29.12
28.00
40.43
37.32
34.23
31.16
29.31
28.35
applications. The higher the PSNR is, the better the compression and decompression performance is.
Table 2 summarizes the PSNR of Lena and Figure 2 depicts the PSNRs of different codecs at different compression
ratios. It can be seen that our codec performed consistently
better, except in two cases, than other codecs.
6
EURASIP Journal on Applied Signal Processing
Performance comparison of our
coder with three commercial coders
42
40
38
PSNR
36
34
32
30
28
26
0
20
40
60
80
100
120
140
Compression ratio
SPIHT
JPEG
(a)
JPEG-2000
OBTWC
(b)
Figure 2: PSNRs of various codecs at different compression ratios for Lena.
Table 3: Coding results of various progressive coders for Barbara.
Barbara
Comp. ratio
1:8
1 : 16
1 : 32
1 : 64
1 : 100
1 : 128
Progressive transmission coders
SPIHT (9-7WL) JPEG JPEG 2000 OBTWC
36.41
36.31
37.17
38.08
31.40
31.11
32.29
33.47
27.58
27.28
28.39
29.53
24.86
24.58
25.42
26.37
23.76
23.42
24.06
24.95
23.35
22.68
23.37
24.01
compression ratios were tried. The perceptual differences between the various coders are hard to discern by human eyes.
However, the objective performance index (PSNR) tells a big
difference. The PSNR is summarized in Table 4. We also plotted PSNRs versus compression ratios. As shown in Figure 4,
although our coder has comparable performance as the commercial products, in terms of computational complexity, our
algorithm allows parallel processing and hence is much more
efficient than other codecs.
3.2.2. Army’s SAR image
Similarly, Table 3 and Figure 3 summarize the PSNRs for
Barbara. Again, our proposed codec performed consistently
better than all other codecs.
3.2. SAR image compression
We have compressed four types of SAR images: two types
from the Air Force, one type from the Army, and one type
from NASA. Our algorithm outperforms both wavelet and
JPEG coders. The wavelet coder was developed by Summus,
Inc. We purchased one copy. It was claimed by Summus that
its coder is better than JPEG and other wavelet-based coders.
The baseline JPEG coder is a shareware from the Internet.
The web address is http://www.geocities.com/SiliconValley/
7726/.
3.2.1. Air Force cluttered SAR image
The SAR image (size: 512 × 480, gray scale: 8 bits/pixel) was
supplied by Air Force Wright Patterson Laboratory (Marvin
Soraya). We applied four algorithms to it: our OBTWC algorithm, Summus wavelet coder, JPEG-2000, and JPEG. Three
The SAR image (size: 764 × 764, gray scale: 8 bits/pixel) was
supplied by Army Research Laboratory in Fort Monmouth.
Again, four algorithms were applied and the performance is
summarized in Table 5. The PSNRs were also plotted against
the compression ratios (Figure 5). From Table 5, one can see
that our codec is slightly inferior to JPEG-2000 but much
better than the other two. But from practical implementation
perspective, our codec is much simpler and hence will offer
significant advantage for large images such as high-definition
TV images.
3.2.3. NASA’s SAR image
Spaceborne imaging radar-C/X-band synthetic aperture
radar (SIR-C/X-SAR) is a joint US-German-Italian Project
that uses a highly sophisticated imaging radar to capture images of Earth that are useful to scientists across a great range
of disciplines. The instrument was flown on two flights in
1994. One was on space shuttle Endeavor on mission STS-59
April 9–20, 1994. The second flight was on shuttle Endeavor
on STS-68 September 30–October 11, 1994.
C. Kwan et al.
7
Performance comparison of our
coder with three commercial coders
40
38
36
PSNR
34
32
30
28
26
24
22
0
20
40
60
80
100
120
140
Compression ratio
SPIHT
JPEG
(a)
JPEG-2000
OBTWC
(b)
Figure 3: PSNRs of various codecs at different compression ratios for Barbra.
Table 4: Performance comparison of our codec with 3 commercial
codecs for the Air Force SAR image.
Performance comparison of our
coder with three commercial coders
35
Algorithm\
compression ratio
34
33
8
16
32
PSNR
32
31
OBTWC
Summus
JPEG
JPEG-2000
34.14
30.64
28.42
33.06
29.83
27.78
32.02
29.40
27.61
34.77
31.16
28.95
30
29
Table 5: Performance comparison of our codec with three commercial codecs for an Army SAR image.
28
27
5
10
15
20
25
30
35
Compression ratio
Summus
JPEG
JPEG-2000
OBTWC
Algorithm\
compression ratio
8
16
32
OBTWC
Summus
JPEG
JPEG-2000
38.07
35.05
32.52
36.73
33.90
31.84
36.32
33.57
31.21
39.41
36.02
33.02
Figure 4: PSNR of four compression methods.
The image (size: 945 × 833, color depth: 8 bits/pixel)
shown in Figure 6 was a recently released image from the
SIR-C/X-SAR Project. We applied OBTWC, Summus, JPEG2000, and JPEG codecs to it. The results are summarized in
Table 6. The PSNRs versus compression ratios are plotted besides Table 6. Except the 32 : 1 compression ratio case, our
OBTWC outperforms the other codecs in the other two categories. Even in the 32 : 1 case, the OBTWC is only 0.01 dB
less than the wavelet coder is. The plots in Figure 7 show the
PSNRs of the three codecs. The OBTWC and Summus have
similar performance in this case.
3.3.
Color image compression
We were given four unclassified color images with the size of
344 × 244 and YUV (4 : 4 : 4) from the Wright Patterson Air
Force Laboratory, USA (http://www.wpafb.af.mil). The first
image is picture of 2s1 tank. The second is T62 tank. The
third is Zill31 armored car. The fourth one is Btr60 armored
car. Our OBTWC codec achieved better results in almost
all cases except 2s1 image. Table 7 summarizes the objective
performance of three coders under three different compression ratios. Plots of PSNRs versus the compression ratios are
shown in Figure 8.
8
EURASIP Journal on Applied Signal Processing
Table 6: Compression performance of 4 codecs to NASA SAR image.
Performance comparison of our
coder with three commercial coders
40
Algorithm\
compression ratio
39
38
Summus
JPEG
JPEG-2000
27.44
24.58
22.40
27.25
24.51
22.41
26.07
23.22
21.71
27.87
24.44
22.17
8
16
32
37
PSNR
OBTWC
36
35
34
33
32
31
Performance comparison of our
coder with three commercial coders
28
5
10
15
20
25
30
35
27
Compression ratio
JPEG-2000
OBTWC
Figure 5: PSNRs of four codecs.
26
PSNR
Summus
JPEG
25
24
23
22
21
5
10
15
20
25
30
35
Compression ratio
Summus
JPEG
JPEG-2000
OBTWC
Figure 7: PSNRs of four codecs.
Figure 6: Raw image from NASA.
3.4. Image enhancement of reconstructed images
The ringing effects in reconstructed images with high compression ratios are caused by the long filter lengths in
OBTWC. Although the ringing effect here is less significant
than wavelet coders are, it is still an annoying artifact that affects the visual perception of a reconstructed image. Here we
propose two approaches to minimize the ringing artifacts. It
is worth to mention that image enhancement is performed
at the receiving end, and hence this post-processing will not
affect the transmission speed.
3.4.1. Post-processing using nonlinear morphological filters
The key idea underlying the deringing algorithm is to avoid
filtering the entire image blindly, but instead to identify the
regions contaminated by ringing and apply the nonlinear
smoothing filter only to these regions. As such, the algorithm
is a signal-dependent (spatially varying) technique which requires the extraction of certain parameters from the input
image. The choice of a morphological smoothing operator
was due to its fit to the purpose and also its very low computational complexity.
Edge detection
Since the ringing artifact is known to be associated with step
edges, the algorithm starts with an edge detection process on
the input image. In case of compressed images, the edge detection process is even further complicated because of the
blur (associated with compression) which typically causes
false negatives (undetected edges) and also the ringing artifact ripples which typically cause false positives (false edges).
Consequently, we designed a 3-phase edge detection algorithm in which the following hold.
(1) The first phase is a baseline edge detection algorithm
employing Sobel edge detection operator (5 × 5). The
associated threshold for this baseline algorithm is extracted from the input image by paying attention to the
ringing around the step edges so that to the binary edge
map, only a very little amount of noise due to ringing
ripples penetrates.
(2) In spite of the careful threshold selection of the first
step, most of the time we still end up with some noise
C. Kwan et al.
9
Table 7: Summary of comparative studies for color images.
Images\
PSNR
JPEG
32 : 1
2s1
T62
Zil131
Btr60
31.56
28.45
28.33
30.48
Summus
32 : 1
32 : 1
32.44
29.07
29.15
29.07
OBTWC
32 : 1
31.96
28.70
28.56
31.75
32.18
30.05
30.03
32.63
JPEG
64 : 1
64 : 1
28.78
25.37
25.36
27.93
Summus
64 : 1
JPEG2000
29.67
26.37
26.27
26.37
28.77
25.72
25.47
28.70
OBTWC
64 : 1
100 : 1
29.52
27.15
26.99
29.79
Performance comparison of our
coder with three commercial coders
33
JPEG
100 : 1
Summus
100 : 1
JPEG2000
OBTWC
100 : 1
26.64
23.45
23.44
26.22
28.18
24.97
24.87
24.97
27.15
24.24
23.94
26.87
28.20
25.61
25.42
28.32
Performance comparison of our
coder with three commercial coders
31
32
30
31
29
28
30
PSNR
PSNR
JPEG2000
29
27
26
28
25
27
26
30
24
40
50
60
70
80
90
23
30
100
40
50
Compression ratio
Summus
JPEG
JPEG-2000
OBTWC
Summus
JPEG
80
90
100
90
100
JPEG-2000
OBTWC
(b) T62
Performance comparison of our
coder with three commercial coders
31
33
30
32
29
31
Performance comparison of our
coder with three commercial coders
30
PSNR
28
PSNR
70
Compression ratio
(a) 2s1
27
26
29
28
27
25
26
24
25
23
30
60
40
50
60
70
80
90
100
24
30
40
Compression ratio
Summus
JPEG
JPEG-2000
OBTWC
(c) Btr60
50
60
70
80
Compression ratio
Summus
JPEG
JPEG-2000
OBTWC
(d) Zil131
Figure 8: PSNRs of three codecs for the four color images.
in the binary edge map. To clean this noise, we use a
morphological filter consisting of some pruning and
hit-or-miss operations.
(3) The cleaned edge map typically has significant discontinuities along many of its edge traces. In this case
through a high-level processing, these edge discontinuities are eliminated by edge tracking and linking. As
a result, we have a binary edge map which is much improved as compared to the raw output from the first
step.
10
EURASIP Journal on Applied Signal Processing
Edge mask
Final image generation
The second major step is the generation of the so-called “edge
mask.” This phase is carried out essentially by a binary closing operation (3 × 3) on the output of the edge detection
phase. The edge mask serves the very important purpose of
protecting many genuine image features and high-frequency
details such as edges with narrow pulse-like profiles and texture from being destroyed by the consequent morphological
smoothing operation.
The final phase is the generation of the filter deringing output. For this purpose, we do the following. We keep the regions of the input image covered by the filtering mask intact. However, the regions of the input image exposed by the
filtering mask (i.e., those regions which are filtered in the
fourth phase) are copied from the output of the morphological smoothing filter and pasted on to the input image. This
generates the output of the deringing filter.
We applied the deringing filter to Lena. Figure 9 shows
the results for a compression ratio 100 : 1. It can be seen that
the image after post-processing is much better in terms of
perceptual performance than the reconstructed image in the
middle.
Filtering mask
The third major phase is the generation of the so-called “filtering mask.” This phase is carried out by a dilation operation (3 × 3) on the output of the edge detection phase (to
isotropically mark the regions surrounding the edges where
we know that only these regions are subject to being contaminated with ringing) and then an exclusive-OR operation
between the dilation result and the edge mask (output of the
second phase) which will remove the regions covered by the
edge mask from the filtering mask so that the regions covered
by the edge mask will not be filtered. This sequence of operations generates the so-called raw filtering mask. One major
feature of the algorithm is that it is employing human visual
system (HVS) properties to further process the raw filtering
mask and eliminate from it those regions which because of
their content and also the masking properties of HVS will
not reveal the ringing noise confined to their boundaries. For
example, textured regions which could not be identified because of blur in the edge detection step, and therefore not
protected by the edge mask, will typically be detected during
this phase and consequently removed from the raw filtering
mask. The above-mentioned upper local variance limit attributable to ringing ripples is a signal-dependent quantity as
well as its dependence on the compression level and we handle it in the appropriate way and extract it from the image in
a spatially adaptive way. Once the HVS-based modification
is performed on the raw filtering mask, we have the so-called
final filtering mask or shortly the filtering mask.
Morphological smoothing
The fourth major phase of the algorithm is the morphological smoothing of the image regions lying under the exposed regions of the filtering mask. For this purpose, we use a
simple averaged gray-level morphological opening and closing filter (3 × 3). The opening filter in a sense extracts the
lower bounding envelope of the ringing ripples, and in a dual
manner the closing filter in a sense extracts the upper bounding envelope of the ringing ripples, and in their arithmetical
average the ringing ripples are to a very great extent eliminated. All of these processings are performed through integer
arithmetic and local min/max operations on gray-level data.
Needless to say, the binary morphological operations of the
previous steps are performed by logical shift, and AND/OR
operations on binary data.
3.4.2. Post-processing using median filter
This approach consists of two steps. First, an edge detection algorithm (Canny’s algorithm) is used to determine the
significant edges in a reconstructed image. Second, a median
filter (3 × 3) is then applied to eliminate the ringing. A median filter is a nonlinear filter that chooses the median of 9
elements in a 3 × 3 window. The idea is to eliminate highamplitude noise without blurring the edges. Figure 10 shows
the results. The perceptual performance did improve after
post-processing. The perceptual performance improvement
of median filtering is comparable to morphological filter described in Section 3.4.1 It appears that the median filter is
simpler than the previous approach.
3.5.
New region-of-interest (ROI) enhancement
capability
In progressive image transmission, the most important information is transmitted first. The importance of pixels in a
picture is reflected by the magnitude of its transformed coefficients. Therefore, the key idea here is that if we want to
highlight a region in an image, we need to scale up the coefficients in that particular region. We achieve this goal by
using Visual Basic. An interface of the software is shown in
Figure 11. First, an image is loaded onto the screen. Second,
a mouse is used to draw a box that one wants to highlight.
The coordinates of the box are passed to the image algorithm
so that the appropriate blocks will be highlighted. Third, a
weight factor is selected from the screen. The weighting factor scales all the coefficients in the region of interest.
Figure 12 shows the performance of image compression
with ROI enhancement. The tip of the gun barrel of a tank is
highlighted. It can be seen that the image with ROI enhancement is better than the one without this option.
3.6.
Computational complexity analysis
We have mainly used three methods in this research: DCT,
wavelet, and GenLOT transforms. Since every component in
coding and decoding is the same except in the transformation stage, we performed a complexity analysis of the three
C. Kwan et al.
11
(a) 100 : 1 by OBTWC
(b) 100 : 1 after post-processing
Figure 9: Effects of morphological deringing filter on Lena.
(a) 100 : 1
(b) 100 : 1 after post-processing
Figure 10: Post-processing using median filter.
schemes. Table 8 summarizes the number of computations
by using software for a given N × N image. It is worth mentioning that if no parallel implementation for both DCT and
GenLOT is done, then it can be seen that DCT is the most
efficient one, followed by wavelet and GenLOT. Figure 13
shows the number of computations versus image size N. All
three grow exponentially if no parallel implementation is
used.
However, if one implements the DCT and GenLOT in
a parallel manner by taking advantage of the block transformation characteristics, one can see that the DCT and
GenLOT can be very efficient. As can be seen from Table 9
and Figure 14, DCT and GenLOT algorithms stay almost flat
while the wavelet transform still grows exponentially.
3.7. Summary of the results
From all the experiments presented above, it is found that
the proposed method can compress images with better or
about the same PSNR as the two competing approaches. In
all these examples, the visual quality of the compressed image
from the proposed method is often better than the competing
approaches. For those cases where the proposed method has
slightly lower PSNR than the wavelet coder, there is little difference in visual quality. Also, the proposed post-processing
techniques are found to be effective in removing the ringing
artifacts at extreme compression ratio. In particular, the system supports selective enhancement of an ROI.
4.
CONCLUSIONS AND FURTHER RESEARCH
In this paper, we presented a complete codec for image
compression based on overlapped block transform, which
has been tested extensively on benchmark images (Lena
and Barbara), SAR, and color images. For aggressive image
compression, post-processing is absolutely essential in order to reduce unavoidable coding artifacts. Thus, we also
presented two methods that can enhance the perceptual
quality of decompressed images. Finally, an ROI enhancement method is included in the proposed system, which can
12
EURASIP Journal on Applied Signal Processing
Figure 11: Interface of the region-of-interest program.
ROI that needs to
be emphasized
(a) Original image
(b) 100 : 1 compression with enhanced
tip of gun barrel
(c) 100 : 1 compression without ROI
enhancement
Figure 12: Comparison of images with and without ROI enhancement.
control the compression ratio at certain critical regions of
the images so that target recognition performance can be
preserved. Extensive comparative studies with a commercial
product (Summus—a wavelet-based codec), a JPEG baseline codec, and a JPEG-2000 codec showed that the proposed
method achieved better performance in most cases.
While there exists extensive work on the reduction of
blocking artifacts in a DCT-based scheme, such as the
projection-onto-convex-sets (POCSs) approaches and others
[5, 18–25], they are mostly post-processing techniques that
work on a blocky image. Theoretically, since the information
is already lost, these post-processing techniques cannot really
reconstruct the original image but only improve the visual
Table 8: Software implementation: computational complexity of
DCT, GenLOT, and wavelet for a given N × N image.
Method\complexity
8∗ 8 DCT
8∗ 40 GenLOT
9/7 4-L wavelet
Multiplications
Additions
3.25N 2
40N 2
11.9N 2
7.25N 2
78N 2
18.6N 2
quality of the image by smoothing out the artifacts. The overlapped block transform solves the issue by virtually eliminating the block boundaries in the first place, and thus providing a more attractive way of addressing the blocking artifact
C. Kwan et al.
13
×107
4
8
3.5
7
3
2.5
2
1.5
1
Additions
9
Number of additions
Number of multiplications
×107
Multiplications
4.5
0.5
6
5
4
3
2
1
0
100 200 300 400 500 600 700 800 900 1000 1100
0
100 200 300 400 500 600 700 800 900 1000 1100
N
N
8 × 8 DCT
8 × 40 GenLOT
9/7 4 − L wavelet
8 × 8 DCT
8 × 40 GenLOT
9/7 4 − L wavelet
(b)
(a)
Figure 13: Software implementation. (All three grow exponentially with wavelet the least efficient.)
×104
12
12
10
10
8
6
4
Additions
14
Number of additions
Number of multiplications
×104
Multiplications
14
8
6
4
2
2
0
100 200 300 400 500 600 700 800 900 1000 1100
N
0
100 200 300 400 500 600 700 800 900 1000 1100
N
8 × 8 DCT
8 × 40 GenLOT
9/7 4 − L wavelet ×100
8 × 8 DCT
8 × 40 GenLOT
9/7 4 − L wavelet ×100
(a)
(b)
Figure 14: Parallel hardware implementation. (DCT and GenLOT stay almost flat; wavelet grows exponentially.)
issue. Nevertheless, it would be an interesting future task to
compare such an overlapped transform approach with one
leading deblocking algorithm to examine the performance of
both approaches.
Table 9: Parallel hardware implementation: computational complexity of DCT, GenLOT, and wavelet.
Method\complexity
∗
ACKNOWLEDGMENTS
This research was supported by the Ballistic Missile Defense
Organization (BMDO) under Contract no. F33615-99-C1474 and managed by the Air Force. The encouragement
8 8 DCT
8∗ 40 GenLOT
9/7 4-L wavelet
Multiplications
Additions
3.25 × 64
40 × 64
11.9N 2
7.25 × 64
78 × 64
18.6N 2
from Mr. Marvin Soraya at Wright-Patterson Air Force Laboratory is deeply appreciated.
14
REFERENCES
[1] C. Kwan, B. Li, R. Xu, et al., “SAR image compression using
wavelets,” in Wavelet Applications VIII, vol. 4391 of Proceedings
of SPIE, pp. 349–357, 2001.
[2] W. B. Pennebaker and J. L. Mitchell, JPEG: Still Image Compression Standard, Van Nostrand Reinhold, New York, NY,
USA, 1993.
[3] H. S. Malvar, Signal Processing with Lapped Transforms, Artech
House, Norwood, Mass, USA, 1992.
[4] H. S. Malvar, “Biorthogonal and nonuniform lapped transforms for transform coding with reduced blocking and ringing artifacts,” IEEE Transactions on Signal Processing, vol. 46,
pp. 1043–1053, 1998.
[5] H. C. Reeves and J. S. Lim, “Reduction of blocking effects in
image coding,” Optical Engineering, vol. 23, no. 1, pp. 34–37,
1984.
[6] R. L. de Queiroz, T. Nguyen, and K. R. Rao, “GenLOT: generalized linear-phase lapped orthogonal transform,” IEEE Transactions on Signal Processing, vol. 44, no. 3, pp. 497–507, 1996.
[7] T. Tran and T. Nguyen, “On M-channel linear phase FIR filter
banks and application in image compression,” IEEE Transactions on Signal Processing, vol. 45, no. 9, pp. 2175–2187, 1997.
[8] Z. Xiong, K. Ramchandran, and M. T. Orchard, “Spacefrequency quantization for wavelet image coding,” IEEE Transactions on Image Processing, vol. 6, no. 5, pp. 677–693, 1997.
[9] J. P. Princen, A. W. Johnson, and A. B. Bradley, “Subband/transform coding using filter bank designs based on time
domain aliasing cancellation,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
(ICASSP ’87), pp. 2161–2164, Dallas, Tex, USA, April 1987.
[10] J. M. Shapiro, “Embedded image coding using zerotrees of
wavelet coefficients,” IEEE Transactions on Signal Processing,
vol. 41, no. 12, pp. 3445–3462, 1993.
[11] A. Said and W. A. Pearlman, “A new fast and efficient image
codec on set partitioning in hierarchical trees,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 6, pp.
243–250, 1996.
[12] “Compression with reversible embedded wavelets,” RICOH
Company Ltd. submission to ISO/IEC JTC1/SC29/WG1 for
the JTC1.29.12 work item, 1995. Can be obtained on the
World Wide Web, http://www.crc.ricoh.com/CREW.
[13] P. P. Vaidyanathan, Multirate Systems and Filter Banks,
Prentice-Hall, Englewood Cliffs, NJ, USA, 1993.
[14] G. Strang and T. Nguyen, Wavelets and Filter Banks, WellesleyCambridge Press, Wellesley, Mass, USA, 1996.
[15] M. Vetterli and J. Kovačević, Wavelets and Subband Coding,
Prentice-Hall, Englewood Cliffs, NJ, USA, 1995.
[16] R. A. DeVore, B. Jawerth, and B. J. Lucier, “Image compression through wavelet transform coding,” IEEE Transactions on
Information Theory, vol. 38, no. 2, part II, pp. 719–746, 1992.
[17] I. Daubechies, Ten Lectures on Wavelets, CBMS Conference Series, SIAM, Philadelphia, Pa, USA, 1992.
[18] N. F. Law and W. C. Siu, “Successive structural analysis using
wavelet transform for blocking artifacts suppression,” Signal
Processing, vol. 81, no. 7, pp. 1373–1387, 2001.
[19] A. Zakhor, “Iterative procedures for reduction of blocking effects in transform image coding,” IEEE Transactions on Circuits
and Systems for Video Technology, vol. 2, no. 1, pp. 91–95, 1992.
[20] Y. Yang, N. P. Galatsanos, and A. K. Katsaggelos, “Regularized reconstruction to reduce blocking artifacts of block discrete cosine transform compressed images,” IEEE Transactions
on Circuits and Systems for Video Technology, vol. 3, no. 6, pp.
421–432, 1993.
EURASIP Journal on Applied Signal Processing
[21] Y. Yang and N. P. Galatsanos, “Projection-based spatially adaptive reconstruction of block-transform compressed images,”
IEEE Transactions on Image Processing, vol. 4, no. 7, pp. 896–
908, 1995.
[22] S. D. Kim, J. Yi, H. M. Kim, and J. B. Ra, “A deblocking
filter with two separate mode in block-based video coding,”
IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 2, pp. 156–160, 1999.
[23] H. W. Park and Y. L. Lee, “A post-processing method for reducing quantization effects in low bit-rate moving picture
coding,” IEEE Transactions on Circuits and Systems for Video
Technology, vol. 9, no. 2, pp. 161–171, 1999.
[24] A. W.-C. Liew and H. Yan, “Blocking artifacts suppression
in blockcoded images using overcomplete wavelet representation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 14, no. 4, pp. 450–461, 2004.
[25] C. Weerasinghe, A. W.-C. Liew, and H. Yan, “Artifact reduction in compressed images based on region homogeneity constraints using the projections onto convex sets algorithm,”
IEEE Transactions on Circuits and Systems for Video Technology, vol. 12, no. 10, pp. 891–897, 2002.
C. Kwan received his B.S. degree in electronics with honors from the Chinese University of Hong Kong in 1988 and his M.S.
and Ph.D. degrees in electrical engineering from the University of Texas at Arlington in 1989 and 1993, respectively. From
April 1991 to February 1994, he worked in
the Beam Instrumentation Department of
the Superconducting Super Collider Laboratory (SSC) in Dallas, Tex, where he was
heavily involved in the modeling, simulation, and design of modern digital controllers and signal processing algorithms for the
beam control and synchronization system. He received an Invention Award for his work at SSC. Between March 1994 and June
1995, he joined the Automation and Robotics Research Institute
in Fort Worth, where he applied neural networks and fuzzy logic to
the control of power systems, robots, and motors. Since July 1995,
he has been with Intelligent Automation, Inc. in Rockville, Md. He
has served as the Principal Investigator/Program Manager for more
than 65 different projects, with total funding exceeding 20 million
dollars. Currently, he is the Vice President, leading research and development efforts in signal/image processing and controls. He has
published more than 40 papers in archival journals and has had 100
additional refereed conference papers.
B. Li received a Ph.D. degree in electrical engineering from the University of Maryland,
College Park, in 2000. He is currently an Assistant Professor of computer science and
engineering in the Arizona State University.
He was previously a Senior Researcher with
Sharp Laboratories of America (SLA), Camas, Wash, working on multimedia analysis for consumer applications. He was the
Technical Lead in developing Sharp’s hiimpact technologies. He was also an adjunct faculty member with
the Portland State University from 2003 to 2004. His research interests include pattern recognition, computer vision, statistic methods, and multimedia processing. He is a Senior Member of IEEE.
C. Kwan et al.
R. Xu received the B.S. degree from Jiangsu University in 1982, and
the M.S. degree in 1988 from Xi’an Jiaotong University, China, both
in electrical engineering. From 1982 to 1985 and from 1988 to 1993,
he was teaching at Jiangsu University as an Assistant Professor.
From 1993 to 1994, he was a Visiting Scholar at Lehrstuhl für Allgemeine und Theoretische Elektrotechnik, Universität ErlangenNürnberg, Germany. Since 1994, he has been with Intelligent Automation, Inc. (IAI), USA, where he is currently a Principal Engineer. His research interests include array signal processing, image
processing, fault diagnostics, network security, and control theory
and applications. Over the last 11 years with IAI, he has worked
on many different research projects in the above areas funded by
various US government agencies such as DoD and NASA. He has
also published over 20 journal and conference papers in the related
areas.
X. Li received his B.S. and M.S. degrees in
electrical engineering from Xi’an Jiaotong
University, China, in 1992 and 1995, respectively. He obtained his Ph.D. degree in
electrical engineering from the University
of Cincinnati, Ohio, in 2004. From 1995 to
1999, he was an Assistant Professor at Xi’an
Jiaotong University. He worked as a Visiting Researcher at Siemens Corporate Research (SCR), Princeton, NJ, in 2002, and
Mitsubishi Electronic Research Labs (MERL), Cambridge, Mass, in
2003. Since 2004, he has been with Intelligent Automation, Inc. as a
Research Engineer. His research interests include image/video processing and analysis, optical/electronic imaging, medical imaging,
computer vision, machine learning, pattern recognition, artificial
intelligence, real-time system, and data visualization. He is a Member of the IEEE, SPIE, and Sigma Xi.
T. Tran received the B.S. and M.S. degrees
from the Massachusetts Institute of Technology, Cambridge, in 1993 and 1994, respectively, and the Ph.D. degree from the
University of Wisconsin, Madison, in 1998,
all in electrical engineering. In July of 1998,
he joined the Department of Electrical and
Computer Engineering, The Johns Hopkins
University, Baltimore, Md, where he currently holds the rank of Associate Professor.
His research interests are in the field of digital signal processing,
particularly in multirate systems, filter banks, transforms, wavelets,
and their applications in signal analysis, compression, processing,
and communications. He was the Codirector (with Professor J. L.
Prince) of the 33rd Annual Conference on Information Sciences
and Systems (CISS’99), Baltimore, Md, in March 1999. He received
the NSF CAREER Award in 2001. In the summer of 2002, he was an
ASEE/ONR Summer Faculty Research Fellow at the Naval Air Warfare Center Weapons Division (NAWCWD) at China Lake, Calif.
He currently serves as an Associate Editor of the IEEE Transactions
on Signal Processing as well as IEEE Transactions on Image Processing. He is also a Member of the Signal Processing Theory and
Methods (SPTM) Technical Committee of the IEEE Signal Processing Society.
15
T. Nguyen received the B.S, M.S, and
Ph.D. degrees in electrical engineering
from the California Institute of Technology,
Pasadena, in 1985, 1986, and 1989, respectively. He was with MIT Lincoln Laboratory from June 1989 to July 1994, as a member of the technical staff. During the academic year 1993–1994, he was a Visiting
Lecturer at MIT and an Adjunct Professor at
Northeastern University. From August 1994
to July 1998, he was with the Electrical and Computer Engineering (ECE) Department, University of Wisconsin, Madison. He was
with Boston University from August 1996 to June 2001. He is currently a Professor at the ECE Department, University of California,
San Diego (UCSD). His research interests include video processing
algorithms and their efficient implementation. He is the coauthor
(with Professor Gilbert Strang) of a popular textbook, Wavelets &
Filter Banks. He has over 200 publications. He received the NSF
Career Award in 1995 and is currently the Series Editor (Digital
Signal Processing) for Academic Press. He served as an Associate
Editor for the IEEE Transaction on Signal Processing from 1994 to
1996, for the IEEE Transaction on Circuits and Systems from 1996
to 1997, and for the IEEE Transaction on Image Processing from
2004 to 2005. He is a Fellow of the IEEE.