ing a histogram for each one. Each histogram is asso-
ciated with a local zone in the image providing spatial
information. A variant consists in dividing the im-
age in equal squares and compute one histogram for
each square (M. Mason, 2001). The Multi-Scale His-
togram Intersection Representation (MSHIR) (Gargi
and Kasturi, 1999) is another variant. It is a global to
local representation where the image is divided into
decreasing scale blocks. Another similar approach
consists in recursively dividing the image into regions
until each region has a homogeneous feature distribu-
tion or until the size of each region becomes smaller
than a given threshold value (H. Yamamoto and Take-
mura, 1999).
To reach real-time performance, it is necessary to
reduce the amount of data by quantizing the fea-
ture space before histogram computation. Consider-
ing color histograms, quantization consists in putting
close colors in the same histogram bin. Quantization
can be performed in different color spaces. (M. Ma-
son, 2001) applies a color depth reduction formula
to transform the 24-bit RGB color space to 12-bit,
(Crandall and Luo, 2004) work in CIE LAB color
space and reduce it to 267 standard colors in a first
stage before keeping only 10 basic colors. The CIE
LAB space has the advantage of being perceptually
uniform i.e. the Euclidean distance between two
colors corresponds to the human perception differ-
ence. The calculation of the distance between two
histograms is another way to reach real time com-
putation. In the case of quadratic histogram distance
(J. L. Hafner and Niblack, 1995), the weight matrix
that contains the coefficients denoting the similarity
between histogram bins can be diagonalized offline.
Filling several histogram bins with a unique pixel
is a good method to reduce influence of noise and
of quantization errors in histogram computation (Han
and Ma, 2002). Quadratic distance (J. L. Hafner and
Niblack, 1995) yields the same advantage but use
only the Euclidean distance in histogram similarity
computation.
3 COLOR LOCAL KERNEL
HISTOGRAMS
In the proposed technique, image is segmented into
overlapped local squares with a histogram for each
one to provide accurate spatial information. To re-
duce significantly the amount of data without loosing
important information because of coarse quantization,
the color space is quantized according to the most rep-
resentative colors extracted from the scene. A double
Gaussian kernel, one in the image space and one in
the color space bring robustness against noise. Tech-
nical implementation is described further below.
3.1 Image Partitioning
Histograms must be computed from a group of pix-
els. For maximum spatial accuracy, the image is par-
titioned in n × n square like regions that are over-
lapped with the same gap g for both image axis co-
ordinates. So, excluding the image edges, a pixel be-
longs to N
a
= (n/g)
2
regions.
On one hand, n must be large enough to smooth
both camera vibrations and waving objects in the
scene. On the other hand, too large regions prevent
accurate objects of interest detection. In experimental
results, n is fixed at 12 pixels with a gap g = 3. More
overlapping requires excessive computing resources.
3.2 Color Quantization
Quantization allows saving computer resources by re-
ducing the histogram sizes. Because camera noise
prevents distinguishing between all the 256×256 col-
ors in the U V space, this last is reduced to 40 × 40
colors. Then, a good option is quantizing taking into
account the most representative colors in the scene.
In this way, n
c
colors are selected from the image
reference to be associated to n
c
histogram bins. An
undefined color bin is added for other unselected col-
ors. Thus, all pixels not corresponding to one of the
selected colors is associated with the undefined color
bin.
This approach brings a great improvement in term
of computation time. To represent more than ninety
percent of reduced colors in a cluttered scene, the
color histogram size is set to only 15 color bins.
Moreover, this size is smaller than those reached
by good quantization: 64 with fuzzy histograms in
CBIR application (Han and Ma, 2002) or 1600 bins in
CIELAB (Crandall and Luo, 2004), and much smaller
than those usually reached: 4096 for the tracking al-
gorithm presented in (M. Mason, 2001) or 9796 for
color cooccurrence histograms (Chang and Krumm,
1999).
3.3 Kernels
Instead of associating one pixel with a unique region
and a unique histogram bin, Gaussian kernels are in-
troduced in both image and color space to bring more
flexible fuzzy associations between image and his-
tograms. Gaussian kernels are also chosen because of
its smoothing properties and are easily computed. For
computation efficiency, the kernels are pre-computed
and stored in lookup tables.
3.3.1 Spatial Gaussian Kernel
Pixels S
k
(x
k
, y
k
) in a local area l are weighted in
terms of distance from the area center. Thus, to com-
VISAPP 2006 - IMAGE ANALYSIS
214