plate (like line, circle ...), appears to be a particu-
lar case of the well-known Radon transform. Seg-
man (Segman, 1992; Segman and Zeevi, 1993; Ru-
binstein et al., 1991) has shown that the Fourier trans-
form written with an appropriate kernel could be seen
as a cross correlation function adapted to the general
affine transformations.
In the present work, we extend Segman’s model to
more general group of transformations such as pro-
jective, non-uniform dilatation, and also to any tem-
plates, e.g. non-parametric shapes (planar or 3D ob-
jects). Based on representation theory, we propose a
framework (Ben Youssef, 2004) for studying image
deformations applicable not only in the plane but also
in other domains like the sphere. This framework in-
volves three steps:
• Identify the domain of definition of the signal as a
homogeneous space and the group acting on it.
• Check whether an invariant measure exists for the
acting group.
• Compute the transformation (group action) from
the adapted correlation.
The remaining of the paper is organized as follows.
Sec. 2 presents a short introduction to representation
theory, in particular the generalized correlation and
the convolution theorem is detailed. Sec. 3 deals with
four interesting cases for image processing, i.e. the
planar and vectorial similarity groups, the motion and
the projective groups. Several experiments on real im-
ages illustrate the different cases and confirm the nu-
merical feasibility of the methodology.
2 GROUP BASED CORRELATION
This section is not meant as a formal enumeration of
the assumptions we make. It is a rather intuitive de-
scription of what is required to apply the recipe of a
generalized Fourier transform. We assume that the
reader is familiar with the concept of group. Let us
live with an intuitive definition of a Lie group: its
elements are on smooth manifold and that the group
operation and the inversion are smooth maps (Miller
and Younes, 2001). The real line and the circle are Lie
groups with respect to addition and well known ma-
trix Lie groups are the general linear group of square
invertible matrices, the rotation groups SO(n), and
the Lorentz group.
A group G is acting on a space X when there is a
map G×X −→ X such that the identity element of the
group leaves X as is and a composition of two actions
has the same effect as the action of the composition
of two group operations (associativity). For example,
the isometry group SE(2) acts on the plane R
2
. The
rotation group SO(3) can act on the sphere S
2
. The
set of all gx ∈ X for any g ∈Gis called the orbit of
x. If the group possesses an orbit, that means for any
a, b ∈ X,ga = b for a g ∈G, then the group action
is called transitive. For example, there is always a
rotation mapping one point on the sphere to another.
If a subgroup H of G fixes a point x ∈ X then H
is called the isotropy group. A typical example of
an isotropy group is the subgroup SO(2) of SO(3)
acting on the north-pole of a sphere.
A space X with a transitive Lie group action G is
called homogeneous space. If the isotropy group is
H, it is denoted with G = H. The plane R
2
is the
homogeneous space SE(2) = SO(2). The sphere
S
2
is the homogeneous space SO(3) = SO(2). Im-
ages are usually defined on homogeneous spaces and
their deformations are the group actions. The ques-
tion is now, for which groups we can explicit a cor-
relation function and does a Fourier transform exist?
The answer requires, first, to be able to integrate on
the group and on the homogeneous space and, sec-
ond, to find the Fourier basis analogous to e
ixω
on the
real line (Miller and Younes, 2001).
We consider a simple two dimensional translation
registration problem. The classical correlation :
C(u, v)=
R
2
f(x, y) h(x − u, y − v) dxdy, (1)
is extended in a natural way to include rotations and
dilations of the template object. Essentially, we ro-
tate, and dilate the template object, overlap it with the
image and compute an overlap area (weighted by the
intensity value at each pixel) with the proper normal-
ization.
C
ex
(α, β)=
R
2
f
x
y
h
αx cos β − αy sin β
αx sin β + αy cos β
dx dy.
The function C
ex
has a maximum that will determine
the scale and the orientation of a target. One limita-
tion of this method is its incapacity to detect a ref-
erence target at different scale (Fig 1). Therefore,
we need to determine a correlation type depending on
transformation parameters.
2.1 Generalized Correlation
Function
Let L (X,dµ) denotes the set of functions which have
a mean defined as
X
|f(x)|
2
dµ(x) where µ is an in-
variant measure. The correlation of functions f
1
and
f
2
for all g ∈Gwith respect to group G is given by
(f
1
⊕ f
2
)(g)=
X
f
1
(g
−1
s)f
2
(s)χ(g)dµ(s) (2)
VISAPP 2006 - IMAGE ANALYSIS
318