multi-mosaics enforces the robot to remain at a single
position during image acquisition it is not possible to
gather visual information while moving. Neverthe-
less, this is not a serious drawback. Most of the time
a scene can even better be represented by scanning
it from some key positions within the scene than by
monitoring all the different pathways the robot pur-
suits. Besides, building mosaics while the robot is
moving bears different problems. On the one hand no
closed-form solution for modelling the camera mo-
tion is available since homographies do not hold in
such situations, and on the other hand appropriate mo-
saic images, like, e.g., manifold mosaics (Peleg et al.,
2000), often exhibit (perspective) distortions of the
scene data hampering easy analysis and scene under-
standing. The multi-mosaics are currently primarily
used to extract additional views of an object in object
learning and recognition. However, they could also
be used for extracting 3D data of the scene provided
that the mosaic 3D world positions are given as, e.g.,
proposed in (Teller, 1998).
6 SUMMARY AND
CONCLUSIONS
Active scene analysis and exploration gains increas-
ing importance in computer vision. Since analyz-
ing image sequences of active cameras has proven a
suitable base for extracting useful information from
a scene, interactive and mobile systems are nowa-
days often equipped with active sensing devices. The
visual scene memory based on multi-mosaics pre-
sented in this paper perfectly fits into this framework
as an additional module between active data acqui-
sition on the one hand and its analysis on the other.
The memory is based on a polytopial reference coor-
dinate system. In contrast to spheres and cylinders the
polytopes provide an euclidean reference frame and,
hence, allow the direct application of standard image
analysis techniques. This is important for interactive
systems since they can work on the memorized data as
on the originally acquired input images. Further on,
based on this euclidean mosaic representation and the
chosen data processing strategy, the data within the
memory is easily updated in an online fashion. Incre-
mental parameter estimation and integration heuris-
tics are used in combination with the focus image
plane. The latter masks the underlying polytope struc-
ture of the memory and thus allows efficient data ac-
cess despite present discontinuities on the memory it-
self. Given these techniques the memory works quite
stable in practice, nevertheless, future work has to be
carried out on investigating more robust online para-
meter estimation techniques and mechanisms for au-
tomatically detecting registration errors.
The memory is ideally suited to be used with inter-
active and mobile systems that have to store and after-
wards access image sequences. Especially systems in
human-machine interaction significantly benefit from
the memory as it provides an improved and more ef-
ficient exploition of available visual data and yields a
higher flexibility as it is necessary to act in dynam-
ically changing environments as well as to perform
intuitive communication with human beings.
REFERENCES
Bergen, J., Anandan, P., Hanna, K., and R.Hingorani
(1992). Hierarchical model-based motion estimation.
In ECCV, pages 237–252.
Bishop, G. and McMillan, L. (1995). Plenoptic modeling:
An image-based rendering system. In SIGGRAPH
Computer Graphics Proceedings, pages 39–46. An-
nual Conference Series.
Burt, P. and Anandan, P. (1994). Image stabilization by
registration to a reference mosaic. In Image Under-
standing Workshop, pages 1:425–434, Monterey, CA.
Capel, D. (2004). Image Mosaicing and Super-resolution.
Springer.
Coorg, S. and Teller, S. (2000). Spherical mosaics with
quaternions and dense correlation. International Jour-
nal of Computer Vision, 37(3):259–273.
Davis, J. (1998). Mosaics of scenes with moving objects.
In CVPR, pages (1):97–100, Santa Barbara, USA.
de Agapito, L., Hartley, R., and Hayman, E. (1999). Linear
self-calibration of a rotating and zooming camera. In
IEEE Int. Conference on Computer Vision and Pattern
Recognition, pages 15–21.
Hartley, R. and Zisserman, A. (2000). Multiple View Geom-
etry in Computer Vision. Cambridge University Press.
Ishiguro, H. and Tsuji, S. (1996). Image-based memory
of environment. In Proc. of Int. Conf. on Intelligent
Robots and Systems (IROS ’96), pages 634–639.
Mann, S. and Picard, R. (1996). Video orbits of the pro-
jective group: A new perspective on image mosaicing.
Technical Report 338, MIT Media Laboratory Percep-
tual Computing Section.
M
¨
oller, B. and Posch, S. (2002). Analysis of object interac-
tions in dynamic scenes. In Pattern Recognition, Proc.
of DAGM Symp., LNCS 2449, pages 361–369, Zurich,
Swiss. Springer.
M
¨
oller, B., Posch, S., Haasch, A., Fritsch, J., and Sagerer,
G. (2005). Interactive object learning for robot com-
panions using mosaic images. In Proc. of Int. Conf.
on Intelligent Robots and Systems (IROS), pages 371–
376, Edmonton, Canada.
Peleg, S., Rousso, B., Rav-Acha, A., and Zomet, A.
(2000). Mosaicing on adaptive manifolds. PAMI,
22(10):1144–1154.
VISAPP 2006 - IMAGE ANALYSIS
420