ment of accurate image encoding/decoding could be
achieved and whether the simultaneous learning of
AE and STN weights is applicable. Our answer for
the later is negative, from either the reconstruction
or anomaly detection point of view, but a pre-trained
STN, to normalize the orientation of input patters, can
improve the reconstruction and defect detection, even
if the input patterns produce multiple minima in the
loss (J(x, ω)), when applying the orientation normal-
izing filter mechanism (Section 6). To underlie the
above statements we tested the models for three dif-
ferent datasets. We found that the often used Tensor-
Flow implementation of STNs (STN, 2016) has in-
accurate interpolation (Fig. 4). Our proposed nor-
malization approach, using an accurate transforma-
tion block, could outperform the base AE method in
almost all cases and metrics. In future we are to in-
vestigate the performance of other transformations.
ACKNOWLEDGEMENTS
We acknowledge the financial support of the projects
2018-1.3.1-VKE-2018-00048 under the
´
UNKP-19-
3 New National Excellence Program, 2020-4.1.1-
TKP2020 under the Thematic Excellence Program,
and the Hungarian Research Fund grant OTKA K
135729.
REFERENCES
(2016). TensorFlow implementation of STN. https://github.
com/daviddao/spatial-transformer-tensorflow. Ac-
cessed: 2021-11-04.
Alaverdyan, Z., Jung, J., Bouet, R., and Lartizien, C.
(2020). Regularized siamese neural network for un-
supervised outlier detection on brain multiparametric
magnetic resonance imaging: application to epilepsy
lesion screening. Medical Image Analysis, 60:101618.
Beggel, L., Pfeiffer, M., and Bischl, B. (2019). Robust
anomaly detection in images using adversarial autoen-
coders. arXiv preprint arXiv:1901.06355.
Bergmann, P., Batzner, K., Fauser, M., Sattlegger, D., and
Steger, C. (2021). The MVTec anomaly detection
dataset: a comprehensive real-world dataset for un-
supervised anomaly detection. International Journal
of Computer Vision, 129(4):1038–1059.
Bergmann, P., Fauser, M., Sattlegger, D., and Steger, C.
(2019). MVTec AD–A comprehensive real-world
dataset for unsupervised anomaly detection. In Pro-
ceedings of the IEEE/CVF Conference on Computer
Vision and Pattern Recognition, pages 9592–9600.
Bergmann, P., L
¨
owe, S., Fauser, M., Sattlegger, D., and Ste-
ger, C. (2018). Improving unsupervised defect seg-
mentation by applying structural similarity to autoen-
coders. arXiv preprint arXiv:1807.02011.
Bidart, R. and Wong, A. (2019). Affine variational autoen-
coders: An efficient approach for improving gener-
alization and robustness to distribution shift. arXiv
preprint arXiv:1905.05300.
Chianucci, D. and Savakis, A. (2016). Unsupervised change
detection using spatial transformer networks. In 2016
IEEE Western New York Image and Signal Processing
Workshop (WNYISPW), pages 1–5. IEEE.
Ding, P., Wan, S., Jin, P., and Zou, C. (2020). A rotation
invariance spatial transformation network for remote
sensing image retrieval. In Twelfth International Con-
ference on Digital Image Processing (ICDIP 2020),
volume 11519, page 115191P. International Society
for Optics and Photonics.
Dosovitskiy, A. and Brox, T. (2016). Generating images
with perceptual similarity metrics based on deep net-
works. Advances in neural information processing
systems, 29:658–666.
Jaderberg, M., Simonyan, K., Zisserman, A., et al. (2015).
Spatial transformer networks. Advances in Neural In-
formation Processing Systems, 28:2017–2025.
Lee, M. C., Oktay, O., Schuh, A., Schaap, M., and Glocker,
B. (2019). Image-and-spatial transformer networks
for structure-guided image registration. In Inter-
national Conference on Medical Image Computing
and Computer-Assisted Intervention, pages 337–345.
Springer.
Li, G., Xu, S., Liu, X., Li, L., and Wang, C. (2018). Jer-
sey number recognition with semi-supervised spatial
transformer network. In Proceedings of the IEEE
Conference on Computer Vision and Pattern Recog-
nition Workshops, pages 1783–1790.
Park, D. and Chun, S. Y. (2018). Classification based grasp
detection using spatial transformer network. arXiv
preprint arXiv:1803.01356.
R
´
adli, R. and Cz
´
uni, L. (2021). About the application of
autoencoders for visual defect detection. In 2021 29.
International Conference in Central Europe on Com-
puter Graphics, Visualization and Computer Vision,
pages 181–188. WSCG.
Sakurada, M. and Yairi, T. (2014). Anomaly detection
using autoencoders with nonlinear dimensionality re-
duction. In Proceedings of the MLSDA 2014 2nd
Workshop on Machine Learning for Sensory Data
Analysis, pages 4–11.
Tuluptceva, N., Bakker, B., Fedulova, I., Schulz, H.,
and Dylov, D. V. (2020). Anomaly detection
with deep perceptual autoencoders. arXiv preprint
arXiv:2006.13265.
Wang, Z., Bovik, A. C., Sheikh, H. R., and Simoncelli, E. P.
(2004). Image quality assessment: from error visi-
bility to structural similarity. IEEE Transactions on
Image Processing, 13(4):600–612.
Zhai, J., Zhang, S., Chen, J., and He, Q. (2018). Autoen-
coder and its various variants. In 2018 IEEE Interna-
tional Conference on Systems, Man, and Cybernetics
(SMC), pages 415–419. IEEE.
VISAPP 2022 - 17th International Conference on Computer Vision Theory and Applications
658