above a given threshold, the corresponding class can
be assigned automatically by the system. Otherwise,
the user can be prompted to perform the classification
manually.
6.2 Future Work
Our system supports the user during the product clas-
sification process by using corresponding product im-
ages. To automate this task completely, further re-
search and development work is necessary. The sys-
tem can be combined with object detection and image
segmentation to detect the product in an image and cut
it out before feature extraction. This approach allows
almost any product image type to be used in the sys-
tem for classification. Currently, product images are
compared in their entirety. This can lead to a product
image not being classified correctly, when the product
only makes up a small part of the image.
The system can also be combined with additional
text-based and color-based techniques to further in-
crease classification performance. Despite the high
accuracy of the system, errors can occur during a fully
automated classification process. The user can be
asked for input if the probability of a predicted class
is low and below a given threshold to minimize the
error rate of the system. In this way, the user inter-
actively contributes to the improvement of the system
by generating labeled image data, which can be used
for classification in the future.
ACKNOWLEDGEMENTS
This work was funded by the German Federal Min-
istry of Education and Research (FKZ 01IS20085).
REFERENCES
Allweyer, O., Schorr, C., Krieger, R., and Mohr, A. (2020).
Classification of Products in Retail using Partially Ab-
breviated Product Names Only. Proceedings of the
9th International Conference on Data Science, Tech-
nology and Applications - DATA, pages 67–77.
Bast, S. (2021). Image Matching for Product Image Clas-
sification based on Machine Learning. University of
Applied Science Trier, Institute for Software Systems,
Master thesis.
Chavaltada, C., Pasupa, K., and Hardoon, D. R. (2017). A
Comparative Study of Machine Learning Techniques
in Automatic Product Categorisation. Proceedings
of the 14th International Symposium on Neural Net-
works - ISNN, pages 10–17.
Deng, J., Dong, W., Socher, R., Li, L.-J., and Li, L. K. F.-F.
(2009). ImageNet: a Large-Scale Hierarchical Image
Database. IEEE Conference on Computer Vision and
Pattern Recognition, pages 248–255.
Google (2022). Google Vision API. https://cloud.google.
com/vision. (2022-02-16).
GS1 (2018). Global Product Classification Development
& Implementation Guide. https://www.gs1.org/sites/
default/files/gpc development and implementation 1.
pdf. (2022-02-12).
GS1 (2019). GS1 Product Images Application Guideline for
the Retail Grocery & Foodservice Industries. https://
www.gs1us.org/grocery-image-guide. (2022-02-12).
He, K., Zhang, X., Ren, S., and Sun, J. (2015). Deep
Residual Learning for Image Recognition. CoRR,
abs/1512.03385.
Karami, E., Prasad, S., and Shehata, M. (2017). Image
Matching using SIFT, SURF, BRIEF and ORB: Per-
formance Comparison for Distorted Images. CoRR,
abs/1710.02726.
LeCun, Y., Bengio, Y., and Hinton, G. (2015). Deep learn-
ing. Nature, 524:436–444.
Lever, J., Krzywinski, M., and Altman, N. (2017). Principal
Component Analysis. Nature Methods, 14(7):641–
642.
Mitchell, T. M. (1997). Machine Learning. McGraw-Hill
Education Ltd.
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S.,
Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bern-
stein, M., Berg, A. C., and Fei-Fei, L. (2015). Ima-
geNet Large Scale Visual Recognition Challenge. In-
ternational Journal of Computer Vision (IJCV), pages
211–252.
scikit learn (2022). scikit-learn - Machine Learning in
Python. https://scikit-learn.org. version 1.0.2. (2022-
02-12).
Simonyan, K. and Zisserman, A. (2015). Very Deep Con-
volutional Networks for Large-Scale Image Recogni-
tion. 3rd International Conference on Learning Rep-
resentations, ICLR 2015.
Sokolova, M. and Lapalme, G. (2009). A systematic analy-
sis of performance measures for classification tasks.
Information Processing & Management, 45(4):427–
437.
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., and Wojna,
Z. (2016). Rethinking the Inception Architecture for
Computer Vision. 2016 IEEE Conference on Com-
puter Vision and Pattern Recognition (CVPR), pages
2818–2826.
Szeliski, R. (2021). Computer Vision: Algorithms and Ap-
plications. Springer, 2 edition.
Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., and Liu, C.
(2018). A Survey on Deep Transfer Learning. Arti-
ficial Neural Networks and Machine Learning, pages
270–279.
Wei, Y., Tran, S., Xu, S., Kang, B., and Springer, M. (2020).
Deep Learning for Retail Product Recognition: Chal-
lenges and Techniques. Computational Intelligence
and Neuroscience.
DATA 2022 - 11th International Conference on Data Science, Technology and Applications
300