A short comparison to other existing metadata
systems revealed the high potential of our approach
as it offers a more complete characterization of the
data sources and covers a set of key features reported
in literature and expanded in this work. Furthermore,
it provides the means to perform efficient and fast
retrieval of the required information.
Future research steps will include the full
implementation of the proposed mechanism using our
metadata model in the context of structured, semi-
structured and unstructured data. This will allow us to
evaluate our framework in more detail, and in
particular to compare it further against other existing
systems via the use of certain performance metrics.
This, in turn, will allow us to focus on and improve
privacy, security, and data governance in Data Lakes
by using the blockchain technology and smart
contracts.
ACKNOWLEDGEMENTS
This paper is part of the outcomes of the CSA
Twinning project DESTINI. This project has received
funding from the European Union’s Horizon 2020
research and innovation programme under grant
agreement No 945357.
REFERENCES
Chen, Min, Shiwen Mao, & Yunhao Liu. (2014). “Big Data:
A Survey.” Mobile Networks and Applications 19(2):
171–209.
Bertino, Elisa. (2013). “Big Data - Opportunities and
Challenges: Panel Position Paper.” Proceedings –
International Computer Software and Applications
Conference: 479–80.
Günther, Wendy Arianne, Mohammad H. Rezazade
Mehrizi, Marleen Huysman, & Frans Feldberg. (2017).
“Debating Big Data: A Literature Review on Realizing
Value from Big Data.” Journal of Strategic Information
Systems 26(3): 191–209.
Blazquez, Desamparados, & Josep Domenech. (2018). “Big
Data Sources and Methods for Social and Economic
Analyses.” Technological Forecasting and Social
Change 130 (March 2017): 99–113.
https://doi.org/10.1016/j.techfore.2017.07.027.
Fang, Huang. (2015). “Managing Data Lakes in Big Data
Era: What’s a Data Lake and Why Has It Became
Popular in Data Management Ecosystem.” 2015 IEEE
International Conference on Cyber Technology in
Automation, Control and Intelligent Systems, IEEE-
CYBER 2015: 820–24.
Khine, Pwint Phyu, & Zhao Shun Wang. (2018). “Data
Lake: A New Ideology in Big Data Era.” ITM Web of
Conferences 17: 03025.
Miloslavskaya, Natalia, & Alexander Tolstoy. (2016). “Big
Data, Fast Data and Data Lake Concepts.” Procedia
Computer Science 88: 300–305.
http://dx.doi.org/10.1016/j.procs.2016.07.439.
Bell, David, Mark Lycett, Alaa Marshan, & Asmat
Monaghan. (2021). “Exploring Future Challenges for
Big Data in the Humanitarian Domain.” Journal of
Business Research 131(August 2019): 453–68.
https://doi.org/10.1016/j.jbusres.2020.09.035.
Kościelniak, H. & Puto, A. (2015). BIG DATA in decision
making processes of enterprises. Procedia Computer
Science, 65, pp.1052-1058.
Gandomi, Amir, & Murtaza Haider. (2015). “Beyond the
Hype: Big Data Concepts, Methods, and Analytics.”
International Journal of Information Management
35(2): 137–44. http://dx.doi.org/10.1016/j.ijinfomgt.
2014.10.007.
Luckow, Andre et al. (2015). “Automotive Big Data:
Applications, Workloads and Infrastructures.”
Proceedings - 2015 IEEE International Conference on
Big Data, IEEE Big Data 2015: 1201–10.
Herschel, R. & Miori, V.M., (2017). Ethics & big data.
Technology in Society, 49, pp.31-36.
Kim, Y., You, E., Kang, M. & Choi, J., (2012). Does Big
Data Matter to Value Creation?: Based on Oracle
Solution Case. Journal of Information Technology
Services, 11(3), pp.39-48.
Sethi, Pallavi, & Smruti R. Sarangi. (2017). “Internet of
Things: Architectures, Protocols, and Applications.”
Journal of Electrical and Computer Engineering 2017.
Papazoglou, Michael P., & Amal Elgammal. (2018). “The
Manufacturing Blueprint Environment: Bringing
Intelligence into Manufacturing.” 2017 International
Conference on Engineering, Technology and
Innovation: Engineering, Technology and Innovation
Management Beyond 2020: New Challenges, New
Approaches, ICE/ITMC 2017 - Proceedings 2018-
Janua: 750–59.
Sawadogo, Pegdwendé, & Jérôme Darmont.. (2021). “On
Data Lake Architectures and Metadata Management.”
Journal of Intelligent Information Systems 56(1): 97–
120.
Sawadogo, P. N., Scholly, É., Favre, C., Ferey, É.,
Loudcher, S., & Darmont, J. (2019). Metadata Systems
for Data Lakes: Models and Features. Communications
in Computer and Information Science, 1064, 440–451.
https://doi.org/10.1007/978-3-030-30278-8_43
Beheshti, A., Benatallah, B., Nouri, R. & Tabebordbar, A.,
(2018). CoreKG: a knowledge lake service.
Proceedings of the VLDB Endowment, 11(12),
pp.1942-1945.