Design of Heterogeneous Data Warehouse Architecture for Supply
Chain Management System
Ruiqin Lin
1
, Wenan Tan
2
, Pan Liu
3
and Lu Zhang
1
1
College of Resources and environmental engineering, Shanghai Polytechnic University, Jinhai Road, Shanghai, China
2
College of Computer and Information Engineering, Shanghai Polytechnic University, Jinhai Road, Shanghai, China
3
Information Technology Center, Shenzhen Easttop Supply Chain Management CO.,LTD., Zhouhai Road, Shanghai, China
Keywords: Data Warehouse, Data Quality, Data Integration, Big Data Development.
Abstract: Multiple systems are covered by the supply chain management system portal website, which includes tasks
such as basic data entry, bill of lading generation, customs declaration, transportation, expenditure settlement,
statistical reports, and more. Each system is spread among several departments, and data is kept by numerous
departments at various phases. The storage format and semantics are vastly different. Data is complicated and
varied, and data quality is challenging to ensure during the data integration process. Data is frequently lost,
and storage types are incompatible with one another. As a result, appropriate technical solutions are required
to ensure the supply chain management system's data quality after data integration. This article focuses on the
current state of the supply chain management system data and the issues that it faces. The business
requirements for the unified and standardized storage of supply chain management system data are derived
from the description of the challenges. Finally, it situates the primary issues discussed in this article within
the framework of this company.
1 INTRODUCTION
1.1 Background in the Industry
The supply chain industry's informatization has been
strengthened internally as a result of the rapid
development of Internet technology, and industry
data has shown explosive growth (
HUANG, 2021).
Massive data contains enormous value, and how to
mine these values more effectively and quickly has
steadily become the focus of data owners' attention.
The information system's basic data is a valuable
resource that has a significant impact on the
enterprise's economic development and management,
and serves as the foundation for scientific
management and decision-making. Currently, most
supply chain management systems spend a
significant amount of money and time developing
online transaction processing OLTP business systems
and office automation systems to record various
transaction processing related data. These data have a
lot of commercial worth. Enterprises did not make the
best use of their existing data resources, wasting more
time and money while also missing out on the best
opportunity to make critical business decisions. Most
traditional data warehouses are still in use in the
business, and the majority of existing supply chain
management systems are built using old methods,
such as acquiring pricey large-scale servers. Database
fragmentation divides the data on this basis. The data
is stored on a disk array, which makes system growth
and upgrading more difficult and expensive, and the
entire system is tightly coupled, making it impossible
to meet the demands of high efficiency,
dependability, and economy. As a result, figuring out
how to turn data into information and knowledge via
various technical means has become a major barrier
in improving the company's fundamental
competitiveness. ETL technology is the most
important technological tool among them.
1.2 Scenario of a Business
The data from the supply chain management system
is spread across several departments. Different
departments are in charge of developing, managing,
and maintaining various enterprises, and different
departments keep track of various basic business
data. Basic information is frequently kept and defined