
 
5 CONCLUSION 
In this paper we have interested on schema 
matching, and focused on structural context 
matching for enhanced XML schemas. We began by 
an analysis of problems involved in the matching, 
and we proposed a new solution taking into account 
of heterogeneity of the schema sources. For the 
structural similarity measure, we recovered a matrix 
of terminological similarity coefficients between 
schema nodes based on the similarity of their labels. 
We outlined the limitations of current solutions 
through the study of Cupid and Similarity Flooding 
systems. Then we proposed a structural matching 
technique that considers the context of schemas 
nodes (defined by their roots, intermediates and 
leafs contexts in schema graph). By the way, we 
suggest a simple structural algorithm based on the 
previous ideas and exploit the three types of 
contexts. We refer to the result produced by the 
algorithm as a mapping. The user validates this 
mapping in order to produce a final mapping result 
that serves to generate transformation scripts. 
For future work, we would like to improve the 
matching process, while taking into account the 
optimisation of the process in order to determine a 
set of semantic equivalences between schemas 
(source and target). That will facilitate the 
generation of operators based on the primitive of 
transformations between entities of EXS schemas. 
The second axis to land concerns the efficiency and 
the time of human interaction. The key is then to 
discover how to minimize ser interaction but 
maximizing the impact of the feedback.   
REFERENCES 
Abiteboul, S., Cluet, S., Milo, T., 1997. Correspondence 
and Translation for heterogeneous data. In Proceeding 
of The international Conference on Database Theory 
(ICDT). 351-363. 
Boukottaya, A., Vanoirbeek, C., Paganelli, F., Abou-
Khaled, O., 2004. Automating XML documents 
transformations: a conceptual modelling based 
approach.  In Proceedings of the first Asian-Pacific 
conference on Conceptual modelling. ACM, 81-90. 
Castano, S. and De Antonellis, V., 1999. A schema 
analysis and Reconciliation Tool Environment For 
Heterogeneous Databases. In Proceedings of 
International Database Engineering and Applications 
Symposium. 
Doan, A., Madhavan, J., Domingos, P., Halevey, A., 2001. 
Reconciling schemas of disparate data sources: A 
machine Learning Approach. In Proceedings ACM 
SIGMOD conference. 509-520. 
Drew, P., King, R., McLeod, D., Rusinkiewicz, M., 
Silberschatz, A., 1993. Report of the Workshop on 
Semantic Heterogeneity and Interoperation in 
Multidatabase Systems. In Proceedings ACM 
SIGMOD record, 47-56. 
Fellbum, C., 1998. WordNet: An Electronic Lexical 
Database. MIT press. 
Lamolle, M. and Mellouli, N., 2003. Intégration de bases 
de données hétérogènes via XML.EGC’2003. 
Lamolle, M. and Zerdazi, A., 2005. Intégration de Bases 
de données hétérogènes par une modélisation 
conceptuelle XML, COSI’05. 216-227. 
Li, W.S. and Clifton, C., 1994, Semantic Integration in 
Heterogeneous Databases Using Neural Networks. 
VLDB. 
Li, W.S. and Clifton C., 2000, SemInt: A Tool for 
Identifying Attribute Correspondences in 
Heterogeneous Databases Using Neural Network. Data 
and Knowledge Engineering. 49-84. 
Madhavan, J., Bernstein, P., Rahm, E., 2001. Generic 
schema matching with cupid. VLDB. 
Melnik, S., Garcia-Molina, H., Rahm, E., 2002. Similarity 
Flooding: A versatile Graph Matching and its 
Application to Schema Matching. Data Engineering. 
Miller, A.G., 1995. WordNet: A lexical Database for 
English. ACM. 39-41. 
Miller, A.G., Hass, L., Hernandez, M.A., 2000. Schema 
mapping as query discovery. VLDB. 77-88. 
Rahm, E. and Bernstein, P., 2001 A survey of approaches 
to automatic schema matching. In VLDB Journal. 
334-350. 
XML Schema, W3C Recommendation, 2001. XML-
Schema Primer, W3 Consortium, 2001. Available at 
http://www.w3.org/TR /xmlschema-0. 
Zerdazi, A. and Lamolle, M., 2005. Modélisation des 
schémas XML par adjonction de  métaconnaissances 
sémantiques. ASTI’05. 29-32. 
Zerdazi, A. and Lamolle, M., 2006. Intégration de sources 
hétérogènes par matching semi-automatique de 
schémas XML étendus. INFORSID’2006. 991-1006. 
 
 
 
MATCHING OF ENHANCED XML SCHEMAS WITH A MEASURE OF STRUCTURAL-CONTEXT SIMILARITY
133