(name, number) contribute to increase the number of
irrelevant solutions. It would so be necessary to
minimize their use in the database schemas and to
resort to more precise terms. The quality of the
ontology is also highly important to obtain relevant
rewritings. Ontology WORDNET used for our
experiments is too general and contributes to
sending back too many solutions.
More elaborated solutions exist to deal with this
problem. A solution which we are investigating at
present consists in placing annotations in the OWL
representation at the level of classes or properties.
These annotations will be exploited by the matcher
to take into account semantic features (sense of a
term, meaning of a property). These annotations
could be installed manually by the administrator of
the source or automatically by the system by seeking
the opinion of the users when several rewritings are
possible. To help the matching one can ask the user
to clarify his query if the system detects some
ambiguities.
We think that these improvements could result in
an efficient system.
The system can be extended to deal with other
types of sources (relational, object).
The main advantage of our approach is its
robustness with regard to the evolution of sources.
When a new source is inserted, it is sufficient to
elaborate its OWL representation so that it can be
exploited by the system. When a source evolves, it is
sufficient to reshape its OWL representation.
We are also engaged in another improvement of
our prototype in order to allow the join of results
coming from different sources. In that case a query
is rewritten in several sub-queries, each sub-queries
being relative to a different source. Our matching
algorithm can be easily adapted for this more
general situation. It is necessary to look for sub-
paths in different sources and to impose a join
condition between sub-paths (the terminal node of a
sub-path must be compatible with the start node of
another sub-path).
Such a system can be very useful for different
applications. Incorporated into an intranet system, it
would allow a user to reach the data sources without
knowing their schemas, by being based only on the
domain ontology. In a P2P system, it could be
installed on some peers or on the super-peers to
facilitate access to data by their semantics. The only
obligation for a peer would be to publish its data by
using the OWL representation.
REFERENCES
Bernstein P. A., Melnik, S., Petropoulos M., and Quix C.,
2004. Industrial-strength schema matching. SIGMOD
Record, 33, 4. pp 38-43.
Cui Z., Jones D., O’Brien P., 2001. Issues in Ontology-
based Information Integration. IJCAI, Seattle, August
5 2001.
Garcia-Molina H., Papakonstantinou Y., Quass D.,
Rajaraman A., Sagiv Y., Ullman J., Vassalos V. and
Widom J.,1997). The Tsimmis approach to mediation:
Data models and languages. Journal of Intelligent
Information Systems , Vol. 8, No. 2. pp. 117-132.
Hai Do H., Melnik S., Rahm E., 2002. Comparison of
Schema Matching Evaluations. Web, Web-Services,
and Database Systems. pp 221-237.
Hull R., 1997. Managing semantic heterogeneity in
databases: A theoretical perspective. Proc. of the
Symposium on Principles of Database Systems
(PODS), Tucson, Arizona . pp. 51-61.
JWNL. Java WordNet Library.
http ://sourceforge.net/projects/jwordnet.
Kedad Z., Métais E., 1999. Dealing with Semantic
Heterogeneity During Data Integration. Proc of the
International Entity Relationship Conference. pp. 325-
339.
Mohsenzadeh M., Shams F., Teshnehlab M., 2005.
Comparison of Schema Matching Systems. WEC (2).
pp 141-147.
Lenzerini M., 2005. Logical Foundations for Data
Integration. SOFSEM 2005. pp 38-40.
Missikoff M., Taglino F., 2004. An Ontology-based
Platform for Semantic Interoperability. Handbook on
Ontologies. pp 617-634.
Rahm E ., Bernstein P.A., 2001. A survey of approaches to
automatic schema matching. VLDB Journal 10(4). pp
334-350.
Saxon. SAXON: The XSLT and XQuery Processor.
http://saxon.sourceforge.net/.
Wache H., Vogele T., Visser U., Stuckenschmidt H.,
Schuster G., Neumann H. and Hubner S., 2001.
Ontology-based integration of information - a survey
of existing approaches. In Stuckenschmidt, H., ed.,
IJCAI-01 Workshop: Ontologies and Information
Sharing. pp 108-117.
Wiederhold G., 1992; Mediators in the architecture of
future information systems. IEEE Computer, Vol. 25,
No.3. pp.38-49.
WEBIST 2006 - INTERNET TECHNOLOGY
12