spatial information. With the flexible incorporation
of geospatial information retrieval; will become
precise to the level that the results of user queries
will be immediately useful, without weeding out
irrelevant hits. In order to address geospatial
semantic, one needs computational methods that go
beyond syntax comparisons. In the case of the
Geospatial Semantic Web, three types of geospatial
semantics are distinguished, each requiring different
computational methods (Egenhafer 2002):
• semantics of geospatial entity classes
• semantics of spatial predicates
• semantics of geospatial names
The classification of geographic entities is
geospatial, even when no geometry is involved.
Non-geometric concepts, such as building, road,
and place are geospatial concepts that are used for
describing the semantics of geospatial objects.
Semantic relations are a typical way to describe
knowledge about concepts. We refer to geospatial
entity classes by words or sets of synonym
interrelated by hyponymy and metonymy relations.
3 ROLE OF SEMANTIC
KNOWLEDGE IN CASE
SEARCHES
In order to understand and appreciate, the role and
importance of semantic knowledge and sentence
structure in case-retrieval process, we need to
understand the working of systems that do not use
this information and rely only on the word
knowledge. Based on the inputs given by the user, a
relevant case is retrieved and output to the user. A
set of questions are posed to the user, the answers to
which are compared to the ones listed out under the
relevant cases that are retrieved. A question answer
pair typically behaves as an attribute value pair. The
answers provided by the user to the questions posed
are compared to these values and a match is found.
In our method, similar concepts are playing
similar roles in the sentences. A sentence is
represented using an Interlingua called, Universal
Networking Language (UNL). Information in every
sentence is captured at three levels: the concepts that
are involved, the role they play in the sentence, and
attributes that describe their properties. Universal
Networking Language (UNL), proposed by United
Nations University, represents natural language in
the form the links among them see in figure 1. UNL
represents information sentence by sentence. This
hypergraph is also represented as a set of directed
binary relations, each between two concepts present
in the sentence. Concepts are represented as
character strings called Universal Words (UW).
The knowledge within a document is
represented in three dimensions: Universal Words
(UW): describe concepts that are present in a
document; UNL Relations: describe the relations
between the concepts involved in the sentence and
the roles (e.g. subject or object in case of nouns) that
they play in conveying the meaning of a sentence;
UW attributes: capture and represent properties of
concepts like tense of a verb.
4 MEASURING SENTENCE
SIMILARITY
Similarity between two sentences is measured on
two counts: how similar are the concepts involved in
the two sentences and how similar roles do the
concepts play in the sentence? Since relations
describe the roles that concepts play in the meaning
of the sentence, similar structure sentences will have
similar relations in their respective UNL
representations. Taking UNL representation, we
compare each of the concepts occurring in UNL
representation with the concepts that appear in the
UNL representation for the problem sentence. The
similarity score is computed using the method
proposed by (Resnik, 1999) where similarity of two
concepts is determined by the information that they
share indicated by the most specific concept that
subsumes them both in a concept hierarchy. Resnik
used WordNet for this hierarchy of concepts. For
every concept, its likelihood of occurring in the
document is calculated by counting the number of
instances of itself and the concepts subsumed by it in
the document. Therefore, the more general a
concept, the more number of occurrences it will
have. Probability (or likelihood) of occurrence of a
concept is given as
P(c) =Nc/N
where Nc is the number of times a concept C occurs
in the document and N is the total number of words
in the document. Using the Information Content
Theory, the Information Values associated with each
concept C is negative log of the likelihood of
occurrence of the concept.
IC(c) =-ln (P(c))
We too used WordNet to arrange concepts in
a hierarchy and assign them Information Content
Values in the manner proposed by (Resnik, 1999).
However, in Resnik's method, the sense of the
concepts being matched is not known.
GEOSPATIAL SEMANTIC QUERY BASED ON CASE-BASED REASONING SYSTEM
357