In this paper, an ontology for representing the
knowledge of a law document is proposed. This
model is the integration of ontology Rela-model,
which is useful to represent relational knowledge
domains (Do et al., 2018), and the graph of key
phrases as a conceptual graph (Shi et al., 2018). The
integration model, called Legal Rela-model, has the
foundation including concepts in law domain,
relations between concepts, inference rules of this
domain and relations between key phrases, concepts
in law document and database storing law contents.
This model can represent complex relations between
concepts in a law document to retrieve some required
knowledge to people. Besides, the method for
intellectual retrieval on the law document is
proposed. The improvement of self-attention network
(Vaswani et al., 2017) is presented by language-
oriented semantic analyzing in Vietnamese (Nguyen
et al., 2020a). This technique is used to extract key
phrases of a law document.
Moreover, the proposed method is applied to
construct an intelligent support system for querying
on Vietnamese land law (Nguyen et al., 2020c) with
its knowledge base is organized by ontology Legal
Rela-model. This system can help users to query
some meaning of terminology in land law and some
land-related administrative procedures. It also tested
by major lawyers and got positive feedback from
users.
The next section presents some related work
about methods for organizing the document
repository, especially for law documents. Section 3
proposes a knowledge model for representing the
content of a law document, called Legal Rela-model,
based on the integration of ontology Rela-model and
the conceptual graph of key phrases. Section 4
designs the method for solving problems about
querying knowledge content of the law document by
Vietnamese. Section 5 shows the architecture of a
support system in land resource for querying on
Vietnamese Land Law and its testing results. The last
section concludes and presents some future work.
2 RELATED WORK
The law document is a general rule of conduct,
commonly binding on agencies, organizations and
individuals nationwide or within a certain
administrative unit (Vietnam Ministry of Justice,
2011, Nguyen et al., 2022). With a determined
domain, there are many documents related together
impacting to that domain. Ontology is an effective
approach to organize semantic document repository
(Huynh et al., 2019, Doan et al, 2003). However,
those methods are not suitable to organize law
documents.
LIDO is an ontology for Legal Informatics
Document (Sartor, 2019). This ontology can be
represented the legal actions affecting the document,
the legal temporal events, the structure of the legal
resource, and the semantic structure of the legal
document organization.
Huynh et al. (2019) constructed the integrating
method of an ontology describing domain knowledge,
and a database of document repository. This method
includes a model of domain knowledge for various
information retrieval tasks, called The Classed
Keyphrase based Ontology (CK-ONTO).
Nonetheless, this graph-based measure has not been
used to evaluate the semantic relevance in documents.
Ngo et al. (2021) designed a system for
Vietnamese legal text processing by leveraging the
strength of traditional information retrieval methods
(BM25), pre-trained masked language models
(BERT), and legal domain knowledge. They also
proposed a novel data augmentation method which is
based on legal domain knowledge in the legal textual
entailment. However, the proposed method does not
represent the semantic of the legal document.
The chatbot in (Nguyen et al., 2020c) was
designed to tutor some administrative procedures,
such as how to get a printing license. However, this
system cannot support to query the content in a law
document related to the working domain.
Statistical relational learning (SRL) and graph
neural networks (GNNs) are two powerful
approaches for learning and inference over graphs.
Typically, they are evaluated in terms of simple
metrics such as accuracy over individual node labels.
The study in (Embar and Srinivasan, 2021) proposed
a sampling framework to tractably compute the
values of aggregate graph queries (AGQ). That
method only works on information of social network
and cannot be used for organizing the meaning of a
legal document.
Ontology is a useful method for representing the
knowledge domain and searching on it (Do et al.,
2020). This study presents a method for organizing
the content of a law document and its meaning in each
article by integrated ontology. It is the foundation to
design techniques for querying some meaning of law
terminology and some administrative procedures in
Vietnamese.