DETECTION OF INCOHERENCES IN A TECHNICAL AND NORMATIVE DOCUMENT CORPUS

Susana Martin-Toral, Gregorio I. Sainz-Palmero, Yannis Dimitriadis

2008

Abstract

This paper is focused on the problems and effects generated by the use of a document corpus with mistakes, content incoherences amongst its connected documents and other errors. The problem introduced in this paper is very relevant in any area of human activity when this corpus is used as base element in the relationships between company partners, legal support, etc., and the way in which these incoherences can be detected. These problems can appear in several ways, and the produced effects are different, but a common situation exists in those areas of activity where many linked documents must be generated, managed and updated by different authors. This paper describes some examples of this problem in the case of a technical document corpus used amongst partners, and the solution framework developed for this case. Several types of incoherence have been detected and formulated, connected with problems described in other research areas such as information extraction and retrieval, text mining, document interpretation and others, but all of them have been bounded and introduced from the point of view of document incoherences and their effects, specially in a company context. Finally the computational architecture and methodology uses are described and some initial results of incoherence detection are discussed.

Download


Paper Citation


in Harvard Style

Martin-Toral S., I. Sainz-Palmero G. and Dimitriadis Y. (2008). DETECTION OF INCOHERENCES IN A TECHNICAL AND NORMATIVE DOCUMENT CORPUS . In Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 978-989-8111-37-1, pages 282-287. DOI: 10.5220/0001699102820287

in Bibtex Style

@conference{iceis08,
author={Susana Martin-Toral and Gregorio I. Sainz-Palmero and Yannis Dimitriadis},
title={DETECTION OF INCOHERENCES IN A TECHNICAL AND NORMATIVE DOCUMENT CORPUS},
booktitle={Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2008},
pages={282-287},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001699102820287},
isbn={978-989-8111-37-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - DETECTION OF INCOHERENCES IN A TECHNICAL AND NORMATIVE DOCUMENT CORPUS
SN - 978-989-8111-37-1
AU - Martin-Toral S.
AU - I. Sainz-Palmero G.
AU - Dimitriadis Y.
PY - 2008
SP - 282
EP - 287
DO - 10.5220/0001699102820287