A XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION

Xingjie Zeng, Fang Li, Dongmo Zhang, Athena I.Vakali

2004

Abstract

Extensible Markup Language (XML) has been widely used as a middleware because of its flexibility. Fixed domain is one of the bottlenecks of Information Extraction (IE) technologies. In this paper we present a XML-based domain-adaptable bootstrapping method of pattern acquisition, which focuses on minimizing the cost of domain migration. The approach starts from a seed corpus with some seed patterns; extends the corpus based on the seed corpus through the Internet and acquires the new patterns from extended corpus. Positive and negative examples classified from training corpus are used to evaluate the patterns acquired. The result shows our method is a practical way in pattern acquisitions.

Download


Paper Citation


in Harvard Style

Zeng X., Li F., Zhang D. and I.Vakali A. (2004). A XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION . In Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 972-8865-00-7, pages 303-308. DOI: 10.5220/0002607803030308

in Bibtex Style

@conference{iceis04,
author={Xingjie Zeng and Fang Li and Dongmo Zhang and Athena I.Vakali},
title={A XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION},
booktitle={Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2004},
pages={303-308},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002607803030308},
isbn={972-8865-00-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - A XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION
SN - 972-8865-00-7
AU - Zeng X.
AU - Li F.
AU - Zhang D.
AU - I.Vakali A.
PY - 2004
SP - 303
EP - 308
DO - 10.5220/0002607803030308