A XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION
Xingjie Zeng, Fang Li, Dongmo Zhang, Athena I.Vakali
2004
Abstract
Extensible Markup Language (XML) has been widely used as a middleware because of its flexibility. Fixed domain is one of the bottlenecks of Information Extraction (IE) technologies. In this paper we present a XML-based domain-adaptable bootstrapping method of pattern acquisition, which focuses on minimizing the cost of domain migration. The approach starts from a seed corpus with some seed patterns; extends the corpus based on the seed corpus through the Internet and acquires the new patterns from extended corpus. Positive and negative examples classified from training corpus are used to evaluate the patterns acquired. The result shows our method is a practical way in pattern acquisitions.
DownloadPaper Citation
in Harvard Style
Zeng X., Li F., Zhang D. and I.Vakali A. (2004). A XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION . In Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS, ISBN 972-8865-00-7, pages 303-308. DOI: 10.5220/0002607803030308
in Bibtex Style
@conference{iceis04,
author={Xingjie Zeng and Fang Li and Dongmo Zhang and Athena I.Vakali},
title={A XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION},
booktitle={Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS,},
year={2004},
pages={303-308},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002607803030308},
isbn={972-8865-00-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the Sixth International Conference on Enterprise Information Systems - Volume 2: ICEIS,
TI - A XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION
SN - 972-8865-00-7
AU - Zeng X.
AU - Li F.
AU - Zhang D.
AU - I.Vakali A.
PY - 2004
SP - 303
EP - 308
DO - 10.5220/0002607803030308