SEMANTIC INDEXING OF WEB PAGES VIA PROBABILISTIC METHODS - In Search of Semantics Project

Fabio Clarizia, Francesco Colace, Massimo De Santo, Paolo Napoletano

2009

Abstract

In this paper we address the problem of modeling large collections of data, namely web pages by exploiting jointly traditional information retrieval techniques with probabilistic ones in order to find semantic descriptions for the collections. This novel technique is embedded in a real Web Search Engine in order to provide semantics functionalities, as prediction of words related to a single term query. Experiments on different small domains (web repositories) are presented and discussed.

Download


Paper Citation


in Harvard Style

Clarizia F., Colace F., De Santo M. and Napoletano P. (2009). SEMANTIC INDEXING OF WEB PAGES VIA PROBABILISTIC METHODS - In Search of Semantics Project . In Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 4: ICEIS, ISBN 978-989-8111-87-6, pages 134-140. DOI: 10.5220/0002010401340140

in Bibtex Style

@conference{iceis09,
author={Fabio Clarizia and Francesco Colace and Massimo De Santo and Paolo Napoletano},
title={SEMANTIC INDEXING OF WEB PAGES VIA PROBABILISTIC METHODS - In Search of Semantics Project},
booktitle={Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 4: ICEIS,},
year={2009},
pages={134-140},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002010401340140},
isbn={978-989-8111-87-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 4: ICEIS,
TI - SEMANTIC INDEXING OF WEB PAGES VIA PROBABILISTIC METHODS - In Search of Semantics Project
SN - 978-989-8111-87-6
AU - Clarizia F.
AU - Colace F.
AU - De Santo M.
AU - Napoletano P.
PY - 2009
SP - 134
EP - 140
DO - 10.5220/0002010401340140