This paper proposes a novel query expansion method to improve accuracy of text retrieval systems. Our method makes use of a minimal relevance feedback to expand the initial query with a structured representation composed of weighted pairs of words. Such a structure is obtained from the relevance feedback through a method for pairs of words selection based on the Probabilistic Topic Model. We compared our method with other baseline query expansion schemes and methods. Evaluations performed on TREC-8 demonstrated the effectiveness of the proposed method with respect to the baseline.
Weighted Word Pairs for query expansion
COLACE, Francesco;DE SANTO, Massimo;GRECO, LUCA;
2015-01-01
Abstract
This paper proposes a novel query expansion method to improve accuracy of text retrieval systems. Our method makes use of a minimal relevance feedback to expand the initial query with a structured representation composed of weighted pairs of words. Such a structure is obtained from the relevance feedback through a method for pairs of words selection based on the Probabilistic Topic Model. We compared our method with other baseline query expansion schemes and methods. Evaluations performed on TREC-8 demonstrated the effectiveness of the proposed method with respect to the baseline.File in questo prodotto:
File | Dimensione | Formato | |
---|---|---|---|
132 Colace Definitivo.pdf
non disponibili
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
1.74 MB
Formato
Adobe PDF
|
1.74 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
132 Colace Pre-Print.pdf
accesso aperto
Descrizione: 0306-4573/Ó 2014 Published by Elsevier Ltd. Link editore: http://dx.doi.org/10.1016/j.ipm.2014.07.004
Tipologia:
Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza:
Creative commons
Dimensione
2.33 MB
Formato
Adobe PDF
|
2.33 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.