Hostname: page-component-77f85d65b8-pkds5 Total loading time: 0 Render date: 2026-04-18T18:19:03.808Z Has data issue: false hasContentIssue false

Document ranking refinement using a Markov random field model*

Published online by Cambridge University Press:  14 March 2012

ESAÚ VILLATORO
Affiliation:
Department of Computer Science, National Institute of Astrophysics, Optics and Electronics, Luis Enrique Erro 1 Tonantzintla, Puebla, CP 72840, México e-mail: villatoroe@inaoep.mx, antjug@inaoep.mx, mmontesg@inaoep.mx, villasen@inaoep.mx, esucar@inaoep.mx
ANTONIO JUÁREZ
Affiliation:
Department of Computer Science, National Institute of Astrophysics, Optics and Electronics, Luis Enrique Erro 1 Tonantzintla, Puebla, CP 72840, México e-mail: villatoroe@inaoep.mx, antjug@inaoep.mx, mmontesg@inaoep.mx, villasen@inaoep.mx, esucar@inaoep.mx
MANUEL MONTES
Affiliation:
Department of Computer Science, National Institute of Astrophysics, Optics and Electronics, Luis Enrique Erro 1 Tonantzintla, Puebla, CP 72840, México e-mail: villatoroe@inaoep.mx, antjug@inaoep.mx, mmontesg@inaoep.mx, villasen@inaoep.mx, esucar@inaoep.mx
LUIS VILLASEÑOR
Affiliation:
Department of Computer Science, National Institute of Astrophysics, Optics and Electronics, Luis Enrique Erro 1 Tonantzintla, Puebla, CP 72840, México e-mail: villatoroe@inaoep.mx, antjug@inaoep.mx, mmontesg@inaoep.mx, villasen@inaoep.mx, esucar@inaoep.mx
L. ENRIQUE SUCAR
Affiliation:
Department of Computer Science, National Institute of Astrophysics, Optics and Electronics, Luis Enrique Erro 1 Tonantzintla, Puebla, CP 72840, México e-mail: villatoroe@inaoep.mx, antjug@inaoep.mx, mmontesg@inaoep.mx, villasen@inaoep.mx, esucar@inaoep.mx

Abstract

This paper introduces a novel ranking refinement approach based on relevance feedback for the task of document retrieval. We focus on the problem of ranking refinement since recent evaluation results from Information Retrieval (IR) systems indicate that current methods are effective retrieving most of the relevant documents for different sets of queries, but they have severe difficulties to generate a pertinent ranking of them. Motivated by these results, we propose a novel method to re-rank the list of documents returned by an IR system. The proposed method is based on a Markov Random Field (MRF) model that classifies the retrieved documents as relevant or irrelevant. The proposed MRF combines: (i) information provided by the base IR system, (ii) similarities among documents in the retrieved list, and (iii) relevance feedback information. Thus, the problem of ranking refinement is reduced to that of minimising an energy function that represents a trade-off between document relevance and inter-document similarity. Experiments were conducted using resources from four different tasks of the Cross Language Evaluation Forum (CLEF) forum as well as from one task of the Text Retrieval Conference (TREC) forum. The obtained results show the feasibility of the method for re-ranking documents in IR and also depict an improvement in mean average precision compared to a state of the art retrieval machine.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 2012

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable