UCL logo

UCL Discovery

UCL home » Library Services » Electronic resources » UCL Discovery

Portfolio Theory of Information Retrieval

Wang, J; Zhu, JH; (2009) Portfolio Theory of Information Retrieval. In: Sanderson, M and Zhai, CX and Zobel, J and Allan, J and Aslam, JA, (eds.) PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL. (pp. 115 - 122). ASSOC COMPUTING MACHINERY

Full text not available from this repository.

Abstract

This paper studies document ranking under uncertainty. It is tackled in a general situation where the relevance predictions of individual documents have uncertainty, and are dependent between each other. Inspired by the Modern Portfolio Theory, an economic theory dealing with investment in financial markets, we argue that ranking under uncertainty is not just about picking individual relevant documents, but about choosing the right combination of relevant documents. This motivates us to quantify a ranked list of documents on the basis of its expected overall relevance (mean) and its variance; the latter serves as a measure of risk, which was rarely studied for document ranking in the past. Through the analysis of the mean and variance, we show that an optimal rank order is the one that balancing the overall relevance (mean) of the ranked list against its risk level (variance). Based on this principle, we then derive an efficient document ranking algorithm. It generalizes the well-known probability ranking principle (PRP) by considering both the uncertainty of relevance predictions and correlations between retrieved documents. Moreover, the benefit of diversification is mathematically quantified; we show that diversifying documents is an effective way to reduce the risk of document ranking. Experimental results in text retrieval confirm the theoretical insights with improved retrieval performance.

Type:Proceedings paper
Title:Portfolio Theory of Information Retrieval
Event:32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Location:Boston, MA
Dates:2009-07-19 - 2009-07-23
ISBN-13:978-1-60558-483-6
Keywords:Modern portfolio theory, Mean-variance analysis, Probability ranking principle, Ranking under uncertainty, PROBABILITY RANKING PRINCIPLE
UCL classification:UCL > School of BEAMS > Faculty of Engineering Science > Computer Science

Archive Staff Only: edit this record