UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Improving the quality and security of probabilistic search in peer-to-peer information retrieval systems

Richardson, SA; (2014) Improving the quality and security of probabilistic search in peer-to-peer information retrieval systems. Doctoral thesis , UCL (University College London);.

Full text not available from this repository.

Abstract

Commercial web search engines typically use a centralised architecture, where machines are kept at centralised server facilities and are under the control of a central authority. This requires high capital and operating costs, potentially deterring new entrants from offering competing services. A promising alternative is to host the search engine on a peer-to-peer (P2P) network. Volunteers can add their machines as peers, essentially providing computing resources for free. However, search quality and security against malicious behaviour is lower than for a centralised system. This thesis addresses these issues for an unstructured P2P architecture. To improve search quality we develop techniques to accurately estimate the collection global statistics that are required by modern retrieval models, but which may be unavailable to peers. To aid with improving search quality further, we introduce the measure of rank-accuracy to better model human perception of queries, and propose rank-aware document replication policies to increase overall rank-accuracy. These policies assume the query distribution can be inferred from prior queries. For cases where this is not feasible, we propose a rank-aware dynamic replication technique that distributes documents as queries are performed. To improve security, we first develop theoretical models to show how an adversary can use malicious nodes to (i) censor a document, (ii) increase the rank of a document, or (iii) disrupt overall search results. We then develop defences that can detect and evict adversarial nodes. We also study how an adversary may perform these attacks by manipulating the estimates of global statistics at each node, and we develop a defence that is effective when up to 40% of nodes behave maliciously.

Type: Thesis (Doctoral)
Title: Improving the quality and security of probabilistic search in peer-to-peer information retrieval systems
Language: English
UCL classification: UCL > Provost and Vice Provost Offices
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Security and Crime Science
URI: https://discovery.ucl.ac.uk/id/eprint/1457389
Downloads since deposit
3Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item