UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Toxicity in the Decentralized Web and the Potential for Model Sharing

Zia, Haris Bin; Raman, Aravindh; Castro, Ignacio; Anaobi, Ishaku Hassan; Cristofaro, Emiliano De; Sastry, Nishanth; Tyson, Gareth; (2022) Toxicity in the Decentralized Web and the Potential for Model Sharing. In: Proceedings of the 2022 ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS'22). (pp. pp. 15-16). ACM Green open access

[thumbnail of 2204.12709v1.pdf]
Preview
Text
2204.12709v1.pdf - Accepted Version

Download (2MB) | Preview

Abstract

The "Decentralised Web" (DW) is an evolving concept, which encompasses technologies aimed at providing greater transparency and openness on the web. The DW relies on independent servers (aka instances) that mesh together in a peer-to-peer fashion to deliver a range of services (e.g. micro-blogs, image sharing, video streaming). However, toxic content moderation in this decentralised context is challenging. This is because there is no central entity that can define toxicity, nor a large central pool of data that can be used to build universal classifiers. It is therefore unsurprising that there have been several high-profile cases of the DW being misused to coordinate and disseminate harmful material. Using a dataset of 9.9M posts from 117K users on Pleroma (a popular DW microblogging service), we quantify the presence of toxic content. We find that toxic content is prevalent and spreads rapidly between instances. We show that automating per-instance content moderation is challenging due to the lack of sufficient training data available and the effort required in labelling. We therefore propose and evaluate ModPair, a model sharing system that effectively detects toxic content, gaining an average per-instance macro-F1 score 0.89.

Type: Proceedings paper
Title: Toxicity in the Decentralized Web and the Potential for Model Sharing
Event: 2022 ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS'22)
ISBN-13: 978-1-4503-9141-2
Open access status: An open access version is available from UCL Discovery
DOI: 10.1145/3489048.3530968
Publisher version: https://doi.org/10.1145/3489048.3530968
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification: UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL
URI: https://discovery.ucl.ac.uk/id/eprint/10149966
Downloads since deposit
26Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item