Ensuring the Integrity of Wikipedia: A Data Science Approach

Research output: Contribution to journalConference articlepeer-review

Abstract

In this paper, we present our research on the problem of ensuring the integrity of Wikipedia, the world’s biggest free encyclopedia. As anyone can edit Wikipedia, many malicious users take advantage of this situation to make edits that compromise pages’ content quality. Specifically, we present DePP, the state-of-the-art tool that detects article pages to protect with an accuracy of 93% and we introduce our research on identifying spam users. We show that we are able to classify spammers from benign users with 80.8% of accuracy and 0.88 mean average precision.

Original languageEnglish
Pages (from-to)98-105
Number of pages8
JournalCEUR Workshop Proceedings
Volume2037
StatePublished - 2017
Event25th Italian Symposium on Advanced Database Systems, SEBD 2017 - Squillace Lido, Catanzaro, Italy
Duration: 25 Jun 201729 Jun 2017

Fingerprint

Dive into the research topics of 'Ensuring the Integrity of Wikipedia: A Data Science Approach'. Together they form a unique fingerprint.

Cite this