Ensuring the integrity of wikipedia: A data science approach

Research output: Contribution to conferencePaperpeer-review

Abstract

In this paper, we present our research on the problem of ensuring the integrity of Wikipedia, the world's biggest free encyclopedia. As anyone can edit Wikipedia, many malicious users take advantage of this situation to make edits that compromise pages' content quality. Specifically, we present DePP, the state-of-the-art tool that detects article pages to protect with an accuracy of 93% and we introduce our research on identifying spam users. We show that we are able to classify spammers from benign users with 80.8% of accuracy and 0.88 mean average precision.

Original languageEnglish
Pages98-105
Number of pages8
StatePublished - 2017
Event25th Italian Symposium on Advanced Database Systems, SEBD 2017 - Squillace Lido, Catanzaro, Italy
Duration: 25 Jun 201729 Jun 2017

Conference

Conference25th Italian Symposium on Advanced Database Systems, SEBD 2017
Country/TerritoryItaly
CitySquillace Lido, Catanzaro
Period25/06/1729/06/17

Fingerprint

Dive into the research topics of 'Ensuring the integrity of wikipedia: A data science approach'. Together they form a unique fingerprint.

Cite this