Skip to main navigation Skip to search Skip to main content

DePP: A System for Detecting Pages to Protect in Wikipedia: A system for detecting pages to protect in wikipedia

  • Boise State University

Research output: Chapter in Book/Report/Conference proceedingChapter

5 Scopus citations

Abstract

Wikipedia is based on the idea that anyone can make edits to the website in order to create reliable and crowd-sourced content. Yet with the cover of internet anonymity, some users make changes to the website that do not align with Wikipedia’s intended uses. For this reason, Wikipedia allows for some pages of the website to become protected, where only certain users can make revisions to the page. This allows administrators to protect pages from vandalism, libel, and edit wars. However, with over five million pages on Wikipedia, it is impossible for administrators to monitor all pages and manually enforce page protection. In this paper we consider for the first time the problem of deciding whether a page should be protected or not in a collaborative environment such as Wikipedia. We formulate the problem as a binary classification task and propose a novel set of features to decide which pages to protect based on (i) users page revision behavior and (ii) page categories. We tested our system, called DePP, on a new dataset we built consisting of 13.6K pages (half protected and half unprotected) and 1.9M edits. Experimental results show that DePP reaches 93.24% classification accuracy and significantly improves over baselines.

Original languageAmerican English
Title of host publicationCIKM '16 Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
Pages2081-2084
Number of pages4
ISBN (Electronic)9781450340731
DOIs
StatePublished - 1 Jan 2016
Event25th ACM International Conference on Information and Knowledge Management, CIKM 2016 - Indianapolis, United States
Duration: 24 Oct 201628 Oct 2016

Publication series

Name24-28-October-2016

Conference

Conference25th ACM International Conference on Information and Knowledge Management, CIKM 2016
Country/TerritoryUnited States
CityIndianapolis
Period24/10/1628/10/16

EGS Disciplines

  • Computer Sciences

Fingerprint

Dive into the research topics of 'DePP: A System for Detecting Pages to Protect in Wikipedia: A system for detecting pages to protect in wikipedia'. Together they form a unique fingerprint.

Cite this