Wikipedia:Counter-Vandalism Unit/Vandalism studies

This task force is believed to be inactive.

Consider looking for related projects for help or ask at the Teahouse.
If you are not currently a project participant and wish to help you may still participate in the project. This status should be changed if collaborative activity resumes.

“

Vandalism is any addition, removal, or change of content made in a deliberate attempt to compromise the integrity of Wikipedia.

”

— Wikipedia:Vandalism

Don't understand? Need help? Just want to chat? LIVE SUPPORT on the #wikipedia-cvua ^connect IRC

Contents

The Vandalism Studies project is a portion of the Counter-Vandalism Unit designated to conduct research related to unconstructive edits on Wikipedia. The project covers all vandalism on Wikipedia. If you'd like to get involved, please add your name to the Members list, below!

Studies

_{(in reverse chronological order)}

Study	Status
Study 3 (suggest ideas)	N Not done and not likely to be done
Obama article study (talk page)	N Stale
Study 2 (talk page)	N Not done and not likely to be done
Study 1 (talk page)	Done

Members

Please add your name to the bottom of the list if you would like to be involved in the Vandalism Studies!

_{(This will also add you to our talk page mailing list – if you don't want to receive messages, please say so!)}

List of members

JohnLaurensAnthonyRamos333 (talk · contribs) – happy to help!
Theopolisme (talk · contribs) – CVUA Co-coordinator
Chip123456 (talk · contribs) – CVUA instructor
Riley Huntley (talk · contribs) – CVUA instructor and SWMT member
newroderick895 (talk · contribs) – I would like to help. (vandalism fighter.)
Jake The Great 908 (talk · contribs) – I'll do my best!
West.andrew.g (talk · contribs) STiki developer; academic research on anti-vandalism
Matthewrbowker (talk · contribs) Willing to help
C.S.Abishek (talk · contribs) Anti-Vandal-Martinet in the making
Aetheling (talk · contribs) Planning to replicate and extend my earlier research.
Remember (talk · contribs) Glad to see that someone is resurrecting this project. Not sure if I have a lot of time to help, but I will do what I can.
Djjazzyb (talk · contribs) Willing to do anything I can to help out.
MJ94 (talk · contribs) I'll help in any way possible.
Achowat (talk · contribs)
Zhaofeng Li (talk · contribs) I'll try my best to help.
Activism1234 (talk · contribs) – recently joined the unit
Coolg863 (talk · contribs) I would like to help out however I can.
This lousy T-shirt (talk · contribs) You might need the help and insight that I can provide. :)
Adamstraw99 (talk · contribs) want to join
Zaldax (talk · contribs) I wouldn't mind helping out.
Uploadvirus (talk · contribs) Research ... YUMMY!
Ace of Spades (talk · contribs)
Ásvaldr Petia Viator Dara (talk · contribs)
Anti-Quasar (talk · contribs) I would like to help out that I can...
Arun sharma 101 (talk · contribs) I will be happy if I can do something
Harsh_2580 (talk · contribs) want to join :)
Electriccatfish2 (talk · contribs)
Thomas85753 (talk · contribs) I'll help in any way I can! I would not like to receive messages.
Torreslfchero (talk · contribs)
Egeymi (talk · contribs) I would to join
Mdann52 (talk · contribs) CVUA instructor
Joeyjimenez (talk · contribs) I would like to join
Juristicweb (talk · contribs) I'd like to join
Cyrax Cyborg (talk · contribs) I love to combat vandalism
Ioscat (talk · contribs) Used to create pointless pages, wish I never had. Wan't to help out now and stop vandals and trolls :)
Mojoworker (talk · contribs)
Altaïr (talk · contribs) Of course, it's gonna benefit Wikipedia in the end. Happy to sign up.
Stausifr (talk · contribs)
AdventurousSquirrel (talk · contribs) I've been working to combat vandalism where I can for a while now so it makes sense to join on here.
Cskcsk (talk · contribs) Count me in!
AlexJFox (talk · contribs) I'm in!
Vertium (talk · contribs) Sounds like fun!
xtraelv (talk · contribs)
Jr_johnstone (talk · contribs)
NitRav (talk · contribs) Reporting Sir!
Sutton Shin (talk · contribs) I HATE trolls.
Sahirshah (talk · contribs)
kirananils (talk · contribs) I would like to join
CHCSPrefect (talk · contribs) I made this account to stop Vandalism coming out of my school's IP Address, this is what this account was made for, something like this is where I belong!
Hairrr (talk · contribs) Of course!
Shaun9876 (talk · contribs) Lets Go!
AJackl (talk · contribs) This is important...
Rushbugled13 (talk · contribs) I wish to help maintain the reliability of Wikipedia as a resource, and vandalism is a large problem with respect to reliability.
Pratyya Ghosh (talk · contribs)
Stainedclasssinner (talk · contribs)
SergeantHippyZombie (talk · contribs) I like seeing funny wandalism and then erasing it.
Spindocter123 (talk · contribs) I will be willing to help out in any way that I can!
Epicgenius (talk · contribs) Of course, I don't really like vandalism in the first place...
Param Mudgal (talk · contribs) Willing to help.
MusikAnimal (talk · contribs) Primarily using edit filters to study patterns/trends
Asdklf; (talk · contribs)
Horse Dancing (talk · contribs) Specially interested in vandalism to Native American and history topics
William2001 (talk · contribs) I will be happy to join and help out!
JustBerry (talk · contribs)
ZLEA (talk · contribs)
Dan Koehl (talk · contribs) Fighting vandalism with STiki and Huggle.
Abelmoschus Esculentus (talk · contribs) Why not?#
Hairrr (talk · contribs) Of course!
Coryphantha (talk · contribs) It's my pleasure to be a part of this.
CommanderOzEvolved (talk · contribs) I'll alert the researchers on interesting events. Also using WP:Twinkle.
The Living love (talk · contribs) I would like to help.
Meshvogel (talk · contribs) doing academic research on edit filters and vandalism.
Path slopu (talk · contribs) like to counter vandalism
AnUnnamedUser (talk · contribs)
Andrew Base (talk · contribs) — CVUA instructor.
Josephine W. (talk · contribs)
The4lines (talk · contribs) I would like to help
SuperGhostPrimus (talk · contribs) I would like to help.
JaneciaTaylor (talk) I would like to help.
Asension (talk · contribs) Glad to help.
ItzJustLucky (talk · contribs) Planning to use twinkle on the way.
MrAgentSochi (talk · contribs) We love Anti Vandals
Suriname0 (talk · contribs) academic researcher and interested in subtle vandalism.
Pink Saffron (talk · contribs) Interested in Vandalism studies

Open tasks

Begin formulating ideas and topics to research for "Study 3" (discuss now!)
Discuss what research needs to be conducted and what would be good metrics to gather.
Add thoughts/contribute to any of the completed studies. (Study 1, Study 2, Obama article study)

Research questions

These are some preliminary questions may stimulate future studies. Not all questions may be answerable, so think of it more as a brainstorming section.

Analysis of vandalism Who is responsible for vandalism? What do vandals want? What are the demographics of the vandal population? What proportion of vandals are on dynamic IP addresses, and hence very hard to block? Are IP edits ever responsible for improving a featured article while on the Main Page? (See also essay IPs are human too.) What motivates people to vandalize articles? How can we minimize the satisfaction they get from doing it? (See: The motivation of a vandal) Do vandals just choose another article to edit instead if an article is semi-protected? How can we test this? Why do certain articles attract more vandalism than others? What types of vandalism are there? What message are they trying to get across? Why do vandals not fully realise that their actions are futile? What sort of financial gains can be made from using Wikipedia to advertise – are spammers just wasting their time, or can it actually be profitable? Are our anti-spam measures adequate? What is the overall contribution from schools and universities? Are they worth having? Do universities contribute less vandalism than schools or are all ages equally immature? How does the rate of vandalism vary throughout the day? Would there still be problems with vandalism if unregistered editing was blocked? How can we test this hypothesis? Certain categories could be experimentally altered to block unregistered editors, but then vandals could just choose an article that wasn't protected. We would have to block all IP editing, which would certainly be controversial, even just to gather a small sample of data. The blocks would also have to allow newly registered users to edit, otherwise, there wouldn't be time to create an account and then wait 4 days. Perhaps we could use a comparative method by doing the experiments on another wiki instead? Quantitatively, how are levels of vandalism affected (both in terms of percentage of edits and number of edits) when there is external attention draw to an article (e.g. Slashdot or The Colbert Report). Do levels of vandalism return to normal (e.g. in elephant) in all cases? How quickly? How much of vandalism is self-reverted? How do the levels of reverted edits compare between articles of different quality (e.g. GA vs. start class) How often are good faith edits labeled as vandalism, either a) mistakenly and through misinterpretation of policy or b) maliciously?

Impact How long does vandalism typically remain visible? What level of vandalism is considered acceptable before semi-protection or some other measure is needed? How should the 'level of vandalism' be measured? (See:A more explicit semi-protection policy for articles subject to vandalism) What impact does vandalism have on the reputation of Wikipedia? How often are good faith editors driven away after getting mislabeled as vandals? How often are good faith editors driven away because an article is vandalized? How much time do editors waste cleaning up vandalism?

Counter measures How effective are bots in curtailing vandalism? Warnings: Are editors any more likely to continue or desist vandalizing if warned by a bot instead of a person? How often are vandals warned on their talk page after committing an offense? What are the costs and benefits, and hence overall utility, of warning users? How do users respond to warnings? Who is responsible for reverting vandalism? What effects does semi-protection have on the level of vandalism of protected articles? What strategies can we employ to catch vandalism quickly? How can we catch most of it at recent changes? How can we establish a situation where almost every article has someone responsible for maintaining it? Is this even a good idea? (See: Ownership of articles) How good are editors at reverting vandalism? That is, is it reverted properly, or is it often dealt with poorly, e.g. removing a whole paragraph that the vandal has simply altered in meaning. What happens to vandalism levels when edits won't show up in the current version of the article – a trial of something like stable versions, where the vandal cannot vandalize the actual article people see, or something functionally similar, is needed. Perhaps a small section (e.g. all articles in a certain category) could be tested out. How well do flagged revisions work in practice?

Wikipedia vandalism studies outside of this project

Published

Carter, Jacobi (2 June 2010). "ClueBot and Vandalism on Wikipedia" (PDF). Archived from the original (PDF) on 2010-06-02. Retrieved 5 October 2020.
"U of M researchers reveal new findings about Wikipedia authorship and vandalism" (Press release). University of Minnesota – Department of Computer Science and Engineering. 2007-11-06. Archived from the original on 2012-09-20.
Buriol, Luciana S.; Carlos Castillo; Debora Donato; Stefano Leonardi; Stefano Millozzi (2006). "Temporal Analysis of the Wikigraph" (PDF). Sapienza University of Rome.
GroupLens Research (November 4–7, 2007). "Creating, Destroying, and Restoring Value in Wikipedia". Proceedings of the 2007 international ACM conference on Conference on supporting group work - GROUP '07. Sanibel Island, Florida, USA: University of Minnesota – Department of Computer Science and Engineering. p. 259. doi:10.1145/1316624.1316663. ISBN 9781595938459.
MIT Media Lab; IBM Research (April 24–29, 2004). "Studying Cooperation and Conflict between Authors with history flow Visualizations" (PDF). Vienna: Massachusetts Institute of Technology.
Moore, Rick (2007-11-16). "New information on Wikipedia". University of Minnesota.
Smets, Koen; Bart Goethals; Brigitte Verdonk (2008). "Automatic Vandalism Detection in Wikipedia: Towards a Machine Learning Approach" (PDF). University of Antwerp – Department of Mathematics and Computer Science.
Wang, William Yang; McKeown, Kathleen R. (2010). "Got You!: Automatic Vandalism Detection in Wikipedia with Web-based Shallow Syntactic-Semantic Modeling" (PDF). the 23rd International Conference on Computational Linguistics.
Belani, Amit (2009-11-11). "Vandalism Detection in Wikipedia: a Bag-of-Words Classifier Approach". arXiv:1001.0700 [cs.LG].
West, Andrew G.; Sampath Kannan; Insup Lee (2010). "Detecting Wikipedia vandalism via spatio-temporal analysis of revision metadata?". Detecting Wikipedia Vandalism via Spatio-Temporal Analysis of Revision Metadata. pp. 22–28. doi:10.1145/1752046.1752050. ISBN 9781450300599. S2CID 215753727.
Adler, B. Thomas; Luca de Alfaro; Santiago Mola-Velasco; Paolo Rosso; Andrew G. West (2011). "Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features". Computational Linguistics and Intelligent Text Processing. Lecture Notes in Computer Science. Vol. 6609. pp. 277–288. doi:10.1007/978-3-642-19437-5_23. hdl:10251/36621. ISBN 978-3-642-19436-8.
West, Andrew G.; Insup Lee (2011). "Multilingual Vandalism Detection using Language-Independent & Ex Post Facto Evidence". Pan-Clef '11: Notebook Papers on Uncovering Plagiarism, Authorship, and Social Software Misuse.

Wikipedia

Wikipedia: Don't protect Main Page featured articles/December Main Page FA analysis (2006-12-8)
Wikipedia: Study of article statistics by Opabinia regalis (2007-2-18)
Wikipedia: Study of the ratio of IP-based vandalism to signed in user vandalism by Cool3 (2006-10-2)
Wikipedia: Study of vandalism on individual's user page by Angela (2007-3-29)
Wikipedia: Study of vandalism to featured articles by Colonel Chaos (2007-5-1)
Wikipedia: Study of vandalism survival times (2009-6-22)
Wikipedia: Vandalism statistics by Dragons flight (2009-8-20)
Wikipedia: Village pump (policy) – What will it take to ban unregistered editors?: alternative interpretation of your data by Jayron32 (2007-3-17)

Datasets

Webis-WVC-07 - Has been subsumed by PAN-WVC-10 and PAN-WVC-11
PAN-WVC-10
PAN-WVC-11
"Wikipedia Vandal Study – US Senate: Oct 1 – Dec 31, 2007". Google Docs. January 2008. Archived from the original on 2008-12-20.

Coverage of the WikiProject's studies

Coverage of first study:

Digg – Wikipedia Vandalism Study: 97% of vandalism is by anonymous editors^{[dead link]}
en.Wikizine.org – Year: 2007 Week: 15 Number: 67^{[dead link]}
Original Research – Another Wikipedia statistical analysis
ValueWiki – Wikipedia Vandalism Study: The results are in...
WikiAngela – What percentage of Wikipedia vandalism is done by registered users?
Wikipedia Signpost/2007-04-23/News and notes#Wikipedia user studies continue
WikipediaWeekly – Episode 19: Vandalism and Stabbing Polonius – Discussion begins at minute 17.

Resources

Vandalism
Cleaning up vandalism
Administrator intervention against vandalism
IPs are human too – various data, including some on assumption of vandalism/disruption
Long-term abuse
Most vandalized pages
Recent changes patrol
The motivation of a vandal
Vandalism won by 2009
User:Dragons flight/Log analysis – various data, including some on vandalism.
User:Fred Bauder/Error management

Show Your Pride!

Just add {{User WVS}} to your page.

WVS

This user is interested in
studying vandalism.

v t e WikiProject Council
WikiProject guides	WikiProject Council Talk Guidelines/Intro WikiProjects Task forces Technical notes Assessment FAQ Work via WikiProjects
Directories and summaries	Directory Proposals Deletion Signpost Shortcuts Popular pages Database reports Watchers
Culture and the arts	Arts Music Performing Plastic Visual Broadcasting Crafts and hobbies Entertainment Games and toys Food and drink Internet culture Language and literature Biography Linguistics Media Philosophy and religion Sports
Geographical	Bodies of water Cities Countries Africa Americas Asia Europe Oceania Landforms Maps Parks, conservation areas and historical sites
History and society	History and society Business and economics Education Military and warfare Politics and government Transportation
Science, technology and engineering	Science Biology Chemistry Economics Geosciences Information science Mathematics Medicine Meteorology Physics Space Technology Time
Wikipedia assistance and tasks	Contents systems Maintenance Files Article improvement and grading Classroom projects WikiProjects

CVU Vandalism Studies (talk)
Study 1 (talk) (finished) Study 2 (talk) (never finished) Obama article study (talk) (never finished) Study 3 (suggest ideas!) (In Planning)
Related Projects/Pages
Researching Wikipedia Statistics The motivation of a vandal
v t e