Paper

Revisiting Reverts - Accurate Revert Detection in Wikipedia

Slides presented at HT2012 conference in Milwaukee

Revisiting Reverts - Slides

Revert Scripts

Download script files archive

Note: To run the Python script implementing the old SIRD method, please use Start_Blankers_HasRolledBackTo.py, for the script implementing the new DIFF method run Start_Blankers_HasPartlyReverted.py. This version runs on separate single XML dumps for each article, not on the compressed complete dump. The path for the input XML files and output result files (separate for each article) can be defined in each of the "Start_" scripts. The script has been tested for en. and simple. articles only.

For the parallelization of our scripts we built upon the Wikimedia Utilities by Aaron Halfaker [1].

Survey results

Survey results for sample A
Survey results for sample B

Note: The survey .csv files contain 20 assessment steps each as rows. The first variable is the ID of the question, the second the method the edit pair was detected with (h = hash/SIRD only, p = DIFF method only, b = both methods). The four following variables sum up how often this assessment step received the "Full/Partial/No revert" or "No clue" answers from all of the participants. The remaining variables are the raw answers of every single participant with 1 = Full revert, 2 = Partial revert, 3 = No revert, 4 = No clue.

For questions/feedback feel free to contact me at fabian d.o.t. floeck a.t kit.edu

Copyright (c) 2012, Fabian Flöck
Permission is granted to any person obtaining a copy of this software the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice must be included in all copies or substantial portions of the Software. Please reference the paper "Revisiting Reverts - Accurate Revert Detection in Wikipedia" when referring to or using the software.

Footnotes:
[1] Under the MIT license:
Copyright (c) 2011 Aaron Halfaker, http://halfaker.info/

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.