Logo: Deutsches Institut für Internationale Pädagogische Forschung

Publications

Publikationendatenbank

show results

Autor:
Daxenberger, Johannes; Gurevych, Iryna:

Titel:
A corpus-based study of edit categories in featured and non-featured Wikipedia articles

Quelle:
In: Kay, Martin; Boitet, Christian (Hrsg.): Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012) Mumbai : The COLING 2012 Organizing Committee (2012) , 711-726

URL des Volltextes:
http://aclweb.org/anthology-new/C/C12/C12-1044.pdf

Sprache:
Englisch

Dokumenttyp:
4. Beiträge in Sammelwerken; Tagungsband/Konferenzbeitrag/Proceedings

Schlagwörter:
Bewertung, Internet, Lexikon, Mitarbeit, Online, Publizieren, Qualität, Schreiben


Abstract(original):
In this paper, we present a study of the collaborative writing process in Wikipedia. Our work is based on a corpus of 1,995 edits obtained from 891 article revisions in the English Wikipedia. We propose a 21-category classification scheme for edits based on Faigley and Witte's (1981) model. Example edit categories include spelling error corrections and vandalism. In a manual multi-label annotation study with 3 annotators, we obtain an inter-annotator agreement of = 0.67. We further analyze the distribution of edit categories for distinct stages in the revision history of 10 featured and 10 non-featured articles. Our results show that the information content in featured articles tends to become more stable after their promotion. On the opposite, this is not true for non-featured articles. We make the resulting corpus and the annotation guidelines freely available.1 1http://www.ukp.tu-darmstadt.de/data/wiki-edits/


DIPF-Abteilung:
Informationszentrum Bildung

Notizen:

last modified Nov 11, 2016