Ferschke, Oliver; Gurevych, Iryna; Rittberger, Marc:

FlawFinder: A modular system for predicting quality flaws in Wikipedia
Notebook for PAN at CLEF 2012

In: Forner, Pamela; Karlgren, Jussi; Womser-Hacker, Christa (Hrsg.): CLEF 2012 Labs and Workshop, Notebook Papers Mattarello : Grafiche Futura s.r.l. (2012) , 101

4. Beiträge in Sammelwerken; Tagungsband/Konferenzbeitrag/Proceedings

Computerprogramm, Computerunterstütztes Verfahren, Evaluation, Fehler, Information, Klassifikation, Messung, Nachschlagewerk, Online, Qualität, Qualitätssicherung

With over 23 million articles in 285 languages,Wikipedia is the largest free knowledge base on the web. Due to its open nature, everybody is allowed to access and edit the contents of this huge encyclopedia. As a downside of this open access policy, quality assessment of the content becomes a critical issue and is hardly manageable without computational assistance. In this paper, we present FlawFinder, a modular system for automatically predicting quality flaws in unseen Wikipedia articles. It competed in the inaugural edition of the Quality Flaw Prediction Task at the PAN Challenge 2012 and achieved the best precision of all systems and the second place in terms of recall and F1-score.

