Logo: Deutsches Institut für Internationale Pädagogische Forschung

Publications

Publikationendatenbank

show results

Autor:
Panchenko, Alexander; Faralli, Stefano; Ruppert, Eugen; Remus, Steffen; Naets, Hubert; Fairon, Cédrick; Ponzetto, Simone Paolo; Biemann, Chris:

Titel:
TAXI
A taxonomy induction method based on lexico-syntactic patterns, substrings and focused crawling

Quelle:
In: Association for Computational Linguistics (Hrsg.): Proceedings of the 10th International Workshop on Semantic Evaluation co-located with NAACL 2016 Stroudsburg, PA : Association for Computational Linguistics (2016) , 1320-1327

URL des Volltextes:
http://www.aclweb.org/anthology/S16-1206

Sprache:
Englisch

Dokumenttyp:
4. Beiträge in Sammelwerken; Tagungsband/Konferenzbeitrag/Proceedings

Schlagwörter:
Computerlinguistik, Taxonomie, Methode, Sprache, Englisch, Niederländisch, Französisch, Italienisch, Text, Begriff, Struktur, Evaluation


Abstract(original):
We present a system for taxonomy construction that reached the first place in all sub-tasks of the SemEval 2016 challenge on Taxonomy Extraction Evaluation. Our simple yet effective approach harvests hypernyms with substring inclusion and Hearst-style lexico-syntactic patterns from domain-specific texts obtained via language model based focused crawling. Extracted taxonomies are evaluated on English, Dutch, French and Italian for three domains each (Food, Environment and Science). Evaluations against a gold standard and by human judgment show that our method outperforms more complex and knowledge-rich approaches on most domains and languages. Furthermore, to adapt the method to a new domain or language, only a small amount of manual labour is needed. (DIPF/Orig.)


DIPF-Abteilung:
Informationszentrum Bildung

Notizen:

last modified Nov 11, 2016