-
-
Autor*innen: Panchenko, Alexander; Faralli, Stefano; Ruppert, Eugen; Remus, Steffen; Naets, Hubert; Fairon, Cédrick; Ponzetto, Simone Paolo; Biemann, Chris
Titel: TAXI. A taxonomy induction method based on lexico-syntactic patterns, substrings and focused crawling
Aus: Association for Computational Linguistics (Hrsg.): Proceedings of the 10th International Workshop on Semantic Evaluation co-located with NAACL 2016, Stroudsburg; PA: Association for Computational Linguistics, 2016 , S. 1320-1327
URL: http://www.aclweb.org/anthology/S16-1206
Dokumenttyp: 4. Beiträge in Sammelbänden; Tagungsband/Konferenzbeitrag/Proceedings
Sprache: Englisch
Schlagwörter: Computerlinguistik; Taxonomie; Methode; Sprache; Englisch; Niederländisch; Französisch; Italienisch; Text; Begriff; Struktur; Evaluation
Abstract: We present a system for taxonomy construction that reached the first place in all sub-tasks of the SemEval 2016 challenge on Taxonomy Extraction Evaluation. Our simple yet effective approach harvests hypernyms with substring inclusion and Hearst-style lexico-syntactic patterns from domain-specific texts obtained via language model based focused crawling. Extracted taxonomies are evaluated on English, Dutch, French and Italian for three domains each (Food, Environment and Science). Evaluations against a gold standard and by human judgment show that our method outperforms more complex and knowledge-rich approaches on most domains and languages. Furthermore, to adapt the method to a new domain or language, only a small amount of manual labour is needed. (DIPF/Orig.)
DIPF-Abteilung: Informationszentrum Bildung