Publikationendatenbank
show results
Autor:
Panchenko, Alexander;
Faralli, Stefano;
Ruppert, Eugen;
Remus, Steffen;
Naets, Hubert;
Fairon, Cédrick;
Ponzetto, Simone Paolo;
Biemann, Chris:
Titel:
TAXI
A taxonomy induction method based on lexico-syntactic patterns, substrings and focused crawling
Quelle:
In: Association for Computational Linguistics (Hrsg.): Proceedings of the 10th International Workshop on Semantic Evaluation co-located with NAACL 2016
Stroudsburg, PA :
Association for Computational Linguistics
(2016)
, 1320-1327
URL des Volltextes:
http://www.aclweb.org/anthology/S16-1206
Sprache:
Englisch
Dokumenttyp:
4. Beiträge in Sammelwerken; Tagungsband/Konferenzbeitrag/Proceedings
Schlagwörter:
Computerlinguistik,
Taxonomie,
Methode,
Sprache,
Englisch,
Niederländisch,
Französisch,
Italienisch,
Text,
Begriff,
Struktur,
Evaluation
Abstract(original):
We present a system for taxonomy construction that reached the first place in all sub-tasks of the SemEval 2016 challenge on Taxonomy Extraction Evaluation. Our simple yet effective approach harvests hypernyms with substring inclusion and Hearst-style lexico-syntactic patterns from domain-specific texts obtained via language model based focused crawling. Extracted taxonomies are evaluated on English, Dutch, French and Italian for three domains each (Food, Environment and Science). Evaluations against a gold standard and by human judgment show that our method outperforms more complex and knowledge-rich approaches on most domains and languages. Furthermore, to adapt the method to a new domain or language, only a small amount of manual labour is needed. (DIPF/Orig.)
DIPF-Abteilung:
Informationszentrum Bildung
Notizen: