Menü Überspringen
Kontakt
Presse
Deutsch
English
Not track
Datenverarbeitung
Suche
Anmelden
DIPF aktuell
Forschung
Infrastrukturen
Institut
Zurück
Kontakt
Presse
Deutsch
English
Not track
Datenverarbeitung
Suche
Startseite
>
Forschung
>
Publikationen
>
Publikationendatenbank
Ergebnis der Suche in der DIPF Publikationendatenbank
Ihre Abfrage:
(Schlagwörter: "Textanalyse")
zur erweiterten Suche
Suchbegriff
Nur Open Access
Suchen
Markierungen aufheben
Alle Treffer markieren
Export
54
Inhalte gefunden
Alle Details anzeigen
WebAnno. A flexible, web-based annotation tool for CLARIN
Eckart de Castilho, Richard; Biemann, Chris; Gurevych, Iryna; Muhie Yimam, Seid
Sammelbandbeitrag
| Aus: CAC2014 (Hrsg.): Proceedings of the CLARIN Annual Conference (CAC2014) | Soesterberg; Netherlands: CLARIN ERIC | 2014
34991 Endnote
Autor*innen:
Eckart de Castilho, Richard; Biemann, Chris; Gurevych, Iryna; Muhie Yimam, Seid
Titel:
WebAnno. A flexible, web-based annotation tool for CLARIN
Aus:
CAC2014 (Hrsg.): Proceedings of the CLARIN Annual Conference (CAC2014), Soesterberg; Netherlands: CLARIN ERIC, 2014 , S. 1-3
URL:
http://www.clarin.eu/sites/default/files/cac2014_submission_6_0.pdf
Dokumenttyp:
4. Beiträge in Sammelwerken; Tagungsband/Konferenzbeitrag/Proceedings
Sprache:
Englisch
Schlagwörter:
Automatisierung; Computerlinguistik; Computerunterstütztes Verfahren; Software; Textanalyse
Abstract:
We present WebAnno, a web-based annotation tool suitable for a wide range of text annotation tasks. The development of the tool was driven by the requirements of the CLARIN community, and the tool interacts with the CLARIN infrastructure. The ability to host multiple annotation projects being in parallel - yet isolated from each other - on a single installation of WebAnno makes it particularly attractive for research centers. The ability to fully configure projects via a web interface also enables non-technical staff to create and administer annotation projects. Further, it supports distributed teams of annotators, who are able to work remotely without having to install the software locally. (DIPF/Orig.)
DIPF-Abteilung:
Informationszentrum Bildung
Automatic annotation suggestions and custom annotation layers in WebAnno
Muhie Yimam, Seid; Eckart de Castilho, Richard; Gurevych, Iryna; Biemann, Chris
Sammelbandbeitrag
| Aus: Bontcheva, Kalina; Jingbo, Zhu (Hrsg.): Proceedings of COLING 2014: System demonstrations | Stroudsburg; PA: Association for Computational Linguistics | 2014
34723 Endnote
Autor*innen:
Muhie Yimam, Seid; Eckart de Castilho, Richard; Gurevych, Iryna; Biemann, Chris
Titel:
Automatic annotation suggestions and custom annotation layers in WebAnno
Aus:
Bontcheva, Kalina; Jingbo, Zhu (Hrsg.): Proceedings of COLING 2014: System demonstrations, Stroudsburg; PA: Association for Computational Linguistics, 2014 , S. 91-96
URL:
http://www.aclweb.org/anthology/P/P14/P14-5016.pdf
Dokumenttyp:
4. Beiträge in Sammelwerken; Tagungsband/Konferenzbeitrag/Proceedings
Sprache:
Englisch
Schlagwörter:
Automatisierung; Computerlinguistik; Computerprogramm; Fallstudie; Indexierung; Inhaltserschließung; Text; Textanalyse; Tool
Abstract:
In this paper, we present a flexible approach to the efficient and exhaustive manual annotation of text documents. For this purpose, we extend WebAnno (Yimam et al., 2013) an open-source web-based annotation tool. While it was previously limited to specific annotation layers, our extension allows adding and configuring an arbitrary number of layers through a web-based UI. These layers can be annotated separately or simultaneously, and support most types of linguistic annotations such as spans, semantic classes, dependency relations, lexical chains, and morphology. Further, we tightly integrate a generic machine learning component for automatic annotation suggestions of span annotations. In two case studies, we show that automatic annotation suggestions, combined with our split-pane UI concept, significantly reduces annotation time. (DIPF/Orig.)
DIPF-Abteilung:
Informationszentrum Bildung
GermEval-2014. Nested named entity recognition with neural networks
Reimers, Nils; Eckle-Kohler, Judith; Schnober, Carsten; Kim, Jungi; Gurevych, Iryna
Sammelbandbeitrag
| Aus: Faaß, Gertrud; Ruppenhofer, Josef (Hrsg.): Workshop Proceedings of the 12th edition of the KONVENS Conference | Hildesheim: Universitätsverlag Hildesheim | 2014
34989 Endnote
Autor*innen:
Reimers, Nils; Eckle-Kohler, Judith; Schnober, Carsten; Kim, Jungi; Gurevych, Iryna
Titel:
GermEval-2014. Nested named entity recognition with neural networks
Aus:
Faaß, Gertrud; Ruppenhofer, Josef (Hrsg.): Workshop Proceedings of the 12th edition of the KONVENS Conference, Hildesheim: Universitätsverlag Hildesheim, 2014 , S. 117-120
URL:
http://www.uni-hildesheim.de/konvens2014/data/konvens2014-workshop-proceedings.pdf
Dokumenttyp:
4. Beiträge in Sammelwerken; Tagungsband/Konferenzbeitrag/Proceedings
Sprache:
Englisch
Schlagwörter:
Automatisierung; Computerlinguistik; Daten; Evaluation; Information; Modell; Netzwerk; Sprachanalyse; Textanalyse; Wissen
Abstract:
Collobert et al. (2011) showed that deep neural network architectures achieve state-of-the-art performance in many fundamental NLP tasks, including Named Entity Recognition (NER). However, results were only reported for English. This paper reports on experiments for German Named Entity Recognition, using the data from the GermEval 2014 shared task on NER. Our system achieves an F1-measure of 75.09% according to the official metric. (DIPF/Orig.)
DIPF-Abteilung:
Informationszentrum Bildung
Identifying argumentative discourse structures in persuasive essays
Stab, Christian; Gurevych, Iryna
Sammelbandbeitrag
| Aus: Moschitti, Alessandro;Pang, Bo;Daelemans, Walter (Hrsg.): Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2014) | Stroudsburg; PA: Association for Computational Linguistics | 2014
34986 Endnote
Autor*innen:
Stab, Christian; Gurevych, Iryna
Titel:
Identifying argumentative discourse structures in persuasive essays
Aus:
Moschitti, Alessandro;Pang, Bo;Daelemans, Walter (Hrsg.): Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), Stroudsburg; PA: Association for Computational Linguistics, 2014 , S. 46-56
URL:
http://aclweb.org/anthology/D/D14/D14-1006.pdf
Dokumenttyp:
4. Beiträge in Sammelwerken; Tagungsband/Konferenzbeitrag/Proceedings
Sprache:
Englisch
Schlagwörter:
Argumentation; Aufsatz; Computerlinguistik; Klassifikation; Struktur; Textanalyse
Abstract:
In this paper, we present a novel approach for identifying argumentative discourse structures in persuasive essays. The structure of argumentation consists of several components (i.e. claims and premises) that are connected with argumentative relations. We consider this task in twoconsecutive steps. First, we identify the components of arguments using multiclass classification. Second, we classify a pair of argument components as either support or non-support for identifying the structure of argumentative discourse. For both tasks, we evaluate several classifiers and propose novel feature sets including structural, lexical, syntactic and contextual features. In our experiments, we obtain a macro F1-score of 0.726 for identifying argument components and 0.722 for argumentative relations. (DIPF/Org.)
DIPF-Abteilung:
Informationszentrum Bildung
The people's web meets NLP. Collaboratively Constructed Language Resources
Gurevych, Iryna; Kim, Jungi (Hrsg.)
Sammelband
| Dordrecht: Springer | 2013
32811 Endnote
Herausgeber*innen:
Gurevych, Iryna; Kim, Jungi
Titel:
The people's web meets NLP. Collaboratively Constructed Language Resources
Erscheinungsvermerk:
Dordrecht: Springer, 2013 (Theory and applications of natural language processing)
DOI:
10.1007/978-3-642-35085-6
URL:
https://link.springer.com/book/10.1007/978-3-642-35085-6
Dokumenttyp:
2. Herausgeberschaft; Sammelband (keine besondere Kategorie)
Sprache:
Englisch
Schlagwörter:
Automatisierung; Computerlinguistik; Computerspiel; Data Mining; Forschung; Gemeinschaft; Indexierung; Kooperation; Mehrsprachigkeit; Methodologie; Nachschlagewerk; Ontologie; Schreiben; Semantic Web; Soziale Software; Sprachanalyse; Sprache; Textanalyse; Textverarbeitung; Wissen; World wide web 2.0
Abstract (english):
The application of collective intelligence in the domain of language yielded collaboratively constructed language resources (CCLR) that can be used in a variety of ways. For example, Wikipedia, Wiktionary, and other language resources constructed through crowdsourcing such as Games with a Purpose and Mechanical Turk have been used in many ways in NLP. Researchers started using such resources to substitute for or supplement conventional lexical semantic resources such as WordNet or linguistically annotated corpora in different NLP tasks. Another research direction is to utilize NLP techniques to enhance the collaboration process and its outcome. Overall the emergence of CCLRs has generated new challenges to the research field that are to be addressed in the present book. As the research field of CCLRs matures, it has become necessary to summarize a set of results to advance and focus the further research effort.
DIPF-Abteilung:
Informationszentrum Bildung
Dijkstra-WSA: A graph-based approach to word sense alignment
Matuschek, Michael; Gurevych, Iryna
Zeitschriftenbeitrag
| In: Transactions of the Association for Computational Linguistics (TACL) | 2013
33524 Endnote
Autor*innen:
Matuschek, Michael; Gurevych, Iryna
Titel:
Dijkstra-WSA: A graph-based approach to word sense alignment
In:
Transactions of the Association for Computational Linguistics (TACL), 1 (2013) , S. 151-164
URL:
http://www.transacl.org/wp-content/uploads/2013/05/paper151.pdf
Dokumenttyp:
3a. Beiträge in begutachteten Zeitschriften; Aufsatz (keine besondere Kategorie)
Sprache:
Englisch
Schlagwörter:
Algorithmus; Computerlinguistik; Computerunterstütztes Verfahren; Evaluation; Methode; Semantik; Textanalyse
Abstract (english):
In this paper, we present Dijkstra-WSA, a novel graph-based algorithm for word sense alignment. We evaluate it on four different pairs of lexical-semantic resources with different characteristics (WordNet-OmegaWiki, WordNet-Wiktionary, GermaNet-Wiktionary and WordNet-Wikipedia) and show that it achieves competitive performance on 3 out of 4 datasets. Dijkstra-WSA outperforms the state of the art on every dataset if it is combined with a back-off based on gloss similarity. We also demonstrate that Dijkstra-WSA is not only flexibly applicable to different resources but also highly parameterizable to optimize for precision or recall.
DIPF-Abteilung:
Informationszentrum Bildung
Fingerprint matrices. Uncovering the dynamics of social networks in prose literature
Oelke, Daniela; Kokkinakis, Dimitrios; Keim, Daniel A.
Zeitschriftenbeitrag
| In: Computer Graphics Forum | 2013
33555 Endnote
Autor*innen:
Oelke, Daniela; Kokkinakis, Dimitrios; Keim, Daniel A.
Titel:
Fingerprint matrices. Uncovering the dynamics of social networks in prose literature
In:
Computer Graphics Forum, 32 (2013) 3, S. 371-380
DOI:
10.1111/cgf.12124
URL:
https://onlinelibrary.wiley.com/doi/full/10.1111/cgf.12124
Dokumenttyp:
3a. Beiträge in begutachteten Zeitschriften; Aufsatz (keine besondere Kategorie)
Sprache:
Englisch
Schlagwörter:
Analyse; Computerunterstütztes Verfahren; Literatur; Soziale Beziehung; Soziales Netzwerk; Textanalyse; Visualisierung
Abstract (english):
In prose literature often complex dynamics of interpersonal relationships can be observed between the different characters. Traditionally, node-link diagrams are used to depict the social network of a novel. However, static graphs can only visualize the overall social network structure but not the development of the networks over the course of the story, while dynamic graphs have the serious problem that there are many sudden changes between different portions of the overall social network. In this paper we explore means to show the relationships between the characters of a plot and at the same time their development over the course of a novel. Based on a careful exploration of the design space, we suggest a new visualization technique called Fingerprint Matrices. A case study exemplifies the usage of Fingerprint Matrices and shows that they are an effective means to analyze prose literature with respect to the development of relationships between the different characters.
DIPF-Abteilung:
Informationszentrum Bildung
Die soziale Konstitution des Unterrichts in pädagogischen Praktiken und die Potentiale qualitativer […]
Reh, Sabine; Rabenstein, Kerstin
Zeitschriftenbeitrag
| In: Zeitschrift für Pädagogik | 2013
33544 Endnote
Autor*innen:
Reh, Sabine; Rabenstein, Kerstin
Titel:
Die soziale Konstitution des Unterrichts in pädagogischen Praktiken und die Potentiale qualitativer Unterrichtsforschung. Rekonstruktionen des Zeigens und Adressierens
In:
Zeitschrift für Pädagogik, 59 (2013) 3, S. 291-307
Dokumenttyp:
3a. Beiträge in begutachteten Zeitschriften; Aufsatz (keine besondere Kategorie)
Sprache:
Deutsch
Schlagwörter:
Bewertung; Deutschunterricht; Empirische Forschung; Ethnographie; Interaktion; Lehrer; Lyrik; Pädagogisches Handeln; Qualitative Forschung; Schüler; Sekundarstufe I; Subjekt <Phil>; Textanalyse; Unterricht; Unterrichtsbeobachtung; Unterrichtsforschung; Unterrichtsgestaltung; Unterrichtsmethode; Verhalten
Abstract:
Die Potentiale qualitativer Unterrichtsforschung wurden bislang mit der Herausarbeitung der Fall- bzw. Situationsspezifik pädagogischer Praxis bestimmt. Das sequentielle Vorgehen bei der Rekonstruktion von Sinnemergenz ermöglicht darüber hinaus jedoch, den Zusammenhang - die Relationalität - von zwei in der Regel analytisch getrennt gehaltenen Dimensionen pädagogischen Handelns - Bezug zur Sache und Bezug zur Person - in den Blick zu nehmen. In einer von der Suche nach Berührungsflächen geprägten Auseinandersetzung mit dem holistischen Vorgehen quantitativer Unterrichtsforschung, den hoch-inferenten Ratingverfahren, wird argumentiert, dass die sequentielle Erschließung von Sinn genau mit dieser Relationalität die Besonderheit pädagogischer Situationen zu erfassen in der Lage ist. Plausibel gemacht wird dieses anhand einiger Gesprächssequenzen aus Unterrichtsstunden, in denen Gedichte und Gedichtinterpretationen als "Sache" konstituiert und gleichzeitig damit Schüler und Schülerinnen auf eine ganz bestimmte Art und Weise als "hermeneutische Subjekte" adressiert werden. Ziel des Beitrags ist es darüber hinaus, für ein gewinnbringendes Gespräch unter Bedingungen wechselseitiger Akzeptanz von Differenzen zwischen quantitativer und qualitativer Unterrichtsforschung zu plädieren.
DIPF-Abteilung:
Bibliothek für Bildungsgeschichtliche Forschung
Automatically classifying edit categories in wikipedia revisions
Daxenberger, Johannes; Gurevych, Iryna
Sammelbandbeitrag
| Aus: Yarowsky, David;Baldwin, Timothy;Korhonen, Anna;Livescu, Karen;Bethard, Steven (Hrsg.): Conference on Empirical Methods in Natural Language Processing (EMNLP 2013) | Stroudsburg; PA: Association for Computational Linguistics | 2013
34053 Endnote
Autor*innen:
Daxenberger, Johannes; Gurevych, Iryna
Titel:
Automatically classifying edit categories in wikipedia revisions
Aus:
Yarowsky, David;Baldwin, Timothy;Korhonen, Anna;Livescu, Karen;Bethard, Steven (Hrsg.): Conference on Empirical Methods in Natural Language Processing (EMNLP 2013), Stroudsburg; PA: Association for Computational Linguistics, 2013 , S. 578-589
URL:
https://www.ukp.tu-darmstadt.de/fileadmin/user_upload/Group_UKP/publikationen/2013/EMNLP2013_DaxenbergerGurevych.pdf
Dokumenttyp:
4. Beiträge in Sammelwerken; Tagungsband/Konferenzbeitrag/Proceedings
Sprache:
Englisch
Schlagwörter:
Automatisierung; Computerlinguistik; Computerunterstütztes Verfahren; Evaluation; Korrektur; Nachschlagewerk; Qualität; Taxonomie; Textanalyse; World wide web 2.0
Abstract:
In this paper, we analyze a novel set of features for the task of automatic edit category classification. Edit category classification assigns categories such as spelling error correction, paraphrase or vandalism to edits in a document. Our features are based on differences between two versions of a document including meta data, textual and language properties and markup. In a supervised machine learning experiment, we achieve a micro-averaged F1 score of .62 on a corpus of edits from the English Wikipedia. In this corpus, each edit has been multi-labeled according to a 21-category taxonomy. A model trained on the same data achieves state-of-the-art performance on the related task of fluency edit classification. We apply pattern mining to automatically labeled edits in the revision histories of different Wikipedia articles. Our results suggest that high-quality articles show a higher degree of homogeneity with respect to their collaboration patterns as compared to random articles.
DIPF-Abteilung:
Informationszentrum Bildung
SemEval-2013 Task 5: Evaluating phrasal semantics
Korkontzelos, Ioannis; Zesch, Torsten; Zanzotto, Fabio Massimo; Biemann, Chris
Sammelbandbeitrag
| Aus: Association for Computational Linguistics (Hrsg.): Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013): Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2 | Atlanta; GA: Association for Computational Linguistics | 2013
33564 Endnote
Autor*innen:
Korkontzelos, Ioannis; Zesch, Torsten; Zanzotto, Fabio Massimo; Biemann, Chris
Titel:
SemEval-2013 Task 5: Evaluating phrasal semantics
Aus:
Association for Computational Linguistics (Hrsg.): Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013): Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2, Atlanta; GA: Association for Computational Linguistics, 2013 , S. 39-47
URL:
https://aclanthology.org/S13-2007/
Dokumenttyp:
4. Beiträge in Sammelwerken; Tagungsband/Konferenzbeitrag/Proceedings
Sprache:
Englisch
Schlagwörter:
Computerlinguistik; Computerprogramm; Computerunterstütztes Verfahren; Evaluation; Mehrsprachigkeit; Semantik; Syntax; Textanalyse
Abstract (english):
This paper describes the SemEval-2013 Task 5: "Evaluating Phrasal Semantics". Its first subtask is about computing the semantic similarity of words and compositionality of phrases in a given context. The paper discusses the importance and background of these subtasks and their structure. In succession, it introduces the systems that participated and discusses evaluation results.
DIPF-Abteilung:
Informationszentrum Bildung
Markierungen aufheben
Alle Treffer markieren
Export
<
1
...
3
4
(aktuell)
5
6
>
Alle anzeigen
(54)