Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen:
https://doi.org/10.21256/zhaw-4974
Publikationstyp: | Konferenz: Paper |
Art der Begutachtung: | Peer review (Publikation) |
Titel: | German compound splitting using the compound productivity of morphemes |
Autor/-in: | Sugisaki, Kyoko Tuggener, Don |
DOI: | 10.21256/zhaw-4974 |
Tagungsband: | 14th Conference on Natural Language Processing - KONVENS 2018 |
Herausgeber/-in des übergeordneten Werkes: | Barbaresi, Adrien Biber, Hanno Neubarth, Friedrich Osswald, Rainer |
Seite(n): | 141 |
Seiten bis: | 147 |
Angaben zur Konferenz: | 14th Conference on Natural Language Processing (KONVENS 2018), Vienna, Austria, 19-21 September 2018 |
Erscheinungsdatum: | 2018 |
Verlag / Hrsg. Institution: | Austrian Academy of Sciences Press |
Andere Identifier: | 0xc1aa5576 0x003a2438 |
Sprache: | Englisch |
Schlagwörter: | Compound splitting |
Fachgebiet (DDC): | 410.285: Computerlinguistik |
Zusammenfassung: | In this work, we present a novel compound splitting method for German by capturing the compound productivity of morphemes. We use a giga web corpus to create a lexicon and decompose noun compounds by computing the probabilities of compound elements as bound and free morphemes. Furthermore, we provide a uniformed evaluation of several unsupervised approaches and morphological analysers for the task. Our method achieved a high F1 score of 0.92, which was a comparable result to state-of-the-art methods. |
URI: | https://digitalcollection.zhaw.ch/handle/11475/14372 |
Volltext Version: | Publizierte Version |
Lizenz (gemäss Verlagsvertrag): | Lizenz gemäss Verlagsvertrag |
Departement: | School of Engineering |
Organisationseinheit: | Institut für Informatik (InIT) |
Enthalten in den Sammlungen: | Publikationen School of Engineering |
Dateien zu dieser Ressource:
Datei | Beschreibung | Größe | Format | |
---|---|---|---|---|
2018_Sugisaki_German_compound_splitting_using_the_compound.pdf | 177.39 kB | Adobe PDF | Öffnen/Anzeigen |
Zur Langanzeige
Sugisaki, K., & Tuggener, D. (2018). German compound splitting using the compound productivity of morphemes [Conference paper]. In A. Barbaresi, H. Biber, F. Neubarth, & R. Osswald (Eds.), 14th Conference on Natural Language Processing - KONVENS 2018 (pp. 141–147). Austrian Academy of Sciences Press. https://doi.org/10.21256/zhaw-4974
Sugisaki, K. and Tuggener, D. (2018) ‘German compound splitting using the compound productivity of morphemes’, in A. Barbaresi et al. (eds) 14th Conference on Natural Language Processing - KONVENS 2018. Austrian Academy of Sciences Press, pp. 141–147. Available at: https://doi.org/10.21256/zhaw-4974.
K. Sugisaki and D. Tuggener, “German compound splitting using the compound productivity of morphemes,” in 14th Conference on Natural Language Processing - KONVENS 2018, 2018, pp. 141–147. doi: 10.21256/zhaw-4974.
SUGISAKI, Kyoko und Don TUGGENER, 2018. German compound splitting using the compound productivity of morphemes. In: Adrien BARBARESI, Hanno BIBER, Friedrich NEUBARTH und Rainer OSSWALD (Hrsg.), 14th Conference on Natural Language Processing - KONVENS 2018. Conference paper. Austrian Academy of Sciences Press. 2018. S. 141–147
Sugisaki, Kyoko, and Don Tuggener. 2018. “German Compound Splitting Using the Compound Productivity of Morphemes.” Conference paper. In 14th Conference on Natural Language Processing - KONVENS 2018, edited by Adrien Barbaresi, Hanno Biber, Friedrich Neubarth, and Rainer Osswald, 141–47. Austrian Academy of Sciences Press. https://doi.org/10.21256/zhaw-4974.
Sugisaki, Kyoko, and Don Tuggener. “German Compound Splitting Using the Compound Productivity of Morphemes.” 14th Conference on Natural Language Processing - KONVENS 2018, edited by Adrien Barbaresi et al., Austrian Academy of Sciences Press, 2018, pp. 141–47, https://doi.org/10.21256/zhaw-4974.
Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt, soweit nicht anderweitig angezeigt.