Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: https://doi.org/10.21256/zhaw-4974
Publikationstyp: Konferenz: Paper
Art der Begutachtung: Peer review (Publikation)
Titel: German compound splitting using the compound productivity of morphemes
Autor/-in: Sugisaki, Kyoko
Tuggener, Don
DOI: 10.21256/zhaw-4974
Tagungsband: 14th Conference on Natural Language Processing - KONVENS 2018
Herausgeber/-in des übergeordneten Werkes: Barbaresi, Adrien
Biber, Hanno
Neubarth, Friedrich
Osswald, Rainer
Seite(n): 141
Seiten bis: 147
Angaben zur Konferenz: 14th Conference on Natural Language Processing (KONVENS 2018), Vienna, Austria, 19-21 September 2018
Erscheinungsdatum: 2018
Verlag / Hrsg. Institution: Austrian Academy of Sciences Press
Andere Identifier: 0xc1aa5576 0x003a2438
Sprache: Englisch
Schlagwörter: Compound splitting
Fachgebiet (DDC): 410.285: Computerlinguistik
Zusammenfassung: In this work, we present a novel compound splitting method for German by capturing the compound productivity of morphemes. We use a giga web corpus to create a lexicon and decompose noun compounds by computing the probabilities of compound elements as bound and free morphemes. Furthermore, we provide a uniformed evaluation of several unsupervised approaches and morphological analysers for the task. Our method achieved a high F1 score of 0.92, which was a comparable result to state-of-the-art methods.
URI: https://digitalcollection.zhaw.ch/handle/11475/14372
Volltext Version: Publizierte Version
Lizenz (gemäss Verlagsvertrag): Lizenz gemäss Verlagsvertrag
Departement: School of Engineering
Organisationseinheit: Institut für Informatik (InIT)
Enthalten in den Sammlungen:Publikationen School of Engineering

Dateien zu dieser Ressource:
Datei Beschreibung GrößeFormat 
2018_Sugisaki_German_compound_splitting_using_the_compound.pdf177.39 kBAdobe PDFMiniaturbild
Öffnen/Anzeigen
Zur Langanzeige
Sugisaki, K., & Tuggener, D. (2018). German compound splitting using the compound productivity of morphemes [Conference paper]. In A. Barbaresi, H. Biber, F. Neubarth, & R. Osswald (Eds.), 14th Conference on Natural Language Processing - KONVENS 2018 (pp. 141–147). Austrian Academy of Sciences Press. https://doi.org/10.21256/zhaw-4974
Sugisaki, K. and Tuggener, D. (2018) ‘German compound splitting using the compound productivity of morphemes’, in A. Barbaresi et al. (eds) 14th Conference on Natural Language Processing - KONVENS 2018. Austrian Academy of Sciences Press, pp. 141–147. Available at: https://doi.org/10.21256/zhaw-4974.
K. Sugisaki and D. Tuggener, “German compound splitting using the compound productivity of morphemes,” in 14th Conference on Natural Language Processing - KONVENS 2018, 2018, pp. 141–147. doi: 10.21256/zhaw-4974.
SUGISAKI, Kyoko und Don TUGGENER, 2018. German compound splitting using the compound productivity of morphemes. In: Adrien BARBARESI, Hanno BIBER, Friedrich NEUBARTH und Rainer OSSWALD (Hrsg.), 14th Conference on Natural Language Processing - KONVENS 2018. Conference paper. Austrian Academy of Sciences Press. 2018. S. 141–147
Sugisaki, Kyoko, and Don Tuggener. 2018. “German Compound Splitting Using the Compound Productivity of Morphemes.” Conference paper. In 14th Conference on Natural Language Processing - KONVENS 2018, edited by Adrien Barbaresi, Hanno Biber, Friedrich Neubarth, and Rainer Osswald, 141–47. Austrian Academy of Sciences Press. https://doi.org/10.21256/zhaw-4974.
Sugisaki, Kyoko, and Don Tuggener. “German Compound Splitting Using the Compound Productivity of Morphemes.” 14th Conference on Natural Language Processing - KONVENS 2018, edited by Adrien Barbaresi et al., Austrian Academy of Sciences Press, 2018, pp. 141–47, https://doi.org/10.21256/zhaw-4974.


Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt, soweit nicht anderweitig angezeigt.