Please use this identifier to cite or link to this item: https://doi.org/10.21256/zhaw-4850
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBenites de Azevedo e Souza, Fernando-
dc.contributor.authorGrubenmann, Ralf-
dc.contributor.authorvon Däniken, Pius-
dc.contributor.authorvon Grünigen, Dirk-
dc.contributor.authorDeriu, Jan Milan-
dc.contributor.authorCieliebak, Mark-
dc.date.accessioned2018-09-27T15:05:10Z-
dc.date.available2018-09-27T15:05:10Z-
dc.date.issued2018-
dc.identifier.urihttp://www.aclweb.org/anthology/W18-3925de_CH
dc.identifier.urihttps://digitalcollection.zhaw.ch/handle/11475/11222-
dc.description.abstractWe describe our approaches used in the German Dialect Identification (GDI) task at the VarDial Evaluation Campaign 2018. The goal was to identify to which out of four dialects spoken in German speaking part of Switzerland a sentence belonged to. We adopted two different metaclassifier approaches and used some data mining insights to improve the preprocessing and the meta-classifier parameters. Especially, we focused on using different feature extraction methods and how to combine them, since they influenced the performance very differently of the system. Our system achieved second place out of 8 teams, with a macro averaged F-1 of 64.6%. We also participated on the surprise dialect task with a multi-label approach.de_CH
dc.language.isoende_CH
dc.publisherVarDialde_CH
dc.rightshttp://creativecommons.org/licenses/by/4.0/de_CH
dc.subjectDialect recognitionde_CH
dc.subjectText classificationde_CH
dc.subjectShared taskde_CH
dc.subjectSwiss germande_CH
dc.subject.ddc410.285: Computerlinguistikde_CH
dc.subject.ddc430: Deutschde_CH
dc.titleTwist Bytes : German dialect identification with data mining optimizationde_CH
dc.typeKonferenz: Paperde_CH
dcterms.typeTextde_CH
zhaw.departementSchool of Engineeringde_CH
zhaw.organisationalunitInstitut für Informatik (InIT)de_CH
dc.identifier.doi10.21256/zhaw-4850-
zhaw.conference.details27th International Conference on Computational Linguistics (COLING 2018), Santa Fe, August 20-26, 2018de_CH
zhaw.funding.euNode_CH
zhaw.originated.zhawYesde_CH
zhaw.pages.end227de_CH
zhaw.pages.start218de_CH
zhaw.publication.statuspublishedVersionde_CH
zhaw.publication.reviewPeer review (Publikation)de_CH
zhaw.title.proceedingsProceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018)de_CH
zhaw.webfeedSoftware Systemsde_CH
zhaw.webfeedNatural Language Processingde_CH
Appears in collections:Publikationen School of Engineering

Files in This Item:
File Description SizeFormat 
2018_Benites_Twist_Bytes_German_Dialect_Identification_with_data_mining_optimization.pdf250.32 kBAdobe PDFThumbnail
View/Open
Show simple item record
Benites de Azevedo e Souza, F., Grubenmann, R., von Däniken, P., von Grünigen, D., Deriu, J. M., & Cieliebak, M. (2018). Twist Bytes : German dialect identification with data mining optimization [Conference paper]. Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), 218–227. https://doi.org/10.21256/zhaw-4850
Benites de Azevedo e Souza, F. et al. (2018) ‘Twist Bytes : German dialect identification with data mining optimization’, in Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018). VarDial, pp. 218–227. Available at: https://doi.org/10.21256/zhaw-4850.
F. Benites de Azevedo e Souza, R. Grubenmann, P. von Däniken, D. von Grünigen, J. M. Deriu, and M. Cieliebak, “Twist Bytes : German dialect identification with data mining optimization,” in Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), 2018, pp. 218–227. doi: 10.21256/zhaw-4850.
BENITES DE AZEVEDO E SOUZA, Fernando, Ralf GRUBENMANN, Pius VON DÄNIKEN, Dirk VON GRÜNIGEN, Jan Milan DERIU und Mark CIELIEBAK, 2018. Twist Bytes : German dialect identification with data mining optimization. In: Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018) [online]. Conference paper. VarDial. 2018. S. 218–227. Verfügbar unter: http://www.aclweb.org/anthology/W18-3925
Benites de Azevedo e Souza, Fernando, Ralf Grubenmann, Pius von Däniken, Dirk von Grünigen, Jan Milan Deriu, and Mark Cieliebak. 2018. “Twist Bytes : German Dialect Identification with Data Mining Optimization.” Conference paper. In Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), 218–27. VarDial. https://doi.org/10.21256/zhaw-4850.
Benites de Azevedo e Souza, Fernando, et al. “Twist Bytes : German Dialect Identification with Data Mining Optimization.” Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), VarDial, 2018, pp. 218–27, https://doi.org/10.21256/zhaw-4850.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.