Please use this identifier to cite or link to this item:
https://doi.org/10.21256/zhaw-4850
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Benites de Azevedo e Souza, Fernando | - |
dc.contributor.author | Grubenmann, Ralf | - |
dc.contributor.author | von Däniken, Pius | - |
dc.contributor.author | von Grünigen, Dirk | - |
dc.contributor.author | Deriu, Jan Milan | - |
dc.contributor.author | Cieliebak, Mark | - |
dc.date.accessioned | 2018-09-27T15:05:10Z | - |
dc.date.available | 2018-09-27T15:05:10Z | - |
dc.date.issued | 2018 | - |
dc.identifier.uri | http://www.aclweb.org/anthology/W18-3925 | de_CH |
dc.identifier.uri | https://digitalcollection.zhaw.ch/handle/11475/11222 | - |
dc.description.abstract | We describe our approaches used in the German Dialect Identification (GDI) task at the VarDial Evaluation Campaign 2018. The goal was to identify to which out of four dialects spoken in German speaking part of Switzerland a sentence belonged to. We adopted two different metaclassifier approaches and used some data mining insights to improve the preprocessing and the meta-classifier parameters. Especially, we focused on using different feature extraction methods and how to combine them, since they influenced the performance very differently of the system. Our system achieved second place out of 8 teams, with a macro averaged F-1 of 64.6%. We also participated on the surprise dialect task with a multi-label approach. | de_CH |
dc.language.iso | en | de_CH |
dc.publisher | VarDial | de_CH |
dc.rights | http://creativecommons.org/licenses/by/4.0/ | de_CH |
dc.subject | Dialect recognition | de_CH |
dc.subject | Text classification | de_CH |
dc.subject | Shared task | de_CH |
dc.subject | Swiss german | de_CH |
dc.subject.ddc | 410.285: Computerlinguistik | de_CH |
dc.subject.ddc | 430: Deutsch | de_CH |
dc.title | Twist Bytes : German dialect identification with data mining optimization | de_CH |
dc.type | Konferenz: Paper | de_CH |
dcterms.type | Text | de_CH |
zhaw.departement | School of Engineering | de_CH |
zhaw.organisationalunit | Institut für Informatik (InIT) | de_CH |
dc.identifier.doi | 10.21256/zhaw-4850 | - |
zhaw.conference.details | 27th International Conference on Computational Linguistics (COLING 2018), Santa Fe, August 20-26, 2018 | de_CH |
zhaw.funding.eu | No | de_CH |
zhaw.originated.zhaw | Yes | de_CH |
zhaw.pages.end | 227 | de_CH |
zhaw.pages.start | 218 | de_CH |
zhaw.publication.status | publishedVersion | de_CH |
zhaw.publication.review | Peer review (Publikation) | de_CH |
zhaw.title.proceedings | Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018) | de_CH |
zhaw.webfeed | Software Systems | de_CH |
zhaw.webfeed | Natural Language Processing | de_CH |
Appears in collections: | Publikationen School of Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
2018_Benites_Twist_Bytes_German_Dialect_Identification_with_data_mining_optimization.pdf | 250.32 kB | Adobe PDF | View/Open |
Show simple item record
Benites de Azevedo e Souza, F., Grubenmann, R., von Däniken, P., von Grünigen, D., Deriu, J. M., & Cieliebak, M. (2018). Twist Bytes : German dialect identification with data mining optimization [Conference paper]. Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), 218–227. https://doi.org/10.21256/zhaw-4850
Benites de Azevedo e Souza, F. et al. (2018) ‘Twist Bytes : German dialect identification with data mining optimization’, in Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018). VarDial, pp. 218–227. Available at: https://doi.org/10.21256/zhaw-4850.
F. Benites de Azevedo e Souza, R. Grubenmann, P. von Däniken, D. von Grünigen, J. M. Deriu, and M. Cieliebak, “Twist Bytes : German dialect identification with data mining optimization,” in Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), 2018, pp. 218–227. doi: 10.21256/zhaw-4850.
BENITES DE AZEVEDO E SOUZA, Fernando, Ralf GRUBENMANN, Pius VON DÄNIKEN, Dirk VON GRÜNIGEN, Jan Milan DERIU und Mark CIELIEBAK, 2018. Twist Bytes : German dialect identification with data mining optimization. In: Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018) [online]. Conference paper. VarDial. 2018. S. 218–227. Verfügbar unter: http://www.aclweb.org/anthology/W18-3925
Benites de Azevedo e Souza, Fernando, Ralf Grubenmann, Pius von Däniken, Dirk von Grünigen, Jan Milan Deriu, and Mark Cieliebak. 2018. “Twist Bytes : German Dialect Identification with Data Mining Optimization.” Conference paper. In Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), 218–27. VarDial. https://doi.org/10.21256/zhaw-4850.
Benites de Azevedo e Souza, Fernando, et al. “Twist Bytes : German Dialect Identification with Data Mining Optimization.” Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), VarDial, 2018, pp. 218–27, https://doi.org/10.21256/zhaw-4850.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.