Please use this identifier to cite or link to this item:
https://doi.org/10.21256/zhaw-21550
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Büchi, Matthias | - |
dc.contributor.author | Ulasik, Malgorzata Anna | - |
dc.contributor.author | Hürlimann, Manuela | - |
dc.contributor.author | Benites de Azevedo e Souza, Fernando | - |
dc.contributor.author | von Däniken, Pius | - |
dc.contributor.author | Cieliebak, Mark | - |
dc.date.accessioned | 2021-02-04T13:11:33Z | - |
dc.date.available | 2021-02-04T13:11:33Z | - |
dc.date.issued | 2020-06 | - |
dc.identifier.issn | 1613-0073 | de_CH |
dc.identifier.uri | https://digitalcollection.zhaw.ch/handle/11475/21550 | - |
dc.description.abstract | This paper presents the contribution of ZHAW-InIT to Task 4 ”Low-Resource STT” at GermEval 2020. The goal of the task is to develop a system for translating Swiss German dialect speech into Standard German text in the domain of parliamentary debates. Our approach is based on Jasper, a CNN Acoustic Model, which we fine-tune on the task data. We enhance the base system with an extended Language Model containing in-domain data and speed perturbation and run further experiments with post-processing. Our submission achieved first place with a final Word Error Rate of 40.29%. | de_CH |
dc.language.iso | en | de_CH |
dc.publisher | CEUR Workshop Proceedings | de_CH |
dc.rights | http://creativecommons.org/licenses/by/4.0/ | de_CH |
dc.subject.ddc | 410.285: Computerlinguistik | de_CH |
dc.title | ZHAW-InIT at GermEval 2020 task 4 : low-resource speech-to-text | de_CH |
dc.type | Konferenz: Paper | de_CH |
dcterms.type | Text | de_CH |
zhaw.departement | School of Engineering | de_CH |
zhaw.organisationalunit | Institut für Informatik (InIT) | de_CH |
dc.identifier.doi | 10.21256/zhaw-21550 | - |
zhaw.conference.details | 5th SwissText & 16th KONVENS Joint Conference, Zurich (online), 24-25 June 2020 | de_CH |
zhaw.funding.eu | No | de_CH |
zhaw.originated.zhaw | Yes | de_CH |
zhaw.parentwork.editor | Ebling, Sarah | - |
zhaw.parentwork.editor | Tuggener, Don | - |
zhaw.parentwork.editor | Hürlimann, Manuela | - |
zhaw.parentwork.editor | Cieliebak, Mark | - |
zhaw.parentwork.editor | Volk, Martin | - |
zhaw.publication.status | publishedVersion | de_CH |
zhaw.publication.review | Peer review (Abstract) | de_CH |
zhaw.title.proceedings | Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS) | de_CH |
zhaw.webfeed | Datalab | de_CH |
zhaw.webfeed | Software Systems | de_CH |
zhaw.webfeed | Natural Language Processing | de_CH |
zhaw.author.additional | No | de_CH |
zhaw.display.portrait | Yes | de_CH |
Appears in collections: | Publikationen School of Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
2020_Buechi_etal_ZHAW-InIT-at-GermEval-2020.pdf | 458.44 kB | Adobe PDF | View/Open |
Show simple item record
Büchi, M., Ulasik, M. A., Hürlimann, M., Benites de Azevedo e Souza, F., von Däniken, P., & Cieliebak, M. (2020). ZHAW-InIT at GermEval 2020 task 4 : low-resource speech-to-text [Conference paper]. In S. Ebling, D. Tuggener, M. Hürlimann, M. Cieliebak, & M. Volk (Eds.), Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS). CEUR Workshop Proceedings. https://doi.org/10.21256/zhaw-21550
Büchi, M. et al. (2020) ‘ZHAW-InIT at GermEval 2020 task 4 : low-resource speech-to-text’, in S. Ebling et al. (eds) Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS). CEUR Workshop Proceedings. Available at: https://doi.org/10.21256/zhaw-21550.
M. Büchi, M. A. Ulasik, M. Hürlimann, F. Benites de Azevedo e Souza, P. von Däniken, and M. Cieliebak, “ZHAW-InIT at GermEval 2020 task 4 : low-resource speech-to-text,” in Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), Jun. 2020. doi: 10.21256/zhaw-21550.
BÜCHI, Matthias, Malgorzata Anna ULASIK, Manuela HÜRLIMANN, Fernando BENITES DE AZEVEDO E SOUZA, Pius VON DÄNIKEN und Mark CIELIEBAK, 2020. ZHAW-InIT at GermEval 2020 task 4 : low-resource speech-to-text. In: Sarah EBLING, Don TUGGENER, Manuela HÜRLIMANN, Mark CIELIEBAK und Martin VOLK (Hrsg.), Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS). Conference paper. CEUR Workshop Proceedings. Juni 2020
Büchi, Matthias, Malgorzata Anna Ulasik, Manuela Hürlimann, Fernando Benites de Azevedo e Souza, Pius von Däniken, and Mark Cieliebak. 2020. “ZHAW-InIT at GermEval 2020 Task 4 : Low-Resource Speech-to-Text.” Conference paper. In Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), edited by Sarah Ebling, Don Tuggener, Manuela Hürlimann, Mark Cieliebak, and Martin Volk. CEUR Workshop Proceedings. https://doi.org/10.21256/zhaw-21550.
Büchi, Matthias, et al. “ZHAW-InIT at GermEval 2020 Task 4 : Low-Resource Speech-to-Text.” Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), edited by Sarah Ebling et al., CEUR Workshop Proceedings, 2020, https://doi.org/10.21256/zhaw-21550.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.