Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: https://doi.org/10.21256/zhaw-30250
Langanzeige der Metadaten
DC ElementWertSprache
dc.contributor.authorBollinger, Tobias-
dc.contributor.authorDeriu, Jan Milan-
dc.contributor.authorVogel, Manfred-
dc.date.accessioned2024-03-15T15:52:14Z-
dc.date.available2024-03-15T15:52:14Z-
dc.date.issued2023-06-
dc.identifier.urihttps://digitalcollection.zhaw.ch/handle/11475/30250-
dc.description.abstractIn this work, we studied the synthesis of Swiss German speech using different Text-to-Speech (TTS) models. We evaluated the TTS models on three corpora, and we found, that VITS models performed best, hence, using them for further testing. We also introduce a new method to evaluate TTS models by letting the discriminator of a trained vocoder GAN model predict whether a given waveform is human or synthesized. In summary, our best model delivers speech synthesis for different Swiss German dialects with previously unachieved quality.de_CH
dc.language.isoende_CH
dc.publisherarXivde_CH
dc.rightshttp://creativecommons.org/licenses/by/4.0/de_CH
dc.subjectSpeech synthesisde_CH
dc.subjectText to speechde_CH
dc.subject.ddc410.285: Computerlinguistikde_CH
dc.subject.ddc430: Deutschde_CH
dc.titleText-to-speech pipeline for Swiss German : a comparisonde_CH
dc.typeKonferenz: Paperde_CH
dcterms.typeTextde_CH
zhaw.departementSchool of Engineeringde_CH
zhaw.organisationalunitCentre for Artificial Intelligence (CAI)de_CH
dc.identifier.doi10.48550/arXiv.2305.19750de_CH
dc.identifier.doi10.21256/zhaw-30250-
zhaw.conference.details8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023de_CH
zhaw.funding.euNode_CH
zhaw.originated.zhawYesde_CH
zhaw.publication.statuspublishedVersionde_CH
zhaw.publication.reviewPeer review (Publikation)de_CH
zhaw.funding.snf200729de_CH
zhaw.webfeedNatural Language Processingde_CH
zhaw.funding.zhawEnd-to-End Low-Resource Speech Translation for Swiss German Dialectsde_CH
zhaw.author.additionalNode_CH
zhaw.display.portraitYesde_CH
Enthalten in den Sammlungen:Publikationen School of Engineering

Dateien zu dieser Ressource:
Datei Beschreibung GrößeFormat 
2023_Bollinger-etal_Text-to-speech-pipeline-for-Swiss-German.pdf1.03 MBAdobe PDFMiniaturbild
Öffnen/Anzeigen
Zur Kurzanzeige
Bollinger, T., Deriu, J. M., & Vogel, M. (2023, June). Text-to-speech pipeline for Swiss German : a comparison. 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. https://doi.org/10.48550/arXiv.2305.19750
Bollinger, T., Deriu, J.M. and Vogel, M. (2023) ‘Text-to-speech pipeline for Swiss German : a comparison’, in 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. arXiv. Available at: https://doi.org/10.48550/arXiv.2305.19750.
T. Bollinger, J. M. Deriu, and M. Vogel, “Text-to-speech pipeline for Swiss German : a comparison,” in 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023, Jun. 2023. doi: 10.48550/arXiv.2305.19750.
BOLLINGER, Tobias, Jan Milan DERIU und Manfred VOGEL, 2023. Text-to-speech pipeline for Swiss German : a comparison. In: 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. Conference paper. arXiv. Juni 2023
Bollinger, Tobias, Jan Milan Deriu, and Manfred Vogel. 2023. “Text-to-Speech Pipeline for Swiss German : A Comparison.” Conference paper. In 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. arXiv. https://doi.org/10.48550/arXiv.2305.19750.
Bollinger, Tobias, et al. “Text-to-Speech Pipeline for Swiss German : A Comparison.” 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023, arXiv, 2023, https://doi.org/10.48550/arXiv.2305.19750.


Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt, soweit nicht anderweitig angezeigt.