Please use this identifier to cite or link to this item: https://doi.org/10.21256/zhaw-30386
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHelland, Solveig-
dc.contributor.authorGavagnin, Elena-
dc.contributor.authorde Spindler, Alexandre-
dc.date.accessioned2024-03-27T12:40:43Z-
dc.date.available2024-03-27T12:40:43Z-
dc.date.issued2023-
dc.identifier.urihttps://aclanthology.org/2023.swisstext-1.7de_CH
dc.identifier.urihttps://digitalcollection.zhaw.ch/handle/11475/30386-
dc.description.abstractThe growing capabilities of transformer models pave the way for solving increasingly complex NLP tasks. A key to supporting applicationspecific requirements is the ability to fine-tune. However, compiling a fine-tuning dataset tailored to complex tasks is tedious and results in large datasets, limiting the ability to control transformer output. We present an approach in which complex tasks are divided into simpler subtasks. Multiple transformer models are fine-tuned to one subtask each, and lined up to accomplish the complex task. This simplifies the compilation of fine-tuning datasets and increases overall controllability. Using the example of reducing gender bias as a complex task, we demonstrate our approach and show that it performs better than using a single model.de_CH
dc.language.isoende_CH
dc.publisherAssociation for Computational Linguisticsde_CH
dc.rightshttp://creativecommons.org/licenses/by/4.0/de_CH
dc.subject.ddc410.285: Computerlinguistikde_CH
dc.titleDivide et impera : multi-transformer architectures for complex NLP-tasksde_CH
dc.typeKonferenz: Paperde_CH
dcterms.typeTextde_CH
zhaw.departementSchool of Management and Lawde_CH
zhaw.organisationalunitInstitut für Wirtschaftsinformatik (IWI)de_CH
dc.identifier.doi10.21256/zhaw-30386-
zhaw.conference.details8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023de_CH
zhaw.funding.euNode_CH
zhaw.originated.zhawYesde_CH
zhaw.pages.end75de_CH
zhaw.pages.start70de_CH
zhaw.publication.statuspublishedVersionde_CH
zhaw.publication.reviewPeer review (Publikation)de_CH
zhaw.title.proceedingsProceedings of the 8th edition of the Swiss Text Analytics Conferencede_CH
zhaw.author.additionalNode_CH
zhaw.display.portraitYesde_CH
Appears in collections:Publikationen School of Management and Law

Files in This Item:
File Description SizeFormat 
2023_Helland-etal_Multi-transformer-architectures-for-complex-NLP-tasks.pdf157.83 kBAdobe PDFThumbnail
View/Open
Show simple item record
Helland, S., Gavagnin, E., & de Spindler, A. (2023). Divide et impera : multi-transformer architectures for complex NLP-tasks [Conference paper]. Proceedings of the 8th Edition of the Swiss Text Analytics Conference, 70–75. https://doi.org/10.21256/zhaw-30386
Helland, S., Gavagnin, E. and de Spindler, A. (2023) ‘Divide et impera : multi-transformer architectures for complex NLP-tasks’, in Proceedings of the 8th edition of the Swiss Text Analytics Conference. Association for Computational Linguistics, pp. 70–75. Available at: https://doi.org/10.21256/zhaw-30386.
S. Helland, E. Gavagnin, and A. de Spindler, “Divide et impera : multi-transformer architectures for complex NLP-tasks,” in Proceedings of the 8th edition of the Swiss Text Analytics Conference, 2023, pp. 70–75. doi: 10.21256/zhaw-30386.
HELLAND, Solveig, Elena GAVAGNIN und Alexandre DE SPINDLER, 2023. Divide et impera : multi-transformer architectures for complex NLP-tasks. In: Proceedings of the 8th edition of the Swiss Text Analytics Conference [online]. Conference paper. Association for Computational Linguistics. 2023. S. 70–75. Verfügbar unter: https://aclanthology.org/2023.swisstext-1.7
Helland, Solveig, Elena Gavagnin, and Alexandre de Spindler. 2023. “Divide et Impera : Multi-Transformer Architectures for Complex NLP-Tasks.” Conference paper. In Proceedings of the 8th Edition of the Swiss Text Analytics Conference, 70–75. Association for Computational Linguistics. https://doi.org/10.21256/zhaw-30386.
Helland, Solveig, et al. “Divide et Impera : Multi-Transformer Architectures for Complex NLP-Tasks.” Proceedings of the 8th Edition of the Swiss Text Analytics Conference, Association for Computational Linguistics, 2023, pp. 70–75, https://doi.org/10.21256/zhaw-30386.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.