Please use this identifier to cite or link to this item: https://doi.org/10.21256/zhaw-19637
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBrunner, Ursin-
dc.contributor.authorStockinger, Kurt-
dc.date.accessioned2020-03-05T12:54:42Z-
dc.date.available2020-03-05T12:54:42Z-
dc.date.issued2020-03-
dc.identifier.isbn978-3-89318-083-7de_CH
dc.identifier.urihttps://digitalcollection.zhaw.ch/handle/11475/19637-
dc.description.abstractTransformer architectures have proven to be very effective and provide state-of-the-art results in many natural language tasks. The attention-based architecture in combination with pre-training on large amounts of text lead to the recent breakthrough and a variety of slightly different implementations. In this paper we analyze how well four of the most recent attention-based transformer architectures (BERT, XLNet, RoBERTa and DistilBERT) perform on the task of entity matching - a crucial part of data integration. Entity matching (EM) is the task of finding data instances that refer to the same real-world entity. It is a challenging task if the data instances consist of long textual data or if the data instances are "dirty" due to misplaced values. To evaluate the capability of transformer architectures and transfer-learning on the task of EM, we empirically compare the four approaches on inherently difficult data sets. We show that transformer architectures outperform classical deep learning methods in EM by an average margin of 27.5%.de_CH
dc.language.isoende_CH
dc.publisherOpenProceedingsde_CH
dc.rightshttp://creativecommons.org/licenses/by-nc-nd/4.0/de_CH
dc.subjectEntity matchingde_CH
dc.subjectData integrationde_CH
dc.subjectMachine learningde_CH
dc.subjectNeural networksde_CH
dc.subjectTransformersde_CH
dc.subjectBERTde_CH
dc.subject.ddc006: Spezielle Computerverfahrende_CH
dc.titleEntity matching with transformer architectures - a step forward in data integrationde_CH
dc.typeKonferenz: Paperde_CH
dcterms.typeTextde_CH
zhaw.departementSchool of Engineeringde_CH
zhaw.organisationalunitInstitut für Informatik (InIT)de_CH
dc.identifier.doi10.5441/002/edbt.2020.58de_CH
dc.identifier.doi10.21256/zhaw-19637-
zhaw.conference.details23rd International Conference on Extending Database Technology, Copenhagen, 30 March - 2 April 2020de_CH
zhaw.funding.euNode_CH
zhaw.originated.zhawYesde_CH
zhaw.pages.end473de_CH
zhaw.pages.start463de_CH
zhaw.publication.statuspublishedVersionde_CH
zhaw.publication.reviewPeer review (Publikation)de_CH
zhaw.title.proceedingsProceedings of EDBT 2020de_CH
zhaw.webfeedDatalabde_CH
zhaw.webfeedInformation Engineeringde_CH
zhaw.author.additionalNode_CH
Appears in collections:Publikationen School of Engineering

Files in This Item:
File Description SizeFormat 
Entity_Machting_with_Transformers_edbt_2020__Camera_Ready.pdfEntity Machting with Transformers EDBT 20201.12 MBAdobe PDFThumbnail
View/Open
Show simple item record
Brunner, U., & Stockinger, K. (2020). Entity matching with transformer architectures - a step forward in data integration [Conference paper]. Proceedings of EDBT 2020, 463–473. https://doi.org/10.5441/002/edbt.2020.58
Brunner, U. and Stockinger, K. (2020) ‘Entity matching with transformer architectures - a step forward in data integration’, in Proceedings of EDBT 2020. OpenProceedings, pp. 463–473. Available at: https://doi.org/10.5441/002/edbt.2020.58.
U. Brunner and K. Stockinger, “Entity matching with transformer architectures - a step forward in data integration,” in Proceedings of EDBT 2020, Mar. 2020, pp. 463–473. doi: 10.5441/002/edbt.2020.58.
BRUNNER, Ursin und Kurt STOCKINGER, 2020. Entity matching with transformer architectures - a step forward in data integration. In: Proceedings of EDBT 2020. Conference paper. OpenProceedings. März 2020. S. 463–473. ISBN 978-3-89318-083-7
Brunner, Ursin, and Kurt Stockinger. 2020. “Entity Matching with Transformer Architectures - a Step Forward in Data Integration.” Conference paper. In Proceedings of EDBT 2020, 463–73. OpenProceedings. https://doi.org/10.5441/002/edbt.2020.58.
Brunner, Ursin, and Kurt Stockinger. “Entity Matching with Transformer Architectures - a Step Forward in Data Integration.” Proceedings of EDBT 2020, OpenProceedings, 2020, pp. 463–73, https://doi.org/10.5441/002/edbt.2020.58.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.