Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: https://doi.org/10.21256/zhaw-27771
Publikationstyp: Konferenz: Paper
Art der Begutachtung: Peer review (Publikation)
Titel: VersaMatch : ontology matching with weak supervision
Autor/-in: Fürst, Jonathan
Fadel Argerich, Mauricio
Cheng, Bin
et. al: No
DOI: 10.14778/3583140.3583148
10.21256/zhaw-27771
Tagungsband: Proceedings of the VLDB Endowment
Herausgeber/-in des übergeordneten Werkes: Koutrika, Georgia
Yang, Jun
Band(Heft): 16
Heft: 6
Seite(n): 1305
Seiten bis: 1318
Angaben zur Konferenz: 49th Conference on Very Large Data Bases (VLDB), Vancouver, Canada, 28 August - 1 September 2023
Erscheinungsdatum: 2023
Verlag / Hrsg. Institution: Association for Computing Machinery
ISSN: 2150-8097
Sprache: Englisch
Schlagwörter: Ontology matching; Weak supervision; Machine learning; Data integration
Fachgebiet (DDC): 006: Spezielle Computerverfahren
Zusammenfassung: Ontology matching is crucial to data integration for across-silo data sharing and has been mainly addressed with heuristic and machine learning (ML) methods. While heuristic methods are often inflexible and hard to extend to new domains, ML methods rely on substantial and hard to obtain amounts of labeled training data. To overcome these limitations, we propose VersaMatch, a flexible, weakly-supervised ontology matching system. VersaMatch employs various weak supervision sources, such as heuristic rules, pattern matching, and external knowledge bases, to produce labels from a large amount of unlabeled data for training a discriminative ML model. For prediction, VersaMatch develops a novel ensemble model combining the weak supervision sources with the discriminative model to support generalization while retaining a high precision. Our ensemble method boosts end model performance by 4 points compared to a traditional weak-supervision baseline. In addition, compared to state-of-the-art ontology matchers, VersaMatch achieves an overall 4-point performance improvement in F1 score across 26 ontology combinations from different domains. For recently released, in-the-wild datasets, VersaMatch beats the next best matchers by 9 points in F1. Furthermore, its core weak-supervision logic can easily be improved by adding more knowledge sources and collecting more unlabeled data for training.
URI: https://digitalcollection.zhaw.ch/handle/11475/27771
Zugehörige Forschungsdaten: https://github.com/nec-research/VersaMatch
Volltext Version: Publizierte Version
Lizenz (gemäss Verlagsvertrag): CC BY-NC-ND 4.0: Namensnennung - Nicht kommerziell - Keine Bearbeitungen 4.0 International
Departement: School of Engineering
Organisationseinheit: Institut für Informatik (InIT)
Enthalten in den Sammlungen:Publikationen School of Engineering

Dateien zu dieser Ressource:
Datei Beschreibung GrößeFormat 
2023_Fuerst-etal_VersaMatch-ontology-matching-with-weak-supervision.pdf781.26 kBAdobe PDFMiniaturbild
Öffnen/Anzeigen
Zur Langanzeige
Fürst, J., Fadel Argerich, M., & Cheng, B. (2023). VersaMatch : ontology matching with weak supervision [Conference paper]. In G. Koutrika & J. Yang (Eds.), Proceedings of the VLDB Endowment (Vol. 16, Issue 6, pp. 1305–1318). Association for Computing Machinery. https://doi.org/10.14778/3583140.3583148
Fürst, J., Fadel Argerich, M. and Cheng, B. (2023) ‘VersaMatch : ontology matching with weak supervision’, in G. Koutrika and J. Yang (eds) Proceedings of the VLDB Endowment. Association for Computing Machinery, pp. 1305–1318. Available at: https://doi.org/10.14778/3583140.3583148.
J. Fürst, M. Fadel Argerich, and B. Cheng, “VersaMatch : ontology matching with weak supervision,” in Proceedings of the VLDB Endowment, 2023, vol. 16, no. 6, pp. 1305–1318. doi: 10.14778/3583140.3583148.
FÜRST, Jonathan, Mauricio FADEL ARGERICH und Bin CHENG, 2023. VersaMatch : ontology matching with weak supervision. In: Georgia KOUTRIKA und Jun YANG (Hrsg.), Proceedings of the VLDB Endowment. Conference paper. Association for Computing Machinery. 2023. S. 1305–1318
Fürst, Jonathan, Mauricio Fadel Argerich, and Bin Cheng. 2023. “VersaMatch : Ontology Matching with Weak Supervision.” Conference paper. In Proceedings of the VLDB Endowment, edited by Georgia Koutrika and Jun Yang, 16:1305–18. Association for Computing Machinery. https://doi.org/10.14778/3583140.3583148.
Fürst, Jonathan, et al. “VersaMatch : Ontology Matching with Weak Supervision.” Proceedings of the VLDB Endowment, edited by Georgia Koutrika and Jun Yang, vol. 16, no. 6, Association for Computing Machinery, 2023, pp. 1305–18, https://doi.org/10.14778/3583140.3583148.


Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt, soweit nicht anderweitig angezeigt.