DoQA : accessing domain-specific FAQs via conversational QA

Campos, Jon Ander; Otegi, Arantxa; Soroa, Aitor; Deriu, Jan Milan; Cieliebak, Mark; Agirre, Eneko

doi:10.18653/v1/2020.acl-main.652

Please use this identifier to cite or link to this item: https://doi.org/10.21256/zhaw-20320

Full metadata record

DC Field	Value	Language
dc.contributor.author	Campos, Jon Ander	-
dc.contributor.author	Otegi, Arantxa	-
dc.contributor.author	Soroa, Aitor	-
dc.contributor.author	Deriu, Jan Milan	-
dc.contributor.author	Cieliebak, Mark	-
dc.contributor.author	Agirre, Eneko	-
dc.date.accessioned	2020-08-05T15:22:33Z	-
dc.date.available	2020-08-05T15:22:33Z	-
dc.date.issued	2020	-
dc.identifier.uri	https://digitalcollection.zhaw.ch/handle/11475/20320	-
dc.description.abstract	The goal of this work is to build conversational Question Answering (QA) interfaces for the large body of domain-specific information available in FAQ sites. We present DoQA, a dataset with 2,437 dialogues and 10,917 QA pairs. The dialogues are collected from three Stack Exchange sites using the Wizard of Oz method with crowdsourcing. Compared to previous work, DoQA comprises well-defined information needs, leading to more coherent and natural conversations with less factoid questions and is multi-domain. In addition, we introduce a more realistic information retrieval (IR) scenario where the system needs to find the answer in any of the FAQ documents. The results of an existing, strong, system show that, thanks to transfer learning from a Wikipedia QA dataset and fine tuning on a single FAQ domain, it is possible to build high quality conversational QA systems for FAQs without in-domain training data. The good results carry over into the more challenging IR scenario. In both cases, there is still ample room for improvement, as indicated by the higher human upperbound.	de_CH
dc.language.iso	en	de_CH
dc.publisher	Association for Computational Linguistics	de_CH
dc.rights	http://creativecommons.org/licenses/by/4.0/	de_CH
dc.subject	Question answering	de_CH
dc.subject	Deep learning	de_CH
dc.subject	Natural language processing	de_CH
dc.subject.ddc	006: Spezielle Computerverfahren	de_CH
dc.subject.ddc	400: Sprache und Linguistik	de_CH
dc.title	DoQA : accessing domain-specific FAQs via conversational QA	de_CH
dc.type	Konferenz: Paper	de_CH
dcterms.type	Text	de_CH
zhaw.departement	School of Engineering	de_CH
zhaw.organisationalunit	Institut für Informatik (InIT)	de_CH
dc.identifier.doi	10.18653/v1/2020.acl-main.652	de_CH
dc.identifier.doi	10.21256/zhaw-20320	-
zhaw.conference.details	58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), online, 5-10 July 2020	de_CH
zhaw.funding.eu	No	de_CH
zhaw.originated.zhaw	Yes	de_CH
zhaw.pages.end	7314	de_CH
zhaw.pages.start	7302	de_CH
zhaw.publication.status	publishedVersion	de_CH
zhaw.publication.review	Peer review (Publikation)	de_CH
zhaw.title.proceedings	Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics	de_CH
zhaw.webfeed	Software Systems	de_CH
zhaw.webfeed	Natural Language Processing	de_CH
zhaw.funding.zhaw	LIHLITH - Learning to Interact with Humans by Lifelong Interaction with Humans	de_CH
zhaw.author.additional	No	de_CH
zhaw.display.portrait	Yes	de_CH
Appears in collections:	Publikationen School of Engineering

Files in This Item:

File	Description	Size	Format
2020_Campos-etal_DoQA_ACL.pdf		590.12 kB	Adobe PDF	View/Open

Show simple item record

Campos, J. A., Otegi, A., Soroa, A., Deriu, J. M., Cieliebak, M., & Agirre, E. (2020). DoQA : accessing domain-specific FAQs via conversational QA [Conference paper]. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 7302–7314. https://doi.org/10.18653/v1/2020.acl-main.652

Campos, J.A. et al. (2020) ‘DoQA : accessing domain-specific FAQs via conversational QA’, in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 7302–7314. Available at: https://doi.org/10.18653/v1/2020.acl-main.652.

J. A. Campos, A. Otegi, A. Soroa, J. M. Deriu, M. Cieliebak, and E. Agirre, “DoQA : accessing domain-specific FAQs via conversational QA,” in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020, pp. 7302–7314. doi: 10.18653/v1/2020.acl-main.652.

CAMPOS, Jon Ander, Arantxa OTEGI, Aitor SOROA, Jan Milan DERIU, Mark CIELIEBAK und Eneko AGIRRE, 2020. DoQA : accessing domain-specific FAQs via conversational QA. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Conference paper. Association for Computational Linguistics. 2020. S. 7302–7314

Campos, Jon Ander, Arantxa Otegi, Aitor Soroa, Jan Milan Deriu, Mark Cieliebak, and Eneko Agirre. 2020. “DoQA : Accessing Domain-Specific FAQs via Conversational QA.” Conference paper. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 7302–14. Association for Computational Linguistics. https://doi.org/10.18653/v1/2020.acl-main.652.

Campos, Jon Ander, et al. “DoQA : Accessing Domain-Specific FAQs via Conversational QA.” Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, 2020, pp. 7302–14, https://doi.org/10.18653/v1/2020.acl-main.652.