Title: Corpus PaGeS : a multifunctional resource for language learning, translation and cross-linguistic research
Authors : Doval, Irene
Fernández Lanza, Santiago
Jiménez Juliá, Tomás
Liste Lamas, Elsa
Lübke, Barbara
Published in : Parallel corpora for contrastive and translation studies : new resources and applications
Pages : 103
Pages to: 121
Editors of the parent work: Doval, Irene
Sánchez Nieto, M. Teresa
Publisher / Ed. Institution : John Benjamins
Publisher / Ed. Institution: Amsterdam
Issue Date: 2019
License (according to publishing contract) : Licence according to publishing contract
Type of review: Editorial review
Language : English
Subject (DDC) : 400: Language, linguistics
418.02: Translating and interpreting
Abstract: This chapter presents the bilingual parallel corpus PaGeS, compiled by the research group SpatiAlEs from the University of Santiago de Compostela. PaGeS currently amounts to nearly 20 million tokens and consists of texts originally written in German and in Spanish and their correspondent translations into the other language, as well as a small portion of German and Spanish translations from third languages. The present contribution introduces the main characteristics of the PaGeS corpus, focusing on its design and compilation. It first explains the criteria for the selection of the texts and the details of text pre-processing, automatic alignment and manual review. It then addresses the search and display features describing the server architecture and indexing process. Finally, the intended development of the PaGeS corpus is briefly discussed.
Departement: Applied Linguistics
Organisational Unit: Institute of Language Competence (ILC)
Publication type: Book Part
ISBN: 9789027202345
URI: https://digitalcollection.zhaw.ch/handle/11475/15667
Appears in Collections:Publikationen Angewandte Linguistik

Files in This Item:
There are no files associated with this item.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.