Compilation, recycling and standardization in a Collaborative Corpus of Linguistics
methodological approaches
DOI:
https://doi.org/10.17851/2237-2083.29.3.2041-2078Keywords:
Linguistics, Corpus Linguistics, Collaborative corpus, Domain Tree, TerminographyAbstract
This text describes a work experience with a collaborative corpus, developed during the last 10 years (2010-2020) with students in several undergraduate and graduate classes and students working on scientific undergraduate research projects at Universidade Federal de Uberlândia (already partially described by Fromm, 2013, and Fromm and Yamamoto, 2013). The work starts from the corpus elaboration methodology (including its history) as a step towards a specific type of research (terminographic), goes through an analysis to point out and solve compilation problems and ends in its adequacy and standardization for reuse. The result is a robust, well-balanced, bilingual (English/Portuguese) corpus that can be used in numerous other studies in the area of Linguistics.