Compilation, recycling and standardization in a Collaborative Corpus of Linguistics

methodological approaches

Authors

  • Guilherme Fromm Universidade Federal de Uberlândia
  • Márcio Issamu Yamamoto Universidade Federal de Jataí

DOI:

https://doi.org/10.17851/2237-2083.29.3.2041-2078

Keywords:

Linguistics, Corpus Linguistics, Collaborative corpus, Domain Tree, Terminography

Abstract

This text describes a work experience with a collaborative corpus, developed during the last 10 years (2010-2020) with students in several undergraduate and graduate classes and students working on scientific undergraduate research projects at Universidade Federal de Uberlândia (already partially described by Fromm, 2013, and Fromm and Yamamoto, 2013). The work starts from the corpus elaboration methodology (including its history) as a step towards a specific type of research (terminographic), goes through an analysis to point out and solve compilation problems and ends in its adequacy and standardization for reuse. The result is a robust, well-balanced, bilingual (English/Portuguese) corpus that can be used in numerous other studies in the area of Linguistics.

Downloads

Download data is not yet available.

Published

2024-10-06

How to Cite

FROMM, G.; YAMAMOTO, M. I. Compilation, recycling and standardization in a Collaborative Corpus of Linguistics: methodological approaches. Revista de Estudos da Linguagem, [S. l.], v. 29, n. 3, p. 2041–2078, 2024. DOI: 10.17851/2237-2083.29.3.2041-2078. Disponível em: https://periodicos.ufmg.br/index.php/relin/article/view/54465. Acesso em: 24 nov. 2024.

Issue

Section

Número Atemático 29:3