Corpora of academic texts | CLARIN ERIC
Corpora of academic texts contain scholarly writing, which includes research papers, essays and abstracts published in academic journals, conference proceedings, and edited volumes, theses written by students at the undergraduate and graduate levels, and scientific monographs.
The CLARIN ERIC infrastructure gives access to 22 corpora of academic texts, 2 of which are multilingual and 20 monolingual. The available corpora contain scholarly texts in the following 11 languages: Czech, English, Estonian, Finnish, French, German, Greek, Russian, Slovenian, Spanish, and Swedish. More than 15 different scholarly disciplines are represented, with the most prominent being linguistics, computer science, economics, and medicine. The majority of the corpora richly tagged and are available under public licences.
We first provide overviews of the corpora that are already part of the CLARIN infrastructure and then list those that have not yet been integrated
Tue Mar 3 08:21:59 2020 - permalink -
-
https://www.clarin.eu/resource-families/corpora-academic-texts