Now showing items 1-8 of 8
CLARIN-IT: State of Affairs, Challenges and Opportunities
This paper gives an overview on the Italian national CLARIN consortium as it currently stands two years after its creation at the end of 2015. It thus discusses the current state of affairs of the consortium on several ...
A Generic Data Workflow for Building Annotated Text Corpora
(Peter Lang, 2015)
We present an abstract and generic workflow, and detail how it has been implemented to build and annotate learner corpora. This workflow has been developed through an interdisciplinary collaboration between linguists, who ...
The MERLIN corpus: Learner language and the CEFR
(European Language Resources Association (ELRA), 2014)
The MERLIN corpus is a written learner corpus for Czech, German, and Italian that has been designed to illustrate the Common European Framework of Reference for Languages (CEFR) with authentic learner data. The corpus ...
MERLIN: An Online Trilingual Learner Corpus Empirically Grounding the European Reference Levels in Authentic Learner Data
Since its publication in 2001, the Common European Framework of Reference for Languages (CEFR) has gained a leading role as an instrument of reference for language teaching and certification. Nonetheless, there is a growing ...
An extended version of the KoKo German L1 Learner corpus
This paper describes an ex- tended version of the KoKo corpus (ver- sion KoKo4, Dec 2015), a corpus of written German L1 learner texts from three different German-speaking regions in three different countries. The KoKo ...
A Trilingual Learner Corpus illustrating European Reference Levels
Since its publication in 2001, the Common European Framework of Reference for Languages (CEFR) has gained a leading role as an instrument of reference for language teaching and certification and for the development of ...