Search
Now showing items 1-6 of 6
-
High-Accuracy Phrase Translation Acquisition Through Battle-Royale Selection
(RANLP 2011 Organising Committee / ACL, 2013)In this paper, we report on an unsupervised greedy-style process for acquiring phrase translations from sentence-aligned parallel corpora. Thanks to innovative selection strategies, this process can acquire multiple ... -
Correcting OCR errors for German in Fraktur font
(2014)In this paper, we present ongoing experiments for correcting OCR errors on German newspapers in Fraktur font. Our approach borrows from techniques for spelling correction in context using a probabilistic edit-operation ... -
enetCollect: A New European Network for combining Language Learning with Crowdsourcing Techniques
(2018)We present enetCollect, a large European COST action network set upwith the aim of promoting a research trend combining the well-established domain of Language Learning with recent and successful crowdsourcing approaches. ... -
Open Corpus Interface for Italian Language Learning
(libreriauniversitaria.it, 2013)In this article, we present the multi-faceted interface to the open PAISà corpus of Italian. Created within the project PAISà (Piattaforma per l’Apprendimento dell’Italiano Su corpora Annotati) [1], the corpus is designed ... -
A new Cost Action of interest to lexicographers: European Network for Combining Language Learning with Crowdsourcing Techniques (enetCollect)
(2017)We present enetCollect, a newly funded COST Action that aims at enhancing the production of language learning material (such as lesson content) and the creation and extension of language-related datasets (such as lexicographic ... -
StirWaC: compiling a diverse corpus based on texts from the web for South Tyrolean German
(2013)In this paper, we report on the creation of a web corpus for the variety of German spoken in South Tyrol. We hence provide an example for the compilation of a corpus for a language variety that has neighboring varieties ...