Logo image
Correcting OCR errors for German in Fraktur font
Conference proceeding   Peer reviewed

Correcting OCR errors for German in Fraktur font

Proceedings of the First Italian Conference on Computational Linguistics (CLiC-it 2014) http://www.fileli.unipi.it/projects/clic/proceedings/Proceedings-CLICit-2014.pdf, pp.186-190
First Italian Conference on Computational Linguistics CLiC-it 2014 (Pisa, 09/12/2014 - 10/12/2014)
2014
Handle:
https://hdl.handle.net/10863/8878

Abstract

In this paper, we present ongoing experiments for correcting OCR errors on German newspapers in Fraktur font. Our approach borrows from techniques for spelling correction in context using a probabilistic edit-operation error model and lexical resources. We highlight conditions in which high error reduction rates can be obtained and where the approach currently stands with real data.
url
http://www.fileli.unipi.it/projects/clic/en/proceedings.htmlView
url
http://clic2014.fileli.unipi.it/proceedings/vol1/CLICIT2014136.pdfView

Details

Metrics

32 Record Views