Verena Lyding, Egon W. Stemle, C Borghetti, M Brunello, S Castagnoli, FD Orletta, H Dittmann, A Lenci and V Pirrelli
Proceedings of the 9th Web as Corpus Workshop (WaC-9), pp.36-43
9th Web as Corpus Workshop (WaC-9) (Gothenburg, 26/04/2014 - 26/04/2014)
2014
Handle:
https://hdl.handle.net/10863/8855
Abstract
PAISÀ is a Creative Commons licensed, large web corpus of contemporary Italian. We describe the design, harvesting, and processing steps involved in its creation.