PAISA' is a Creative Commons licensed, large web corpus of contemporary Italian. We describe the design, harvesting, and processing steps involved in its creation.
The PAISA' Corpus of Italian Web Texts
LENCI, ALESSANDRO;
2014-01-01
Abstract
PAISA' is a Creative Commons licensed, large web corpus of contemporary Italian. We describe the design, harvesting, and processing steps involved in its creation.File in questo prodotto:
File | Dimensione | Formato | |
---|---|---|---|
W14-0406.pdf
accesso aperto
Tipologia:
Documento in Post-print
Licenza:
Creative commons
Dimensione
150.07 kB
Formato
Adobe PDF
|
150.07 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.