The paper illustrates the design and development of a textual corpus repre- sentative of the historical variants of Ital- ian during the Great War, which was en- riched with linguistic (lemmatization and pos-tagging) and meta-linguistic annota- tion. The corpus, after a manual revision of the linguistic annotation, was used for specializing existing NLP tools to process historical texts with promising results.

Italian in the Trenches: Linguistic Annotation and Analysis of Texts of the Great War

Irene De Felice
Primo
;
Alessandro Lenci
Penultimo
;
2018-01-01

Abstract

The paper illustrates the design and development of a textual corpus repre- sentative of the historical variants of Ital- ian during the Great War, which was en- riched with linguistic (lemmatization and pos-tagging) and meta-linguistic annota- tion. The corpus, after a manual revision of the linguistic annotation, was used for specializing existing NLP tools to process historical texts with promising results.
2018
978-88-31978-41-5
File in questo prodotto:
File Dimensione Formato  
DeFelice_etal_clic-it_2018.pdf

accesso aperto

Descrizione: Articolo principale
Tipologia: Versione finale editoriale
Licenza: Creative commons
Dimensione 149.08 kB
Formato Adobe PDF
149.08 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11568/953558
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact