The chapter presents and discusses the creation of the spoken corpus LIPS (Lexicon of Spoken Italian by Foreigners). The corpus is based on proficiency exams in Italian as L2 taken at the University for Foreigners of Siena (Università per Stranieri di Siena), and created mainly to enable studies in L2 Italian vocabulary acquisition. The corpus is also organized in a way that facilitates comparisons with the equivalent native speaker corpora. Methodological choices related to transcription norms, lemmatization and part-of-speeh tagging are discussed and motivated. Methods of analysis of native and non-native speakers' lexical richness are discussed as well.
LIPS: Creating a Learner Corpus
Francesca Gallina
2017-01-01
Abstract
The chapter presents and discusses the creation of the spoken corpus LIPS (Lexicon of Spoken Italian by Foreigners). The corpus is based on proficiency exams in Italian as L2 taken at the University for Foreigners of Siena (Università per Stranieri di Siena), and created mainly to enable studies in L2 Italian vocabulary acquisition. The corpus is also organized in a way that facilitates comparisons with the equivalent native speaker corpora. Methodological choices related to transcription norms, lemmatization and part-of-speeh tagging are discussed and motivated. Methods of analysis of native and non-native speakers' lexical richness are discussed as well.File | Dimensione | Formato | |
---|---|---|---|
Gallina_2017_LIPS creating a Learner Corpus.pdf
solo utenti autorizzati
Tipologia:
Versione finale editoriale
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
7.69 MB
Formato
Adobe PDF
|
7.69 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.