Accesso libero

Lemmatization of the DIA1900 Diachronic Corpus

,  e   
25 dic 2023
INFORMAZIONI SU QUESTO ARTICOLO

Cita
Scarica la copertina

This paper focuses on the process of lemmatization of the upcoming Czech diachronic corpus of the second half of the 19th century, DIA1900. The article describes different approaches to the corpus lemmatization of synchronic written, spoken and diachronic corpora within the Czech National Corpus project, including single- and multilevel lemmatization and available tools used to link the variants.

Lingua:
Inglese
Frequenza di pubblicazione:
2 volte all'anno
Argomenti della rivista:
Linguistica e semiotica, Strutture teoretiche e discipline, Linguistica, altro