1. bookVolume 70 (2019): Issue 2 (December 2019)
Journal Details
License
Format
Journal
eISSN
1338-4287
ISSN
0021-5597
First Published
05 Mar 2010
Publication timeframe
2 times per year
Languages
English
access type Open Access

Annotations in the Corpus of Texts of Students Learning Slovak as a Foreign Language (ERRKORP)

Published Online: 21 Dec 2019
Volume & Issue: Volume 70 (2019) - Issue 2 (December 2019)
Page range: 345 - 357
Journal Details
License
Format
Journal
eISSN
1338-4287
ISSN
0021-5597
First Published
05 Mar 2010
Publication timeframe
2 times per year
Languages
English
Abstract

The article presents the upcoming acquisition corpus of written texts of students learning Slovak as a Foreign Language and focuses on the annotation of texts, which includes information about the text as well as social and linguistic details about the student. The article also discusses the tags that identify individual errors in the texts and concept of creating the tagset itself.

Keywords

[1] Šebesta, K., Bedřichová, Z., Šormová, K., Štindlová, B., Hrdlička, M., Hrdličková, T., Hana, J., Petkevič, V., Jelínek, T., Škodová, S., Poláčková, M., Janeš, P., Lundáková, K., Skoumalová, H., Sládek, Š., Pierscieniak, P., Toufarová, D., Richter, M., Straka, M., and Rosen, A. (2014). CzeSL-SGT: korpus češtiny nerodilých mluvčích s automaticky provedenou anotací, version 2 as of 28 July 2014. Ústav Českého národního korpusu FF UK, Praha. Accessible at: http://www.korpus.cz.Search in Google Scholar

[2] Granger, S., Dagneaux, E., Meunier, F., and Paquot, M. (2009). International Corpus of Learner English v2 (Handbook + CD-Rom). Louvain-la-Neuve, Presses universitaires de Louvain, Louvain-la-Neuve.Search in Google Scholar

[3] Reznicek, M., Lüdeling, A., Krummes, C., Schwantuschke, F., Walter, M., Schmidt, K., Hirschmann, H., and Torsten, A. (2012). Das Falko-Handbuch. Korpusaufbau und Annotationen Version 2.01. Accessible at: https://www.linguistik.hu-berlin.de/de/institut/professuren/korpuslinguistik/forschung/falko/FalkoHandbuchV2/view.Search in Google Scholar

[4] RLC. Russian Learner Corpus. Accessible at: http://web-corpora.net/RLC/.Search in Google Scholar

[5] Pekarovičová, J. (2004). Slovenčina ako cudzí jazyk. Predmet aplikovanej lingvistiky. Bratislava, Stimul. 220 p.Search in Google Scholar

[6] Corder, S. P. (1981). Error Analysis and Interlanguage. Oxford, Oxford University Press, 120 p.Search in Google Scholar

[7] James, C. (1998). Errors in Language Learning and Use. London – New York, Longman, 304 p.Search in Google Scholar

[8] Šebesta, K., Škodová, S. et al. (2012). Čeština – cílový jazyk a korpusy. Liberec, Technická univerzita v Liberci. 168 p. Accesible at: http://akces.ff.cuni.cz/system/files/ce%C5%A1tina%20-%20c%C3%ADlov%C3%BD%20jazyk.pdf.Search in Google Scholar

[9] Škodová, S., Štindlová, B., Rosen, A., Jelínek, T., and Vidová Hladká, B. (2019). Příručka k morfologické anotaci textů nerodilých mluvčích češtiny. Version 1.0 as of 17 January 2019. Accesible at: http://utkl.ff.cuni.cz/~rosen/public/2018_prirucka_morfologicke_anotace.pdf.Search in Google Scholar

[10] Kotková, R. (2017). Čeština nerodilých mluvčích s mateřským jazykem neslovanským. Praha: Univerzita Karlova, 154 p.Search in Google Scholar

[11] CzeSL-SGT – korpus češtiny nerodilých mluvčiìch s automaticky provedenou anotaciì. Manuál. Accessible at: http://utkl.ff.cuni.cz/~rosen/public/2014-czesl-sgt-cs.pdf.Search in Google Scholar

[12] Kilgariff, A., Baisa, V., Bušta, J., Jakubíček, M., Kovář, V., Michelfeit, J., Rychlý, P., and Suchomel, V. (2014). The Sketch Engine: ten years on. Lexicography, 1, pages 7–36. Accessible at: http://www.sketchengine.eu.Search in Google Scholar

[13] Project of the corpus of texts written by students learning Slovak as a foreign language – ERRKORP. Accessible at: https://korpus.sk/errkorp.html.Search in Google Scholar

Recommended articles from Trend MD

Plan your remote conference with Sciendo