From the National Corpus of Polish to the Polish Corpus Infrastructure

The National Corpus of Polish emerged as a cumulative result of many years of work on large reference corpora by computer scientists and linguists in Poland. While its impact on research in linguistics, humanities and language technology is unquestionable and highly significant, the construction of the national corpus was halted in 2011. In the paper we call for activating the research community and funding institutions around the construction of a corpus infrastructure with the national corpus at its heart. It is claimed that on the verge of an artificial intelligence revolution the envisaged Polish Corpus Infrastructure would provide reliable language data, combine available resources and allow easy integration of new ones.

Sprache:: Englisch

Zeitrahmen der Veröffentlichung:: 2 Hefte pro Jahr
Fachgebiete der Zeitschrift:: Linguistik und Semiotik, Theorien und Fachgebiete, Linguistik, andere

Zeitschrift RSS Feed

From the National Corpus of Polish to the Polish Corpus Infrastructure

Maciej Ogrodniczuk

Rafał L. Górski

Marek Łaziński

Piotr Pęzik

Online veröffentlicht: 21. Dez. 2019

Seitenbereich: 315 - 323

DOI: https://doi.org/10.2478/jazcas-2019-0061

Schlüsselwörtercorpus linguistics, corpus lexicography, dialect corpora

© 2019 Maciej Ogrodniczuk et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Schlüsselwörter
corpus linguistics, corpus lexicography, dialect corpora