From the National Corpus of Polish to the Polish Corpus Infrastructure
, , und
21. Dez. 2019
Über diesen Artikel
Online veröffentlicht: 21. Dez. 2019
Seitenbereich: 315 - 323
DOI: https://doi.org/10.2478/jazcas-2019-0061
Schlüsselwörter
© 2019 Maciej Ogrodniczuk et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
The National Corpus of Polish emerged as a cumulative result of many years of work on large reference corpora by computer scientists and linguists in Poland. While its impact on research in linguistics, humanities and language technology is unquestionable and highly significant, the construction of the national corpus was halted in 2011. In the paper we call for activating the research community and funding institutions around the construction of a corpus infrastructure with the national corpus at its heart. It is claimed that on the verge of an artificial intelligence revolution the envisaged Polish Corpus Infrastructure would provide reliable language data, combine available resources and allow easy integration of new ones.