Corpus of Slovak Legislative Documents

Radovan Garabík

Open Access

Corpus of Slovak Legislative Documents

Radovan Garabík

| Mar 27, 2023

Journal of Linguistics/Jazykovedný casopis

Volume 73 (2022): Issue 2 (September 2022)

About this article

Cite

Page range: 175 - 189

DOI: https://doi.org/10.2478/jazcas-2023-0004

Keywords
corpus, Slovak language, body of law, legislation

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

The article describes the construction of the corpus of Slovak legislative documents. By analyzing several statistical values of the source metadata and documents, we efficiently improve corpus quality. We describe the methods used to clean up small variations in metadata, length based discrimination of document and examine the effectiveness of several strategies of deduplication. The corpus is a part of a comparable corpus of legislative documents of seven languages, created in the Multilingual Resources for CEF.AT in the Legal Domain (MARCELL) project.

eISSN:: 1338-4287
Language:: English

Publication timeframe:: 2 times per year
Journal Subjects:: Linguistics and Semiotics, Theoretical Frameworks and Disciplines, Linguistics, other

Journal RSS Feed

Corpus of Slovak Legislative Documents

Published Online: Mar 27, 2023

Page range: 175 - 189

DOI: https://doi.org/10.2478/jazcas-2023-0004

Keywords
corpus, Slovak language, body of law, legislation

© 2022 Radovan Garabík, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Corpus of Slovak Legislative Documents

Published Online: Mar 27, 2023

Page range: 175 - 189

DOI: https://doi.org/10.2478/jazcas-2023-0004

Keywordscorpus, Slovak language, body of law, legislation

© 2022 Radovan Garabík, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Keywords
corpus, Slovak language, body of law, legislation