Published Online: Jul 24, 2020
Page range: 23 - 27
Received: Jun 01, 2019
Accepted: Jan 01, 2020
DOI: https://doi.org/10.2478/lf-2019-0002
Keywords
© 2020 Dan Faltýnek et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.
In this article, we deal with the similarity between epigenetic marks in DNA and hapax legomena in language; based on the so-called hapaxes, a grammar description is designed. We reflect hapax analysis of Czech language provided by Novotná (2013) and avoid random selection of the corpus. For this reason, we analyze a corpus of 12 authentic books from 12 authors who elaborated the theme “What’s new in…” concerning their field of science, assigned by Nová beseda publishing. By analyzing a middle-sized corpus, we expected results similar to those of large-scale national corpus (see Novotná 2013). We chose to classify hapaxes into different categories in comparison to Novotná, yet the results show similar language productive categories. This kind of language potentiality seems to be analogical to epigenetic processes in biology, which is briefly introduced.