1. bookVolume 71 (2020): Issue 2 (December 2020)
Journal Details
License
Format
Journal
eISSN
1338-4287
ISSN
0021-5597
First Published
05 Mar 2010
Publication timeframe
2 times per year
Languages
English
access type Open Access

It will certainly be found that some words are literally repeated: Horecký’s hypersyntax

Published Online: 31 Dec 2020
Volume & Issue: Volume 71 (2020) - Issue 2 (December 2020)
Page range: 185 - 196
Journal Details
License
Format
Journal
eISSN
1338-4287
ISSN
0021-5597
First Published
05 Mar 2010
Publication timeframe
2 times per year
Languages
English
Abstract

The article reflects the linguistic work of Ján Horecký in connection with hyper syntax and text linguistics. In his work Základy jazykovedy (Outline of linguistics) Ján Horecký remarks (1974, p. 90) that one of the principles of text construction is the literal repetition of certain words. We discuss this Horecký’s assumption and describe its consequences for the langue parole opposition and the concepts of textual isotopy and textual cohesion. The main task of the article is to examine Horeský’s assumption. For these purposes, we present an authorship attribution analysis of literary works by two Slovak authors: Svetozár Hurban Vajanský and Martin Kukučín. We focus on low‐frequency lexicon, i.e. hapax legomena, which are supposed to be independent of the authorial style (e.g. Binongo, 2003) and should reflect random circumstances of communication (de Saussure, 1996, p. 50; Bloomfield, 1933, p. 170). This means that if the structure of the text were to be affected by the repetition of certain words, the low frequency layer of the lexicon should contain evidence of this repetition with a low degree of dependence on the content and style of the literary work (Baayen, 1996). The analysis and its presentation is based on separate processing of hapax legomena and their n‐grams, cosine dissimilarity and multidimensional scaling (Torgerson, 1952). Contrary to the general notion of the text structure, we conclude that the authorial texts are based on the repetition of certain word forms and word forms combinations (by n‐gram analysis), even in the level of low‐frequency words.

Keywords

BAAYEN, Harald – VAN HALTEREN, Hans – TWEEDIE, Fiona: Outside the cave of shadows: using syntactic annotation to enhance authorship attribution. In: Literary and Linguistic Computing, 1996, roč. 11, č. 3, s. 121 – 132.10.1093/llc/11.3.121Search in Google Scholar

BARTHES, Roland: Smrt autora. In: Aluze, 2006, roč. 10, č. 3, s. 75 – 77.Search in Google Scholar

BINONGO, José, Nilo, G.: Who wrote the 15th book of Oz? An application of multivariate analysis to authorship attribution. In: Chance, 2003, roč. 16, č. 2, s. 9 – 17.10.1080/09332480.2003.10554843Search in Google Scholar

BLOOMFIELD, Leonard: Language. New York: Henry Holt & Co 1933. 566 s.Search in Google Scholar

BOUKHALED, Mohamed, Amine – GANASCIA, Jean‐Gabriel: Using Function Words for Authorship Attribution: Bag‐of‐Words vs. Sequential Rules. In: The 11th International Workshop on Natural Language Processing and Cognitive Science, Oct 2014, Venice, Italy. de Gruyter, Natural Language Processing and Cognitive Science Proceedings 2014, 2015, s. 115 – 122.10.1515/9781501501289.115Search in Google Scholar

BURROWS, John: Questions of Authorship: Attribution and Beyond. In: Computers and the Humanities, 2003, roč. 37, č. 1, s. 5 – 32.10.1023/A:1021814530952Search in Google Scholar

ČERMÁK, František: Jazyk a jazykověda. Přehled a slovníky. Praha: Karolinum 2011. 382 s.Search in Google Scholar

ČERNÝ, Jiří: Úvod do obecné jazykovědy. Olomouc: Rubico 1998. 248 s.Search in Google Scholar

DE DEYNE, Simon – KENETT, Yoed, N. – ANAKI, David – FAUST, Miriam – NAVARRO, Daniel: Large‐scale network representations of semantics in the mental lexicon. In: Frontiers of cognitive psychology. Big Data in Cognitive Science. Ed. N. M. Jones. Routledge/Taylor & Francis Group 2016, s. 174 – 202.Search in Google Scholar

DOLNÍK, Juraj: Všeobecná jazykoveda: opis a vysvetľovanie jazyka. Bratislava: Veda, vydavateľstvo SAV 2009. 376 s.Search in Google Scholar

DOLNÍK, Juraj: Jazykové znalosti a ovládanie jazyka. In: Jazykovedný časopis, 2018a, roč. 69, č. 1, s. 77 – 89.10.2478/jazcas-2018-0013Search in Google Scholar

DOLNÍK, Juraj: Lexikálny význam v recepcii. In: Jazykovedný časopis, 2018b, roč. 69, č. 3, s. 302 – 315.10.2478/jazcas-2019-0012Search in Google Scholar

FALTÝNEK, Dan: Má tedy Kroužek sedmou publikaci. In: Bohemica Olomucensia, 2013, roč. 4, s. 318 – 328.Search in Google Scholar

FALTÝNEK, Dan – MATLACH, Vladimír: Hapax Remains: authorial features of textual cohesion in authorship attribution. Preprint, 2020, DOI: 10.13140/RG.2.2.19865.24164.Search in Google Scholar

FALTÝNEK, Dan – MATLACH, Vladimír – OWSIANKOVÁ, Hana: Hapax legomena jako indikátor autorského stylu a formální znak koheze textu. Preprint, 2020, DOI: 10.13140/RG.2.2.16509.79847.Search in Google Scholar

FORSTER, Kenneth, I. – CHAMBERS, Susan, M.: Lexical access and naming time. In: Journal of Verbal Learning & Verbal Behavior, 1973, roč. 12, č. 6, s. 627 – 635.Search in Google Scholar

GREIMAS, Algirdas, Julien: On Meaning Selected Writings in Semiotic Theory. Minneapolis: University of Minnesota Press 1987. 251 s.Search in Google Scholar

HELBIG, Gerhard: Vývoj jazykovědy po roce 1970. Prekl. J. Nekvapil – J. Holšánková. Praha: Academia 1991. 303 s.Search in Google Scholar

HIRSCHOVÁ, Milada: Úvod do teorie textu. Olomouc: Univerzita Palackého v Olomouci 1989. 243 s.Search in Google Scholar

HORECKÝ, Ján: Úvod do matematickej jazykovedy. Bratislava: Univerzita Komenského 1969. 92 s.Search in Google Scholar

HORECKÝ, Ján: A Model of Slovak Conjugation. In: Prague Studies in Mathematical Linguistics, 1970, č. 3, s. 259 – 271.Search in Google Scholar

HORECKÝ, Ján: Základy jazykovedy. Bratislava: Slovenské pedagogické nakladateľ-stvo 1978. 176 s.Search in Google Scholar

HORECKÝ, Ján: Jazykové vedomie. In: Jazykovedný časopis, 1991, roč. 42, č. 2, s. 81 – 88.Search in Google Scholar

JUOLA, Patrick: Authorship attribution. In: Foundations and Trends in Information Retrieval, 2008, roč. 1, č. 3, s. 233 – 334.10.1561/1500000005Search in Google Scholar

JUOLA, Patrick – BAAYEN, Harald: A controlled‐corpus experiment in authorship identification by cross‐entropy. In: Literary and Linguistic Computing, 2003, roč. 20, č. Suppl. Issue, s. 159 – 167.Search in Google Scholar

KELIH, Emmerich – ANTIĆ, Gordana – GRZYBEK, Peter – STADLOBER, Ernst: Classification of author and/or genre? The impact of word length. In: Classification: The Ubiquitous Challenge. Eds. C. Weihs – W. Gaul. Heidelberg: Springer, 2005, s. 498 – 505.Search in Google Scholar

KOPPEL, Moshe – SCHLER, Jonathan – ARGAMON, Shlomo: Computational methods in authorship attribution. In: Journal of the American Society for Information Sciences and Technology, 2009, roč. 60, č. 1, s. 9 – 26.10.1002/asi.20961Search in Google Scholar

KOŘENSKÝ, Jan: Za Jánem Horeckým. In: Slovo a slovesnost, 2007, roč. 68, č. 1, s. 78.Search in Google Scholar

ORGOŇOVÁ, Oľga – BOHUNICKÁ, Alena: Interakčná štylistika. Bratislava: Univerzita Komenského 2018. 208 s.Search in Google Scholar

PALEK, Bohumil: Základy obecné jazykovědy. Praha: SPN 1989. 288 s.Search in Google Scholar

TĚŠITELOVÁ, Marie: Profesor Ján Horecký šedesátníkem. In: Slovo a slovesnost, 1980, roč. 41, č. 2, s. 157 – 160.Search in Google Scholar

TORGERSON, Warren S.: Multidimensional scaling: I. Theory and method. In: Psychometrika, 1952, roč. 17, č. 4, s. 401 – 419.Search in Google Scholar

ZHAO, Ying – ZOBEL, Justin: Effective and scalable authorship attribution using function words. In: Proceedings of 2nd Asian Information Retrieval Symposium, 2005, s. 174 – 189.10.1007/11562382_14Search in Google Scholar

ZUMRÍK, Miroslav: Ján Horecký’s Approach to Language and Thinking. In: Jazykovedný časopis, 2017, roč. 68, č. 2, s. 426 – 431.10.1515/jazcas-2017-0052Search in Google Scholar

Recommended articles from Trend MD

Plan your remote conference with Sciendo