Journal & Issues

Volume 47 (2023): Issue 1 (May 2023)

Volume 46 (2022): Issue 1 (August 2022)

Volume 45 (2021): Issue 1 (May 2021)

Volume 44 (2020): Issue 1 (March 2020)

Volume 43 (2019): Issue 1 (March 2019)

Volume 42 (2018): Issue 1 (March 2018)

Volume 41 (2017): Issue 1 (March 2017)

Volume 40 (2016): Issue 1 (March 2016)

Volume 39 (2015): Issue 1 (March 2015)

Volume 38 (2014): Issue 1 (April 2014)

Journal Details
Format
Journal
eISSN
1502-5462
First Published
28 Apr 2014
Publication timeframe
1 time per year
Languages
English

Search

Volume 42 (2018): Issue 1 (March 2018)

Journal Details
Format
Journal
eISSN
1502-5462
First Published
28 Apr 2014
Publication timeframe
1 time per year
Languages
English

Search

14 Articles
Open Access

A tribute to Matti Rissanen

Published Online: 11 Apr 2018
Page range: 5 - 8

Abstract

Open Access

Three problems connected with the use of diachronic corpora

Published Online: 11 Apr 2018
Page range: 9 - 12

Abstract

Abstract

In compiling and testing the diachronic part of the Helsinki Corpus of English Texts, our project group has come across three problems which arise from the use of computer corpora in studies of syntax and vocabulary. While these problems are mainly associated with work on diachronic corpora, they may be universal enough to deserve somewhat more general consideration. They could be called “The philologist’s dilemma”, “God’s truth fallacy”, and “The mystery of vanishing reliability”. The first could be described as pedagogical, the second methodological and the third pragmatic.

Open Access

Is there a correlation between form and function? A syntactic and functional investigation of the introductory it pattern in student writing

Published Online: 11 Apr 2018
Page range: 13 - 40

Abstract

Abstract

The introductory it pattern, as in ‘It is important to note that information was added’, is a tool used by academic writers for a range of different rhetorical and information-structural purposes. It is thus an important pattern for students to learn. Since previous research on student writing has indicated that there seems to be a correlation between form and function of the pattern, the present study sets out to investigate this more systematically in non-native-speaker and nativespeaker student writing in two disciplines (linguistics and literature). In doing so, the study adds to and extends previous research looking into factors such as NS status and discipline. It uses data from three corpora: ALEC, BAWE and MICUSP. The results show that there is indeed a correlation between form and function, as the most common syntactic types of the pattern each display a preferred function and vice versa. While very few differences across NS status were found, there were certain discipline-specific disparities. The findings, which could be useful for teaching students about the use of the introductory it pattern, also have implications for the automatized functional tagging of parsed corpora.

Open Access

Revisiting the ENL-ESL-EFL continuum: A multifactorial approach to grammatical aspect in spoken Englishes

Published Online: 11 Apr 2018
Page range: 41 - 78

Abstract

Abstract

This study focuses on the progressive vs. non-progressive alternation to revisit the debate on the ENL-ESL-EFL continuum (i.e. whether native (ENL) and nonnative (ESL/EFL) Englishes are dichotomous types of English or form a gradient continuum). While progressive marking is traditionally studied independently of its unmarked counterpart, we examine (i) how the grammatical contexts of both constructions systematically affect speakers’ constructional choices in ENL (American, British), ESL (Indian, Nigerian and Singaporean) and EFL (Finnish, French and Polish learner Englishes) and (ii) what light speakers’ varying constructional choices bring to the continuum debate. Methodologically, we use a clustering technique to group together individual varieties of English (i.e. to identify similarities and differences between those varieties) based on linguistic contextual features such as AKTIONSART, ANIMACY, SEMANTIC DOMAIN (of aspect-bearing lexical verb), TENSE, MODALITY and VOICE to assess the validity of the ENL-ESL-EFL classification for our data. Then, we conduct a logistic regression analysis (based on lemmas observed in both progressive and non-progressive constructions) to explore how grammatical contexts influence speakers’ constructional choices differently across English types. While, overall, our cluster analysis supports the ENL-ESL-EFL classification as a useful theoretical framework to explore cross-variety variation, the regression shows that, when we start digging into the specific linguistic contexts of (non-)progressive constructions, this classification does not systematically transpire in the data in a uniform manner. Ultimately, by including more than one statistical technique into their exploration of the continuum, scholars could avoid potential methodological biases.

Open Access

Charting orthographical reliability in a corpus of English historical letters

Published Online: 11 Apr 2018
Page range: 79 - 96

Abstract

Abstract

Research into orthography in the history of English is not a simple venture. The history of English spelling is primarily based on printed texts, which fail to capture the range of variation inherent in the language; many manuscript phenomena are simply not found in printed texts. Manuscript-based corpora would be the ideal research data, but as this is resource-intensive, linguists use editions that have been produced by non-linguists. Many editions claim to retain original spellings, but in practice text is always normalized at the graph level and possibly more so. This does not preclude using such a corpus for orthographical research, but there has been no systematic way to determine the philological reliability of an edited text. In this paper we present a typological methodology we are developing for the evaluation of orthographical quality of edition-based corpora, with the aim of making the best use of bad data in the context of editions and manuscript practices. As a case study, we apply this methodology to the Early Modern and Late Modern English sections of the Corpus of Early English Correspondence.

Open Access

The interface between cultures and corpora: Tracing reflections and manifestations

Published Online: 11 Apr 2018
Page range: 97 - 132

Abstract

Open Access

Evaluating stance-annotated sentences from the Brexit Blog Corpus: A quantitative linguistic analysis

Published Online: 11 Apr 2018
Page range: 133 - 166

Abstract

Abstract

This paper offers a formally driven quantitative analysis of stance-annotated sentences in the Brexit Blog Corpus (BBC). Our goal is to identify features that determine the formal profiles of six stance categories (contrariety, hypotheticality, necessity, prediction, source of knowledge and uncertainty) in a subset of the BBC. The study has two parts: firstly, it examines a large number of formal linguistic features, such as punctuation, words and grammatical categories that occur in the sentences in order to describe the specific characteristics of each category, and secondly, it compares characteristics in the entire data set in order to determine stance similarities in the data set. We show that among the six stance categories in the corpus, contrariety and necessity are the most discriminative ones, with the former using longer sentences, more conjunctions, more repetitions and shorter forms than the sentences expressing other stances. necessity has longer lexical forms but shorter sentences, which are syntactically more complex. We show that stance in our data set is expressed in sentences with around 21 words per sentence. The sentences consist mainly of alphabetical characters forming a varied vocabulary without special forms, such as digits or special characters.

Open Access

From broadcast archive to language corpus: Designing and investigating a sociohistorical corpus from Desert Island Discs

Published Online: 11 Apr 2018
Page range: 167 - 190

Abstract

Abstract

The aims of this paper are twofold: i) to present the motivation and design of a sociohistorical corpus derived from the popular BBC Radio show, Desert Island Discs (DID); and ii) to illustrate the potential of the DID corpus (DIDC) with a case study. In an era of ever-increasing digital resources and scholarly interest in recent language change, there remains an enormous disparity between available written and spoken corpora. We describe how a corpus derived from DID contributes to redressing the balance. Treating DID as an example of a specialized register, namely, a ‘biographical chat show’, we review its attendant situational characteristics, and explain the affordances and design features of a sociolinguistic corpus sampling of the show. Finally, to illustrate the potential of DIDC for linguistic exploration of recent change, we conduct a case study on two pronouns with generic, impersonal reference, namely you and one.

Open Access

Bridging the gap between university and upper secondary school English studies: The ULE project

Published Online: 11 Apr 2018
Page range: 191 - 212

Abstract

Open Access

Kate Beeching. Pragmatic markers in British English. Meaning in social interaction. Cambridge: Cambridge University Press. 2016. 255 pp. ISBN: 9781139507110.

Published Online: 11 Apr 2018
Page range: 213 - 218

Abstract

Open Access

Alexandra D’Arcy. Discourse-pragmatic variation in context – eight hundred years of LIKE. Amsterdam & Philadelphia: John Benjamins. 2017. 235 pp. DOI 10.1075/slcs.187. ISBN 978 90 272 5952 3 (HB), ISBN 978 90 272 6531 9 (EBOOK).

Published Online: 11 Apr 2018
Page range: 219 - 224

Abstract

Open Access

Marianne Hundt, Sandra Mollin and Simone E. Pfenninger (eds.). The changing English language: Psycholinguistic perspectives (Studies in English Language). Cambridge: Cambridge University Press, 2017. 410 pp. ISBN 978-1-107-08686-9.

Published Online: 11 Apr 2018
Page range: 225 - 233

Abstract

Open Access

Merja Kytö and Päivi Pahta (eds.). The Cambridge handbook of English historical linguistics. Cambridge: Cambridge University Press. 2016. ISBN: 978-1-107-03935-3. xxv + 624 pp.

Published Online: 11 Apr 2018
Page range: 234 - 239

Abstract

Open Access

María José López-Couso, Belén Méndez-Naya, Paloma Núñez-Pertejo and Ignacio Palacios-Martínez (eds.). Corpus linguistics on the move. Exploring and understanding English through corpora. Leiden: Brill/Rodopi. 2016. 368 pp. ISBN13 9-789-004-308077.

Published Online: 11 Apr 2018
Page range: 240 - 245

Abstract

14 Articles
Open Access

A tribute to Matti Rissanen

Published Online: 11 Apr 2018
Page range: 5 - 8

Abstract

Open Access

Three problems connected with the use of diachronic corpora

Published Online: 11 Apr 2018
Page range: 9 - 12

Abstract

Abstract

In compiling and testing the diachronic part of the Helsinki Corpus of English Texts, our project group has come across three problems which arise from the use of computer corpora in studies of syntax and vocabulary. While these problems are mainly associated with work on diachronic corpora, they may be universal enough to deserve somewhat more general consideration. They could be called “The philologist’s dilemma”, “God’s truth fallacy”, and “The mystery of vanishing reliability”. The first could be described as pedagogical, the second methodological and the third pragmatic.

Open Access

Is there a correlation between form and function? A syntactic and functional investigation of the introductory it pattern in student writing

Published Online: 11 Apr 2018
Page range: 13 - 40

Abstract

Abstract

The introductory it pattern, as in ‘It is important to note that information was added’, is a tool used by academic writers for a range of different rhetorical and information-structural purposes. It is thus an important pattern for students to learn. Since previous research on student writing has indicated that there seems to be a correlation between form and function of the pattern, the present study sets out to investigate this more systematically in non-native-speaker and nativespeaker student writing in two disciplines (linguistics and literature). In doing so, the study adds to and extends previous research looking into factors such as NS status and discipline. It uses data from three corpora: ALEC, BAWE and MICUSP. The results show that there is indeed a correlation between form and function, as the most common syntactic types of the pattern each display a preferred function and vice versa. While very few differences across NS status were found, there were certain discipline-specific disparities. The findings, which could be useful for teaching students about the use of the introductory it pattern, also have implications for the automatized functional tagging of parsed corpora.

Open Access

Revisiting the ENL-ESL-EFL continuum: A multifactorial approach to grammatical aspect in spoken Englishes

Published Online: 11 Apr 2018
Page range: 41 - 78

Abstract

Abstract

This study focuses on the progressive vs. non-progressive alternation to revisit the debate on the ENL-ESL-EFL continuum (i.e. whether native (ENL) and nonnative (ESL/EFL) Englishes are dichotomous types of English or form a gradient continuum). While progressive marking is traditionally studied independently of its unmarked counterpart, we examine (i) how the grammatical contexts of both constructions systematically affect speakers’ constructional choices in ENL (American, British), ESL (Indian, Nigerian and Singaporean) and EFL (Finnish, French and Polish learner Englishes) and (ii) what light speakers’ varying constructional choices bring to the continuum debate. Methodologically, we use a clustering technique to group together individual varieties of English (i.e. to identify similarities and differences between those varieties) based on linguistic contextual features such as AKTIONSART, ANIMACY, SEMANTIC DOMAIN (of aspect-bearing lexical verb), TENSE, MODALITY and VOICE to assess the validity of the ENL-ESL-EFL classification for our data. Then, we conduct a logistic regression analysis (based on lemmas observed in both progressive and non-progressive constructions) to explore how grammatical contexts influence speakers’ constructional choices differently across English types. While, overall, our cluster analysis supports the ENL-ESL-EFL classification as a useful theoretical framework to explore cross-variety variation, the regression shows that, when we start digging into the specific linguistic contexts of (non-)progressive constructions, this classification does not systematically transpire in the data in a uniform manner. Ultimately, by including more than one statistical technique into their exploration of the continuum, scholars could avoid potential methodological biases.

Open Access

Charting orthographical reliability in a corpus of English historical letters

Published Online: 11 Apr 2018
Page range: 79 - 96

Abstract

Abstract

Research into orthography in the history of English is not a simple venture. The history of English spelling is primarily based on printed texts, which fail to capture the range of variation inherent in the language; many manuscript phenomena are simply not found in printed texts. Manuscript-based corpora would be the ideal research data, but as this is resource-intensive, linguists use editions that have been produced by non-linguists. Many editions claim to retain original spellings, but in practice text is always normalized at the graph level and possibly more so. This does not preclude using such a corpus for orthographical research, but there has been no systematic way to determine the philological reliability of an edited text. In this paper we present a typological methodology we are developing for the evaluation of orthographical quality of edition-based corpora, with the aim of making the best use of bad data in the context of editions and manuscript practices. As a case study, we apply this methodology to the Early Modern and Late Modern English sections of the Corpus of Early English Correspondence.

Open Access

The interface between cultures and corpora: Tracing reflections and manifestations

Published Online: 11 Apr 2018
Page range: 97 - 132

Abstract

Open Access

Evaluating stance-annotated sentences from the Brexit Blog Corpus: A quantitative linguistic analysis

Published Online: 11 Apr 2018
Page range: 133 - 166

Abstract

Abstract

This paper offers a formally driven quantitative analysis of stance-annotated sentences in the Brexit Blog Corpus (BBC). Our goal is to identify features that determine the formal profiles of six stance categories (contrariety, hypotheticality, necessity, prediction, source of knowledge and uncertainty) in a subset of the BBC. The study has two parts: firstly, it examines a large number of formal linguistic features, such as punctuation, words and grammatical categories that occur in the sentences in order to describe the specific characteristics of each category, and secondly, it compares characteristics in the entire data set in order to determine stance similarities in the data set. We show that among the six stance categories in the corpus, contrariety and necessity are the most discriminative ones, with the former using longer sentences, more conjunctions, more repetitions and shorter forms than the sentences expressing other stances. necessity has longer lexical forms but shorter sentences, which are syntactically more complex. We show that stance in our data set is expressed in sentences with around 21 words per sentence. The sentences consist mainly of alphabetical characters forming a varied vocabulary without special forms, such as digits or special characters.

Open Access

From broadcast archive to language corpus: Designing and investigating a sociohistorical corpus from Desert Island Discs

Published Online: 11 Apr 2018
Page range: 167 - 190

Abstract

Abstract

The aims of this paper are twofold: i) to present the motivation and design of a sociohistorical corpus derived from the popular BBC Radio show, Desert Island Discs (DID); and ii) to illustrate the potential of the DID corpus (DIDC) with a case study. In an era of ever-increasing digital resources and scholarly interest in recent language change, there remains an enormous disparity between available written and spoken corpora. We describe how a corpus derived from DID contributes to redressing the balance. Treating DID as an example of a specialized register, namely, a ‘biographical chat show’, we review its attendant situational characteristics, and explain the affordances and design features of a sociolinguistic corpus sampling of the show. Finally, to illustrate the potential of DIDC for linguistic exploration of recent change, we conduct a case study on two pronouns with generic, impersonal reference, namely you and one.

Open Access

Bridging the gap between university and upper secondary school English studies: The ULE project

Published Online: 11 Apr 2018
Page range: 191 - 212

Abstract

Open Access

Kate Beeching. Pragmatic markers in British English. Meaning in social interaction. Cambridge: Cambridge University Press. 2016. 255 pp. ISBN: 9781139507110.

Published Online: 11 Apr 2018
Page range: 213 - 218

Abstract

Open Access

Alexandra D’Arcy. Discourse-pragmatic variation in context – eight hundred years of LIKE. Amsterdam & Philadelphia: John Benjamins. 2017. 235 pp. DOI 10.1075/slcs.187. ISBN 978 90 272 5952 3 (HB), ISBN 978 90 272 6531 9 (EBOOK).

Published Online: 11 Apr 2018
Page range: 219 - 224

Abstract

Open Access

Marianne Hundt, Sandra Mollin and Simone E. Pfenninger (eds.). The changing English language: Psycholinguistic perspectives (Studies in English Language). Cambridge: Cambridge University Press, 2017. 410 pp. ISBN 978-1-107-08686-9.

Published Online: 11 Apr 2018
Page range: 225 - 233

Abstract

Open Access

Merja Kytö and Päivi Pahta (eds.). The Cambridge handbook of English historical linguistics. Cambridge: Cambridge University Press. 2016. ISBN: 978-1-107-03935-3. xxv + 624 pp.

Published Online: 11 Apr 2018
Page range: 234 - 239

Abstract

Open Access

María José López-Couso, Belén Méndez-Naya, Paloma Núñez-Pertejo and Ignacio Palacios-Martínez (eds.). Corpus linguistics on the move. Exploring and understanding English through corpora. Leiden: Brill/Rodopi. 2016. 368 pp. ISBN13 9-789-004-308077.

Published Online: 11 Apr 2018
Page range: 240 - 245

Abstract