Error Analysis and Instructional Strategy Adjustment in a Corpus of English Language Learners

Corpus linguistics is the use of computers to analyze and compile large amounts of naturally occurring language [1]. Over the past decades, corpus linguistics has underpinned empirical studies of language change and use, through which more generalized and valid linguistic research conclusions have been drawn. These studies have been categorized into two main types, corpus-based and corpus-driven. Corpus-based research refers to the use of information from a corpus to validate, exemplify, and illustrate known theories and to establish links between data and known theories [2–4]. Corpus-driven research, on the other hand, uses the corpus as a basis to summarize the repetition patterns and frequency distributions in the corpus and to summarize the relevant linguistic theories [5–7]. These two approaches have also become the main corpus research directions at present.

In the process of language learning, word collocation plays an indispensable role. However, the differences between Chinese and English languages and thinking, and improper vocabulary teaching methods have led to the lack of English learners’ word collocation ability [8–9]. Therefore, cultivating students’ awareness of word collocation in teaching and summarizing the collocational behaviors of words with authentic and reliable corpus can help students improve their word collocation ability [10–12]. The corpus allows language teachers and language learners to search and query consecutive texts of tens of millions of words, and these texts are real, living linguistic evidence [13–15]. In each line of the word index of the corpus, the keywords appear centrally, and to the left and right are the words that make up their contexts, and observing and analyzing these words reveals the collocational behavior of the keywords [16–17]. Using the retrieved evidence of the word indexes, it is possible to analyze the collocation behavior of the keywords for error checking, on the basis of which students are able to acquire more comprehensible language [18–19].

In this paper, the classical BP backpropagation algorithm is used to train the recurrent neural network, and in order to solve the problem of long distance dependence of the recurrent neural network, an RNN variant modeling the long and short-term memory network with a gating structure is proposed. Based on the two structures of decoder and encoder, the Seq2Seq framework is established, and the framework is applied to the modeling of text generation tasks. A corpus of learners at home and abroad is listed and categorized, and the English grammar error correction model based on seq2seq is established according to the definition and evaluation criteria of English grammar error correction. Use Soft Attention in global computing combined with seq2seq to deal with the problem of grammar error correction, introduce BN in the model, normalize the input activation parameters of any neuron in each layer of the network, transform the computer text into vectors, and improve the comprehensibility of the corpus semantics through the feedback filtering mechanism of the n-gram language model. The error effect of corpus error correction is analyzed, and based on the analysis results, the adjustment countermeasures of English grammar teaching are proposed.

2

Relevant theories and technologies

2.1

Deep Neural Networks

2.1.1

Recurrent Neural Networks

Recurrent Neural Networks (RNN) are a class of recurrent neural networks that take sequence data as input, recursively in the evolutionary direction of the sequence, and all nodes are connected in a chain fashion [20]. Due to the recursive nature, recurrent neural networks can represent sequences of arbitrary length as vectors of fixed length while focusing on the structured properties of the input.

RNN networks mainly involve some formulas as follows: 1 $s_{t} = f (U \cdot x_{t} + W \cdot s_{t - 1})$ 2 $o_{t} = σ (V \cdot s_{t})$

Equation (1) contains the parameter matrices U and W, the time markers t, the input vector x, and the intermediate hidden state s. The hidden state s_i of the current time step is derived from the input x_i by co-computing with the state s_t–1 of the previous time step, while the output vector o_i in Equation (2) is derived from the current s_i by linear transformation and activation. The parameters of each time step are shared, and the output vector of the last time step can contain the information of the whole sequence.

In the training process of recurrent neural networks, the classical BP backpropagation algorithm is usually used to update the parameters forward from the last moment t. During the BP derivation process, too long sequences can have vanishing or exploding gradients. In order to solve the problem of long distance dependence of recurrent neural networks, a RNN variant modeled Long Short-Term Memory Network (LSTM) with gating structure has been proposed.

The biggest difference with the RNN hidden layer unit is that there is an extra memory unit C, and its main gating structure consists of a forgetting gate f, an input gate i, and an output gate o. The following briefly describes the computation process of the hidden layer in LSTM.

In the formula (3) of the forgetting gate, the output h_t–1 of the previous time-step unit and the input x_i of the current unit are linearly transformed, and the sigmoid activation function controls their values in [0, 1] to control the degree of forgetting the information of the previous unit: 3 $f_{t} = σ (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f})$

The input gates are computed as follows, the tanh function in Eq. (4) produces a new candidate vector ${\tilde{C}}_{i}$ , and the input gating i_i in Eq. (5) produces a value within [0, 1] for each item in ${\tilde{C}}_{i}$ to control how much new information is added. Finally C_i in Eq. (6) is the integration of the previous time step C_i = 1 and the new input ${\tilde{C}}_{i}$ , which completes the update of the previous memory cell: 4 ${\tilde{C}}_{t} = \tanh (W_{C} \cdot [h_{t - 1}, x_{t}] + b_{C})$ 5 $i_{t} = σ (W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i})$ 6 $C_{t} = f_{t} \cdot C_{t - 1} + i_{t} \cdot {\tilde{C}}_{t}$

The final output gate o_i is further filtered for the current cell state C, computed as shown in Eqs. (7) and (8) below, where the output gate produces a value of [0, 1] for each item of the cell state, controlling the extent to which the cell state is filtered: 7 $o_{t} = σ (W_{o} \cdot [h_{t - 1}, x_{t}] + b_{o})$ 8 $h_{t} = o_{t} \cdot \tanh (C_{t})$

LSTM solves the RNN long-range dependency problem well with three gating structures and achieves better results. However, the complexity of the model is also significantly increased, the model has more parameters, more training data is needed to make the model fit, and the training time of the model is also increased a lot more than RNN.

2.1.2

Door-controlled circulation units

The GRU unit contains the following main calculation steps: 9 $r_{t} = σ (W_{r} \cdot [h_{t - 1}, x_{t}])$ 10 ${\tilde{h}}_{t} = \tanh (W_{h} \cdot [r_{t} \cdot h_{t - 1}, x_{t}])$ 11 $z_{t} = σ (W_{z} \cdot [h_{t - 1}, x_{t}])$ 12 $h_{t} = (1 - z_{t}) \cdot h_{t - 1} + z_{t} \cdot {\tilde{h}}_{t}$

Firstly, Equation (9) indicates that the hidden state h_i–1 of the previous time step and the input x_i of the current time step are linearly transformed, and after the sigmoid activation function processing, a reset gate r_t with a value in the range of [0, 1] is obtained. The role of the reset gate r_t is equivalent to a proportionality factor, which determines the amount of information in the previous hidden state that can be retained, and the retained information $\tilde{h}$ , which is obtained by the calculation of Equation (10). The updating gate z_i in Equation (11) is calculated in the same way as the reset gate r_i, and z_τ can simultaneously control the amount of information in the previous time step h_τ–1 and in the ${\tilde{h}}_{τ}^{'}$ obtained through the reset gate, as seen in Equation (12), the closer z_i is to 1, the more the information of the previous time step state h_i–1 is forgotten. The state h_r of the current time step is finally obtained.

2.1.3

Sequence-to-sequence framework

Sequence to Sequence (Seq2Seq) framework is commonly used for modeling text generation tasks and has achieved great success in areas such as Machine Translation, Speech Recognition, Text Summarization, Question and Answer systems, etc. [21]. The Seq2Seq framework usually consists of two constructs, an Encoder and a Decoder, whose inputs and outputs are both a sequence. In Encoder, the input sequence is usually converted into a fixed-length intermediate vector, and then the intermediate vector is converted into a result sequence by Decoder. A brief structure diagram of the Seq2Seq framework is shown below in Figure 1. The most important feature of this framework is that the output sequence is of variable length, e.g., the input sequence {x₁, x₂, …, x_m} is of length m and the output sequence {y₁, y₂, …, y_n} is of length n in the figure, so it has great flexibility. Decoding usually starts from the symbol indicating the beginning such as <bos> until the symbol indicating the end <eos> is decoded (where bos denotes the beginning of sentence and eos denotes the end of sentence).

Grammatical error correction task can also be regarded as a generative task, the input of the encoder side is the sentence containing grammatical errors, the output of the decoder side is the sentence predicted to be generated by the model after correction, at the same time in the correction of the error may be inserted or deleted some words resulting in the original sentence and the target sentence length inconsistency. For example, the sentence "I always study English in morning." The article "the" is missing from "in morning", and the Seq2Seq framework can be added by decoding the word "in" at the next decoded position when decoding to the word "in". This method of predicting the generation of indefinite-length sentences can achieve the correction of grammatical errors contained in the original sentence in the process of generating sentences.

2.2

Grammar Error Correction Corpus

2.2.1

Foreign learner corpus

1)

NUCLE

NUCLE is a learner corpus co-constructed by the Natural Language Processing team at NUS. The corpus contains 1,397 English essays written by NUS undergraduates on topics in the fields of science and technology, healthcare, and economics, with more than one million words. All compositions are manually marked for grammatical errors by professional English teachers.

2)

FCE

FCE is a public subset of the Cambridge Learner Corpus, which consists of 1,244 English test responses written by ESL learners in the form of short essays, letters, and descriptions. All texts are manually annotated with grammatical errors by an annotator, and all grammatical errors are categorized by category.The FCE corpus is also a publicly available common training corpus for grammatical error correction tasks, and the corpus itself is divided into training, development, and test sets, although when in use, the FCE corpus is usually used only in the training phase.

3)

Lang-8

The Lang-8 corpus of English language learners is a subset of the multilingual Lang-8 corpus extracted and preprocessed in 2012. The corpus contains 100,000 English texts written by ESL learners on social learning sites, and the corpus is also more noisy because the texts are annotated for grammatical errors by native English speakers on forums rather than by professional linguistic corpus annotators. Nevertheless, Lang-8 is still by far the largest publicly available learner corpus for the task of grammatical error correction, which contains a total of 1.04M parallel sentence pairs and about 11.86M Token counts, while the corpus quality is higher than the majority of synthetic corpora.

4)

JFLEG

Unlike NUCLE and other corpora, the annotation of JFLEG corpus includes not only the correction of sentence grammatical errors, but also sentence rewriting at the fluency level, which requires the grammar error correction system to not only correct the grammatical errors, but also to make the corrected sentences have a more fluent expression.

2.2.2

Domestic learner corpus

There are fewer publicly available manually annotated corpora for grammar error correction tasks in China, and this subsection focuses on CLEC, which is used in this paper.

The CLEC corpus collects English writing compositions from students with different English proficiency levels, including secondary school students, university English level 4 and 6, and professional English level 4 and 6, corresponding to the five categories from ST2 to ST6, and containing more than one million words in total. The purpose of this corpus is to observe and analyze Chinese students’ verbal errors in the use of English through quantitative and qualitative methods, so as to provide useful information for the development of English education in China. For the grammar correction task, the grammatical errors in the CLEC corpus reflect the English grammar usage habits of Chinese students, and the use of the CLEC corpus to train and test the grammar correction model allows the grammar correction model to be better applied to the grammar correction scenarios of Chinese English learners.

3

Modeling of English Grammar Error Correction Corpus Based on Deep Learning

3.1

Definition of English Grammar Correction Problems and Evaluation Criteria

3.1.1

Definition of English Grammar Correction Problems

CoNLL2013 and CoNLL2014 are two competitions held specifically for the problem of English grammatical error correction, which gave a clearer definition of the problem of English grammatical error correction and attracted many teams in the field of grammatical error correction to participate. It also promoted the enthusiasm of academics in researching this problem, which led to a certain development of grammar error correction.The task given in CoNLL2013 was to correct five common grammatical errors in English, including prepositional errors, noun singular-plural errors, qualifier errors, subject-verb agreement errors, and verb morphology errors. In CoNLL2014, on the other hand, the 28 most common grammatical errors in English are corrected, which is more difficult than CoNLL2013 but also more practical to the problem of English grammar correction. In this paper, the problems in CoNLL2014 will be studied, and natural language processing techniques and methods will be used to improve the effect of grammatical error correction.

3.1.2

Evaluation Criteria for English Grammar Error Correction

The open source tool MaxMatchscorer implements the MaxMatch algorithm, and gives a simple and easy to use, you can directly use the tool to evaluate the effect of model error correction. The following is an introduction to the principle of MaxMatch algorithm.

Correction rate P, the degree of correctness of the model to correct the sentence, is defined as: 13 $P = \frac{\sum_{i = 1}^{n} | g_{i} \cap e_{i} |}{Σ_{i = 1}^{n} | e_{i} |}$

Correction rate R, the proportion of errors corrected by the model for all errors that should be corrected, is defined as: 14 $R = \frac{\sum_{i = 1}^{n} | g_{i} \cap e_{i} |}{Σ_{i = 1}^{n} | g_{i} |}$

The numerator of both equations above is the number of sentences in which the model correction result matches the reference answer, the denominator of the correction rate P is the number of sentences modified by the model, and the denominator of the correction rate R is the number of all sentences in the reference answer that need to be corrected.The key evaluation metric in MaxMatch is F_0.5, and the combined P and R are considered, and the equations are defined as follows: 15 $F_{0.5} = \frac{(1 + {0.5}^{2}) * R * P}{R + {0.5}^{2} * P}$

From the above equation, it can be seen that the algorithm amplifies the percentage contribution of the correction rate and reduces the percentage contribution of the correction rate. It makes the overall presentation that whenever a sentence is corrected it is better to be correct, otherwise it is preferred not to make corrections, penalizing the phenomenon of correcting an otherwise correct sentence.

3.2

English grammar error correction modeling based on seq2seq

3.2.1

English grammar error correction model based on seq2seq

Aiming at the fact that the English grammar error correction problem has certain similarities with machine translation, and the seq2Seq model has made many breakthroughs in the field of NLP, in this paper, we will choose seq2Seq as the basic model for grammar error correction problem solving, and improve the training efficiency and model error correction ability of the basic model of seq2Seq, so as to improve the effect of English grammar error correction. The main improvement points are described in detail below.

3.2.2

Attention mechanism

In the seq2seq model, the encoder forms the input sequence into a fixed-length context vector, and a large amount of detailed information is lost during the computation of the context vector, which becomes more obvious when the sentence length exceeds a certain limit. Therefore, scholars have carried out a series of studies to address this problem, which was eventually inspired by research work on human vision.

Figure 2 shows the attention model, according to the calculation range division of the input sequence, the attention mechanism can be divided into two kinds of local and global, in this paper, we will use Soft Attention in the global computation combined with seq2seq to deal with the problem of grammatical error correction, so the following will be introduced to Soft Attention [22].

The value of weight α_ij is determined by the i – 1 nd hidden state s_i–1 and each of the hidden state variables in the input, calculated as follows: 16 $α_{i j} = \frac{e x p (e_{i j})}{Σ_{k = 1}^{T_{x}} e x p (e_{i k})}$ 17 $e_{i j} = α (s_{i - 1}, h_{j})$

3.2.3

Additive layer normalization

Deep neural networks are powerful, but their requirements for machine computational power are high, and the improvement of training efficiency plays a key role in model optimization and improvement [23]. Theory shows that normalizing the input activation parameters of neurons can reduce the model training time. The (BN) introduced in CNN by normalizing the input activation parameters of any neuron in each layer of the network, the activation parameters conform to the standard normal distribution after the transformation, which makes the activation parameters fall in the region of the slope of the nonlinear function, the gradient is larger, and the model convergence speed is faster.

However, since the BN operation depends on the mini-batch first-order and second-order statistics, which are related to the size of the mini-batch, and the RNN input length is variable, the BN cannot be directly used in the RNN. In order to improve the RNN training efficiency, researchers proposed (LN), which can significantly reduce the model training time and various RNN networks after layer normalization outperform the original network.

LN is normalized for the training samples, which is performed independently and in parallel with other sample data, so it is not as demanding for the data distribution as BN, and the training difficulty decreases drastically.LN is a kind of transversal normalization, which maps the inputs of neurons of an entire layer in an RNN into the same distribution by summing up the inputs of neurons in the layer with the computation of the variance, and all the hidden elements share the same normalization terms μ and σ: 18 $μ = \frac{1}{H} \sum_{i = 1}^{H} x_{i}$ 19 $σ = \sqrt{\frac{1}{H} \sum_{i = 1}^{H} {(x_{i} - μ)}^{2}}$

3.2.4

Word Embedding Processing

Since computers can not understand natural language, so the use of computers for text processing must be vectorized into numbers that computers can understand. The simplest way to encode is one-hot matrix representation, that is, with a fixed length of the size of the vector represents a word, the vector length for the size of the word list, only in the word appears in the position of 1, and the other are 0, such as “book” using [0, 0, 1, 0, 0, 0, …] that the method is simple, but the disadvantages are obvious. The method is simple, but the disadvantages are also obvious, the data dimension is too large, too sparse, and the relationship between words is difficult to reflect.

3.3

Feedback Filtering Algorithm

3.3.1

Feedback filtering mechanisms

For the modification suggestions submitted by users, taking into account the varying levels of English proficiency of users, it is necessary to screen these utterance suggestions to identify which are correctly modified utterances and which are incorrect utterances. This is related to the process of re-learning and re-training of the system, and belongs to the key module of the system. This filtering process is actually comparing the modified statements given by the system and the modified suggested statements given by the user which one is more credible, then the sentences need to be scored, and those with higher scores can be considered more probable to be the sentences with no grammatical errors, so when the user suggests to modify the sentences with a score lower than the system’s score of the modified sentences will not be adopted. Therefore, in order to effectively filter invalid suggested text, the n-gram grammar model is used to score the sentences. The n-gram model is described below.

3.3.2

Model of n-gram based feedback filtering algorithm

Using a string S to represent a sentence, the n-gram model models the string S. P(S) denotes the probability of occurrence of the sentence. Words are the basic units in the n-gram, and a sentence consisting of m words has probability P(S) = P(ω₁, ω₂, ⋯, ω_m). According to the chain rule, the probability of a string S is calculated as follows: 20 $P (ω_{1}, ω_{2}, \dots, ω_{m}) = P (ω_{1}) P (ω_{2} | ω_{1}) P (ω_{3} | ω_{1}, ω_{2}) \dots P (ω_{m} | ω_{1}, \dots ω_{m - 1})$

According to the above equation, it can be seen that the i st word ω_i probability is related to the word ω₁ω₂ ⋯ ω_i–1 before that word, and there is a dependency relationship between words, and the i th word depends on the previous i – 1 words. But obviously Eq. (20) is very difficult to compute, and the computation is particularly large. Moreover, due to the dataset, many combinations do not appear when we train the model, and the model parameters are difficult to obtain.

For this reason, the Markov assumption is introduced that the current word is only related to the previous few words, which can simplify the calculation process as: 21 $P (ω_{i} | ω_{1}, \dots, ω_{i - 1}) = P (ω_{i} | ω_{i - n - 1}, \dots, ω_{i - 1})$

For the n-gram language model, it is common that n can be taken as 1, 2, or 3. A start and end flag is usually added before and at the end of the sentence to facilitate uniform computation.

When n=2, bigram is: 22 $P (ω_{1}, ω_{2}, \dots, ω_{m}) = \prod_{i = 1}^{m} P (ω_{i} | ω_{i - 1})$

When n = 3, trigram is: 23 $P (ω_{1}, ω_{2}, \dots, ω_{m}) = \prod_{i = 1}^{m} P (ω_{i} | ω_{i - 2} ω_{i - 1})$

It is sufficient to make Eq. (23) obtain the maximum value according to the maximum likelihood estimation method. These are the basic elements of the N-metric grammar. In the training corpus, the value of P(w_i | w_i–2w_i–1) can be estimated with the following formula: 24 $P (w_{i} | w_{i - 2} w_{i - 1}) = \frac{c (w_{i - 2} w_{i - 1} w_{i})}{c (w_{i - 2} w_{i - 1})}$

After training the model it is possible to calculate the probability and perplexity of the sentence, and the concepts of probability and perplexity are first briefly explained below.

Probability, as the name suggests, is the frequency of occurrence of the sentence, according to the basics of the N-meta grammatical model introduced earlier: P(S) = P(ω₁, ω₂, ⋯, ω_m).

Perplexity, the basic idea is to give normal sentences to the larger the probability value of the language model is better, after training the model, use the model to score the test set, at this time, the test set are all normal sentences, then the larger the score of the model is better. The relevant formula is as follows: 25 $P P (S) = P (^{ω_{1}, ω_{2}, \dots, ω_{m}) - \frac{1}{m}} = \sqrt[m]{\frac{1}{P (ω_{1}, ω_{2}, \dots, ω_{m})}}$

It can be rewritten according to the chain rule: 26 $P P (S) = \sqrt[m]{\prod_{i = 1}^{m} P (ω_{i} | ω_{i - n - 1}, \dots, ω_{i - 1})}$

From the formula, it can be seen that the smaller the confusion degree is, the larger the sentence probability is. Comparing probability and perplexity, it can be found that the calculation of probability is greatly influenced by the length of the sentence and the number of words, while perplexity can effectively reduce this influence, so perplexity is often used instead of probability to judge the sentence and model. Therefore, in this paper, the confusion degree is also used, but for the sake of convenience and semantic comprehensibility, probability is sometimes used instead.

3.4

Experimental results and analysis

3.4.1

Comparison of results based on attention mechanisms

In order to demonstrate the effectiveness of the attention mechanism proposed in this paper, this subsection compares the error correction effects of the model without the attention mechanism with the present model with the attention mechanism, as shown in Fig. 3. The values in the figure are the P, R, F0.5 results obtained on the CoNLL-2014 test set. It can be seen that the model with the addition of the corresponding attention mechanism is significantly better than the model without attention on all five types. Among them, the attention mechanism is the most helpful for the noun singular-plural model, which is 25 more effective than the one without the attention mechanism on the F0.5 metric.

3.4.2

Comparison based on classification results

This subsection compares the error correction results of the three classification models, this paper’s model, the CUUI model, and the deep contextual model, on five error types, namely, articles, prepositions, verb morphology, noun singular-plural, and subject-verb agreement, as shown in Figure 4. It can be seen that this paper’s model achieves the highest results in the other 4 types except for the preposition type, and in the F0.5 metric, the difference between the classification results of this paper’s model for coronal, verb morphology, noun singular and plural, and subject-predicate agreement is 6.6, 18.1, 8.0, and 8.0, when compared to the deep contextual model. And the progress is relatively obvious. In terms of preposition types, the model in this paper is slightly inferior to the deep contextual model, probably due to the large number of preposition classes, the deep model still can’t get a better grasp of the collocation laws.

4

Analysis of the effect of error analysis based on grammatical error correction

4.1

Experimental Procedures

4.1.1

Study subjects

Two classes were selected as experimental subjects in college J. The experimental class used the corpus error analysis constructed in this paper for English language learning, while the control panel used the traditional English language learning method for 15 weeks.

A pre-study test was administered to these two classes when they first entered their junior year of college, and the results showed that students in both classes had more grammatical errors in English writing, concentrating on prepositions, verbs, and conjunctions, and both had many common problems. After entering the third year of college, students were more concerned about improving their English performance, and after two years of study, students’ reading and completing the blanks were more correct, which was greatly related to the fact that English teachers in the school emphasized reading. At the stage of freshman and sophomore years, students practiced reading and gap-filling exercises a lot, but less practice in written expression and even less correction of errors, and usually memorized model essays and wrote essays for exams, so there are more grammatical errors in students’ writing, and the present study carries out an action research on the treatment of grammatical errors in high school students’ English writing according to the problem of grammatical errors in students’ writing.

4.1.2

Test methods

Before, after the first, second and third rounds of this action research, tests will be given to the students of both classes, and then the test scores of both classes will be analyzed according to the data analysis method, so as to judge the effectiveness of the grammar correction material bank constructed in this paper on the students’ written expressions in English.

4.2

Test results and analysis

4.2.1

Pre-testing results and analysis

By correcting the pre-test paper, the results of the pre-test paper according to the eight lexical aspects examined in grammar [prepositional errors, pronoun errors, coronal errors, noun errors, adjective errors, adverb errors, verb errors and conjunction errors (syntactic errors)] are shown in Table 1.

In general, it seems that the students’ error rate in pronouns is low, but there is a compound structure, the students will be confused by other grammatical points, ignoring the use of pronouns in the wrong way, in topic 13, 78.15% of the students revised correctly, but in the other topic (topic 11), the correct rate is only 34.79%, and most of them directly ignored not to revise the error here and did not understand the grammatical examination of this sentence.

Table 1.

Test volume score

Subject number	Exploratory morphology	Scoring rate	Subject number	Exploratory morphology	Scoring rate
1	Nouns	82.65%	11	Pronoun	34.79%
2	Article	82.65%	12	Verbs	69.36%
3	Article	73.45%	13	Pronoun	78.15%
4	Nouns	26.06%	14	Verbs	21.65%
5	Adverb	85.66%	15	Verbs	69.52%
6	Adjective	65.18%	16	Conjunction	78.23%
7	Article	43.66%	17	Conjunction	78.23%
8	Preposition	56.96%	18	Conjunction	56.58%
9	Preposition	60.83%	19	Verbs	69.15%
10	Verbs	60.84%	20	Conjunction	4.36%

Table 2 shows the statistical scale of the sample mean of the writing pre-test, and the score rate of the essay fill-in-the-blank is higher, which is 12.9352. During the period of freshman and sophomore years, the students practiced a lot of grammar fill-in-the-blank by using the English grammar correction model in the corpus, and they mastered this type of question well, and the prompt words of the essay fill-in-the-blank questions are more obvious, which makes it easier for the students to write the correct answers compared with short-text corrections.

Table 2.

The sample mean statistics before writing

Type	Mean value	Case number	Standard deviation	Standard error mean
Simple Sentence	12.1654	50	2.7563	0.3648
Error correction	5.7556	50	2.6466	0.3896
Single Fill	13.6485	50	2.5489	0.3248
Single Sentence Translation	5.9547	50	2.9954	0.3495
Fill In	12.9352	50	2.3964	0.3685
Written Expression	14.9487	50	3.2874	0.3185

4.2.2

First round of after-action tests

Table 3 shows the results of the pre- and post-tests of the first round of action research. Before and after the first round of action research, the score rate of the five types of questions, such as prepositional, pronouns, nouns, predicate verbs, and parallel conjunctions, increased, but the score rate of the five types of questions, such as coronal, adjectives, adverbs, nonpredicate verbs, and subordinate conjunctions, on the contrary, declined. The type with the highest increase in score rate was the noun category, which reached 29.89%, and the type with the highest decrease was the adjective category questions, which reached -31.46%. Taken together, before and after the first round of action research, although the students’ individual category score fluctuation values varied greatly, the final average score had little ups and downs and was more influenced by randomness.

Table 3.

The first round of action was studied

/	The first round of action research (score rate%)	The first round of action (score rate%)	Fluctuating value
Preposition	58.64%	66.34%	7.70%
Article	66.24%	58.71%	-7.53%
Pronoun	56.48%	82.63%	26.15%
Adjective	65.21%	33.75%	-31.46%
Adverb	86.69%	71.69%	-15.00%
Nouns	54.36%	84.25%	29.89%
Predicate verb	53.21%	57.63%	4.42%
Non-predicate verb	65.97%	53.23%	-12.74%
Coordinate conjunction	67.15%	70.64%	3.49%
Subordinate conjunction	41.36%	41.26%	-0.10%
Average score	6.18	6.23	0.05
Pair difference
Average (post-test)	Standard deviation	Standard error mean	The difference is 95%
0.48%	18.654%	5.8965%	-12.856%
Confidence interval limit	T	Freedom	Significance (double tail)
13.851%	0.0815	10	0.945

A paired sample t-test was conducted on the grades before and after the first round of action research. The value of significance was 0.945, indicating that there was no significant correlation between the changes in pre- and post-test scores and the first round of action research. Combined with the fluctuations in the score rates of the various types of grammar questions, it can be concluded that the first round of action research using the corpus for error analysis in EFL did not have a significant impact on the students’ knowledge of grammar.

4.2.3

Second round of after-action tests

Table 4 shows the results of the pre-test and post-test of the second round of action research. Before and after conducting the second round of action research, the score rate of the grammar topics in all categories increased, except for the adverbial category, where the score rate decreased slightly. Among them, the type with the highest increase in the score rate is the predicate verb category, which reaches 32.67%. Taken together, the students’ grades in the grammar questions before and after the second round of action research on English language learning using the error analysis in the corpus constructed in this paper have increased significantly. The fluctuation value of the average score also rose more significantly compared to the first round of action research.

Table 4.

The first test of the second round of action and the subsequent test results

/	The first test of the second round of action (score rate%)	After the second round of action (score rate)	Fluctuating value
Preposition	66.45%	76.09%	9.64%
Article	58.73%	86.48%	27.75%
Pronoun	82.63%	88.63%	6.00%
Adjective	33.54%	63.15%	29.61%
Adverb	71.85%	65.05%	-6.80%
Nouns	84.59%	92.38%	7.79%
Predicate verb	57.56%	90.23%	32.67%
Non-predicate verb	53.69%	75.63%	21.94%
Coordinate conjunction	70.65%	71.26%	0.61%
Subordinate conjunction	41.65%	43.42%	1.77%
Average score	6.15	6.69	0.54
Pair difference
Average (post-test)	Standard deviation	Standard error mean	The difference is 95%
13.10%	13.76%	4.43%	3.36%
Confidence interval limit	T	Freedom	Significance (double tail)
22.92%	3.039	9	0.0154

A paired samples t-test was conducted on the scores before and after the second round of action research. This time, the t-value was 3.039 and the significance value was 0.0154, indicating that the changes in the pre- and post-test scores were significant and statistically significant. Combined with the fluctuations in the score rates of the various types of grammar questions, it can be concluded that the second round of the action research significantly improved the students’ mastery of the various types of grammar.

4.2.4

Third round of after-action tests

Table 5 shows the statistics comparing the pre and post-test scoring rates for the third round of actions, with significant increases in the pre and post-test scoring rates for Topics 1 and 4 showing a significant improvement in the students’ understanding of the use of nouns. Topics 2, 3 and 7 showed an increase in score rate of 9.11%, a decline of -8.40% and an increase of 25.50%, respectively, which indicates that students’ understanding of coronal use is not always consistent and may need to be strengthened in some areas. Adverbs were examined in Topic 5 and the score increased from 86.93% to 100%, showing that students have a basic grasp of the use of adverbs. Adjectives were examined in Topic 6, and there was a 21.03% increase in the score, showing some improvement in the mastery of adjectives. Prepositions were examined in Topics 8 and 9 with an increase in score of 13.31% and 30.42% respectively, showing a significant improvement in students’ understanding of the use of prepositions. Verbs appeared in Topics 10, 12, 14, 15 and 19 with a range of improvement in scores from 17.1% to 25.36%, which shows some improvement in students’ understanding of verb usage. Pronouns were examined in Topics 11 and 13 and there was an increase in score of 34.49% and 17.28%, which shows a greater improvement in students’ use of pronouns. Conjunctions were examined in Topics 16, 17, 18, and 20, and there was an increase of more than 10% in the score rate, which shows a greater improvement in students’ understanding of the use of conjunctions.

Table 5.

The third round is compared to the rate of score

Subject number	Exploratory morphology	Pretest rate	After score	Rate fluctuation
1	Nouns	82.63%	100%	17.37%
2	Article	82.14%	91.25%	9.11%
3	Article	73.66%	65.26%	-8.40%
4	Nouns	26.06%	60.25%	34.19%
5	Adverb	86.93%	100%	13.07%
6	Adjective	65.23%	86.26%	21.03%
7	Article	43.98%	69.48%	25.50%
8	Preposition	56.31%	69.62%	13.31%
9	Preposition	60.84%	91.26%	30.42%
10	Verbs	60.89%	86.25%	25.36%
11	Pronoun	34.66%	69.15%	34.49%
12	Verbs	69.36%	91.62%	22.26%
13	Pronoun	78.15%	95.43%	17.28%
14	Verbs	21.96%	39.65%	17.69%
15	Verbs	69.26%	86.36%	17.10%
16	Conjunction	78.56%	91.26%	12.70%
17	Conjunction	78.26%	100%	21.74%
18	Conjunction	56.23%	78.26%	22.03%
19	Verbs	69.48%	86.64%	17.16%
20	Conjunction	4.36%	26.97%	22.61%

Taken together, the majority of students improved their understanding of all types of lexemes. Although there was a decrease in scores on one quiz for articles, there was also an increase in scores on other quizzes, and it is possible that this is an isolated phenomenon. These data show improvement in students’ grammatical understanding and application.

Table 6 shows the paired samples t-test for the comparison analysis of the pre-test and the writing post-test, which showed that the post-test, compared to the pre-test, improved by an average of 5.0645 points for single-sentence corrections, 3.0354 points for short-text corrections, 3.3214 points for one-sentence fills in the blanks, 1.7258 points for one-sentence translations, 3.3454 points for essay fills in the blanks and 1.4598 points. Although the increase in the scores of each question type is different, it proves that students have made some progress.

Table 6.

Test of matching sample t after test and writing

Item		Mean value	Standard deviation
Simple sentence	Post test-Pre test	5.0645	3.4215
Error correction		3.0354	4.0695
Single fill		3.3214	3.8545
Single sentence translation		1.7258	3.1698
Fill in		3.3454	4.6284
Written expression		1.4598	2.3654
Pairing difference
Item	Mean standard error	Difference 95% confidence interval
Item	Mean standard error	Lower limit	Upper limit
Simple sentence	0.5261	4.0069	6.0684
Error correction	0.6598	1.8935	4.2254
Single fill	0.5621	2.1658	4.4825
Single sentence translation	0.4628	0.7985	2.6365
Fill in	0.9156	1.4585	4.2158
Written expression	0.3515	0.7856	2.2365
Item	T	Freedom	Significance (double tail)
Simple sentence	9.7184	45	0.000
Error correction	5.1254	45	0.000
Single fill	5.8596	45	0.000
Single sentence translation	3.7185	45	0.000
Fill in	3.3985	45	0.000
Written expression	4.2615	45	0.000

In summary, the post-test performed better than the pre-test in all of the items mentioned. The t-value proves that such an improvement is statistically significant and the significance value of 0 for each item also indicates the significance of the improvement. This means that all the instructional activities, be it single sentence correction, short correction, single sentence fill in the blank, single sentence translation, essay fill in the blank, or written expression have achieved effective improvement.

4.3

Comparative Experimental Analysis

Figure 5 shows the ontology errors of the experimental class control class before and after the test, as can be seen from the figure: 1)

Regarding case errors, before the experiment, the difference between the experimental class and the control class in terms of case errors is not very large, but after the experiment, the control class kept the same as before the experiment which is basically stable, but the experimental class basically did not make any mistakes in terms of case after the experiment.

2)

Regarding punctuation, before the experiment, the number of errors in the experimental class and the control class were 16 and 18 respectively, which is not a big difference. But after the experiment, the experimental class made good results in punctuation errors, 4 compared to the previous 16 errors, which proves that the teaching about punctuation is basically successful in the process of experimental teaching based on corpus-based error analysis.

3)

Regarding word spelling errors, before the experiment, the difference between the experimental class and the control class is not very much, but after the experiment, the data about the control class has increased, while the experimental class and the previous error rate compared to reduce by half, is not a very obvious progress, but words are after all, a process of progress over time, so to a certain extent, this aspect still attracts the attention of students.

To sum up, the teaching strategies and methods of corpus error analysis applied in the experiment are helpful to students regarding ontological error correction.

5

Adjustment of the response to the teaching of English grammar

5.1

Emphasize the teaching of English grammar in secondary schools

According to the survey of students’ English grammar test scores, it can be seen that students’ poor knowledge of grammar and vague concepts of grammar lead to a large number of errors in oral or written expression in English.

Language knowledge and language skills are the basis of comprehensive language use ability, and grammar is an important part of language knowledge, so to cultivate students’ comprehensive language use ability, it is necessary for students to master a certain amount of grammar knowledge. Without a solid foundation of grammatical rules, it is impossible to learn any foreign language, and language learning must follow the law of language learning. The acquisition of language without explanation and grammar refers to the natural acquisition of the mother tongue by young children, while today’s foreign languages must be acquired through classroom learning due to the lack of environmental support. Therefore, teachers should pay attention to the teaching of grammar, familiarize themselves with the syllabus, teach and summarize the basic grammatical knowledge when teaching English, and strengthen the grammatical rules by carrying out a lot of training in listening, speaking, reading and writing, so that they can familiarize themselves with the rules of English grammar and encourage students to communicate with each other by applying the forms of the language they have learned, organically combining the forms of the language with the functions of communication in a certain scenario, and improving the ability of comprehensive use of the language.

In a word, the cultivation of comprehensive language ability should be emphasized in English teaching, but the teaching of language knowledge should not be taken lightly, and attention should be paid to the cultivation of learners’ grammatical awareness.

5.2

Using the right method to teach grammar

Grammar has to be taught, and it has to be taught well. To teach grammar well, method is the key. This section discusses the methods of teaching English grammar in secondary schools in view of the problems in grammar learning of secondary school students. This section will mainly discuss the use of comparative method to teach grammar, the combination of context to teach grammar and the combination of task-based teaching method to teach grammar.

5.2.1

Using the comparative method to teach grammar

Due to the differences between English and Chinese languages, students often make mistakes when using English because of the influence of mother tongue transfer, therefore, it is quite important to teach English grammar with appropriate use of comparative method so that students can understand the differences between English and Chinese languages.

5.2.2

Teaching Grammar in Context

In the process of grammar teaching, if only through the words, phrases, sentences and other static forms of teaching, repeated drills to make students memorize the grammar rules, this will only make students learn boring, and can not really understand the rules of grammar, can not flexibly use the language. Combined with the context of teaching grammar, that is, in teaching according to the specific teaching content and characteristics of the design of different contexts, the abstract rules of grammar into concrete language facts, in the corresponding context for students to actively feel and discover the rules of grammar, so that students will really learn and remember the rules of grammar, to avoid inappropriate generalization of the rules of grammar, to reduce the grammatical errors within the language.

5.2.3

Teaching grammar in conjunction with task-based teaching methods

The task-based teaching method emerged in the century. Task-based language teaching advocates letting students do things, i.e., mastering language through accomplishing various tasks, and adopting the method of letting students discover, generalize and summarize to cultivate their ability of independent learning and inquiry. The core idea of task-based teaching is to simulate the kinds of activities that people engage in when they use language in school life and society, to combine language teaching with language use, and to let learners actively participate in the attempts to communicate in the target language, so as to cultivate students’ comprehensive language competence. Advocates of task-based language teaching believe that grammar teaching should not be rejected, but should be emphasized instead. They believe that classroom instruction should teach the forms of grammar and how these forms can be used for communicative purposes.

5.3

Enhancement of instruction in teaching strategies and study habits

It is one of the important tasks of the English language program for teachers to consciously strengthen the guidance of learning strategies for students so that they can develop good learning habits. Teachers should help students gradually learn how to learn English and how to master English grammar concepts in the process of learning and using the English language, and they should pay attention to cultivating students’ habit of thinking in English. Teachers can strengthen the explanation and training of grammar knowledge and teach students to master and use grammar knowledge skillfully. Major language phenomena such as key sentence patterns, tenses and morphology should be repeatedly practiced, which will constantly reinforce the stimulation of students’ brains and is an effective means to overcome the interference of Chinese sentence patterns. Teachers can pay attention to the combination of the usual reading class and writing training, so that students can extract the typical sentence patterns learned in the reading class and drill them repeatedly in order to master them. Students can be introduced to some of the sentence transformation method at the right time, such as a variety of subordinate clauses, a variety of connectives, a variety of expressions used to emphasize the stressed sentence, exclamatory sentence, inverted sentence, etc., the usual sentence pattern. Students can master various grammars and syntaxes and make fewer mistakes.

5.4

Correctly treating and correcting students ’ grammatical errors

Teaching grammar in the right way will enable students to master grammar knowledge better and reduce grammatical errors in language use. However, the process of language learning is a process of traveling with mistakes, and it is inevitable for students to make grammatical mistakes. Teachers should deal with students’ grammatical mistakes correctly in grammar teaching, so that students can learn from their mistakes and improve their grammatical level and language ability.

6

Conclusion

In this paper, we use deep neural network, based on foreign and domestic learner corpus, to establish English grammar error correction model based on seq2seq, add the attention mechanism, process the grammar error correction problem, introduce layer normalization, and perform text processing before vectorization operation. In order to filter the invalid suggestion text effectively, n-gram grammar model is used to score the sentences and complete the feedback filtering of English semantics. 1)

The model of this paper has a classification result difference of 6.631, 18.107, 7.944, and 8.007 for coronal, verb morphology, noun singular-plural, and subject-predicate agreement compared with the deep context model in the F0.5 metrics.In the pre-test of grammatical error correction, 78.15% of the students modified the use of pronouns correctly, but the correct rate was only 34.79% in the other question. Most of them directly ignored not modifying the error here and did not understand the grammatical examination point of this sentence.

2)

In the third round of the action post-test, the score rate of each topic has been improved to different degrees, the examination of adverbs in topic 5, the score rate has been increased from 86.93% to 100%, and the students have basically mastered the use of adverbs. Pronouns were examined in Topics 11 and 13, and the score rate has increased from 34.49% and 17.28%, which shows that students have improved their use of pronouns.

3)

By comparing the experiments and analyzing the ontology errors before and after the error analysis of the corpus, the number of errors in the experimental class and the control class before the experiment were 16 and 18 respectively, and the experimental class after the experiment made 4 errors in punctuation, which is a good achievement compared with the previous one, which proves that, in the process of the experimental teaching based on the error analysis of the corpus, the teaching about the punctuation in the English grammar is basically successful.

Idioma:: Inglés

Calendario de la edición:: 1 veces al año
Temas de la revista:: Ciencias de la vida, Ciencias de la vida, otros, Matemáticas, Matemáticas aplicadas, Matemáticas generales, Física, Física, otros

RSS Feed de revista

Error Analysis and Instructional Strategy Adjustment in a Corpus of English Language Learners

Fang Wei

Publicado en línea: 19 mar 2025

Recibido: 27 oct 2024

Aceptado: 21 feb 2025

DOI: https://doi.org/10.2478/amns-2025-0513

Palabras claveRecurrent neural network, Seq2Seq framework, Learner corpus, English grammar error correction model, n-gram grammar model

© 2025 Fang Wei, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Palabras clave
Recurrent neural network, Seq2Seq framework, Learner corpus, English grammar error correction model, n-gram grammar model