For decades the differences in human capital have been considered to be one of the major factors that explain why some countries are richer than others. While neoclassical and endogenous growth theory identified and analyzed numerous important factors affecting these output differences such as foreign trade, government consumption, geography, and political institutions to mention just a few [Barro, 1991; Acemoglu et al., 2001; Jones, 2014; Cieślik and Goczek, 2018], one of these important determinants has been the human capital which up to a point would be associated with education-related indicators frequently understood as average educational attainment. Along these policy lines, the vast majority of countries have targeted the Organization for Economic Co-operation and Development (OECD) leaders in their efforts to improve their educational attainment in the past decades. However, some countries have consistently lagged in output per capita despite having introduced an education reform aimed at increasing average years of schooling. This brings about the idea that the lion's share of the income gap between poor and rich countries can be attributed not just to the length or population enrolment of education but also to its quality. Recent works by Erosa et al. [2022], Hanushek and Woessmann [2008, 2012], Manuelli and Seshadri [2014], Tamura [2001], and Schoellman [2012] confirm that the role of quality of education may be underappreciated as opposed to average attainment or years of schooling.
The main objective of this paper is to identify the role of the quality of education in economic growth. We extend the classical Nelson and Phelps [1966] approach by introducing differences in education quality in a leader-follower type of growth model with knowledge diffusion. As in Hanushek and Woessmann [2012, 2015], we use students' performance in a standardized international test (the Program for International Student Assessment [PISA] test) to approximate the quality of education in a set of all countries observed over the years 1975–2018. This issue is important also from a political point of view: a confirmed crucial role of education quality may encourage governments to rethink their policy of financing education. This, however, raises a question about the stability of the education quality and economic growth nexus: we intend to verify whether the short and long-run effects are equal and what the pattern otherwise is. Moreover, we consider the possible reverse causality that characterizes economic development and the quality of education Finally, in most papers, any form of co-variation of education and GDP is assumed to be the confirmation of an existing impact of the education factor on economic growth. This, however, is not at all obvious and we set as our target in identifying the direction of this relation.
This paper is arranged as follows. In Section 2, we briefly outline the data and procedures used in this paper to allow for the reproduction of procedures by an independent researcher. We review the relevant theory in Section 3, presenting adequate literature discoveries and a theoretical model that provides a theoretical background for the role of education quality in GDP growth. In Section 4 we discuss the empirical model and the econometric approach used in this paper. Section 5 presents the main empirical findings and their discussion and briefly concludes.
To verify the research hypothesis, we use two main sources of data. The quality of education is assessed with PISA test results which are freely available from the OECD website. These data are extrapolated backward with the use of linear panel data Genearalized Least Squares approach allowing for autocorrelation and heteroskedasticity of the errors as described by Goczek et al. [2021]. The macroeconomic country data used in the calculation are taken from the World Bank (WB), International Money Fund (IMF), and the OECD. Table 1 presents the descriptive statistics.
Descriptive statistics
logGDP | 4,942 | 8.942929 | 1.233535 | 5.508054 | 11.82894 |
OPEN | 7,538 | 80.11214 | 51.46048 | 0.0209992 | 531.7374 |
FDI | 5,535 | 1.86752 | 12.44171 | −202.8239 | 247.9023 |
PISAAVE | 4,154 | 420.173 | 50.2345 | 312.482 | 550.1743 |
PISASCIE | 4,154 | 424.9734 | 48.47972 | 323.4689 | 550.6454 |
PISAMAT | 4,154 | 419.5068 | 51.97938 | 319.7289 | 566.1837 |
PISAREAD | 4,154 | 416.0389 | 51.054 | 294.2483 | 543.4544 |
The main part of the research is based on the vector error correction model (VECM; Keblowski, 2016, for instance) explaining GDP as a function of different factors, including the education quality. Its application requires the processes to be co-integrated, thus we test for the series cross-correlation with Pesaran's [2004] test, and having found cross-dependence we use the appropriate stationarity Im, Pesaran, and Shin's (CIPS) test (2003). Next, we use Westerlund's [2007] co-integration test to confirm the existence of the co-integrating relation. The existence of co-integration allows for the estimation of the considered VECM which we carry out with the use of the dynamic fixed effects (DFEs) estimator. As the last step, we test for causality by extending the classical Granger's method with Dumitrescu and Hurlin's [2012] individual coefficients approach. All the estimations are carried out with Stata with the use of either the standard procedures or add-ins available freely for download and easy to find with Stata's search engine.
Human capital can be defined as “the set of intangible resources embedded in the labor factor which have improved its productivity” [Goldin, 2016]. These resources are related to knowledge and skills acquired through education and experience [Schultz, 1961; Becker, 1962]. There exists a large body of microeconomic evidence of a strong relationship between human capital and earnings, stemming from the classical Mincer model which uses years of schooling as a key determinant of wages. Mincer's model implies that a change in a country's average level of schooling should be the fundamental factor of economic growth [Krueger and Lindahl, 2001]. Owing to this finding, several studies adjust the workforce for improvements in human capital measured by their educational attainment. However, on the macroeconomic level, numerous studies have failed to find a significant relationship between economic growth and educational attainment [Bosworth and Collins, 2003]. Elbakidze and Jin [2015] go even one step further and show that the increased secondary attainment resulted mostly in an increased amount of violence rather than educational effects.
While on the microeconomic level the accumulation of human capital improves labor productivity and increases the returns to capital and a well-educated workforce is also essential for the creation and diffusion of technology, the issue is the distinction between the quantity (an example of which could be the educational attainment) and the quality of education. Stern [2000] documents one example of the increased quantity not necessarily being followed by the desired economic effects. This could be attributed to the measurement problem. In a comprehensive study, Benos and Zotou [2014] show that educational attainment, frequently used in empirical studies, is an inaccurate measure of human capital since it measures education quantity, which is more or less stable, while education quality varies widely across countries and periods. To investigate this difference, Schoellman [2012] adjusts years of schooling by considering the education quality to explain the cross-country output differences per worker. He finds that the cross-country differences in education quality are roughly as important as cross-country differences in its quantity. This increases the total contribution of education differences from 10% to 20% of output. In consequence, as long as it can be measured, the quality of education seems a better measure to describe the quality of human capital in the macroeconomic production function than the years of schooling or educational attainment. In this paper, we follow the approach of Hanushek and Woessmann [2012] and use the OECD's PISA results to describe each country's quality of education in the primary and early secondary stages. The PISA measures 15-year-olds' ability to solve real-life-like types of problems with their reading, mathematics, and science knowledge and skills. The main strength of this cyclical program that has lasted over the last two decades is the fact that in each participating country a sample of at least a few thousand students is drawn. Those take a test whose results can be compared across different countries. The fact that the analysis is repeated every three years with an increasing number of participating countries allows for the cross-country dynamic comparison of educational achievements measured by the test results.
There exists some research on the impact of educational quality measured by PISA scores on economic growth, however, most of it is based on cross-sectional regressions or correlations [Goczek et al., 2021]. Breton [2011] challenges Hanushek and Woessmann's [2008] contention that the quality and not the quantity of schooling determines a nation's rate of economic growth by arguing this cross-sectional method is flawed. Nevertheless, a few methodologically more advanced papers, mostly by Hanushek and Woessman, should be also mentioned. In their paper, Hanushek and Woessmann [2012] address the problem of causality by trying to eliminate gradually the influence of potentially relevant features on growth. This is aimed at avoiding the potential spuriousness of the regression. The authors argue that the education quality-GDP growth relation is not instantaneous and they use lagged education measures, which enforces them to use the results of tests performed even before PISA was introduced (such as Trends in International Mathematics and Science Study TIMSS). In line with this, OECD [2010] mentions the problem of causality and performs some robustness checks to determine the direction of the existing relation, others do not always do so – for example, Chen and Luoh [2010] simply regress growth on test scores. They examined the relationship between test scores (mathematics and science) and cross-country income differences. The authors pose a question of whether test scores are good indicators of workforce quality. The obtained results suggest that the strong link between test scores and cross-country income differences cannot be confirmed. Instead, they discover that the per capita number of researchers involved in R&D as well as the per capita number of scientific and technical journal articles could better account for the cross-country income differences. This might be due to improper lagging of the test scores. However, it might also mean that the general proxy for the country's education quality (which PISA intends to measure at the mid-teen level) is not sufficient. Indeed, it can be construed that the skills as they are measured in PISA tests are not key factors to the future human capital quality. Campbell and Üngör [2020] develop a national accounting exercise with human capital to account for schooling, cognitive skills, and health. We present group comparisons, finding for some regions values as large as 40–50%. The group comparisons of education quality indicate, for some regions, values as large as 40–50%. Moreover, Balart et al. [2018] decompose PISA test scores into a cognitive- and non-cognitive components to find that both components are positively associated with economic growth. We believe that the data could be decomposed further due to increased data availability. In addition, one of the reasons for the lack of statistical significance of education in the convergence process could be the presence of omitted control variables like openness as indicated in the meta-analysis of the literature by Benos and Zotou [2014], which could be seen as a significant critique of many previous studies.
Hanushek and Woessmann [2015] present evidence that when controlling for cognitive skills acquired during the education process (a measure of education quality), the initial value of years of schooling does not have any effect on the average annual growth rate of GDP per capita in the period 1960–2010. The authors address the problem of causality in yet one more way, namely instead of analyzing the level-level relation, they regress deviation on deviation. Still, most research is based just on cross-sectional data with few researchers applying instrumental variables regression. We found no trace of dynamic panel data model analysis in the literature that makes use of PISA (and similar) test measurement of human capital.
We argue that the possibility of controlling for endogeneity is not appropriately undertaken in the surveyed articles. To our knowledge, no co-integration analysis is present in the literature. The latter seems to be particularly important when time-series (or panel) data from different countries are used. While it is commonly known that the GDP series are almost always non-stationary, the human capital quality measured by the results of such tests as PISA is likely to share the same property due to the existence of the so-called “Flynn effect.” The latter expression is used to name the regular and systematic worldwide increase of mean results in standardized intelligence test performance, observed since the beginning of the 20th century [Flynn, 1984, 1987]. Based on the analysis of data from 14 different countries a considerable increase in intelligence test scores of 5 to 25 points on the deviational intelligence quotient scale for each generation has been demonstrated. The rise of scores is usually found to be greater on tests of fluid intelligence, such as Raven Progressive Matrices [Brouwers et al., 2009], than on tests of crystallized intelligence, especially nonverbal IQ tests. However, a considerable increase is recorded also for tests that measure crystallized intelligence [Flynn, 1987] such as the Similarities Scale in Wechsler's Intelligence Test. Given the above – and also the inspection of empirical data – one can expect the test results to be generally improving over time. The co-integration analysis would then seem a reasonable tool to avoid the risk of attaining spurious regression or reverse causality in which GDP increase can lead to better education quality as countries can invest in better education to make their income higher. Although there exists much more research whose authors try to attain the PISA score determinants as well as PISA score implications, there seems to be space for a lot more, especially on the country aggregated data, as well as the potential non-stationarity of the data, which suggests the use of co-integration techniques, not present in the literature in this context.
The model which is employed investigates the relationship between education quality and growth in an open economy version of the leader-follower model of Nelson and Phelps [1966] that allows for conditional convergence in growth rates between countries but not in the levels of income. In the steady state, the leader and the follower exhibit the same growth rate. The convergence of the follower to the leader occurs through the international diffusion of knowledge through international trade with the leader economy. In our setting increasing education quality allows for a quicker diffusion of technology.
The model builds on earlier studies by Cieślik and Goczek [2018] as well as Cieślik and Tarselewska [2011] who employed the leader-follower model in a different context of privatization and corruption. In this paper, we extend this setting to show that education quality can impact the steady-state level of income and the rate of growth in the follower economy during its transition to the steady-state but not its steady-state level of growth. In the steady state, the growth in the follower economy matches the growth in the leader economy.
We set out by considering an autarky case with no international knowledge diffusion. We then introduce knowledge diffusion enabled by international openness in the next step. Let the technological leader be denoted as object 1 and the follower as object 2. The aggregate production functions can be then written as:
In the case of autarky, there is no trade and investment so no international knowledge diffusion takes place as each country has to come up with all varieties of capital goods alone. In this case, the behavior of the follower equals the behavior of the leader. The critical difference lies in the costs of inventing a new variety of capital goods in the
In each country, the output
We expect that better education makes it easier to innovate. Thus, the variable
The capital
In the beginning, economies are closed to both international trade and FDI. This imposes that the national output equals domestic expenditures which are distributed between consumption (
In both countries, both innovating and imitating firms face a demand for a variety of capital goods, which comes from the profit maximization problem of the final goods producers. The demand is given by:
To make the innovation profitable, patents are introduced. The patents yield monopoly power to the owner – the innovating representative firm. As the firms are homogenous, profit maximization causes prices to be the same across all varieties. Thus, the profit maximization of the innovator
Consumers in both countries are assumed to be Ramsey-type with infinite time horizons who maximize standard intertemporal utility function at time zero:
Standard maximization of utility, subject to the intertemporal budget constraint yields the rate of consumption growth:
The growth rate of consumption is constant since the interest rate in each country is constant. Therefore, the growth rates of the number of varieties
Let us now consider a case of an open economy. In this case, a firm in the follower country can invent a new variety of goods or imitate a product that is already known in the leader country. However, it is cheaper to imitate pre-existing goods from the leader country because the imitation cost (ν2) is lower than the innovation cost (η2). For this reason, in the open economy equilibrium, the companies in the follower's economy do not innovate. It is assumed that the imitation cost decreases with the number of varieties manufactured by multinational firms (
In the steady state, the growth rate of per capita income in the follower country must equal the growth rate in the leader country. The equality of consumer preferences in both countries causes the international interest rate to equalize. Therefore, the steady-state ratio of per capita incomes (
Eq. (8) demonstrates that the steady-state per capita income of both countries is not equal and a constant income gap may continue, depending on, among others, the national productivity parameters as well as external openness. In this model, education quality does not have permanent effects on the rate of growth in the steady state as the rate of growth in the follower country converges in the long run to the rate of growth of the leader country. Nevertheless, these parameters determine the level of income in the steady state and affect the rate of growth during the transition to the steady state. In particular, a higher level of education quality translates into quicker transitional growth, having controlled for country characteristics that impact their steady-state levels of income, namely their international openness to trade and FDI.
Therefore, the main hypothesis is that education quality has a positive impact on economic growth, having controlled for other effects. As a result, the hypotheses based on the formal theoretical model above that may be tested empirically are:
We base the analysis on the model proposed in Section 3.2 of the article. Importantly and contrary to Hanushek and Woessmann [2010], we construct a model in which we explain the GDP change as a function of not only the quality of education but also the necessary and relevant economic characteristics captured in the theoretical GDP growth model. Furthermore, a key issue in economics is the distinction between the short run and the long run while the most frequently used estimators were derived under the assumption of short- and long-run homogeneity. In such models, the estimated coefficients are held constant across all units in the sample; therefore, the coefficients are the same for each analyzed country over time. However, Pesaran [2007] note that this situation does not have to be in line with reality, especially in the short run. For this reason, the commonly used estimators may not be consistent, and long-run coefficients may be biased. The details of this methodology are explained in detail by Blackburne and Frank [2007]. The methodology fits the needs of our theoretical model and data sample characteristics, since it allows us to decompose the short and the long run, allows for non-stationarity of the variables, panel cross dependence, and last, not least allows us to tackle the short country dimension as compared to the time dimension of the panel.
Therefore, the theoretical results from the formal model suggest the use of the VECM and cointegration analysis approach, which we provide within the panel framework:
φ α β ε
The cointegration approach allows us to tackle reverse causality where a GDP increase can lead to better education attainment as countries can invest in better education. This obvious risk of endogeneity raises doubts regarding the causality inference as it may lead to giving too much weight to the impact of education on GDP and may overestimate its true impact. Our method provides consistent coefficients (despite the possible presence of endogeneity) because it includes lags of both dependent and independent variables which are included in the estimation which allows us to interpret the results as Granger-type causality.
The PISA scores in different types of skills and their average across all the considered skill groups are considered separately. Thus, in empirical models, the PISA variable in Eq. (10) is replaced with PISAAVE (global PISA score), PISAREAD (PISA score for the reading skills), PISAMATH (PISA score for the mathematical skills), and PISASCIE (PISA score for the science skills) respectively in separately estimated versions of Eq. (10).
It should be observed that the PISA test results in the model are taken from the period lagged by
Testing for cointegration is essential before the estimation of Eq. (9). Determining the issue of nonstationarity of the PISA results time series hinges, however, on the issue of cross-sectional dependency between individual countries which is expected to take place because of many reasons. The most obvious is the openness of the economies that results in the global trends of the education processes as well as the global existence of the Flynn effect itself. In light of these arguments, the results of the contemporaneous correlation across the countries' cross-sectional dependence (CD) test developed by Pesaran [2004] that confirm the existence of cross-country correlation of the particular PISA series would not be a surprise.
This dependency may be troublesome for the first-generation unit root tests used to determine the issue of stationarity of the time series that are used in Eq. (10). The first-generation unit root tests assume cross-sectional independence between observational units. This is restrictive and unrealistic in the majority of macroeconomic applications, like the study of convergence where co-movements of business cycles between economies are obvious. The second-generation unit root tests strive to overcome this issue, relaxing the assumption of cross-sectional independence and tackling potential heterogeneity. The most commonly known second-generation test is the cross-sectional Im, Pesaran, Shin (2003) (CIPS) test. Instead of the standard Augumented Dickey Fuller regression, CIPS tests are based on the cross-sectional ADF regression (CADF), which adds lagged cross-sectional means of individuals to control for effects of the common factors, while the computation of the test statistics and the inference follows the IPS procedure. The CIPS statistic is then computed as a group mean of
Similar to panel unit root tests, panel cointegration tests strive to provide more reliable results by checking for the presence of cointegration or cross-dependency issues. The most frequently used panel cointegration tests are based on unit root testing of residuals from an Ordinary Least Squares r egression, however, the major limitation of such tests is that they cannot accommodate structural breaks and account for cross-dependency which exists in the data which is confirmed by the results of the tests shown in Table 2. The panel cointegration test proposed by Westerlund [2007] copes with this issue by determining structural breaks endogenously. More importantly, however, the Westerlund test accounts for cross-sectional dependency evident in the business cycle data across countries with the use of bootstrapping to compute robust critical values. This is an important issue since the application of cointegration tests assuming independence (Kao, Pedroni) to series that are characterized by cross-sectional dependencies leads to size distortions and low power.
Pesaran CD test results
Breusch-Pagan LM | 47,950.26 | 47,579.72 | 46,412.74 | 48,879.8 |
<0.001 | <0.001 | <0.001 | <0.001 | |
Pesaran scaled LM | 563.4106 | 558.7497 | 544.0704 | 575.1033 |
<0.001 | <0.001 | <0.001 | <0.001 | |
Pesaran CD | 202.8806 | 202.8742 | 197.609 | 205.7417 |
<0.001 | <0.001 | <0.001 | <0.001 |
CD, Pesaran's cross-sectional dependence test statistics; LM, Lagrange multiplier test statistics.
Identifying the existence of co-integrating relation allows for the estimation of the panel VECM Eq. (10). The most frequently used methods to estimate Eq. (10) are the dynamic ordinary least squares (DOLS) and the fully modified ordinary least squares (FMOLS). However, an important shortcoming of both methods is that they do not estimate the parameter of the short-run relations that are as important as the long-run relations. Alternative methods, such as the pooled mean group (PMG) regression, mean group (MG) regression, and the DFE approaches, are available to consider different levels of heterogeneity across countries while estimating both the short-run and the long-run effects simultaneously. Due to insufficient data, only the DFE estimation is feasible. However, the application of the DFE approach is justified in this case given that the countries available in the sample have important similarities and most of the heterogeneity between them is captured by country-specific intercepts. In consequence, the DFE model is appropriate in the considered situation.
The last issue that requires consideration in the paper is the problem of causality. The seminal research by Hanushek and Woessmann seems not to shed sufficient light on this crucial issue which, as emphasized by Altman [2020] is often the case. Thus, we adopt two approaches to causality testing in the panel data framework. The first one consists in treating the panel data as one large stacked set of data, and then performing the Granger Causality test in the standard way, except for not letting data from one cross-section enter the lagged values of data from the next cross-section. This method assumes that all coefficients are the same across all cross-sections. Since this might be doubtful, we also consider another approach adopted by Dumitrescu and Hurlin [2012], who make an opposite assumption, allowing all coefficients to be different across cross-sections.
The main concern associated with the use of PISA results as a measure of the quality of schooling δ and growth factor in the empirical growth model is that the PISA tests have only been performed over approximately the last 20 years in general and only a few years in a numerous group of countries. To overcome the problem which immediately stems from a low number of available observations we apply the strategy described by Goczek et al. [2021]: we use exponential smoothing to interpolate the PISA scores in the years when the tests were not performed in a given country and apply backward extrapolation of the PISA results to obtain an adequately long series. As an extrapolation tool, we use a dynamic regression model in which we explain PISA scores (for each of the three groups of skills distinguished in the PISA tests: mathematics, science, and reading) as a function of lagged education expenditures [Heyneman, 1995; Hanushek and Woessmann, 2012; Barro and Lee, 2015] and individual country effects. The countries included in the sample include Albania, Argentina, Australia, Austria, Azerbaijan, Belgium, Chile, Colombia, Costa Rica, Croatia, Czech Republic, Denmark, Estonia, Finland, France, Georgia, Germany, Greece, Hong Kong, Hungary, Iceland, Indonesia, Ireland, Israel, Italy, Japan, Kazakhstan, Korea, Kyrgyz Republic, Latvia, Lithuania, Luxembourg, Macao, Malaysia, Malta, Mexico, Moldova, Netherlands, New Zealand, Norway, Peru, Poland, Portugal, Qatar, Romania, Russia, Singapore, Slovak Republic, Slovenia, Spain, Sweden, Switzerland, Thailand, Tunisia, Turkey, UK, USA, and Uruguay.
An adequate GLS estimator allowing for autocorrelation and heteroskedasticity is then applied to yield theoretically lagged PISA country-average results. Naturally, possession of true data without any forms of attrition or imputation is an ideal situation, however, this often is not the case, especially in cross-country studies. In consequence, we partly estimate the missing values. However, it should be emphasized that we use an algorithm based on the estimation of the missing values with the use of an appropriate regression equation with an individual effect, individual autocorrelation, and inclusion of educational expenditures as a regressor.
The PISA results for different skills are country correlated, as was expected. The results of Pesaran's test of CD are provided in Table 2.
The results that confirm the existence of correlation across different PISA test item scores are by no means surprising. On the one hand, to some extent, the 15-year-olds that are tested within the frame of PISA can be – for simplification – considered as members of the groups of either the more or of the less involved in studying. This division does not fully determine the results of the PISA test for each skill but has a certain effect on it. In consequence, the conditional and the unconditional mean result for one skill given the results regarding other skills are not equal. On the other hand, the cross-dependence is partly caused by the construction of the PISA test itself.
This dependency determines the use of CIPS test to determine the orders of integration of the incurred series. Table 3 summarizes the results for the variables included in the model.
CIPS unit root test results
logGDP | 44.523 | 1.00 | −1.854 | 0.03 |
OPEN | 0.01 | 0.50 | −5.525 | <0.01 |
FDI | 0.72 | 0.76 | −16.309 | <0.01 |
PISAAVE | 1.318 | 0.91 | −17.421 | <0.01 |
PISAREAD | 1.159 | 0.88 | −17.024 | <0.01 |
PISAMAT | 3.338 | 1.00 | −18.784 | <0.01 |
PISASCIE | −1.907 | 0.03 | −16.632 | <0.01 |
CIPS, Cross-sectional Im, Pesaran, and Shin.
The results are clear and indicate the I(1) character of all the series of interest (assuming a 5% level of significance). This is crucial for further steps of analysis. Having identified the existence of cross-correlation as well as equal order of integration across the series, a proper co-integration test is required. As was stated in Section 3, few tools are adequate given the dependence across different variables, however, Westerlund's approach does have this feature. Besides, the Westerlund test, which results are provided in Table 4, accounts for cross-sectional dependency by using bootstrapping to compute robust critical values. Indeed, the issue of cross-dependency has been shown to affect the results of cointegration as can be seen in the comparison between
Westerlund cointegration test results
PISAAVE | ||||
Gt | −2.655 | 2.798 | <0.01*** | 0.01** |
Ga | −9.173 | 1.592 | 0.94 | <0.01*** |
Pt | −15.039 | 2.881 | <0.01*** | 0.01** |
Pa | −8.567 | 1.054 | 0.15 | <0.01*** |
PISAREAD | ||||
Gt | −2.738 | −3.347 | <0.01*** | <0.01*** |
Ga | −8.582 | 2.117 | 0.98 | 0.01** |
Pt | −14.547 | −2.415 | 0.01** | <0.01*** |
Pa | −8.166 | −0.673 | 0.25 | <0.01*** |
PISAMAT | ||||
Gt | −2.569 | −2.225 | 0.01** | 0.04** |
Ga | −8.584 | 2.115 | 0.98 | <0.01*** |
Pt | −14.21 | −2.097 | 0.02** | 0.01** |
Pa | −8.025 | −0.538 | 0.30 | <0.01*** |
PISASCIE | ||||
Gt | −2.528 | −1.951 | 0.03** | 0.04** |
Ga | −9.397 | 1.393 | 0.91 | <0.01*** |
Pt | −14.082 | −1.975 | 0.02** | 0.01** |
Pa | −8.61 | −1.095 | 0.14 | <0.01*** |
The use of robust
The results provided in Tables 2–4 allow us to conclude that the necessary conditions are met and the main Eq. (9) can be estimated credibly without the risk of being interpreted as spurious regression. This is an important issue: the non-stationarity of the incurred series would otherwise raise serious doubts about the validity of the results. The results of the extrapolation procedure described by Goczek et al. [2021] can also be used to predict the theoretical PISA scores in other countries which have not performed the PISA tests so far upon the assumption of them being typical in terms of their individual country effects. This approach also allows for further analysis to be based on the much broader group of 111 diverse world countries conditional upon the availability of the considered growth factors and the educational expenditures as the latter can be used to estimate the theoretical PISA test results even in the countries where the test itself has not been organized. It should be mentioned though, that all the following results have been verified also in the initial group of countries that have undergone the PISA testing and the results are consistent. To estimate Eq. (9), a total of 1,731 observations in the period 1994–2015 are used. In Eq. (9), we apply the PISA scores lagged by a decade (i.e., we use
The theoretical model displayed an output distance between the country's leader and follower. If the follower increases the quality of education, then in a steady state he catches up with the leader and grows at the same pace as the leader. This corresponds to the idea of a cointegrating relationship which is a simultaneous long-run positive feedback loop. For cointegration identification purposes, the lagged GDP value has its coefficient normalized to 1. This relates to the long-run equilibrium (Table 5, upper panel), in which the quality of education, as captured by all classes of PISA scores, has a significantly positive impact on growth. Therefore, this confirms the main hypothesis. Therefore, a significant coefficient in the long-run cointegrating equation means that whenever the steady-state level of GDP in the follower changes due to an increase in education quality, it follows that this positive spillover impacts the leader as well. Therefore, improvement in the quality of education brings increases in the level of GDP, and the follower, as well as the leader, benefit from this improvement. The obvious rationale behind this result is that the flow of technical thought and human capital in an open economy spillover from the leader to follower and back. Success stories of catching up in terms of development require the absorption of technological expertise from abroad in ways that correspond approximately to our theoretical framework.
Panel VECM dynamic fixed-effects estimates
Long-run | |||||
OPEN | 0.0043*** | 0.0044*** | 0.0041*** | 0.0042*** | |
(0.0014) | (0.0014) | (0.0014) | (0.0014) | ||
FDI (% GDP) | −0.0029 | −0.0029 | −0.0027 | −0.0028 | |
(0.0027) | (0.0028) | (0.0026) | (0.0027) | ||
L10PISAAVE | 0.0221*** | ||||
(0.0085) | |||||
L10PISAREAD | 0.0164** | ||||
(0.0071) | |||||
L10PISAMAT | 0.0236*** | ||||
(0.0084) | |||||
L10PISASCIE | 0.0258*** | ||||
(0.0094) | |||||
Short-run | |||||
Error Correction | −0.0316*** | −0.0312*** | −0.0327*** | −0.0321*** | |
(0.0049) | (0.0050) | (0.0049) | (0.0049) | ||
ΔOPEN | 0.0002** | 0.0002** | 0.0002** | 0.0002** | |
(0.0001) | (0.0001) | (0.0001) | (0.0001) | ||
ΔFDI (% GDP) | 0.0001* | 0.0001* | 0.0001* | 0.0001* | |
(0.0001) | (0.0001) | (0.0001) | (0.0001) | ||
ΔL10PISAAVE(−1) | −0.0007 | ||||
(0.0007) | |||||
ΔL10PISAREAD(−1) | −0.0008 | ||||
(0.0005) | |||||
ΔL10PISAMAT(−1) | −0.0001 | ||||
(0.0009) | |||||
ΔL10PISASCIE(−1) | −0.0008 | ||||
(0.0007) | |||||
Constant | 0.0018 | 0.0780 | −0.0184 | −0.0517 | |
(0.1130) | (0.0992) | (0.1136) | (0.1247) |
VECM, vector error correction model.
In contrast, we cannot confirm that the increases in education quality bring a temporary acceleration of economic growth given the result (Table 5, lower panel) that short-run increases in the quality of education do not significantly impact transitory growth to the steady state, though as indicated above, in the steady state, the quality of education has a positive effect on the level of income.
In addition, we find that external openness as captured by trade to GDP is significant and positive both in the short and in the long run. Besides, the FDI variable impacts transitionary dynamics in the short run (openness to FDI increases the speed of attaining the steady state), which is in line with the theoretical model. Besides, to test the robustness of the specification, we have carried out DOLS and FMOLS estimations that provide further evidence in support of the DFE results (omitted due to lack of space). We can interpret the significant variables as an increase in steady-state growth. Therefore, we need both crucial factors to make this growth happen. First, we need country openness in terms of trade, second, we need education quality.
Lastly, having found that the variables are first-difference stationary, an attempt can be made to identify the causal direction of the relationship between them. Information about the exact direction of the causal link enables a more nuanced discussion of the policy implications of the findings – otherwise, it is difficult to clearly state that there exists any “impact” of the quality of education on the GDP growth and formally the only thing that can be confirmed is their co-variation.
Two approaches are performed and described in Table 6. We provide the results of tests under two sets of assumptions: imposing parameter equality across countries (“common coefficients”) and allowing for their differences, as in Dumitrescu and Hurlin (“individual coefficients”).
Panel VECM Granger causality
PISAAVE does not cause Log(GDP) | 127.347 | 0.000 | 9.534 | 15.982 | 0.000 |
LOG(GDP) does not cause PISAAVE | 17.455 | <0.001 | 4.692 | 3.111 | 0.002 |
PISAREAD does not cause Log(GDP) | 22.825 | 0.000 | 1.482 | −2.938 | 0.003 |
LOG(GDP) does not cause PISAREAD | 1.740 | 0.157 | 1.704 | −2.109 | 0.035 |
PISAMAT does not cause Log(GDP) | 83.056 | 0.000 | 9.422 | 15.683 | 0.000 |
LOG(GDP) does not cause PISAMAT | 11.649 | <0.001 | 5.699 | 5.788 | <0.001 |
PISASCIE does not cause Log(GDP) | 214.826 | 0.000 | 11.419 | 20.993 | 0.000 |
LOG(GDP) does not cause PISASCIE | 23.700 | <0.001 | 7.167 | 9.690 | <0.001 |
VECM, vector error correction model.
As could be expected, the results strongly indicate a bi-directional causal link between education quality and economic growth that can be thought of as a positive feedback loop: increases in the quality of education impact the level of development, and the level of development positively increases the quality of education resulting in a positive synergy effect. This is an important result, which on the one hand confirms that the conclusions achieved in the milestone articles in the field that point to the existence of the influence of education quality on GDP growth are indeed correct in the qualitative sense, while on the other hand – suggests that the quantitative results should be taken very carefully: unless the techniques used take care of the bilateral relation of the dependent and the independent variables, the quantitative interpretation of the results may be misleading.
The notion that quality of education is important is prevalent in the educational literature and there is an overwhelming body of knowledge to support that claim. However, the methods used to obtain the results that confirm the relevance of the quality of education for the GDP growth that can be found in the literature are often excessively simple compared to the current state of the art. Most notably, they do not allow answering whether increases in education quality cause increases in economic growth allowing poorer countries to catch up. To answer this question, we propose a formal theoretical model using Nelson and Phelps [1966] classic dynamics by introducing differences in education quality (proxied by students' performance on the PISA test) in a leader-follower type of growth model to formulate several hypotheses regarding the analyzed education quality, external openness and economic growth nexus.
A panel DFE model has been estimated to examine in both the short run and the long run in our model. The results can be interpreted on a Granger-type causality basis. Education quality as captured by various PISA skills was found to have a positive significant impact on economic growth in the long run while no significant short-run relationship between these variables was observed. External openness was found to increase economic growth in the short run, while in the long run, only openness to trade was significantly positive.
The results discussed here should be viewed as a formal backing of the frequently expressed notion that expenditures on education, bring about the increased output that may finance the reform. The return is not immediate, however, but coincides with the external openness of countries – the open countries can use their increasing quality of education to catch up with the technological leaders. In this context, the countries that want to provide sustainable economic growth coping with technological leaders should allocate more resources to the quality of education. Importantly, the results are based on the quality – not the quantity of education, which is a confirmation of a key role in the way that education processes are planned, organized, and administrated. While considered in pair with numerous results from the literature that do not confirm the important role of the average educational attainment or the average years of schooling, can also be treated as an important voice in the discussion whether from the economic point of view the governments should concentrate on massive scale tertiary education, perhaps covering its costs for all the interested students or, instead, invest in the quality of the primary and secondary education. There is more evidence for the long-run relevance of the latter.
Having reviewed the field one can find out that most of the authors make use of quite simple research methods, while reverse causality and the omitted variable bias seem to be the biggest threat. Considering the importance of the research problem, it is worthwhile to adopt a more up-to-date research methodology, which would allow for validation of the long-standing argument justifying the value of high-quality education. To the best of our knowledge, no co-integration analysis is present in the literature in the field. The latter seems to be particularly important when time-series (or panel) data from different countries are used. With the use of country-level panel data, we consider the possible lag that characterizes this education-growth nexus and use statistical methods to address the risk of reversed causality of economic performance affecting the quality of education, and we extend the model by inclusion of other potential growth factors. This we believe is the main value-added aspect of this paper. Obviously, attempts to modernize the econometric approach by developing further and further technical innovations should not be viewed as a target in itself. However, as noticed by Altman [2020], most authors neglect the fact that basing the research and drawing conclusions about relations simply basing on the test of significance might be misleading: not only can the regression be spurious but also the “influence” cannot be treated as a simple consequence of the existence of correlation which seems to be the case in most papers that explore the GDP-education nexus, including the early Hanushek and Woessmann's milestone analysis.
The main limitation of the study is data availability. While there is no problem with the availability of the macroeconomic variables used in the study, the seven waves of PISA test that have been carried out so far (2000, 2003,…, 2015, 2018) is already a substantial sample, it still requires extrapolation to base the study with lagged PISA results on a reasonable number of observations. This problem is diminishing over time as there is a new wave every three years and the fraction of the sample that is extrapolated – not just collected – is decreasing. This of course also means the possibility to apply more estimators than just the DFEs, which gives a chance to attain more robustness checks and verify the stability of the results to an even bigger extent. On the other hand, there is a natural limitation with the measurement of the quality of education. While it is difficult to point out a better cross-country education quality measurement tool than the PISA scores, there exists a wave of criticism against PISA. The critiques raise different arguments, whose summary can be found in Zhao [2020], however, while criticizing the PISA itself, no superior alternatives are proposed by the opponents of the OECD's initiative.
The results of our formal theoretical and empirical analysis confirm the huge relevance of earlier education quality for GDP growth. Furthermore, we have shown that the changes in education quality shall have effects on the GDP growth only on a longer time horizon. In policy terms, this is an important voice in the discussion on the role of education and provides an additional argument for directing a stream of investment toward education. This seems particularly important since politicians may find it tempting to limit educational expenditures to assign more resources to other sectors that have suffered during the COVID-19 epidemic. This in turn may have an effect on the quality of education and further – on the rate of economic growth in the future. Thus, the true threat could be associated with the fact that the positive effects of increased education quality are not immediate, while the same is true in reverse – cutting education expenditures may hurt growth only in the long run. Lowering educational expenditures may result in negative selection in teaching professions, lower workforce motivation, or worse studying conditions – all these effects are in the long term. Since politicians are usually focused on immediate electoral success, they might be willing to reduce educational expenditures motivating this policy with a lack of instantly observable negative consequences.