1. bookAHEAD OF PRINT
Journal Details
License
Format
Journal
eISSN
2444-8656
First Published
01 Jan 2016
Publication timeframe
2 times per year
Languages
English
access type Open Access

Research on identifying psychological health problems of college students by logistic regression model based on data mining

Published Online: 20 May 2022
Volume & Issue: AHEAD OF PRINT
Page range: -
Received: 14 Mar 2022
Accepted: 10 Apr 2022
Journal Details
License
Format
Journal
eISSN
2444-8656
First Published
01 Jan 2016
Publication timeframe
2 times per year
Languages
English
Abstract

With the popularisation of education, the number of college students is increasing day by day, and there are also more students with psychological health problems. Whether students’ psychological abnormalities can be detected in time is one of the main problems faced by colleges and universities at present. Adopting digital technology to mine, collect and analyse the data generated by psychological health education in colleges can effectively solve the dynamic development of students’ psychological health problems. Therefore, in this paper, the psychological health problems of college students are identified and classified by establishing an improved logistic regression model. The behaviour characteristics are quantified and the differences are combined according to students’ relationships with their classmates, life rules and economic conditions. The test results show that the regression effect of the model is excellent, which can identify college students’ psychological health problems and improve the intervention and treatment of educators on students’ psychological problems.

Keywords

Introduction

With rapid development of the knowledge economy and the popularisation of higher education, the number of college students is increasing day by day, and there are also more students with psychological problems. At present, the main means of investigating students’ psychological health problems in colleges is through paper or online questionnaires. For the convenience of later tracking, students are generally required to leave more detailed personal information. However, due to considerations of personal privacy, many students are worried that they will be given special treatment or be labelled especially when filling out the questionnaire, which cannot objectively reflect the true psychological status, and leads to false information in most feedback [1,2,3]. In psychological research, big data technology has had a profound impact on the research logic, research methods and research tools. In the traditional research of psychological health education, the difficulty of data statistics limits the in-depth development of related research to a certain extent [4]. The arrival of big data technology has expanded thinking, innovated research platform and easily solved the problem of untoward data collection. Therefore, it is urgent to mine, collect and utilise the data generated by psychological health education in colleges, and combine big data technology with psychological health education in colleges and universities.

At present, researchers have made an in-depth analysis of the source, causes and countermeasures of college students’ psychological health, which is of vital significance for finding the abnormal psychological problems of college students in time and providing scientific theoretical support for their psychological health education [5,6,7]. At the same time, all colleges and universities are trying to equip full-time psychology teachers to undertake the tasks of teaching and psychological consultation of psychological health courses. Although they attach great importance to psychological health education, owing to the lack of professionals, digital technology cannot be effectively used, and thus psychological health education cannot be implemented efficiently [8, 9]. In addition, students’ psychological health is a process of dynamic development. However, there are problems such as insufficient attention to evaluation, weak teachers and imperfect evaluation system in most colleges [10], so it is difficult to capture the dynamic psychological development and changes in students during their study. Meanwhile, with the help of digital technology, under the condition of ensuring the scientificity, stability and sensitivity of indicators, establishing a dynamic identification and evaluation system for students’ psychological problems can effectively promote the development of college students’ psychological health.

Logical regression model of students’ psychological health problems

In the psychological health identification model, on the basis of analysing and summarising numerous data of students, deeper information and features are extracted, which is to identify whether students have a tendency of psychological abnormality through the data of students’ behaviour and other data in college, thus guiding the direction for college teachers and psychological education staff.

The process of data mining

Data mining is a complicated process. Through a series of calculations in a large amount of data, it is necessary to find the potential and hidden internal relations in the data and extract valuable information. The specific process is shown in Figure 1:

Selection and preparation of data set. According to the actual demand, select the initial data set and collect as much relevant data as possible. The more relevant the data, the higher the accuracy, but the amount of calculation will also increase [11].

Data preprocessing. The collected data may contain some isolated points and discrete points, so it is necessary to preprocess the data to make it useful. Methods of data preprocessing include data cleaning, transformation, integration and specification [12].

Data mining. Select appropriate methods and models for data mining, and find the intrinsic relationship between data and extract hidden valuable information, according to the data characteristics after data preprocessing and the actual needs of users,

Analysis and application of results. Analyse the obtained results in connection with the actual situation, and if it is not applicable to the actual situation, the above steps need to be repeated.

Fig. 1

Data mining process

Logistic regression model

Logistic regression is a kind of data mining technology. For a given data set, it can get a nonlinear model through linear model transformation to predict the actual test value as much as possible and output it [13, 14]. Actually, it is a classification model. Compared with other algorithms, logical regression has a simple form, strong interpretability and fast training speed, whose main idea is to fit the decision boundary as much as possible and output the predicted value. On the basis of linear regression, that is, for multivariate input X = (x1, x2, …, xn)T, the linear expression of predicted output y is as follows: f(x)=w1x1+w2x2++wnxn+b f(x) = {w_1}{x_1} + {w_2}{x_2} + \ldots + {w_n}{x_n} + b

Sigmoid function is usually selected as the mapping function, and its expression is: g(z)=11+ez g(z) = {1 \over {1 + {e^{ - z}}}}

Introduce Formula (1) so that: g(x)=11+e(wTx+b) g(x) = {1 \over {1 + {e^{ - \left( {{w^T}x + b} \right)}}}}

The value of g(x) is: g(x)={1,wTx+b>00.5,wTx+b=00,wTx+b<0 g(x) = \left\{ {\matrix{ {1,} \hfill & {{w^T}x + b > 0} \hfill\cr {0.5,} \hfill& {{w^T}x + b = 0} \hfill\cr {0,} \hfill& {{w^T}x + b < 0} \hfill\cr } } \right.

It can be seen that through sigmoid the function can define the domain as R and any real number is mapped between [0, 1], and it can be judged whether the sample belongs to the positive or negative class by judging whether the value of g(x) is larger than the threshold value of 0.5. Different thresholds can be adjusted according to the actual situation, and logarithmic conversion is usually obtained by: g(x)lng(x)1g(x)=wTx+b g(x)\ln {{g(x)} \over {1 - g(x)}} = {w^T}x + b

Regarding g(x) as the probability that x is a positive example, then 1 − g(x) represents the probability of being a counterexample. Therefore, the classification problem using logistic regression algorithm is finally transformed into finding the best solution for w. And through the loss function of the model, that is, the gap between the predicted value and the actual value, the classification result of the model is optimised. When solving the parameters in the model, the gradient descent method is used to solve the minimum value of loss function, which is simple to implement.

Due to the serious data imbalance between normal and abnormal samples, classification without other operations will affect the results of the model [15]. Therefore, before the dichotomy, it is necessary to balance the data, and combine their features with the differences.

The samples of normal students are marked as negative examples, whose code is 0, and the samples of abnormal students are marked as positive examples, whose code is 1. Before training the model, in order to avoid the influence of dimensional units between feature samples, the data should first be standardised. Data standardisation refers to scaling all feature data to a fixed range to avoid the influence of different units and different numerical values when training the model. Similarly, because there is a large numerical span in the features selected in this experiment, in order to eliminate the influence of numerical value on the classification model, the data should be normalised at first. That is, the result is mapped to [0–1] by linear transformation.

For sample X1, X2, Xn, the maximum value is max (Xi) and the minimum value is min (Xi), whose formula is: x*=Ximin(Xi)max(Xi)min(Xi) {x^*} = {{{X_i} - \min ({X_i})} \over {\max ({X_i}) - min({X_i})}}

According to this formula, the range of sample data can be compressed to [0,1], so as to eliminate the differences among different feature samples and avoid the influence of numerical dimension among feature samples.

Improved logistic regression model

The logistic regression model has an important influence on the results of sample classification, so the parameters of the logistic regression model are optimised. According to the transformation of sigmoid function, the assumed function is: x*=Ximin(Xi)max(Xi)min(Xi) {x^*} = {{{X_i} - \min ({X_i})} \over {\max ({X_i}) - min({X_i})}}

Among them, θ is the parameter variable, x is the eigenvector. In logistic regression, loss function is generally used to measure the accuracy of classification results. That is, according to a given sample {(x1,y1) … (xi,yi)}, the degree of deviation between the value of predicted samples ŷi and the value of the actual sample yi are obtained by the loss function, that is, the error between the measured actual value and the predicted value. The logistic regression loss function here is defined as: J=1mi=1mL(y^i,yi)=1mi=1m[yilog(y^i)+(1yi)log(1y^i)] J = {1 \over m}\sum\limits_{i = 1}^m L({\hat y_i},{y_i}) = - {1 \over m}\sum\limits_{i = 1}^m \left[ {{y_i}\log ({{\hat y}_i}) + (1 - {y_i})\log (1 - {{\hat y}_i})} \right]

To prevent the model from over-fitting, a regularisation term is added to the loss function and is represented as: x*=J=1mi=1m[yilog(y^i)+(1yi)log(1y^i)]+λ2mj=1λθj2 {x^*} = J = - {1 \over m}\sum\limits_{i = 1}^m \left[ {{y_i}\log ({{\hat y}_i}) + (1 - {y_i})\log (1 - {{\hat y}_i})} \right] + {\lambda \over {2m}}\sum\limits_{j = 1}^\lambda \theta _j^2 where λ is the regularisation parameter, the gradient descent method is used to solve the parameter θ, and the learning rate α is used to control the step size of gradient descent. The iterative update formula is: θ0=θ0αmi=1m(h0(xi)yi)x0(i) {\theta _0} = {\theta _0} - {\alpha \over m}\sum\limits_{i = 1}^m \left( {{h_0}({x_i}) - {y_i}} \right){x_{{0_{(i)}}}} θj=θj(1λam)αmi=1m(hθ(xi)yi)xi(j),j=1,2,n {\theta _j} = {\theta _j}\left( {1 - {{\lambda a} \over m}} \right) - {\alpha \over m}\sum\limits_{i = 1}^m \left( {{h_\theta }({x_i}) - {y_i}} \right)x_i^{(j)},\quad j = 1,2, \ldots n

Introducing features into the model: hθ(x)=11+e(θ0+θ1x1+θ2x2+θ3x3+θ4x4) {h_\theta }(x) = {1 \over {1 + {e^{ - \left( {{\theta _0} + {\theta _1}{x_1} + {\theta _2}{x_2} + {\theta _3}{x_3} + {\theta _4}{x_4}} \right)}}}} the gradient value is obtained by gradient descent method, and the solution is θ = [− 3.294, 5.145, − 0.042, 2.238, 3.479]T. That is, according to the optimised parameters, the prediction model for identifying students with abnormal psychology is constructed as follows: hθ(x)=11+e(3.294+5.145x10.042x2+2.238x3+3.479x4) {h_\theta }(x) = {1 \over {1 + {e^{ - \left( { - 3.294 + 5.145{x_1} - 0.042{x_2} + 2.238{x_3} + 3.479{x_4}} \right)}}}}

In this paper, the model updated with the above parameters is used to predict students’ psychological problems, in which x1, x2, x3, x4, and y, respectively, represent the number of friends, sleep condition, economic condition, academic achievement and sample category.

Feature extraction of students’ psychological health problems

In the mining of college education data, experts and scholars often pay attention to the characteristics related to students’ achievements, ignoring students’ psychological activities [16]. Therefore, based on the five-factor model [17,18], this paper extracts and quantifies the behavioural characteristics of students’ psychological health problems on the basis of a large amount of original data. By referring to psychology, pedagogy and other related knowledge, indicators are established to extract students’ characteristics, and perfect and complete behavioural characteristics are constructed relatively to measure students’ psychological health problems. Based on the characteristics of responsibility and extraversion of the five-factor model, the behaviour of students is quantified from three perspectives: students-classmate relationship, regularity and economic situation.

Classmate relationship

Most of the students’ studies in colleges and universities are closed or semi-closed, so their health problems can be reflected through their classmates. The co-occurrence times of two students are directly proportional to the probability of becoming friends, therefore, association rules can be used to determine the classmate relationship.

In order to obtain students’ friends at college, it is necessary to calculate the co-occurrence data set between pairs of students, which is represented as: U={(i,j,τ(tj)|i,ju1,2,N)} U = \{ (i,j,\tau (t \cup j)|i,j \in {u_{1,2, \ldots N}})\} where i and j represent students, τ(ij) and refers to the number of simultaneous occurrences of students i and j.

In order to eliminate contingency, if the co-occurrence times of student A and student B are greater than T, the two students are considered as friends. Considering that the threshold value of each location is related to the total number of times, it is defined as: Til=sumilαl T_i^l = {{{\rm{sum}}_i^l} \over {{\alpha ^l}}}

Among them, for students i in position l the threshold of co-occurrence friends at, sumil {\rm{sum}}_i^l refers to students i in position l the total number of occurrences, αl. For location l the ratio coefficient, remember students. i and j the co-occurrence of times are τ(ij), when τ(ij) > Ti. When, think students j are i in position l friends, recorded as G(ij). On the basis of co-occurrence data pairs, use association rules to determine the scale coefficient. αl, which is used here with ij.

Form to explore the relationship, that is ij, if there is the association rules, the students i and j have good friends. Support is defined here. S(ij) and confidence. C(ij), the support degree is:

where Til T_i^l is the threshold of student's i the co-occurrence friends at position l, sumil {\rm{sum}}_i^l is the total number of times student i appears at position l, αl is the ratio coefficient at position l, and the co-occurrence times of student i and j are τ(ij). When τ(ij) > Ti, student J is considered to be a friend of i at position l and is denoted as G(ij). On the basis of co-occurrence data pairs, association rules are used to determine the proportional coefficient αl. Here, the relationship is mined by using the form ij, that is, if there is an association rule ij, then students i and j have a friend relationship. The support degree S(ij) and confidence degree C(ij) are defined here, and the support degree is: S(ij)=τ(ij)D,ij S(i \to j) = {{\tau (i \cup j)} \over D},\quad i \ne j

Confidence is defined as: C(ij)=τ(ij)τ(i),ij C(i \to j) = {{\tau (i \cup j)} \over {\tau (i)}},\quad i \ne j

Among them D, indicates the total number of credit card records in the whole data set, τ(ij) for students i and j. The number of co-occurrences of, τ(i) are for students i. All credit card record entries of support degree S(ij) represent students i and j Frequency and confidence of simultaneous occurrence in all data sets C(ij) can be used to indicate students i and j the degree of relevance. Support and confidence are important indicators in social relationship association analysis. Association rules with low support are mostly random co-occurrence and generally meaningless.

Where D represents the total number of credit card records in the entire data set, τ(ij) is the number of co-occurrences of students i and j, and τ(i) is all credit card records of student i. Support degree S(ij) represents the frequency of simultaneous occurrence of students i and j in all data sets, and confidence degree C(ij) can be used to represent the degree of association between students i and j. Support degree and confidence degree are important indicators in social relationship association analysis. Association rules with low support degree are mostly from random co-occurrence and are generally meaningless.

Rules of college life

The regularity of students’ behaviour reflects their self-discipline and orderliness in college. Students with strong regularity have better binding force, and the regularity of students’ behaviour is closely related to their academic performance [19,20,21]. Therefore, it can be considered that students with strong behavioural regularity can arrange their own plans in life. In this paper, the regularity of students’ behaviour is mainly quantified, such as eating and bathing, and whether there are significant differences between students with abnormal psychology and normal students in the regularity of students’ campus life behaviours, are explored. Shannon entropy is used to calculate and measure, and its definition is as follows: H=ip(i)logp(i) H = - \sum\limits_i p(i)\log p(i)

Here p(i) is said to be the first in the specified unit time i. In this experiment, 24 h a day is defined as the total time period, and students’ daily behaviour data is divided into 48 time series (30 minutes is a time series). For example, the behaviour data of a student in a certain period of time is {2021-12-13, 10:05; 2021-12-14, 12:23; 2021-12-15, 11:32; 2021-12-16, 11:23} and its chronological order is in the information entropy of {22, 24, 23, 22}.

where p(i) represents the probability of doing an activity in the ith time period within a specified unit time. In this experiment, 24 h a day is defined as the total time period, and students’ daily behaviour data is divided into 48 time series (30 minutes is a time series). For example, the behavioural data of a student in a certain period is {2021-12-13, 10:05; 2021-12-14, 12:23; 2021-12-15, 11:32; 2021-12-16, 11:23}, and its time sequence is listed as the information entropy of {22, 24, 23, 22,}. The result calculated by the formula is 1.58 Bit, in statistical informatics. Shannon entropy is used to measure the uncertainty of information [22], so it is used to measure the regularity of students’ behaviour. In this model, entropy is used to measure the regularity of three meals and fetching water. In the data set, all students have restaurant consumption records; about 96% of the students have a record of drawing water from the dormitory. The behaviour of fetching water usually occurs in the evening, and the time of washing face and rinsing mouth can indirectly reflect the bedtime of students, so these two high-frequency behaviours can objectively measure the regular habits of students. The greater the entropy of a student in a time period, the higher is the uncertainty and the lower the regularity.

Economic condition

Studies [23] have shown that the psychological status of college students is influenced by the family economic situation, and many students from poor families will suffer from long-term inferiority and depression; also they seem to be unwilling to communicate with others and other behaviours. However, due to the limitation of conditions, it is impossible to know the family status of students’ original families. In order to explore whether there are significant differences in the college performance between psychologically normal and abnormal students, in this paper, students’ financial situation is measured from the students’ financial aid and their consumption at college.

Given the time and amount of students’ consumption each time, first of all, the annual consumption data of 2,000 students (436,044 items in total) were processed and analysed, and the statistics of the average consumption of two kinds of samples between one semester and 1 year showed that there was no obvious difference in data distribution. Through Wilcoxon S hypothesis test, the zero hypothesis is as follows: There is no significant difference in consumption level between normal students and psychologically abnormal students. The alternative hypothesis is that there are significant differences in consumption level between normal students and students with psychological disorders. It is found that under the condition of 0.05 confidence, the consumption level of the two groups is P = 0.07>0.05, which means that there is no significant difference between them. In addition, a comparative analysis of the students’ financial aid in college has been made, where no difference is found.

Evaluation of the model on students’ psychological health problems
Evaluation index

According to the above characteristics, the quantitatively extracted sample data is introduced into the improved logistic regression model obtained before, which can effectively reflect the psychological health problems of college students. The evaluation index of this model is defined as:

Accuracy, that is, the proportion of correctly predicted samples to all samples. In the confusion matrix, T P and T N represent positive and negative samples, respectively, the formula for calculating the accuracy rate is: Accuracy=TP+TNTP+FP+TN+FN {\rm{Accuracy}} = {{TP + TN} \over {TP + FP + TN + FN}}

Precision, which refers to the proportion of correctly predicted positive samples (T P) to the positive samples (T P and FP), the calculation formula is: Precision=TPTP+FP {\rm{Precision}} = {{TP} \over {TP + FP}}

Recall represents the proportion of correctly predicted positive samples (T P) to the total of positive samples (T P and FP), which also use T Prate to indicate the formula as: Recall=TPrate=TPTP+FN {\rm{Recall}} = T{P_{rate}} = {{TP} \over {TP + FN}}

In this model, for the positive sample in the original data tag, if it is predicted as a positive example, the prediction result of this sample is a True Positive example, or TP in short. On the contrary, if it is predicted as a negative example, the result is a False Negative example. Similarly, a False Positive example (FP) means that a negative example sample in the original label is predicted, and a True Negative example (T N) is to predict a negative example sample in the original label.

Evaluation results

The original sample ratio of this model is about 10:1, as shown in Figures 24; if it is directly used to train the model, the logistic regression effect of the sample data is poor, but with the approaching of positive and negative proportion, the accuracy, precision and recall of the sample are better. When the positive-negative ratio of the sample is 1:2, the model regression effect is the best, with the accuracy, precision and recall reaching 76.2% and 83.4%, respectively. At the same time, when the positive–negative ratio of the sample is less than 1:2, the regression effect of the model is somewhat reduced.

Fig. 2

Comparison of accuracy

Fig. 3

Comparison of precision

Fig. 4

Comparison of recall

In addition, it is necessary to identify students with psychological disorders in the shortest time. For this reason, the data sets of one semester, 1 year, are selected for comparison. The data set used in this model is for students in the second semester of sophomore year (one semester), sophomore year (1 year) and college stage (freshman to junior year; there is no data for senior students here, because there are few exam classes for senior students in our college, and most of the students are in internship outside during their senior years). Feature training model as previously described is used to classify these data.

The recall rate indicates the proportion of the number of samples correctly predicted as positive cases in the sum of all samples divided into positive cases. The purpose of the experiment in this paper is to identify students with abnormal psychology among all students, that is, to identify a relatively large number of positive examples, so recall rate is an important evaluation index for this model. The higher is the recall rate, the more students with abnormal psychology are identified by the model. The results show that the improved logistic regression model performs well when adopting all 3-year data. In addition, the recall rate obtained by using 1 year's data is slightly lower than that obtained by using all the data. Combined with the classification accuracy, it can be considered that it takes at least 1 year's student behaviour data to identify students with psychological disorders, and the more data used, the better the robustness of the logistic regression model.

Conclusion

Researching the dynamic evaluation of college students’ psychological health based on digital technology can provide new research methods and theoretical practice for the follow-up psychological health education in colleges and universities. Therefore, based on the logistic regression model in data mining, by combining the characteristics of responsibility and extraversion of personality theory, students-classmates relationship, life rules and economic condition are quantified. Then, through the differential combination of data about students’ behaviour characteristics, an improved logistic regression model is constructed to classify and predict students’ psychological health problems. Finally, accuracy, precision and recall are used as the evaluation indexes of the model. The results show that when the positive–negative ratio of the sample is 1:2, the regression effect of the model is best with the accuracy, precision and recall reaching 76.2%, 83.4% and 86.6%, respectively. Meanwhile. more than 1 year's data of students’ behaviour is more conducive to identifying students’ psychological abnormalities, and the more data adopted, the better the robustness of the logistic regression model.

Fig. 1

Data mining process
Data mining process

Fig. 2

Comparison of accuracy
Comparison of accuracy

Fig. 3

Comparison of precision
Comparison of precision

Fig. 4

Comparison of recall
Comparison of recall

Lu Xin-en, Chen Huiling. Research on prediction model of personality behavior information processing in big data environment [J]. Information Science, 2019(10):108–113. (in Chinese). LuXin-en ChenHuiling Research on prediction model of personality behavior information processing in big data environment [J]. Information Science 2019 10 108 113 (in Chinese). Search in Google Scholar

Union I T. The World in 2014:ICT Facts and Figures [M]. Geneva, Switzerland: International Telecommunication, 2014. Union I T The World in 2014:ICT Facts and Figures [M]. Geneva, Switzerland International Telecommunication 2014 Search in Google Scholar

Pan Mingyun. Research on the integration of college students’ life education and psychological health education [J]. Journal of Qiqihar University (Philosophy and Social Sciences Edition), 2018,(09):164–166. (in Chinese). PanMingyun Research on the integration of college students’ life education and psychological health education [J]. Journal of Qiqihar University (Philosophy and Social Sciences Edition) 2018 09 164 166 (in Chinese). Search in Google Scholar

Zhuang Meijin. Innovative research on psychological health education of college students from the perspective of big data [J]. Journal of Heilongjiang Ecological Engineering Vocational College, 2019,(05):104–107. (in Chinese). ZhuangMeijin Innovative research on psychological health education of college students from the perspective of big data [J]. Journal of Heilongjiang Ecological Engineering Vocational College 2019 05 104 107 (in Chinese). Search in Google Scholar

Morelli S A, Ong D C, Makati R, et al. Empathy and well-being correlate with centrality in different social networks [J]. Proc Natl Acad Sci U S A, 2017, 114(37):9843–9847. MorelliS A OngD C MakatiR Empathy and well-being correlate with centrality in different social networks [J]. Proc Natl Acad Sci U S A 2017 114 37 9843 9847 10.1073/pnas.1702155114 Search in Google Scholar

Zhang Jiaming, Wang Chunjing. Research on early warning of college students’ psychological crisis based on big data technology [J]. Education and Occupation, 2015. (30):75–77. (in Chinese). ZhangJiaming WangChunjing Research on early warning of college students’ psychological crisis based on big data technology [J]. Education and Occupation 2015 30 75 77 (in Chinese). Search in Google Scholar

Peng Jinxiang. Research on early warning of college students’ psychological crisis based on big data technology [J]. Digital Technology and Application, 2019,(10):210–212. (in Chinese). PengJinxiang Research on early warning of college students’ psychological crisis based on big data technology [J]. Digital Technology and Application 2019 10 210 212 (in Chinese). Search in Google Scholar

Chordiya M, Bagal S B. Comparative Research of Clustering Algorithms for Prediction of Academic Performance of Students [C] International Journal of Engineering Research & Technology. ESRSA Publications, 2015. ChordiyaM BagalS B Comparative Research of Clustering Algorithms for Prediction of Academic Performance of Students [C] International Journal of Engineering Research & Technology ESRSA Publications 2015 Search in Google Scholar

Liang Juan, Luo Haizhu. Application of Big Data Mining Method in College Students’ Psychological Early Warning System [J]. Chinese College Health, 2018, 39 (12): 18211824. (in Chinese). LiangJuan LuoHaizhu Application of Big Data Mining Method in College Students’ Psychological Early Warning System [J]. Chinese College Health 2018 39 12 18211824. (in Chinese). Search in Google Scholar

Geng Zhiming. Analysis of the status quo and solutions of psychological health education function of university library students [J]. Education Modernization, 2019, 6(64):297–298. (in Chinese). GengZhiming Analysis of the status quo and solutions of psychological health education function of university library students [J]. Education Modernization 2019 6 64 297 298 (in Chinese). Search in Google Scholar

Wang J, Han J, Lu Y, et al. TFP: An Efficient Algorithm for Mining Top-K Frequent Closed Item sets [J]. IEEE Transactions on Knowledge & Data Engineering, 2005, 17(5):652–664. WangJ HanJ LuY TFP: An Efficient Algorithm for Mining Top-K Frequent Closed Item sets [J]. IEEE Transactions on Knowledge & Data Engineering 2005 17 5 652 664 10.1109/TKDE.2005.81 Search in Google Scholar

Hao Linqian. Analysis of data mining algorithm based on association rules [J]. Journal of Taiyuan University (Natural Science Edition), 2020, 38(03):42–45. (in Chinese). HaoLinqian Analysis of data mining algorithm based on association rules [J]. Journal of Taiyuan University (Natural Science Edition) 2020 38 03 42 45 (in Chinese). Search in Google Scholar

Mo X, Zeng Y. The relationship between ovarian endometriosis and clinical pregnancy and abortion rate based on logistic regression model [J]. Saudi Journal of Biological Sciences,2020, 27(1):561–566. MoX ZengY The relationship between ovarian endometriosis and clinical pregnancy and abortion rate based on logistic regression model [J]. Saudi Journal of Biological Sciences 2020 27 1 561 566 10.1016/j.sjbs.2019.11.021 Search in Google Scholar

Heba M, Chlebus M. Impact of using industry benchmark financial ratios on performance of bankruptcy prediction logistic regression model [J]. Working Papers, 2020 (7):633–639. HebaM ChlebusM Impact of using industry benchmark financial ratios on performance of bankruptcy prediction logistic regression model [J]. Working Papers 2020 7 633 639 Search in Google Scholar

Tang Huai. Campus personalized learning resource recommendation system based on logistic regression model [J]. Electronic Technology and Software Engineering, 2019(22):164–165. (in Chinese). TangHuai Campus personalized learning resource recommendation system based on logistic regression model [J]. Electronic Technology and Software Engineering 2019 22 164 165 (in Chinese). Search in Google Scholar

Ouyang Wenzhen. Influence of interpersonal relationship training on college students’ psychological health [J]. chinese psychological health journal, 2000. (in Chinese). OuyangWenzhen Influence of interpersonal relationship training on college students’ psychological health [J]. chinese psychological health journal 2000 (in Chinese). Search in Google Scholar

Sing C P, Love P E D, Fung I W Hetc. Personality and occupational accidents: Bar benders in Guangdong Province, Shenzhen, China [J]. Journal of Construction Engineering & Management, 2014, 140(7): 4005. SingC P LoveP E D FungI W H etc. Personality and occupational accidents: Bar benders in Guangdong Province, Shenzhen, China [J]. Journal of Construction Engineering & Management 2014 140 7 4005 10.1061/(ASCE)CO.1943-7862.0000858 Search in Google Scholar

Yvonne Wang. Research on senior high college English teaching from the perspective of Big Five Personality Theory [D]. Xihua Normal University, 2016. (in Chinese). YvonneWang Research on senior high college English teaching from the perspective of Big Five Personality Theory [D]. Xihua Normal University 2016 (in Chinese). Search in Google Scholar

Schulz C, Nocaj A, Goertler J, et al. Probabilistic Graph Layout for Uncertain Network Visualization [J]. IEEE Transactions on Visualization & Computer Graphics, 2016, 23(1):531–540. SchulzC NocajA GoertlerJ Probabilistic Graph Layout for Uncertain Network Visualization [J]. IEEE Transactions on Visualization & Computer Graphics 2016 23 1 531 540 10.1109/TVCG.2016.259891927875169 Search in Google Scholar

Cao, Jian, Gao, et al. Orderliness predicts academic performance: behavioural analysis oncampus lifestyle. [J]. Journal of the Royal Society, Interface, 2018. Cao Jian Gao Orderliness predicts academic performance: behavioural analysis oncampus lifestyle [J]. Journal of the Royal Society, Interface 2018 10.1098/rsif.2018.0210617076530232241 Search in Google Scholar

Zhou M, Ma M, Zhang Y, et al. EDUM: Classroom Education Measurements via Large-scale Wi Fi Networks [C]//The 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 2016. ZhouM MaM ZhangY EDUM: Classroom Education Measurements via Large-scale Wi Fi Networks [C]// The 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing ACM 2016 10.1145/2971648.2971657 Search in Google Scholar

Liu Y, Chen X, Peng H, et al. Multi-focus image fusion with a deep convolutional neural network [J]. Information Fusion, 2017, 36: 191–207. LiuY ChenX PengH Multi-focus image fusion with a deep convolutional neural network [J]. Information Fusion 2017 36 191 207 10.1016/j.inffus.2016.12.001 Search in Google Scholar

Long Xiaodong, Liao Xiangrong, Deng Zhiwen. Investigation and reflection on the psychological health status of poor college students [J]. Hunan Social Sciences, 2003(04):119–121. (in Chinese). LongXiaodong LiaoXiangrong DengZhiwen Investigation and reflection on the psychological health status of poor college students [J]. Hunan Social Sciences 2003 04 119 121 (in Chinese). Search in Google Scholar

Recommended articles from Trend MD

Plan your remote conference with Sciendo