Parsing Korean Classical Literature by Integrating Text Mining and Semantic Analysis
et
19 mars 2025
À propos de cet article
Publié en ligne: 19 mars 2025
Reçu: 23 oct. 2024
Accepté: 29 janv. 2025
DOI: https://doi.org/10.2478/amns-2025-0517
Mots clés
© 2025 Lai Wei, published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Figure 1.

Figure 2.

Figure 3.

Figure 4.

Keywords TF-IDF weights
Number | Keywords | TF-IDF | Number | Keywords | TF-IDF |
---|---|---|---|---|---|
1 | Legend | 0.0359 | 11 | Earth | 0.0200 |
2 | Folklore | 0.0347 | 12 | Buddhism | 0.0196 |
3 | Dynasty | 0.0345 | 13 | Morality | 0.0182 |
4 | Goli | 0.0341 | 14 | Maidservant | 0.016 |
5 | Aristocracy | 0.0310 | 15 | Pariah | 0.015 |
6 | Confucian | 0.0305 | 16 | Taoism | 0.0144 |
7 | Heaven | 0.0300 | 17 | Court | 0.0132 |
8 | Imperial envoy | 0.0280 | 18 | People | 0.0126 |
9 | China | 0.0246 | 19 | Benevolence | 0.0109 |
10 | Identity | 0.0211 | 20 | Analects | 0.0088 |
Attribute keyword lift value analysis results
No. | Work | Cons | Lift | Work | Cons | Lift | Work | Cons | Lift |
---|---|---|---|---|---|---|---|---|---|
1 | A | Legend | 0.4093 | B | Legend | 0.3246 | C | Heaven | 0.3458 |
2 | A | Morality | 0.3637 | B | Identity | 0.3176 | C | Earth | 0.3152 |
3 | A | Maidservant | 0.3052 | B | Folklore | 0.3102 | C | Morality | 0.2891 |
4 | A | Benevolence | 0.2519 | B | Morality | 0.2551 | C | Goli | 0.2675 |
5 | A | Folklore | 0.2003 | B | Aristocracy | 0.2173 | C | Identity | 0.2326 |
6 | A | Court | 0.1833 | B | Confucian | 0.1878 | C | People | 0.2188 |
7 | A | People | 0.1284 | B | Pariah | 0.1803 | C | Dynasty | 0.2019 |
8 | A | Confucian | 0.0721 | B | People | 0.1093 | C | Pariah | 0.1476 |
9 | A | Goli | 0.0504 | B | Earth | 0.0491 | C | Court | 0.1463 |
10 | A | Pariah | 0.0333 | B | Dynasty | 0.0383 | C | Legend | 0.0963 |
High frequency keywords in Korean classical literature works
Keywords | Frequency | Keywords | Frequency |
---|---|---|---|
Heaven | 2909 | Mythology | 857 |
Earth | 1965 | Official | 790 |
Folklore | 1594 | Poetry | 759 |
Confucian | 1403 | Goli | 748 |
Buddhism | 1392 | Three Kingdoms | 743 |
Taoism | 1287 | Chinese | 732 |
Morality | 1266 | Benevolence | 726 |
Dynasty | 1149 | Politeness | 716 |
Legend | 1101 | Filial piety | 703 |
Aristocracy | 986 | Loyalty | 684 |
Drama | 956 | Ethics | 675 |
Art | 947 | North | 621 |
Qu Yuan | 943 | Religious belief | 579 |
Identity | 942 | Analects | 569 |
Maidservant | 931 | Compassion | 568 |
Imperial envoy | 916 | Tao Yuanming | 557 |
People | 912 | Happiness | 551 |
Pariah | 903 | Disaster | 529 |
Court | 901 | Elegance | 516 |
China | 881 | Root | 500 |
Keywords common matrix
K1 | K2 | K3 | K4 | K5 | K6 | K7 | K8 | K9 | |
---|---|---|---|---|---|---|---|---|---|
K1 | 0 | 685 | 823 | 542 | 293 | 77 | 112 | 302 | 20 |
K2 | 685 | 0 | 326 | 187 | 83 | 227 | 101 | 128 | 27 |
K3 | 823 | 326 | 0 | 152 | 54 | 89 | 115 | 378 | 23 |
K4 | 542 | 187 | 152 | 0 | 85 | 26 | 31 | 11 | 43 |
K5 | 293 | 83 | 54 | 85 | 0 | 38 | 62 | 48 | 29 |
K6 | 77 | 227 | 89 | 26 | 38 | 0 | 53 | 92 | 4 |
K7 | 112 | 101 | 115 | 31 | 62 | 53 | 0 | 221 | 9 |
K8 | 302 | 128 | 378 | 11 | 48 | 92 | 221 | 0 | 3 |
K9 | 20 | 27 | 23 | 43 | 29 | 4 | 9 | 3 | 0 |
Korean classical literature works TF-IDF weights
Weight rank | Literature works | TF-IDF weight |
---|---|---|
1 | A | 0.0342 |
7 | C | 0.0179 |
28 | B | 0.0113 |
103 | E | 0.0098 |
112 | D | 0.0082 |