Uneingeschränkter Zugang

Parsing Korean Classical Literature by Integrating Text Mining and Semantic Analysis

 und   
19. März 2025

Zitieren
COVER HERUNTERLADEN

Figure 1.

LDA topic model structure
LDA topic model structure

Figure 2.

The topic generation matrix represents
The topic generation matrix represents

Figure 3.

Topic document generation process
Topic document generation process

Figure 4.

Semantic analysis diagram of keywords of Korean classical literature works
Semantic analysis diagram of keywords of Korean classical literature works

Keywords TF-IDF weights

Number Keywords TF-IDF Number Keywords TF-IDF
1 Legend 0.0359 11 Earth 0.0200
2 Folklore 0.0347 12 Buddhism 0.0196
3 Dynasty 0.0345 13 Morality 0.0182
4 Goli 0.0341 14 Maidservant 0.016
5 Aristocracy 0.0310 15 Pariah 0.015
6 Confucian 0.0305 16 Taoism 0.0144
7 Heaven 0.0300 17 Court 0.0132
8 Imperial envoy 0.0280 18 People 0.0126
9 China 0.0246 19 Benevolence 0.0109
10 Identity 0.0211 20 Analects 0.0088

Attribute keyword lift value analysis results

No. Work Cons Lift Work Cons Lift Work Cons Lift
1 A Legend 0.4093 B Legend 0.3246 C Heaven 0.3458
2 A Morality 0.3637 B Identity 0.3176 C Earth 0.3152
3 A Maidservant 0.3052 B Folklore 0.3102 C Morality 0.2891
4 A Benevolence 0.2519 B Morality 0.2551 C Goli 0.2675
5 A Folklore 0.2003 B Aristocracy 0.2173 C Identity 0.2326
6 A Court 0.1833 B Confucian 0.1878 C People 0.2188
7 A People 0.1284 B Pariah 0.1803 C Dynasty 0.2019
8 A Confucian 0.0721 B People 0.1093 C Pariah 0.1476
9 A Goli 0.0504 B Earth 0.0491 C Court 0.1463
10 A Pariah 0.0333 B Dynasty 0.0383 C Legend 0.0963

High frequency keywords in Korean classical literature works

Keywords Frequency Keywords Frequency
Heaven 2909 Mythology 857
Earth 1965 Official 790
Folklore 1594 Poetry 759
Confucian 1403 Goli 748
Buddhism 1392 Three Kingdoms 743
Taoism 1287 Chinese 732
Morality 1266 Benevolence 726
Dynasty 1149 Politeness 716
Legend 1101 Filial piety 703
Aristocracy 986 Loyalty 684
Drama 956 Ethics 675
Art 947 North 621
Qu Yuan 943 Religious belief 579
Identity 942 Analects 569
Maidservant 931 Compassion 568
Imperial envoy 916 Tao Yuanming 557
People 912 Happiness 551
Pariah 903 Disaster 529
Court 901 Elegance 516
China 881 Root 500

Keywords common matrix

K1 K2 K3 K4 K5 K6 K7 K8 K9
K1 0 685 823 542 293 77 112 302 20
K2 685 0 326 187 83 227 101 128 27
K3 823 326 0 152 54 89 115 378 23
K4 542 187 152 0 85 26 31 11 43
K5 293 83 54 85 0 38 62 48 29
K6 77 227 89 26 38 0 53 92 4
K7 112 101 115 31 62 53 0 221 9
K8 302 128 378 11 48 92 221 0 3
K9 20 27 23 43 29 4 9 3 0

Korean classical literature works TF-IDF weights

Weight rank Literature works TF-IDF weight
1 A 0.0342
7 C 0.0179
28 B 0.0113
103 E 0.0098
112 D 0.0082
Sprache:
Englisch
Zeitrahmen der Veröffentlichung:
1 Hefte pro Jahr
Fachgebiete der Zeitschrift:
Biologie, Biologie, andere, Mathematik, Angewandte Mathematik, Mathematik, Allgemeines, Physik, Physik, andere