A Topic Detection Method Based on Word-attention Networks

Xie, Zheng

Open Access

A Topic Detection Method Based on Word-attention Networks

Zheng Xie

Xie, Zheng

Aug 18, 2021

A Topic Detection Method Based on Word-attention Networks's Cover Image

Journal of Data and Information Science

Volume 6 (2021): Issue 4 (November 2021)

About this article

Cite

Share

Download Cover

Article Category: Research Paper

Published Online: Aug 18, 2021

Page range: 139 - 163

Received: Jun 19, 2021

Accepted: Jul 23, 2021

DOI: https://doi.org/10.2478/jdis-2021-0032

Keywords
Scientific topics, Text analysis, Deep learning

© 2021 Zheng Xie, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Purpose

We proposed a method to represent scientific papers by a complex network, which combines the approaches of neural and complex networks.

Design/methodology/approach

Its novelty is representing a paper by a word branch, which carries the sequential structure of words in sentences. The branches are generated by the attention mechanism in deep learning models. We connected those branches at the positions of their common words to generate networks, called word-attention networks, and then detect their communities, defined as topics.

Findings

Those detected topics can carry the sequential structure of words in sentences, represent the intra- and inter-sentential dependencies among words, and reveal the roles of words playing in them by network indexes.

Research limitations

The parameter setting of our method may depend on practical data. Thus it needs human experience to find proper settings.

Practical implications

Our method is applied to the papers of the PNAS, where the discipline designations provided by authors are used as the golden labels of papers’ topics.

Originality/value

This empirical study shows that the proposed method outperforms the Latent Dirichlet Allocation and is more stable.

Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Computer Sciences, Information Technology, Project Management, Databases and Data Mining

Journal RSS Feed

A Topic Detection Method Based on Word-attention Networks

Zheng Xie

Article Category: Research Paper

Published Online: Aug 18, 2021

Page range: 139 - 163

Received: Jun 19, 2021

Accepted: Jul 23, 2021

DOI: https://doi.org/10.2478/jdis-2021-0032

KeywordsScientific topics, Text analysis, Deep learning

© 2021 Zheng Xie, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Purpose

Design/methodology/approach

Findings

Research limitations

Practical implications

Originality/value

Keywords
Scientific topics, Text analysis, Deep learning