Mining Related Articles for Automatic Journal Cataloging

This paper is an investigation of the effectiveness of the method of clustering biomedical journals through mining the content similarity of journal articles.

Design/methodology/approach

3,265 journals in PubMed are analyzed based on article content similarity and Web usage, respectively. Comparisons of the two analysis approaches and a citation-based approach are given.

Findings

Our results suggest that article content similarity is useful for clustering biomedical journals, and the content-similarity-based journal clustering method is more robust and less subject to human factors compared with the usage-based approach and the citation-based approach.

Research limitations

Our paper currently focuses on clustering journals in the biomedical domain because there are a large volume of freely available resources such as PubMed and MeSH in this field. Further investigation is needed to improve this approach to fit journals in other domains.

Practical implications

Our results show that it is feasible to catalog biomedical journals by mining the article content similarity. This work is also significant in serving practical needs in research portfolio analysis.

Originality/value

To the best of our knowledge, we are among the first to report on clustering journals in the biomedical field through mining the article content similarity. This method can be integrated with existing approaches to create a new paradigm for future studies of journal clustering.

eISSN:: 2543-683X
Lingua:: Inglese

Frequenza di pubblicazione:: 4 volte all'anno
Argomenti della rivista:: Computer Sciences, Information Technology, Project Management, Databases and Data Mining

Feed RSS della rivista

Mining Related Articles for Automatic Journal Cataloging

Article Category: Research Paper

Pubblicato online: 01 set 2017

Pagine: 45 - 59

Ricevuto: 14 dic 2015

Accettato: 26 feb 2016

DOI: https://doi.org/10.20309/jdis.201613

Parole chiavePubMed, Journals, Cluster, Catalog, Text mining, Research evaluation

© 2016 Yuqing Mao, Zhiyong Lu

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Purpose

Design/methodology/approach

Findings

Research limitations

Practical implications

Originality/value

Parole chiave
PubMed, Journals, Cluster, Catalog, Text mining, Research evaluation