A Novel Method for Resolving and Completing Authors’ Country Affiliation Data in Bibliographic Records
, y
09 jul 2020
Acerca de este artículo
Categoría del artículo: Research Paper
Publicado en línea: 09 jul 2020
Páginas: 97 - 115
Recibido: 01 feb 2020
Aceptado: 11 jun 2020
DOI: https://doi.org/10.2478/jdis-2020-0020
Palabras clave
© 2020 Ba Xuan Nguyen et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Figure 1

Figure 2

Figure 3

Figure 4

Summary of data sets used_
Features | ACM DL | MAG |
---|---|---|
Total works | 182,791 | 212,689,976 |
Unique, co-authored, computer science works | 121,672 | 557,730 |
Results of country identification_
Code | Results | ACM DL | MAG |
---|---|---|---|
C | Affiliations in co-authored works | 384,672 | 853,482 |
C1 | “NA”, “None”, etc values | 52,454 (13.64%) | 66,924 (7.84%) |
C2 | Identified | 273,245 (71.02%) | 643,678 (75.42%) |
C2.1 | Identified by string matching | 236,100 (61.38%) | 594,911 (69.70%) |
C2.2 | Identified by Wikidata | 37,106 (9.65%) | 48,767 (5.71%) |
C3 | Not identified (Other values) | 59,012 (15.34%) | 142,888 (16.74%) |
Summary statistics of the method's results_
ACM DL | MAG | |
---|---|---|
Mean | 5.70% | 5.42% |
Standard error | 0.69% | 1.29% |
Median | 1.20% | 0.91% |
Mode | 0% | 0% |
Standard deviation | 8.10% | 17.88% |
Interquartile range | 6.30% | 10.18% |
Count | 137 | 192 |
The accuracy of the method using Wikidata query_
ACM DL | MAG | |
---|---|---|
False match rate (FMR) | 0 % | 0 % |
False non-match rate (FNMR) | 73 % | 75 % |