Masked Sentence Model Based on BERT for Move Recognition in Medical Scientific Abstracts
, , y
27 dic 2019
Acerca de este artículo
Categoría del artículo: Research Paper
Publicado en línea: 27 dic 2019
Páginas: 42 - 55
Recibido: 27 sept 2019
Aceptado: 05 nov 2019
DOI: https://doi.org/10.2478/jdis-2019-0020
Palabras clave
© 2019 Gaihong Yu, Zhixiong Zhang, Huan Liu, Liangping Ding, published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
Figure 1

Figure 2

Figure 3

The results of Exp3: based on MSM integrated information_
Label | P | R | F1 | Support |
---|---|---|---|---|
Background | 75.26 | 81.18 | 78.11 | 3,077 |
Objectives | 78.08 | 61.98 | 69.10 | 2,333 |
Methods | 92.98 | 97.48 | 95.17 | 9,884 |
Results | 96.02 | 93.74 | 94.87 | 9,713 |
Conclusions | 94.70 | 94.51 | 94.60 | 4,571 |
Avg / Total | 91.22 | 91.30 | 91.15 | 29,578 |
Data format for integrating sentence content and context_
Label | The content & context of the sentence |
---|---|
Methods | We selected the major journals (11 journals) collecting papers (more than 7,000) over the last five years from the top members of the research community, and read and analyzed the papers (more than 200) covering the topics. |
Methods | This survey aims at reviewing the literature related to Clinical Information Systems (CIS), Hospital Information Systems (HIS), Electronic Health Record (EHR) systems, and how collected data can be analyzed by Artificial Intelligence (AI) techniques. |
The results of Exp2: based on the context of sentences_
Label | P | R | F1 | Support |
---|---|---|---|---|
Background | 72.27 | 79.72 | 75.82 | 3,077 |
Objectives | 70.51 | 60.27 | 64.99 | 2,333 |
Methods | 90.70 | 89.80 | 90.25 | 9,884 |
Results | 87.71 | 89.20 | 88.45 | 9,713 |
Conclusions | 90.19 | 89.30 | 89.74 | 4,571 |
Avg / Total | 86.13 | 86.15 | 86.09 | 29,578 |
Data format of sentence content_
Label | The content of the sentence |
---|---|
Methods | We selected the major journals (11 journals) collecting papers (more than 7,000) over the last five years from the top members of the research community, and read and analyzed the papers (more than 200) covering the topics. |
The results of Exp1: based on the content of sentences_
Label | P | R | F1 | Support |
---|---|---|---|---|
Background | 64.37 | 75.85 | 69.64 | 3,077 |
Objectives | 73.55 | 56.97 | 64.20 | 2,333 |
Methods | 92.42 | 94.97 | 93.68 | 9,884 |
Results | 92.08 | 91.09 | 91.58 | 9,713 |
Conclusions | 84.95 | 81.38 | 83.13 | 4,571 |
Avg / Total | 86.75 | 86.61 | 86.53 | 29,578 |
Comparison of the results of the experiments_
Label | Exp1 | Exp2 | Exp3 | Exp3-Exp1 | Exp3-Exp2 |
---|---|---|---|---|---|
F1 | F1 | F1 | +F1 | +F1 | |
Background | 69.64 | 75.82 | 78.11 | 2.29 | |
Objectives | 64.20 | 64.99 | 69.10 | 4.11 | |
Methods | 93.68 | 90.25 | 95.17 | 1.49 | |
Results | 91.58 | 88.45 | 94.87 | 3.29 | |
Conclusions | 83.13 | 89.74 | 94.60 | 11.47 | 4.86 |
Avg / Total | 86.53 | 86.09 | 91.15 | 4.62 | 5.06 |
Data format of the sentence’s context_
Label | The context of the sentence |
---|---|
Methods | This survey aims at reviewing the literature related to Clinical Information Systems (CIS), Hospital Information Systems (HIS), Electronic Health Record (EHR) systems, and how collected data can be analyzed by Artificial Intelligence (AI) techniques. |
PubMed 20k RCT results_
Models | F1 (PubMed 20k RCT) | |
---|---|---|
Our Model | MaskedSentenceModel_BERT | |
Others | ||
BERT-Base (Beltagy et al., 2018) | 86.19 | |
Sci BERT (SciVocab) (Beltagy et al., 2018) | 86.80 | |
Sci BERT (BaseVocab) (Beltagy et al., 2018) | 86.81 |