Open Access

A Named Entity Recognition Model Based on Multi-Task Learning and Cascading Pointer Network


Cite

Figure 1.

MLT-NER structure diagram
MLT-NER structure diagram

Figure 2.

Words and meanings in How Net
Words and meanings in How Net

Figure 3.

Structure diagram of entity recognition model
Structure diagram of entity recognition model

Figure 4.

Example of entity recognition
Example of entity recognition

Figure 5.

Entity classification model
Entity classification model

Figure 6.

MSRA dataset dropout
MSRA dataset dropout

Figure 7.

OntoNotes4.0 dataset dropout
OntoNotes4.0 dataset dropout

Figure 8.

CLUENER2020 dataset dropout
CLUENER2020 dataset dropout

Figure 9.

CMeEE dataset dropout
CMeEE dataset dropout

Experimental parameters

Parameter Value
Optimizer SGD
Learning rate 5e–6
Activate function ReLU
Entity category length limit 16
Enter length limit 128
Batch size 6
Deep learning framework Pytorch
Number of GPUs 1

CMEeE dataset model indicators

CMeEE
Model P (%) R (%) F1 (%)
BiLSTM-CRF 56.41 49.52 52.74
BERT-BiLSTM-CRF 68.98 66.25 67.59
BERT-MRC 71.26 69.34 70.29
MTL-NER 73.13 70.68 71.89

MSRA dataset model indicators

MSRA
Model P (%) R (%) F1 (%)
BiLSTM-CRF 87.47 85.23 83.34
BERT-BiLSTM-CRF 95.15 94.85 95.00
BERT-MRC 96.28 95.74 96.01
MTL-NER 97.07 96.43 96.75

OntoNotes4.0 dataset model indicators

OntoNotes4.0
Model P (%) R (%) F1 (%)
BiLSTM-CRF 73.45 60.07 61.71
BERT-BiLSTM-CRF 79.23 79.58 79.40
BERT-MRC 82.49 81.23 81.56
MTL-NER 84.87 82.56 83.70

Example of comprehensive description of domain entity categories

Entity Category Comprehensive description of entity category
color Yellow green blue, environment-friendly color, hue, lightness, saturation and various phenomena of light
Environmental-friendly Characteristic value, protection, positive evaluation, low carbon, energy conservation and emission reduction, life, agriculture, circular economy, wind and solar power generation
O General text

CLUENER2020 dataset model indicators

CLUENER2020
Model P (%) R (%) F1 (%)
BiLSTM-CRF 67.23 65.42 66.31
BERT-BiLSTM-CRF 77.42 78.15 77.78
BERT-MRC 79.04 80.26 79.65
MTL-NER 82.14 80.79 81.46
eISSN:
2470-8038
Language:
English
Publication timeframe:
4 times per year
Journal Subjects:
Computer Sciences, other