B-Morpher: Automated Learning of Morphological Language Characteristics for Inflection and Morphological Analysis
Pubblicato online: 10 nov 2022
Pagine: 111 - 128
Ricevuto: 01 apr 2022
Accettato: 16 set 2022
DOI: https://doi.org/10.2478/cait-2022-0042
Parole chiave
© 2022 László Kovács et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
The automated induction of inflection rules is an important research area for computational linguistics. In this paper, we present a novel morphological rule induction model called B-Morpher that can be used for both inflection analysis and morphological analysis. The core element of the engine is a modified Bayes classifier in which class categories correspond to general string transformation rules. Beside the core classification module, the engine contains a neural network module and verification unit to improve classification accuracy. For the evaluation, beside the large Hungarian dataset the tests include smaller non-Hungarian datasets from the SIGMORPHON shared task pools. Our evaluation shows that the efficiency of B-Morpher is comparable with the best results, and it outperforms the state-of-theart base models for some languages. The proposed system can be characterized by not only high accuracy, but also short training time and small knowledge base size.