B-Morpher: Automated Learning of Morphological Language Characteristics for Inflection and Morphological Analysis
Publié en ligne: 10 nov. 2022
Pages: 111 - 128
Reçu: 01 avr. 2022
Accepté: 16 sept. 2022
DOI: https://doi.org/10.2478/cait-2022-0042
Mots clés
© 2022 László Kovács et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
The automated induction of inflection rules is an important research area for computational linguistics. In this paper, we present a novel morphological rule induction model called B-Morpher that can be used for both inflection analysis and morphological analysis. The core element of the engine is a modified Bayes classifier in which class categories correspond to general string transformation rules. Beside the core classification module, the engine contains a neural network module and verification unit to improve classification accuracy. For the evaluation, beside the large Hungarian dataset the tests include smaller non-Hungarian datasets from the SIGMORPHON shared task pools. Our evaluation shows that the efficiency of B-Morpher is comparable with the best results, and it outperforms the state-of-theart base models for some languages. The proposed system can be characterized by not only high accuracy, but also short training time and small knowledge base size.