B-Morpher: Automated Learning of Morphological Language Characteristics for Inflection and Morphological Analysis
Publicado en línea: 10 nov 2022
Páginas: 111 - 128
Recibido: 01 abr 2022
Aceptado: 16 sept 2022
DOI: https://doi.org/10.2478/cait-2022-0042
Palabras clave
© 2022 László Kovács et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
The automated induction of inflection rules is an important research area for computational linguistics. In this paper, we present a novel morphological rule induction model called B-Morpher that can be used for both inflection analysis and morphological analysis. The core element of the engine is a modified Bayes classifier in which class categories correspond to general string transformation rules. Beside the core classification module, the engine contains a neural network module and verification unit to improve classification accuracy. For the evaluation, beside the large Hungarian dataset the tests include smaller non-Hungarian datasets from the SIGMORPHON shared task pools. Our evaluation shows that the efficiency of B-Morpher is comparable with the best results, and it outperforms the state-of-theart base models for some languages. The proposed system can be characterized by not only high accuracy, but also short training time and small knowledge base size.