Identification of Keywords for Legal Documents Categories Using SOM
Publicado en línea: 31 mar 2025
Páginas: 33 - 41
Recibido: 27 abr 2024
Aceptado: 01 nov 2024
DOI: https://doi.org/10.14313/jamris-2025-004
Palabras clave
© 2025 Paulina Puchalska et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
This study aims to use the decision-making process to categorize legal documents by identifying keywords characterizing each legal domain class. The study utilizes the Kohonen Self-Organizing Map method and the Global Vectors for Word Representation (GloVe) model to create an efficient document classification system. As a result, a satisfactory classification accuracy of 71.69% was achieved. The article also discusses alternative approaches implemented to improve classification accuracy, such as the use of Named Entity Recognizer (NER) tools and the RoBERTa model, along with a comparison of these approaches’ effectiveness. Challenges related to the uneven distribution of categories in the dataset are also mentioned, and potential directions for further research to enhance the classification results of legal documents are presented.