Through the Thicket: A Study of Number-Oriented LLMS Derived from Random Forest Models

This paper introduces a novel approach to training Large Language Models (LLMs) using knowledge transfer from a Random Forest (RF) ensemble. By converting RF decision paths into natural language, this method enhances both the classification accuracy and explanation capabilities of LLMs. Our approach integrates three preprocessing techniques: Relation Encoding, Integer Normalisation, and Verbal Description of Values, tailored for numerical data, improving the model’s ability to interpret structured inputs effectively. Leveraging RF’s ensemble properties, we generate rule-based explanations that can be objectively validated, offering a cost-effective alternative to human evaluations. Experiments on well-known datasets demonstrate high classification accuracy highlighting the potential of our framework for numerical and structured data applications. This study also contributes to Explainable Artificial Intelligence (XAI) by providing LLMs with structured, objectively verifiable explanations, making them more accessible and interpretable for real-world decision-making tasks.

Sprache:: Englisch

Zeitrahmen der Veröffentlichung:: 4 Hefte pro Jahr
Fachgebiete der Zeitschrift:: Informatik, Künstliche Intelligenz, Datanbanken und Data Mining

Zeitschrift RSS Feed

Through the Thicket: A Study of Number-Oriented LLMS Derived from Random Forest Models

Michał Romaszewski

Przemysław Sekuła

Przemysław Głomb

Michał Cholewa

Katarzyna Kołodziej

Online veröffentlicht: 18. März 2025

Seitenbereich: 279 - 298

Eingereicht: 07. Okt. 2024

Akzeptiert: 04. März 2025

DOI: https://doi.org/10.2478/jaiscr-2025-0014

Schlüsselwörterclassification, random forest, large language model, explainable artificial intelligence

© 2025 Michał Romaszewski et al., published by Sciendo

This work is licensed under the Creative Commons Attribution 4.0 International License.

Schlüsselwörter
classification, random forest, large language model, explainable artificial intelligence