Comparative analysis of the performance of selected machine learning algorithms depending on the size of the training sample
Publicado en línea: 26 sept 2024
Recibido: 13 mar 2024
Aceptado: 29 jul 2024
DOI: https://doi.org/10.2478/rgg-2024-0015
Palabras clave
© 2024 Przemysław Kupidura et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.
The article presents an analysis of the effectiveness of selected machine learning methods: Random Forest (RF), Extreme Gradient Boosting (XGB), and Support Vector Machine (SVM) in the classification of land use and cover in satellite images. Several variants of each algorithm were tested, adopting different parameters typical for each of them. Each variant was classified multiple (20) times, using training samples of different sizes: from 100 pixels to 200,000 pixels. The tests were conducted independently on 3 Sentinel-2 satellite images, identifying 5 basic land cover classes: