Comparison of outlier detection approaches in a Smart Cities sensor data context
oraz
14 lut 2024
O artykule
Kategoria artykułu: Research article
Data publikacji: 14 lut 2024
Otrzymano: 06 wrz 2023
DOI: https://doi.org/10.2478/ijssis-2024-0004
Słowa kluczowe
© 2024 Sofia Zafeirelli et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
This study examines outlier detection in time-series sensor data from PurpleAir low-cost sensors in Athens, Greece. Focusing on key environmental parameters such as temperature, humidity, and particulate matter (PM) levels, the study utilizes the Interquartile Range (IQR) and Generalized Extreme Studentized Deviate (GESD) methods on hourly and daily basis. GESD detected more outliers than IQR, most of them in PM, while temperature and humidity data had fewer outliers; applying filters before outlier detection and adjusting alpha values based on time scales were crucial, and outliers significantly affected spatial interpolation, emphasizing the need for spatial statistics in smart city air quality management.