Open Access

Outlier Detection in Ocean Wave Measurements by Using Unsupervised Data Mining Methods


Cite

Outliers are considerably inconsistent and exceptional objects in the data set that do not adapt to expected normal condition. An outlier in wave measurements may be due to experimental and configuration errors, technical defects in equipment, variability in the measurement conditions, rare or unknown conditions such as tsunami, windstorm and etc. To improve the accuracy and reliability of an built ocean wave model, or to extract important and valuable information from collected wave data, detecting of outlying observations in wave measurements is very important. In this study, three typical outlier detection algorithms:Box-plot (BP), Local Distance-based Outlier Factor (LDOF), and Local Outlier Factor (LOF) methods are used to detect outliers in significant wave height (Hs) records. The historical wave data are taken from National Data Buoy Center (NDBC). Finally, those data points are considered as outlier identified by at least two methods which are presented and discussed. Then, Hs prediction has been modelled with and without the presence of outliers by using Regression trees (RTs).

eISSN:
2083-7429
Language:
English
Publication timeframe:
4 times per year
Journal Subjects:
Engineering, Introductions and Overviews, other, Geosciences, Atmospheric Science and Climatology, Life Sciences