Fast fourier transform based new pooling layer for deep learning

Convolution is considered most significant layer in deep learning because it can extract best features of data through the network but it may result in huge volume of data. This problem can be solved by using pooling. In this paper, A novel pooling method is proposed by using discrete Fourier transform (DFT), this method is used DFT technique to transform the data from spatial domain into frequency domain to preserve the most important information from the details coefficients, where the details information of the image is less significant, therefore it can be discarded to down sample the size of dimensions. Its effect will be great with advantage of reducing the eliminated details information as compared with other standard methods. After applying DFT, the most significant coefficients, which represent most important features are cropped while less important details will be discarded then the data are reconstructed by applying inverse DF, therefore the high quality of features are extracted, which solve the problem of losing significant information during the pooling layer. Different methods are proposed based on the scenario of using DFT. The proposed methods are tested by extracting pooled image then the original images were retrieved using only the pooled images. Then the retrieved images are compared with original images by using different measures such as SNR, correlation and SSIM. Then the proposed layers used for image classification for two different datasets. The results proved that the proposed methods outperformed standard methods, thus it can be used for deep learning application.

eISSN:: 1178-5608
Język:: Angielski

Częstotliwość wydawania:: Volume Open
Dziedziny czasopisma:: Engineering, Introductions and Overviews, other

Kanał RSS czasopisma

Fast fourier transform based new pooling layer for deep learning

Data publikacji: 16 kwi 2022

Zakres stron: 1 - 14

Otrzymano: 01 lip 2021

DOI: https://doi.org/10.21307/ijssis-2022-0003

Słowa kluczoweCNN, DL, FFT, FTM and Features

© 2022 Aqeel Mohsin Hamad et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Słowa kluczowe
CNN, DL, FFT, FTM and Features