Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification Problems

The present paper considers an open problem of setting hyperparameters for convolutional neural networks aimed at image classification. Since selecting filter spatial extents for convolutional layers is a topical problem, it is approximately solved by accumulating statistics of the neural network performance. The network architecture is taken on the basis of the MNIST database experience. The eight-layered architecture having four convolutional layers is nearly best suitable for classifying small and medium size images. Image databases are formed of grayscale images whose size range is 28 × 28 to 64 × 64 by step 2. Except for the filter spatial extents, the rest of those eight layer hyperparameters are unalterable, and they are chosen scrupulously based on rules of thumb. A sequence of possible filter spatial extents is generated for each size. Then sets of four filter spatial extents producing the best performance are extracted. The rule of this extraction that allows selecting the best filter spatial extents is formalized with two conditions. Mainly, difference between maximal and minimal extents must be as minimal as possible. No unit filter spatial extent is recommended. The secondary condition is that the filter spatial extents should constitute a non-increasing set. Validation on MNIST and CIFAR- 10 databases justifies such a solution, which can be extended for building convolutional neural network classifiers of colour and larger images.

eISSN:: 2255-8691
Langue:: Anglais

Périodicité:: 2 fois par an
Sujets de la revue:: Computer Sciences, Artificial Intelligence, Information Technology, Project Management, Software Development

RSS Feed de la revue

Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification Problems

Publié en ligne: 30 mai 2018

Pages: 52 - 62

DOI: https://doi.org/10.2478/acss-2018-0007

Mots clésConvolutional layer, convolutional neural networks, filters, hyperparameters, network architecture, square spatial extents of filters

© 2018 Vadim V. Romanuke, published by Sciendo

This work is licensed under the Creative Commons Attribution 4.0 Public License.

Mots clés
Convolutional layer, convolutional neural networks, filters, hyperparameters, network architecture, square spatial extents of filters