Feature Map Augmentation to Improve Scale Invariance in Convolutional Neural Networks

Introducing variation in the training dataset through data augmentation has been a popular technique to make Convolutional Neural Networks (CNNs) spatially invariant but leads to increased dataset volume and computation cost. Instead of data augmentation, augmentation of feature maps is proposed to introduce variations in the features extracted by a CNN. To achieve this, a rotation transformer layer called Rotation Invariance Transformer (RiT) is developed, which applies rotation transformation to augment CNN features. The RiT layer can be used to augment output features from any convolution layer within a CNN. However, its maximum effectiveness is shown when placed at the output end of final convolution layer. We test RiT in the application of scale-invariance where we attempt to classify scaled images from benchmark datasets. Our results show promising improvements in the networks ability to be scale invariant whilst keeping the model computation cost low.

Langue:: Anglais

Périodicité:: 4 fois par an
Sujets de la revue:: Informatique, Bases de données et exploration de données, Intelligence artificielle

RSS Feed de la revue

Feature Map Augmentation to Improve Scale Invariance in Convolutional Neural Networks

Dinesh Kumar

Dharmendra Sharma

Publié en ligne: 28 nov. 2022

Pages: 51 - 74

Reçu: 21 févr. 2022

Accepté: 19 oct. 2022

DOI: https://doi.org/10.2478/jaiscr-2023-0004

Mots clésConvolutional Neural Network, Feature Map Augmentation, Global Features, Scale-Invariant, Vision System

© 2023 Dinesh Kumar et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Mots clés
Convolutional Neural Network, Feature Map Augmentation, Global Features, Scale-Invariant, Vision System