An Efficient Technique for Size Reduction of Convolutional Neural Networks after Transfer Learning for Scene Recognition Tasks

A complex classification task as scene recognition is considered in the present research. Scene recognition tasks are successfully solved by the paradigm of transfer learning from pretrained convolutional neural networks, but a problem is that the eventual size of the network is huge despite a common scene recognition task has up to a few tens of scene categories. Thus, the goal is to ascertain possibility of a size reduction. The modelling recognition task is a small dataset of 4485 grayscale images broken into 15 image categories. The pretrained network is AlexNet dealing with much simpler image categories whose number is 1000, though. This network has two fully connected layers, which can be potentially reduced or deleted. A regular transfer learning network occupies about 202.6 MB performing at up to 92 % accuracy rate for the scene recognition. It is revealed that deleting the layers is not reasonable. The network size is reduced by setting a fewer number of filters in the 17^th and 20^th layers of the AlexNet-based networks using a dichotomy principle or similar. The best truncated network with 384 and 192 filters in those layers performs at 93.3 % accuracy rate, and its size is 21.63 MB.

eISSN:: 2255-8691
Lingua:: Inglese

Frequenza di pubblicazione:: 2 volte all'anno
Argomenti della rivista:: Computer Sciences, Artificial Intelligence, Information Technology, Project Management, Software Development

Feed RSS della rivista

An Efficient Technique for Size Reduction of Convolutional Neural Networks after Transfer Learning for Scene Recognition Tasks

Pubblicato online: 31 dic 2018

Pagine: 141 - 149

DOI: https://doi.org/10.2478/acss-2018-0018

Parole chiaveAlexNet, convolutional neural network, pretrained network, scene recognition, size reduction, transfer learning, truncated network

© 2018 Vadim Romanuke, published by Sciendo

This work is licensed under the Creative Commons Attribution 4.0 Public License.

Parole chiave
AlexNet, convolutional neural network, pretrained network, scene recognition, size reduction, transfer learning, truncated network