Article


Article Code : 13970703164554

Article Title : Improvement in Accuracy and Speed of Image Semantic Segmentation via Convolution Neural Network Encoder-Decoder

Keywords :

Journal Number : 23 Summer 2018

Visited : 427

Files : 452 KB


List of Authors

  Full Name Email Grade Degree Corresponding Author
1 Hanieh Zamanian hanieh.zamanian@birjand.ac.ir Post Graduate Student PhD.Student
2 Hassan Farsi hfarsi@birjand.ac.ir Professor PhD
3 Sajjad Mohammadzadeh s.mohamadzadeh@birjand.ac.ir Assistant Professor PhD

Abstract

Recent researches on pixel-wise semantic segmentation, use deep neural networks to improve accuracy and speed of these networks in order to increase the efficiency in practical applications such as automatic driving. These approaches have used deep architecture to predict pixel tags, but the results seem to be undesirable. The reason for these unacceptable results is mainly due to the existence of max pooling operators, which reduces the resolution of the feature maps. In this paper, we present a convolutional neural network composed of encoder-decoder segments based on successful SegNet network. The encoder section has a depth of 2, which in the first part has 5 convolutional layers, in which each layer has 64 filters with dimensions of 3×3. In the decoding section, the dimensions of the decoding filters are adjusted according to the convolutions used at each step of the encoding. So, at each step, 64 filters with the size of 3×3 are used for coding, the weights of these filters are adjusted by network training and adapted to the educational data. Due to having the low depth of 2, and the low number of parameters in the proposed network, the speed and the accuracy improve in comparison with the popular networks such as SegNet and DeepLab. For the CamVid dataset, after a total of 60,000 iterations, we attain the 91% for global accuracy, which indicates improvements in the efficiency of the proposed method