This paper aims to study the segmentation demands of vineyard images using Convolutional Neural Networks (CNNs). To this end, eleven CNN models able to provide semantic segmented images are examined as part of the sensing subsystem of an autonomous agricultural robot. The task is challenging due to the similar color between grapes, leaves and image’s background. Moreover, the lack of controlled lighting conditions results in varying color representation of grapes and leaves. The studied CNN model architectures combine three different feature learning sub-networks, with five meta-architectures for segmentation purposes. Investigation on three different datasets consisting of vineyard images of grape clusters and leaves, provided segmentation results, by mean pixel intersection over union (IU) performance index, of up to 87.89% for grape clusters and 83.45% for leaves, for the case of ResNet50_FRRN and MobileNetV2_PSPNet model, respectively. Comparative results reveal the efficacy of CNNs to separate grape clusters and leaves from image’s background. Thus, the proposed models can be used for in-field applications for real-time localization of grapes and leaves, towards automation of harvest, green harvest and defoliation agricultural activities by an autonomous robot.
T. Kalampokas, K. Tziridis, A. Nikolaou, E. Vrochidou, G. A. Papakostas, T. Pachidis, V. G. Kaburlasos. “Semantic segmentation of vineyard images using convolutional neural networks”, 21st International Conference on Engineering Applications of Neural Networks (EANN 2020), Porto Carras Grand Resort, Halkidiki, Greece, 5–7 June, 2020. In: L. Iliadis, P. P. Angelov, C. Jayne, E. Pimenidis (Eds.): EANN 2020. Heidelberg, Germany: Springer Nature Switzerland AG 2020, series: Proceedings of the International Neural Networks Society (INNS), series editors: P. Angelov, R. Kozma, vol. 2, pp. 292-303, 2020. https://doi.org/10.1007/978-3-030-48791-1_22