The After paper aims to develop an automatic solution to replace a manual process by human experts for classi- fying mangosteen and their ripe status in images. The classification solutions are developed based on deep learning techniques. These classification models are con- structed by attempting on four architectures (i.e. DenseNet, EfficientNet, ResNet, and VGG) of convolutional neural networks (CNN).
The models are trained using well-known and new prepared datasets. Two training strategies of multi-class and binary classifications are attempted in our experiments for distinguishing man- gosteen from other fruits. It is reported that the multi-class classification performs better than the binary classification, with the precision, recall, and f1-score of 100%.
In addition, a gradient-weighted class activation mapping (Grad-CAM) is used to demonstrate the reliability of the trained models. The proposed solution based on EfficientNetB0 performs the best for classification of mangosteens and their ripe statuses with the average accuracies of 100% and 98% respectively.
The multi-class CNN-based classification is developed for solving a real-world prob- lem of the ripe status classification. Alternative CNN architectures are attempted for finding the best-fit solution on a publicly available dataset and a self-collected dataset from a web scraping. The computed heatmaps show that it is not nec- essary to perform the mangosteen segmentation, the classification task could be performed directly where background and irrelevant parts of images are not/or less used.