High-Precision Tomato Maturity Detection Using a Genetic Algorithm Optimized Swin-YOLO Network

Authors

  • Lujie Fan

DOI:

https://doi.org/10.62051/ijcsit.v2n2.01

Keywords:

Tomato Maturity Detection, Swin-YOLO, Genetic Algorithm, Deep Learning, Agricultural Automation

Abstract

In the realm of precision agriculture, the determination of tomato maturity stages plays a crucial role in optimizing harvest timings, ensuring produce quality, and maximizing yields. Traditional approaches, largely reliant on manual inspection, are not only labor-intensive but also subject to human error, making them unsuitable for modern, large-scale agricultural operations. Addressing these challenges, this study pioneers the use of a Genetic Algorithm (GA) optimized Swin-YOLO (You Only Look Once) network, aiming to automate and enhance the precision of tomato maturity detection. By integrating the advanced capabilities of the Swin Transformer for feature extraction with the efficiency of the YOLO object detection framework, the proposed Swin-YOLO model is meticulously designed to identify varying stages of tomato maturity from complex agricultural imagery. The introduction of a GA facilitates the systematic optimization of the network's architecture and hyperparameters, focusing on improving accuracy and computational efficiency across diverse environmental conditions. Our research involved compiling a comprehensive dataset of annotated tomato images, reflecting a wide array of maturity stages under different lighting and occlusional challenges. The findings from our study indicate a significant leap in performance, with the GA-optimized Swin-YOLO network outshining existing deep learning models and conventional manual methods in terms of both detection accuracy and processing speed. Notably, the model exhibits exceptional adeptness at handling images affected by variable lighting and partial occlusions, marking a substantial advancement in automated tomato maturity detection. The implications of this research are profound, offering a scalable, efficient, and highly accurate solution to one of agriculture's longstanding challenges. This breakthrough not only stands to reduce labor costs and enhance operational efficiency but also contributes to the sustainability of tomato farming practices by facilitating more informed and timely harvesting decisions. Furthermore, the successful application of GA for network optimization underscores the potential of combining deep learning with evolutionary algorithms to tackle complex agricultural challenges, setting the stage for future innovations in precision agriculture and beyond.

Downloads

Download data is not yet available.

References

Yan Gong, Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, et al. "Swin-Transformer-Based YOLOv5 for Small-Object Detection in Remote Sensing Images", Sensors, Vol. 23, No. 7, pp. 3634, 2023. https://doi.org/10.3390/s23073634

Gi-Sang Choi, Department of Electrical and Computer Engineering, Graduate School, University of Seoul, et al. "Improved YOLOv5: Efficient Object Detection Using Drone Images under Various Conditions", Applied Sciences, Vol. 12, No. 14, pp. 7255, 2022. https://doi.org/10.3390/app12147255

YANG R, HU Y, YAO Y, et al. Fruit target detection based on BCo-YOLOv5 model. Mobile Information Systems, 2022, 2022: 1-8.

ZHANG W, XIA X, DU J, et al. Recognition and detection of wolfberry in the naturalbackground based on improved YOLOv5 network. 2022 3rd International Conference on Computer Vision, Image and Deep Learning&International Conference on Computer Engineering and Applications(CVIDL&ICCEA), 2022: 256-262.

WANG J, ZHANG Z, LUO L, et al. SwinGD: A Robust Grape Bunch Detection Model Based on Swin Transformer in Complex Vineyard Environment[J]. Horticulturae, 2021, 7(11).

ZHENG Z, WANG P, LIU W, et al. Distance-IoU loss: Faster and better learning for bounding box regression[C]. Proceedings of the AAAI conference on artificial intelligence, 2020, 34(07): 12993-13000.

ZHENG Z, WANG P, REN D, et al. Enhancing geometric factors in model learning and inference for object detection and instance segmentation[J]. IEEE Transactions on Cybernetics, 2021, 52(8): 8574-8586.

CARON M, TOUVRON H, MISRA I, et al. Emerging properties in self-supervised vision transformers. Proceedings of the IEEE/CVF international conference on computer vision, 2021: 9650-9660.

RANFTL R, BOCHKOVSKIY A, KOLTUN V, et al. Vision Transformers for Dense Prediction. 18th IEEE/CVF International Conference on Computer Vision(ICCV), 2021: 12159-12168.

WANG C-Y, LIAO H-Y M, WU Y-H, et al. CSPNet: A new backbone that can enhance learning capability of CNN. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, 2020: 390-391.

LIU Z, LIN Y, CAO Y, et al. Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE/CVF international conference on computer vision, 2021:10012-10022.

DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16x16 words: Transformers for image recognition at scale.ar Xiv preprint ar Xiv: 201011929, 2020.

PAN X, GE C, LU R et al. On the integration of self-attention and convolution. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022: 815-825.

Downloads

Published

26-04-2024

Issue

Section

Articles

How to Cite

Fan, L. (2024). High-Precision Tomato Maturity Detection Using a Genetic Algorithm Optimized Swin-YOLO Network. International Journal of Computer Science and Information Technology, 2(2), 1-20. https://doi.org/10.62051/ijcsit.v2n2.01