Augmentation for Accuracy Improvement of YOLOv8 in Blind Navigation System
Abstract
This study addresses the critical need for enhanced accuracy in YOLOv8 models designed for visually impaired navigation systems. Existing models often struggle with consistency in object detection and distance estimation under varying environmental conditions, leading to potential safety risks. To overcome these challenges, this research implements a rigorous approach combining data augmentation and meticulous model optimization techniques. The process begins with the meticulous collection of a diverse dataset, essential for training a robust model. Subsequent preprocessing of images in the HSV color space ensures standardized input features, crucial for consistency in model training. Augmentation techniques are then applied to enrich the dataset, enhancing model generalization and robustness. The YOLOv8 model is trained using this augmented dataset, leading to significant enhancements in key performance metrics. Specifically, mean average precision (mAP) improved by 13.3%, from 0.75 to 0.85, precision increased by 10%, from 0.80 to 0.88, and recall rose by 10.3%, from 0.78 to 0.86. Further optimization efforts, including parameter tuning and the strategic integration of a Kalman Filter, notably improved object tracking and distance estimation capabilities. Final validation in real-world scenarios confirms the efficacy of the optimized model, demonstrating its readiness for practical deployment. This comprehensive approach showcases tangible advances in navigational assistance technology, significantly improving safety and reliability for visually impaired users.
Downloads
References
Z. J. Khow, Y. F. Tan, H. A. Karim, and H. A. A. Rashid, “Improved YOLOv8 Model for a Comprehensive Approach to Object Detection and Distance Estimation,” IEEE Access, vol. 12, pp. 63754–63767, 2024, doi: 10.1109/ACCESS.2024.3396224.
M. Safaldin, N. Zaghden, and M. Mejdoub, “An Improved YOLOv8 to Detect Moving Objects,” IEEE Access, vol. 12, pp. 59782–59806, 2024, doi: 10.1109/ACCESS.2024.3393835.
S. Muhamad Itikap, M. Syahid Abdurrahman, E. B. Soewono, and T. Gelar, “Geometry and Color Transformation Data Augmentation for YOLOV8 in Beverage Waste Detection,” Journal of Software Engineering, Information and Communication Technology (SEICT), vol. 4, no. 2, pp. 123–138, 2023, doi: 10.17509/seict.
A. Aboah, B. Wang, U. Bagci, and Y. Adu-Gyamfi, “Real-time Multi-Class Helmet Violation Detection Using Few-Shot Data Sampling Technique and YOLOv8,” Apr. 2023, [Online]. Available: http://arxiv.org/abs/2304.08256
M. I. Thariq Hussan, D. Saidulu, P. T. Anitha, A. Manikandan, and P. Naresh, “Object Detection and Recognition in Real Time Using Deep Learning for Visually Impaired People,” International Journal of Electrical and Electronics Research, vol. 10, no. 2, pp. 80–86, 2022, doi: 10.37391/IJEER.100205.
S. Sun, B. Mo, J. Xu, D. Li, J. Zhao, and S. Han, “Multi-YOLOv8: An infrared moving small object detection model based on YOLOv8 for air vehicle,” Neurocomputing, vol. 588, Jul. 2024, doi: 10.1016/j.neucom.2024.127685.
Y. Li, Q. Fan, H. Huang, Z. Han, and Q. Gu, “A Modified YOLOv8 Detection Network for UAV Aerial Image Recognition,” Drones, vol. 7, no. 5, May 2023, doi: 10.3390/drones7050304.
Y. Lei, S. L. Phung, A. Bouzerdoum, H. Thanh Le, and K. Luu, “Pedestrian Lane Detection for Assistive Navigation of Vision-Impaired People: Survey and Experimental Evaluation,” IEEE Access, vol. 10, pp. 101071–101089, 2022, doi: 10.1109/ACCESS.2022.3208128.
J. Ye, Z. Yuan, C. Qian, and X. Li, “CAA-YOLO: Combined-Attention-Augmented YOLO for Infrared Ocean Ships Detection,” Sensors, vol. 22, no. 10, May 2022, doi: 10.3390/s22103782.
S. X. Tan, J. Y. Ong, K. Ong, M. Goh, and C. Tee, “Boosting Vehicle Classification with Augmentation Techniques across Multiple YOLO Versions,” INTERNATIONAL JOURNAL ON INFORMATICS VISUALIZATION, vol. 8, no. 1, pp. 45–54, 2024, [Online]. Available: www.joiv.org/index.php/joiv
D. Kumar and N. Muhammad, “Object Detection in Adverse Weather for Autonomous Driving through Data Merging and YOLOv8,” Sensors (Basel), vol. 23, no. 20, Oct. 2023, doi: 10.3390/s23208471.
A. Inui et al., “Detection of Elbow OCD in the Ultrasound Image by Artificial Intelligence Using YOLOv8,” Applied Sciences (Switzerland), vol. 13, no. 13, Jul. 2023, doi: 10.3390/app13137623.
L. Da Quach, K. N. Quoc, A. N. Quynh, and H. T. Ngoc, “Evaluating the Effectiveness of YOLO Models in Different Sized Object Detection and Feature-Based Classification of Small Objects,” Journal of Advances in Information Technology, vol. 14, no. 5, pp. 907–917, 2023, doi: 10.12720/jait.14.5.907-917.
E. Casas, L. Ramos, C. Romero, and F. Rivas-Echeverría, “A comparative study of YOLOv5 and YOLOv8 for corrosion segmentation tasks in metal surfaces,” Array, p. 100351, Jun. 2024, doi: 10.1016/j.array.2024.100351.
B. Zakariya, “System for detecting social distance during COVID-19 using YOLOv3 and OpenCV,” 2022, [Online]. Available: https://www.researchgate.net/publication/365098883
M. Hussain, “YOLO-v1 to YOLO-v8, the Rise of YOLO and Its Complementary Nature toward Digital Manufacturing and Industrial Defect Detection,” Jul. 01, 2023, Multidisciplinary Digital Publishing Institute (MDPI). doi: 10.3390/machines11070677.
M. Zha, W. Qian, W. Yi, and J. Hua, “A lightweight yolov4-based forestry pest detection method using coordinate attention and feature fusion,” Entropy, vol. 23, no. 12, Dec. 2021, doi: 10.3390/e23121587.
J. Kaur and W. Singh, “Tools, techniques, datasets and application areas for object detection in an image: a review,” Multimed Tools Appl, vol. 81, no. 27, pp. 38297–38351, Nov. 2022, doi: 10.1007/s11042-022-13153-y.
Z. Zong, G. Song, and Y. Liu, “DETRs with Collaborative Hybrid Assignments Training,” ArXiv, vol. 2211, no. 1286v5, pp. 1–13, Nov. 2022, [Online]. Available: http://arxiv.org/abs/2211.12860
S. Norkobil Saydirasulovich, A. Abdusalomov, M. K. Jamil, R. Nasimov, D. Kozhamzharova, and Y. I. Cho, “A YOLOv6-Based Improved Fire Detection Approach for Smart City Environments,” Sensors, vol. 23, no. 6, Mar. 2023, doi: 10.3390/s23063161.
P. Azevedo and V. Santos, “Comparative analysis of multiple YOLO-based target detectors and trackers for ADAS in edge devices,” Rob Auton Syst, vol. 171, Jan. 2024, doi: 10.1016/j.robot.2023.104558.
T. J. Alahmadi, A. U. Rahman, H. K. Alkahtani, and H. Kholidy, “Enhancing Object Detection for VIPs Using YOLOv4_Resnet101 and Text-to-Speech Conversion Model,” Multimodal Technologies and Interaction, vol. 7, no. 8, Aug. 2023, doi: 10.3390/mti7080077.
S. Y. Lin and H. Y. Li, “Integrated Circuit Board Object Detection and Image Augmentation Fusion Model Based on YOLO,” Front Neurorobot, vol. 15, Nov. 2021, doi: 10.3389/fnbot.2021.762702.
S. X. Tan, J. Y. Ong, K. Ong, M. Goh, and C. Tee, “Boosting Vehicle Classification with Augmentation Techniques across Multiple YOLO Versions,” INTERNATIONAL JOURNAL ON INFORMATICS VISUALIZATION, pp. 45–54, 2024, [Online]. Available: www.joiv.org/index.php/joiv
D. K. Baroroh, C. H. Chu, and L. Wang, “Systematic literature review on augmented reality in smart manufacturing: Collaboration between human and computational intelligence,” J Manuf Syst, vol. 61, pp. 696–711, Oct. 2021, doi: 10.1016/j.jmsy.2020.10.017.
A. Buslaev, V. I. Iglovikov, E. Khvedchenya, A. Parinov, M. Druzhinin, and A. A. Kalinin, “Albumentations: Fast and flexible image augmentations,” Information (Switzerland), vol. 11, no. 2, Feb. 2020, doi: 10.3390/info11020125.
A. Buslaev, A. Parinov, E. Khvedchenya, V. I. Iglovikov, and A. A. Kalinin, “Albumentations: fast and flexible image augmentations,” Sep. 2018, doi: 10.3390/info11020125.
S. E. Ryu and K. Y. Chung, “Detection model of occluded object based on yolo using hard-example mining and augmentation policy optimization,” Applied Sciences (Switzerland), vol. 11, no. 15, Aug. 2021, doi: 10.3390/app11157093.
F. J. Du and S. J. Jiao, “Improvement of Lightweight Convolutional Neural Network Model Based on YOLO Algorithm and Its Research in Pavement Defect Detection,” Sensors, vol. 22, no. 9, May 2022, doi: 10.3390/s22093537.
R. Wang, F. Liang, B. Wang, and X. Mou, “ODCA-YOLO: An Omni-Dynamic Convolution Coordinate Attention-Based YOLO for Wood Defect Detection,” Forests, vol. 14, no. 9, Sep. 2023, doi: 10.3390/f14091885.
W. Zhao, M. Syafrudin, and N. L. Fitriyani, “CRAS-YOLO: A Novel Multi-Category Vessel Detection and Classification Model Based on YOLOv5s Algorithm,” IEEE Access, vol. 11, pp. 11463–11478, 2023, doi: 10.1109/ACCESS.2023.3241630.
D. N. Triwibowo, E. Utami, Sukoco, and S. Raharjo, “Analysis of Classification and Calculation of Vehicle Type at APILL Intersection Using YOLO Method and Kalman Filter,” in 3rd International Conference on Cybernetics and Intelligent Systems, ICORIS, IEEE, Institute of Electrical and Electronics Engineers Inc., 2021. doi: 10.1109/ICORIS52787.2021.9649607.
R. Arifando, S. Eto, and C. Wada, “Improved YOLOv5-Based Lightweight Object Detection Algorithm for People with Visual Impairment to Detect Buses,” Applied Sciences (Switzerland), vol. 13, no. 9, May 2023, doi: 10.3390/app13095802.
S. Wang, “Research towards Yolo-Series Algorithms: Comparison and Analysis of Object Detection Models for Real-Time UAV Applications,” in Journal of Physics: Conference Series, IOP Publishing Ltd, Jun. 2021. doi: 10.1088/1742-6596/1948/1/012021.
D. Ganga, V. Bharath, P. N. Sri, T. Tulasi, and S. K. Sharook, “SOCIAL DISTANCE DETECTOR USING OPENCV YOLO, CNN ALGORITHM IN DEEP LEARNING,” ZKG International, vol. VIII, no. I, pp. 893–897, 2023, [Online]. Available: www.zkginternational.com
Y. Hui, J. Wang, and B. Li, “DSAA-YOLO: UAV remote sensing small target recognition algorithm for YOLOV7 based on dense residual super-resolution and anchor frame adaptive regression strategy,” Journal of King Saud University - Computer and Information Sciences, vol. 36, no. 1, Jan. 2024, doi: 10.1016/j.jksuci.2023.101863.
Y. Cao, Z. Liu, F. Wang, S. Su, Y. Sun, and W. Wang, “An improved YOLOv7 for the state identification of sliding chairs in railway turnout,” High-speed Railway, Apr. 2024, doi: 10.1016/j.hspr.2024.04.002.
C.-Y. Wang, A. Bochkovskiy, and H.-Y. M. Liao, “YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors,” 2023.
M. Konaite et al., “Smart Hat for the blind with Real-Time Object Detection using Raspberry Pi and TensorFlow Lite,” Association for Computing Machinery (ACM), Dec. 2021, pp. 1–6. doi: 10.1145/3487923.3487929.
A. Hendrawan, R. Gernowo, O. D. Nurhayati, and C. Dewi, “A Novel YOLO-ARIA Approach for Real-Time Vehicle Detection and Classification in Urban Traffic,” International Journal of Intelligent Engineering and Systems, vol. 17, no. 1, pp. 428–446, 2024, doi: 10.22266/ijies2024.0229.38.
I. Purwita Sary, E. Ucok Armin, S. Andromeda, E. Engineering, and U. Singaperbangsa Karawang, “Performance Comparison of YOLOv5 and YOLOv8 Architectures in Human Detection Using Aerial Images,” Ultima Computing : Jurnal Sistem Komputer, vol. 15, no. 1, 2023.
J. Yan et al., “Enhanced object detection in pediatric bronchoscopy images using YOLO-based algorithms with CBAM attention mechanism,” Heliyon, vol. 10, no. 12, Jun. 2024, doi: 10.1016/j.heliyon.2024.e32678.
Copyright (c) 2024 Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright in each article belongs to the author
- The author acknowledges that the RESTI Journal (System Engineering and Information Technology) is the first publisher to publish with a license Creative Commons Attribution 4.0 International License.
- Authors can enter writing separately, arrange the non-exclusive distribution of manuscripts that have been published in this journal into other versions (eg sent to the author's institutional repository, publication in a book, etc.), by acknowledging that the manuscript has been published for the first time in the RESTI (Rekayasa Sistem dan Teknologi Informasi) journal ;