Skip to main content
Research
Computer VisionYOLOv8Precision AgricultureDeep Learning

Comparative Study of Deep Learning Methods for Rice Leaf Disease Detection

Download PDF
Abstract

Rice is a staple crop for millions of people worldwide and a cornerstone of Vietnam's agricultural economy. This paper presents a comprehensive comparative evaluation of YOLOv8, Faster R-CNN, and SSD for automated rice leaf disease detection on a unified dataset of 8,040 annotated images covering four disease classes from Vietnamese rice paddies. YOLOv8 achieves the best overall balance with mAP@50-95 of 0.732 and inference speed of 1,873 FPS.

Introduction

Vietnam is among the world's top three rice-exporting nations, cultivating approximately 7.5 million hectares and producing 40-45 million tons per year. Four major diseases threaten yields:

  • Bacterial Blight (Xanthomonas oryzae) — 20-50% yield loss
  • Blast (Magnaporthe oryzae) — epidemic-level damage in Northern Vietnam
  • Brown Spot (Cochliobolus miyabeanus) — degrades grain quality
  • Twisted Draft / Tungro (SRBSDV) — stunting and leaf curling in Mekong Delta

Conventional manual inspection is labor-intensive and often too late. Deep learning offers automated, rapid, objective detection.

Bacterial Blight symptoms
Fig. 1 — Bacterial Blight: leaf desiccation caused by Xanthomonas oryzae

Methodology

Three architectures evaluated on the same dataset:

YOLOv8

Single-stage, anchor-free, CSPDarknet backbone. Single forward pass. Loss: CIoU + BCE + DFL. Trained 100 epochs on NVIDIA A100, ~16 min.

Faster R-CNN

Two-stage, ResNet-50 FPN backbone. RPN generates 15 anchors/position. RoI Pooling to classification head. Trained 50 epochs on L4 GPU, ~1.5 hr.

SSD512

Single-stage, ResNet-50 backbone. 6 feature map levels, 16,512 anchors. Trained 250 epochs on L4 GPU, ~2 hr.

Dataset

SplitImagesBBoxesPercentage
Train6,6225,95882.4%
Validation9461,37211.8%
Test4724725.8%
Total8,0407,802100%

Class Distribution

Disease ClassBBoxesProportion
Bacterial Blight1,48925.4%
Blast1,19120.3%
Brown Spot1,59827.3%
Twisted Draft1,52426.0%

Results

Model Comparison

ModelPrecisionRecallF1mAP@50mAP@50-95FPS
YOLOv80.9230.8580.8890.8980.7321,873
Faster R-CNN0.8800.8500.8650.8700.68020
SSD0.8500.8000.8240.8900.72038.5

ROI Size Impact

DiseaseROI SizeYOLOv8 mAP@50FRCNN mAP@50SSD mAP@50Avg FP
Brown SpotLarge0.9570.9200.900127
Bacterial BlightMedium0.9100.9000.880160
BlastMedium0.9490.8800.860136
Twisted DraftSmall0.7700.7500.780258
YOLOv8 training curves
Fig. 6 — Visual comparison of Precision, Recall, mAP@50, and mAP@50-95 across all three models

Discussion

ROI Size Impact

Twisted Draft consistently lowest across all models (mAP@50 0.750-0.780) due to small lesion areas <0.2x0.2 pixels after backbone downsampling. False positives highest: 221-286 vs 67-128 for other classes.

Speed-Accuracy Trade-off

  • YOLOv8: optimal for real-time (mobile, drone) — 1,873 FPS, 3.2M params (6.3MB)
  • SSD: middle ground — 38.5 FPS, competitive mAP@50
  • Faster R-CNN: best for lab analysis — highest IoU (0.71), too slow for real-time (20 FPS)

Deployment Scenarios

  • Mobile app: YOLOv8 (under 80ms on CPU, on-device without connectivity)
  • Drone field scanning: YOLOv8 or SSD
  • Lab analysis: Faster R-CNN

Conclusion

Key Findings

  1. YOLOv8 optimal overall: highest mAP@50-95 (0.732), Precision (0.923), F1 (0.889), speed (1,873 FPS), lowest FP (483)
  2. SSD: highest mAP@50 (0.890), near-real-time at 38.5 FPS
  3. Faster R-CNN: best localization (IoU 0.71), unsuitable for real-time at 20 FPS
  4. Twisted Draft unsolved — small ROI challenge remains

Future Directions

Small-ROI optimization (16x16 anchors + FPN attention), DETR/RT-DETR architectures, dataset expansion (+500-1,000 Twisted Draft images), INT8 quantization for Jetson Nano, IoT integration with environmental sensors, instance segmentation (Mask R-CNN / U-Net).