CUBIT: A High-resolution Infrastructure Defect Dataset
Fully Evaluated with Autonomous Detection Framework
Submitted to International Conference on Acoustics, Speech, & Signal Processing 2024 (ICASSP 2024)
2.School of Electronics and Information Engineering,Tongji University
- The visualization of defect detection framework based on CUBIT dataset is illustrated below *, which encompasses the entire process: data collection by autonomous unmanned system; the baseline network integrated with our GIPFPP module; the output of defect detection results.
Dataset | Num. of Images | Resolution | Data Collection Platform | Category | Scenario | Material | Experiments |
---|---|---|---|---|---|---|---|
RDD-2018 | 9053 | 600x600 | Smartphones | Crack, Corrosion | Pavement | Asphalt | SSD |
RDD-2019 | 13135 | 600x600 | Smartphones | Crack, Corrosion | Pavement | Asphalt | SSD |
RDD-2020 | 26336 | 600x600, 720x720 | Smartphones | Crack, Pothole | Pavement | Asphalt | SSD |
RDD-2022 | 47420 | 512x512, 600x600, 720x720, 3650x2044 | Smartphones, Hand-held cameras, UAV cameras, Google street view | Crack, Pothole | Pavement | Asphalt | - |
PID | 7237 | 640x640 | Crawled from Internet | Crack | Pavement | Asphalt | YOLOv2, Fast R-CNN |
Murad | 2620 | up to 838x809 | Smartphones | Crack | Pavement | Asphalt | Faster R-CNN |
CODEBRIM | 1590 | up to 6000x4000 | Hand-held cameras, UAV Cameras | Crack, Corrosion | Bridge | Concrete | MetaQNN, ENAS |
CUBIT | 5527 | 4624x3472 and 8000x6000 | Cameras in Unmanned Systems | Crack, Spallinig, Moisture | Building (65%), Pavement (29%), Bridge (6%) | Concrete, Asphalt, Stone | Faster R-CNN, PP-YOLO, PP-YOLOv2, YOLOX, YOLOv5, YOLOv7, YOLOv6, YOLOv6+GIPFPP(ours), Real-site experiment |
The evaluation results of SOTA real-time detection methods and YOLOv6-n with our GIPFPP module are benchmarked in the table below. After switching from the original module to GIPFPP module, the AP of YOLOv6-n is improved by 3%, while its number of parameters is reduced by 10%. The enhancements made to the model will facilitate the real-time defect detection using unmanned systems.
Model | #Params.(M) | FLOPs(G) | Size | mAP${50}^{test}$ / mAP${50:95}^{test}$ | Latency(ms) |
---|---|---|---|---|---|
Faster R-CNN(Res50) | 42.62 | 477.24 | 1024 | 71.5% / 43.3% | 76.9 |
PP-YOLO | 48.99 | 136.43 | 1024 | 76.4% / 45.1% | 14.5 |
PP-YOLOv2 | 56.91 | 146.50 | 1024 | 77.3% / 47.1% | 13.8 |
YOLOv5-n | 1.76 | 4.10 | 1024 | 73.4% / 39.9% | 1.8 |
YOLOv5-s | 7.18 | 15.80 | 1024 | 78.5% / 47.2% | 3.3 |
YOLOv7-t | 6.01 | 13.01 | 1024 | 71.1% / 39.7% | 1.9 |
YOLOX-n | 2.24 | 17.75 | 1024 | 73.0% / 39.5% | 4.4 |
YOLOX-t | 5.03 | 39.00 | 1024 | 75.3% / 49.2% | 5.8 |
YOLOX-s | 8.94 | 68.51 | 1024 | 77.9% / 49.4% | 7.6 |
YOLOv6-n(baseline) | 4.63 | 29.03 | 1024 | 76.3% / 47.9% | 2.2 |
YOLOv6-s | 18.50 | 115.64 | 1024 | 79.0% / 48.2% | 5.3 |
YOLOv6-n+GIFPFF(ours) | 4.14 (-0.49) | 28.02 (-1.01) | 1024 | 77.5% (+1.2) / 50.3% (+3.1) | 2.2 |
We enlarge the prediction results in the bottom right corner of framework images above. CUBIT dataset covers three infrastructure types: Building facade, Pavement, and Bridge, and aims for three types of defect: Crack, Spalling, and Moisture. Rectangles indicate the output prediction box Red for Crack, Pink for Spalling, and Orange for Moisture with inferred defect type and confidence score from YOLOv6-l trained on the training set of our proposed dataset.
Qualitative visualization of UAV-based real-world experiment is shown below On the left, our multi-UAVs inspection schematics is illustrated. On the right, the detection results of four direction façades of the building are displayed.
This work was supported by the InnoHK of the Government of the Hong Kong Special Administrative Region via the Hong Kong Centre for Logistics Robotics.