基于优化预测定位的单阶段目标检测算法
|
张娜,戚旭磊,包晓安,吴彪,涂小妹,金瑜婷
|
Single-stage object detection algorithm based on optimizing position prediction
|
Na ZHANG,Xu-lei QI,Xiao-an BAO,Biao WU,Xiao-mei TU,Yu-ting JIN
|
|
表 1 VOC2007测试集上平均检测精度的对比 |
Tab.1 Comparison of mean average precision on VOC2007 test dataset |
|
方法 | 骨干网络 | 输入尺寸 | GPU | mAP/% | v/(帧·s−1) | Faster R-CNN[3] | VGGNet | 1000×600 | Titan X | 73.2 | 7.0 | Faster R-CNN[3] | ResNet-101 | 1000×600 | 1080Ti | 78.8 | 2.3 | Mask R-CNN[36] | ResNet-50 | 1000×600 | 1080Ti | 77.4 | 4.2 | Cascade R-CNN[37] | VGGNet | 1000×600 | 1080Ti | 79.6 | 5.3 | YOLOV2[6] | Darknet-19 | 352×352 | Titan X | 73.7 | 81.0 | RefineDet320[18] | VGGNet | 320×320 | 1080Ti | 80.0 | 22.1 | FCOS[32] | ResNet-50 | 1333×800 | 1080Ti | 73.5 | 17.6 | ATSS[11] | ResNet-50 | 1333×800 | 1080Ti | 75.2 | 14.9 | RetinaNet400[10] | ResNet-101 | ~640x400 | 1080Ti | 79.4 | 12.4 | FSSD300[16] | VGGNet | 300×300 | 1080Ti | 78.8 | 65.0 | SSD300[13] | VGGNet | 300×300 | 1080Ti | 77.2 | 42.1 | ASSD321[38] | ResNet-101 | 321×321 | K40 | 79.5 | 11.4 | DSSD321[15] | ResNet-101 | 321×321 | Titan X | 78.6 | 9.5 | EL-SSD300 | VGGNet | 300×300 | 1080Ti | 79.8 | 27.0 |
|
|
|