Coupling the Power of YOLOv9 with Transformer for Small Object Detection in Remote-Sensing Images

在线阅读下载PDF

导出

摘要 Recent years have seen a surge in interest in object detection on remote sensing images for applications such as surveillance andmanagement.However,challenges like small object detection,scale variation,and the presence of closely packed objects in these images hinder accurate detection.Additionally,the motion blur effect further complicates the identification of such objects.To address these issues,we propose enhanced YOLOv9 with a transformer head(YOLOv9-TH).The model introduces an additional prediction head for detecting objects of varying sizes and swaps the original prediction heads for transformer heads to leverage self-attention mechanisms.We further improve YOLOv9-TH using several strategies,including data augmentation,multi-scale testing,multi-model integration,and the introduction of an additional classifier.The cross-stage partial(CSP)method and the ghost convolution hierarchical graph(GCHG)are combined to improve detection accuracy by better utilizing feature maps,widening the receptive field,and precisely extracting multi-scale objects.Additionally,we incorporate the E-SimAM attention mechanism to address low-resolution feature loss.Extensive experiments on the VisDrone2021 and DIOR datasets demonstrate the effectiveness of YOLOv9-TH,showing good improvement in mAP compared to the best existing methods.The YOLOv9-TH-e achieved 54.2% of mAP50 on the VisDrone2021 dataset and 92.3% of mAP on the DIOR dataset.The results confirmthemodel’s robustness and suitability for real-world applications,particularly for small object detection in remote sensing images.

作者 Mohammad Barr

机构地区 Department of Electrical Engineering

出处《Computer Modeling in Engineering & Sciences》 2025年第4期593-616,共24页 工程与科学中的计算机建模(英文)

关键词 Remote sensing images YOLOv9-TH multi-scale object detection transformer heads VisDrone2021 dataset

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1Qingping LI,Qin GUAN,Aijuan BAI,Jinhai LI,Yujun ZHU.Construction of Forecast and Early Warning System of Meteorological and Geological Disasters in Qinghai Province[J].Meteorological and Environmental Research,2022,13(3):49-55. 被引量：1
2Rong Yao,Zhiming Kang,Yong Li,Xiangning Cai.Application and Verification of Multi-Model Products in Medium Range Forecast[J].Journal of Geoscience and Environment Protection,2018,6(7):178-193.
3孙明辰,金辉,王英.融合胶囊网络与因果推理的疾病预测[J].应用科学学报,2025,43(1):1-19.
4王新良,王璐莹.多感受野增强的爆破现场安全帽检测算法[J].计算机工程与应用,2025,61(7):315-324.
5周宇,肖健梅,王锡淮.基于GCN和HGP-SL的电力系统暂态稳定评估[J].电气工程学报,2024,19(4):246-254.
6CHEN Lei,XIONG Qingbo,ZHANG Wei,LI Runde.Adaptive-basis decomposition-based low-rank network for efficient non-uniform motion deblurring[J].Optoelectronics Letters,2025,21(1):43-50.
7SUNPeng.Research on Widening Design Methods of Old Hollow Slab Bridges[J].外文科技期刊数据库(文摘版)工程技术,2022(4):184-188.
8MA Xianda,LAN Zhaohui,CHEN Zhitang,MONISHA M L,HE Xinyi,LI Weidong.Significant Retest Effects in Spatial Working Memory Task[J].Journal of Shanghai Jiaotong university(Science),2025,30(1):115-120.
9Songle Chen,Hongbo Sun,Yuxin Wu,Lei Shang,Xiukai Ruan.A Helmet Detection Algorithm Based on Transformers with Deformable Attention Module[J].Chinese Journal of Electronics,2025,34(1):229-241.
10张瑜舟,成丽波.基于组稀疏混合模型的遥感图像去噪方法[J].应用数学进展,2025,14(2):69-80.

Computer Modeling in Engineering & Sciences

2025年第4期

浏览历史

内容加载中请稍等...

Coupling the Power of YOLOv9 with Transformer for Small Object Detection in Remote-Sensing Images

相关作者

相关机构

相关主题

浏览历史