An Approach to Parallelization of SIFT Algorithm on GPUs for Real-Time Applications 被引量：4

An Approach to Parallelization of SIFT Algorithm on GPUs for Real-Time Applications

在线阅读下载PDF

导出

摘要 Scale Invariant Feature Transform (SIFT) algorithm is a widely used computer vision algorithm that detects and extracts local feature descriptors from images. SIFT is computationally intensive, making it infeasible for single threaded im-plementation to extract local feature descriptors for high-resolution images in real time. In this paper, an approach to parallelization of the SIFT algorithm is demonstrated using NVIDIA’s Graphics Processing Unit (GPU). The parallel-ization design for SIFT on GPUs is divided into two stages, a) Algorithm de-sign-generic design strategies which focuses on data and b) Implementation de-sign-architecture specific design strategies which focuses on optimally using GPU resources for maximum occupancy. Increasing memory latency hiding, eliminating branches and data blocking achieve a significant decrease in aver-age computational time. Furthermore, it is observed via Paraver tools that our approach to parallelization while optimizing for maximum occupancy allows GPU to execute memory bound SIFT algorithm at optimal levels. Scale Invariant Feature Transform (SIFT) algorithm is a widely used computer vision algorithm that detects and extracts local feature descriptors from images. SIFT is computationally intensive, making it infeasible for single threaded im-plementation to extract local feature descriptors for high-resolution images in real time. In this paper, an approach to parallelization of the SIFT algorithm is demonstrated using NVIDIA’s Graphics Processing Unit (GPU). The parallel-ization design for SIFT on GPUs is divided into two stages, a) Algorithm de-sign-generic design strategies which focuses on data and b) Implementation de-sign-architecture specific design strategies which focuses on optimally using GPU resources for maximum occupancy. Increasing memory latency hiding, eliminating branches and data blocking achieve a significant decrease in aver-age computational time. Furthermore, it is observed via Paraver tools that our approach to parallelization while optimizing for maximum occupancy allows GPU to execute memory bound SIFT algorithm at optimal levels.

作者 Raghu Raj Prasanna Kumar Suresh Muknahallipatna John McInroy Raghu Raj Prasanna Kumar;Suresh Muknahallipatna;John McInroy(Department of Electrical Engineering, University of Wyoming, Laramie, USA)

机构地区 Department of Electrical Engineering

出处《Journal of Computer and Communications》 2016年第17期18-50,共33页 电脑和通信（英文）

关键词 Scale Invariant Feature Transform (SIFT) Parallel Computing GPU GPU Occupancy Portable Parallel Programming CUDA Scale Invariant Feature Transform (SIFT) Parallel Computing GPU GPU Occupancy Portable Parallel Programming CUDA

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

同被引文献5

1田文,徐帆,王宏远,周波.基于CUDA的尺度不变特征变换快速算法[J].计算机工程,2010,36(8):219-221. 被引量：14
2姜超,耿则勋,娄博,魏小峰,沈忱.基于GPU的SIFT特征匹配算法并行处理研究[J].计算机科学,2013,40(12):295-297. 被引量：4
3闫钧华,杭谊青,许俊峰,储林臻.基于CUDA的高分辨率数字视频图像配准快速实现[J].仪器仪表学报,2014,35(2):380-386. 被引量：27
4张凯,杨红雨,兰时勇.基于CUDA的SIFT特征与拼接缝的全景图生成[J].计算机技术与发展,2015,25(9):22-26. 被引量：4
5Sumathi Lakshmiranganatha,Suresh S. Muknahallipatna.Graphical Processing Unit Based Time-Parallel Numerical Method for Ordinary Differential Equations[J].Journal of Computer and Communications,2020,8(2):39-63. 被引量：1

引证文献4

1汪亮,周新志,严华.基于GPU的实时SIFT算法[J].计算机科学,2020,47(8):105-111. 被引量：4
2Raghu Raj Prasanna Kumar,Suresh Muknahallipatna,John McInroy,Mark McKenna,Lori Franc.Real-Time Range of Motion Measurement of Physical Therapeutic Exercises[J].Journal of Computer and Communications,2017,5(9):19-42.
3Sumathi Lakshmiranganatha,Suresh S. Muknahallipatna.Graphical Processing Unit Based Time-Parallel Numerical Method for Ordinary Differential Equations[J].Journal of Computer and Communications,2020,8(2):39-63. 被引量：1
4Sumathi Lakshmiranganatha,Suresh S. Muknahallipatna.Performance Analysis of Accelerator Architectures and Programming Models for Parareal Algorithm Solutions of Ordinary Differential Equations[J].Journal of Computer and Communications,2021,9(2):29-56.

二级引证文献5

1刘云,董守杰.基于CUDA核函数的多路视频图像拼接加速算法[J].计算机科学,2022,49(S01):441-446. 被引量：5
2杨云源,陈瑞.基于ED-SIFT算法的高分辨率遥感图像快速拼接[J].测绘工程,2023,32(1):8-13. 被引量：1
3高沛,王宗彦,段宏伟.基于改进多尺度Faster-RCNN的带孔工件检测研究[J].建材技术与应用,2024(6):10-15.
4刘长龙,陈征,张乐,徐元德,王威,张玺亮,蓝飞.摄像头移动巡视中对生产工艺区域精准识别算法研究[J].计算机与数字工程,2024,52(12):3735-3743.
5Sumathi Lakshmiranganatha,Suresh S. Muknahallipatna.Performance Analysis of Accelerator Architectures and Programming Models for Parareal Algorithm Solutions of Ordinary Differential Equations[J].Journal of Computer and Communications,2021,9(2):29-56.

1Raghu Raj P. Kumar,Suresh S. Muknahallipatna,John E. McInroy.Acceleration of Points to Convex Region Correspondence Pose Estimation Algorithm on GPUs for Real-Time Applications[J].Journal of Computer and Communications,2016,4(17):1-17.
2郭小溪(整理),郭锡恩,胡如珊.研究,跨界,哲思采访如恩设计研究室[J].室内设计与装修,2021(S02):112-117.
3Frédéric Hecht,Sidi-Mahmoud Kaber.Partial Fraction Decomposition of Matrices and Parallel Computing[J].Journal of Mathematical Study,2019,52(3):244-257.
4Wan Yu Lyu,Min Hong,Wei Di Liu,Meng Li,Qiang Sun,Sheng Duo Xu,Jin Zou,Zhi-Gang Chen.Rare-Earth Nd Inducing Record-High Thermoelectric Performance of(GeTe)_(85)(AgSbTe_(2))_(15)[J].Energy Material Advances,2021(1):201-208. 被引量：1
5刘星,如恩设计(图).如恩设计:了解自己,发现世界[J].艺术与设计,2022(2):134-139.
6ZHANG Yining,ZHOU Guolei,LIU Yanjun,FU Hui,SUN Hongri.Matching of Residential and Commercial Space in Shrinking Cities from the Perspective of Supply and Demand:A Case Study of Yichun City,China[J].Chinese Geographical Science,2022,32(3):389-404. 被引量：1
7Torsten Broeer,Francis K.Tuffner,Anaissia Franca,Nedjib Djilali.A Demand Response System for Wind Power Integration: Greenhouse Gas Mitigation and Reduction of Generator Cycling[J].CSEE Journal of Power and Energy Systems,2018,4(2):121-129. 被引量：3
8Jingzhe Tang,Yanfeng Zheng,Chao Yang,Wei Wang,Yaozhi Luo.Parallelized Implementation of the Finite Particle Method for Explicit Dynamics in GPU[J].Computer Modeling in Engineering & Sciences,2020(1):5-31. 被引量：7
9LIU Zhe,LI Shurong.A Quantum Computing Based Numerical Method for Solving Mixed-Integer Optimal Control Problems[J].Journal of Systems Science & Complexity,2021,34(6):2428-2469.
10Yunlong LIU,Jie CHEN,Jingui HUANG.On book thickness parameterized by the vertex cover number[J].Science China(Information Sciences),2022,65(4):108-109.

Journal of Computer and Communications

2016年第17期

浏览历史

内容加载中请稍等...

An Approach to Parallelization of SIFT Algorithm on GPUs for Real-Time Applications 被引量：4

同被引文献5

引证文献4

二级引证文献5

相关作者

相关机构

相关主题

浏览历史