期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
多簇超长指令字DSP复数运算的编译优化 被引量:3
1
作者 丁陈飞 郑启龙 +2 位作者 徐华叶 付和萍 陈元 《计算机应用与软件》 CSCD 2015年第2期14-17,共4页
多簇超长指令字(VLIW)DSP提供了多种复数指令,而编译器不能直接利用这些复数指令来提升编译性能。特此提出一种针对复数运算的编译优化方法,通过将复数操作基本块中加载指令和存储指令前移后置操作使得复数运算操作汇聚成连续操作片寻... 多簇超长指令字(VLIW)DSP提供了多种复数指令,而编译器不能直接利用这些复数指令来提升编译性能。特此提出一种针对复数运算的编译优化方法,通过将复数操作基本块中加载指令和存储指令前移后置操作使得复数运算操作汇聚成连续操作片寻找复数指令合成的机会。实验结果表明该优化方法能减少复数程序的时钟周期。 展开更多
关键词 多簇 超长指令 复数指令 编译优化
在线阅读 下载PDF
魂芯DSP上复数类型的支持和优化 被引量:2
2
作者 王玉林 郑启龙 赵高义 《计算机系统应用》 2017年第9期40-45,共6页
魂芯DSP是一款采用VLIW和SIMD架构的针对高性能计算领域而设计的32bit静态标量数字信号处理器.为了满足数字高性能计算的性能要求,魂芯DSP提供了丰富的复数指令,而编译器不能直接利用这些复数指令来提升编译性能.因此针对魂芯DSP芯片提... 魂芯DSP是一款采用VLIW和SIMD架构的针对高性能计算领域而设计的32bit静态标量数字信号处理器.为了满足数字高性能计算的性能要求,魂芯DSP提供了丰富的复数指令,而编译器不能直接利用这些复数指令来提升编译性能.因此针对魂芯DSP芯片提供了大量的复数类操作指令的特点,在传统开源编译器Open64的编译框架基础上进行研究,实现了复数作为编译器基础类型和复数运算操作的支持.同时,通过识别特定的复数类操作的模式利用魂芯DSP上的复数类指令对程序编译优化.实验结果表明,该实现方案在魂芯DSP编译器上对复数程序优化后能够取得平均5.28的加速比. 展开更多
关键词 编译优化 分簇体系DSP 复数指令 Open64编译器
在线阅读 下载PDF
Sorting Data Elements by SOCD Using Centralized Diamond Architecture
3
作者 Masumeh Damrudi Kamal Jadidy Aval 《Computer Technology and Application》 2011年第5期374-377,共4页
Several parallel sorting techniques on different architectures have been studied for many years. Due to the need for faster systems in today's world, parallelism can be used to accelerate applications. Nowadays, para... Several parallel sorting techniques on different architectures have been studied for many years. Due to the need for faster systems in today's world, parallelism can be used to accelerate applications. Nowadays, parallel operations are used to solve computer problems such as sort and search, which result in a reasonable speed. Sorting is one of the most important operations in computing world. The authors always try to find the best in different areas which the premier is speedup. In this paper, the authors issued a sort with O(logn) time complexity on PRAM EREW (Parallel Random Access Machine Exclusive Read Exclusive Write). The algorithm is designed in a manner that keeps the tradeoff between the number of processor elements in the architecture and execution time. The simulation of the algorithm proves the theoretical analysis of the algorithm. The results of this research can be utilized in developing faster embedded systems. Sorting on Centralized Diamond (SOCD) algorithm is issued on the novel Centralized Diamond architecture which takes the advantages of Single Instruction Multiple Data (SIMD) architecture. This architecture and the sort on it are intuitive and optimal. 展开更多
关键词 Parallel sorting diamond architecture single instruction multiple data (SIMD) parallel random access machine exclusive read exclusive write (PRAM EREW) sorting on centralized diamond (SOCD).
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部