We present novel vector permutation and branch reduction methods to minimize the number of execution cycles for bit reversal algorithms.The new methods are applied to single instruction multiple data(SIMD) parallel im...We present novel vector permutation and branch reduction methods to minimize the number of execution cycles for bit reversal algorithms.The new methods are applied to single instruction multiple data(SIMD) parallel implementation of complex data floating-point fast Fourier transform(FFT).The number of operational clock cycles can be reduced by an average factor of 3.5 by using our vector permutation methods and by 1.1 by using our branch reduction methods,compared with conventional im-plementations.Experiments on MPC7448(a well-known SIMD reduced instruction set computing processor) demonstrate that our optimal bit-reversal algorithm consistently takes fewer than two cycles per element in complex array operations.展开更多
The three-dimensional inverse transient thermoelastic problem for a thin rectangular object is considered within the context of the theory of generalized thermoelasticity. The upper surface of the rectangular object o...The three-dimensional inverse transient thermoelastic problem for a thin rectangular object is considered within the context of the theory of generalized thermoelasticity. The upper surface of the rectangular object occupying the space D: -a〈xSa; -b〈_y〈b; 0〈z〈h; with the known boundary conditions. Laplace and Finite Marchi-Fasulo transform techniques are used to determine the unknown temperature, temperature distribution, displacement and thermal stresses on upper plane surface of a thin rectangular object. The distributions of the considered physical variables are obtained and represented graphically.展开更多
文摘We present novel vector permutation and branch reduction methods to minimize the number of execution cycles for bit reversal algorithms.The new methods are applied to single instruction multiple data(SIMD) parallel implementation of complex data floating-point fast Fourier transform(FFT).The number of operational clock cycles can be reduced by an average factor of 3.5 by using our vector permutation methods and by 1.1 by using our branch reduction methods,compared with conventional im-plementations.Experiments on MPC7448(a well-known SIMD reduced instruction set computing processor) demonstrate that our optimal bit-reversal algorithm consistently takes fewer than two cycles per element in complex array operations.
基金University Grant Commission,New Delhi for providing the partial financial assistance under major research project scheme
文摘The three-dimensional inverse transient thermoelastic problem for a thin rectangular object is considered within the context of the theory of generalized thermoelasticity. The upper surface of the rectangular object occupying the space D: -a〈xSa; -b〈_y〈b; 0〈z〈h; with the known boundary conditions. Laplace and Finite Marchi-Fasulo transform techniques are used to determine the unknown temperature, temperature distribution, displacement and thermal stresses on upper plane surface of a thin rectangular object. The distributions of the considered physical variables are obtained and represented graphically.