A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decompose...A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decomposed by a two-level wavelet packet and then represented by a model composed of subband filters and reconstruction filters. The coefficients of the subband filters are the zero interpolation of the wavelet coefficients of the HRIR. The coefficients of the reconstruction filters can be calculated from the wavelet function. The model is simplified by applying a threshold method to reduce the wavelet coefficients. The calculated results indicate that for a model with 30 wavelet coefficients, the error of reconstructed HRIR is about 1%. And the result of a psychoacoustic test shows that a model with 35 wavelet coefficients is perceptually indistinguishable from the original HRIR. When multiple virtual sound sources are synthesized simultaneously, the computational cost of the proposed model is much less than the traditional HRTF filters.展开更多
Near-field head-related transfer functions (HRTFs) are essential to scientific re- searches of binaural hearing and practical applications of virtual auditory display. High ef- ficiency, accuracy and repeatability a...Near-field head-related transfer functions (HRTFs) are essential to scientific re- searches of binaural hearing and practical applications of virtual auditory display. High ef- ficiency, accuracy and repeatability are required in a near-field HRTF measurement. Hence, there is no reference which intents on solving the measuring difficulties of near-field HRTF for human subjects. In present work, an efficient near-field HRTF measurement system based on computer control is designed and implemented, and a fast calibration method for the system is proposed to first solve the measurement of near-field HRTF for human subjects. The efficiency of measurement is enhanced by a comprehensive design on the acoustic, electronic and mechanical parts of the system. And the accuracy and repeatability of the measurement are greatly im- proved by carefully calibrating the positions of sound source, subject and binaural microphones. This system is suitable for near-field HRTF measurement at various source distances within 1.0 m, for both human subject and artificial head. The time costs of HRTF measurement at a single sound source distance and full directions has been reduced to less than 20 minutes. The measurement results indicate that the accuracy of the system satisfies the actual requirements. The system is applicable to scientific research and can be used to establish an individualized near-field HRTF database for human subjects.展开更多
针对传统的多重信号分类(multiple signal classification,简称MUSIC)算法定位声源位置时存在计算量大的问题,提出了一种基于宏微导向的蚁群(ant colony optimization,简称ACO)-MUSIC两级相控声源定位算法。首先,利用ACO估算出声源所在...针对传统的多重信号分类(multiple signal classification,简称MUSIC)算法定位声源位置时存在计算量大的问题,提出了一种基于宏微导向的蚁群(ant colony optimization,简称ACO)-MUSIC两级相控声源定位算法。首先,利用ACO估算出声源所在的宏观位置,再用MUSIC算法精确搜索声源所在的微观方位;其次,对提出的算法进行数值仿真,并搭建实验系统进行验证。仿真和实验结果表明,所提出的算法可以高精度、快速地定位出声源所在的位置;在搜索步距为0.05°时,算法的计算复杂度和计算时间仅为传统MUSIC算法的0.25%和2.8%。展开更多
针对三维多重信号分类(Multiple Signal Classification,MUSIC)算法估计声源位置时计算速度慢,计算量大等缺点,提出了一种基于鸡群优化(Chicken Swarm Optimization,CSO)算法的近场声源三维定位算法。首先建立近场声源信号接收的数学模...针对三维多重信号分类(Multiple Signal Classification,MUSIC)算法估计声源位置时计算速度慢,计算量大等缺点,提出了一种基于鸡群优化(Chicken Swarm Optimization,CSO)算法的近场声源三维定位算法。首先建立近场声源信号接收的数学模型,并选取三维MUSIC算法中的空间谱函数为文章算法中的适应度函数。通过不断迭代和局部搜索,以适应度值为指标对鸡群个体进行排序,最终得到最优鸡群个体的位置,即近场待测声源的坐标。仿真和实验结果表明:文中算法具有定位精度高、计算效率高、实时性好等优点,文中算法的平均用时仿真时为三维MUSIC算法平均用时的1.9%,实验时为三维MUSIC算法用时的3.2%。展开更多
基金supported by the National Nature Science Fund of China(50938003,10774049)State Key Lab of Subtropical Building Science,South China University of Technology
文摘A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decomposed by a two-level wavelet packet and then represented by a model composed of subband filters and reconstruction filters. The coefficients of the subband filters are the zero interpolation of the wavelet coefficients of the HRIR. The coefficients of the reconstruction filters can be calculated from the wavelet function. The model is simplified by applying a threshold method to reduce the wavelet coefficients. The calculated results indicate that for a model with 30 wavelet coefficients, the error of reconstructed HRIR is about 1%. And the result of a psychoacoustic test shows that a model with 35 wavelet coefficients is perceptually indistinguishable from the original HRIR. When multiple virtual sound sources are synthesized simultaneously, the computational cost of the proposed model is much less than the traditional HRTF filters.
基金supported by the National Natural Science Foundation of China(11104082,11574090)Fundamental Research Funds for the Central Universities of South China University of Technology(2015ZZ135)
文摘Near-field head-related transfer functions (HRTFs) are essential to scientific re- searches of binaural hearing and practical applications of virtual auditory display. High ef- ficiency, accuracy and repeatability are required in a near-field HRTF measurement. Hence, there is no reference which intents on solving the measuring difficulties of near-field HRTF for human subjects. In present work, an efficient near-field HRTF measurement system based on computer control is designed and implemented, and a fast calibration method for the system is proposed to first solve the measurement of near-field HRTF for human subjects. The efficiency of measurement is enhanced by a comprehensive design on the acoustic, electronic and mechanical parts of the system. And the accuracy and repeatability of the measurement are greatly im- proved by carefully calibrating the positions of sound source, subject and binaural microphones. This system is suitable for near-field HRTF measurement at various source distances within 1.0 m, for both human subject and artificial head. The time costs of HRTF measurement at a single sound source distance and full directions has been reduced to less than 20 minutes. The measurement results indicate that the accuracy of the system satisfies the actual requirements. The system is applicable to scientific research and can be used to establish an individualized near-field HRTF database for human subjects.
文摘针对传统的多重信号分类(multiple signal classification,简称MUSIC)算法定位声源位置时存在计算量大的问题,提出了一种基于宏微导向的蚁群(ant colony optimization,简称ACO)-MUSIC两级相控声源定位算法。首先,利用ACO估算出声源所在的宏观位置,再用MUSIC算法精确搜索声源所在的微观方位;其次,对提出的算法进行数值仿真,并搭建实验系统进行验证。仿真和实验结果表明,所提出的算法可以高精度、快速地定位出声源所在的位置;在搜索步距为0.05°时,算法的计算复杂度和计算时间仅为传统MUSIC算法的0.25%和2.8%。
文摘针对三维多重信号分类(Multiple Signal Classification,MUSIC)算法估计声源位置时计算速度慢,计算量大等缺点,提出了一种基于鸡群优化(Chicken Swarm Optimization,CSO)算法的近场声源三维定位算法。首先建立近场声源信号接收的数学模型,并选取三维MUSIC算法中的空间谱函数为文章算法中的适应度函数。通过不断迭代和局部搜索,以适应度值为指标对鸡群个体进行排序,最终得到最优鸡群个体的位置,即近场待测声源的坐标。仿真和实验结果表明:文中算法具有定位精度高、计算效率高、实时性好等优点,文中算法的平均用时仿真时为三维MUSIC算法平均用时的1.9%,实验时为三维MUSIC算法用时的3.2%。