期刊文献+

基于非线性时频掩蔽的语音盲分离方法 被引量:9

Blind speech source separation via nonlinear time-frequency masking
在线阅读 下载PDF
导出
摘要 针对语音信号的欠定卷积混合模型,利用独立语音在时频域上的近似W-分离正交性(W-DO),提出了一种基于非线性时频掩蔽的盲分离方法。首先对多传声器观测信号在时频域上进行规范化处理,使混合信号在每个时频槽的表示与频率无关,然后采用动态聚类算法获取时频槽对应的活跃源信息,选择关于簇中心偏角的非线性函数进行时频掩蔽,从而实现语音信号的盲分离。该方法解决了经典频域盲分离算法中的频率置换问题,能有效抑制分离矩阵的空间方向扩散。仿真实验表明,与BLUES方法相比具有更优的分离性能,信噪比增益平均增加1.58 dB。 A blind speech source separation method for the underdetermined convolutive mixture model is proposed via nonlinear time-frequency masking, the approximate W-disjoint orthogonality (W-DO) property of independent speech signals in the time-frequency domain is exploited. Firstly the observation mixture signal from multi-microphones is normalized to be independent of frequency in the time-frequency domain, then the dynamic clustering algorithm is developed to obtain the active source information in each time-frequency slot, a nonlinear function of deflection angle from the clustering center is selected for time-frequency masking, finally the blind separation of mixture speech signals can be achieved. This novel method can not only overcome the problem of frequency permutation which may be met in most classic frequency-domain blind separation techniques, but suppress the spatial direction diffusion of the separation matrix. Simulation results demonstrate that our proposed separation method outperform the typical BLUES method, the signal-noise-ratio gain (SNRG) is improved 1.58 dB averagely.
出处 《声学学报》 EI CSCD 北大核心 2007年第4期375-381,共7页 Acta Acustica
基金 国家自然科学基金(60672157 60672158)
关键词 非线性函数 语音信号 分离方法 时频域 掩蔽 盲分离算法 动态聚类算法 信噪比增益 Clustering algorithms Computer simulation Signal to noise ratio
  • 相关文献

参考文献13

  • 1Haykin S.Unsupervised adaptive filtering,volume 1:blind source separation.John Wiley & Sons Canada,Ltd.2000.
  • 2Araki S,Makino S et al.Blind separation of more speech than sensors with less distortion by combining sparseness and ICA.In:Proc.IWAENC2003,2003:271-274.
  • 3Parra L,Spence C.Convolutive blind separation of nonstationary sources.IEEE Trans.Speech Audio Process,2000; 8(3):320-327.
  • 4Yilmaz O,Rickard S.Blind separation of speech mixtures via time-frequency masking.IEEE Trans.Signal Processing,2004; 52(7):1830-1847.
  • 5Pedersen M S,Wang D et al.Separating underdetermined convolutive speech mixtures.ICA2006,2006(3889):674-681.
  • 6Pedersen M S,Wang D et al.Overcomplete blind source separation by combining ICA and binary time-frequency masking.In:Proc.MLSP workshop,2005.
  • 7Belouchrani A,Amin M G.Blind source separation based on time-frequency signal representations.IEEE Trans.Signal Processing,1998; 46(11):2888-2897.
  • 8Fevotte C,Doncarli C.Two contributions to blind source separation using time -frequency distributions.IEEE Signal Processing Letters,2004; 11(3):1-10.
  • 9Li Y,Cichocki A et al.Analysis of sparse representation and blind source separation.Neural Computatio,2004;16(6):1193-1234.
  • 10陈健,陆佶人.噪声背景下双输入时延混合系统的盲源分离[J].声学学报,2002,27(5):477-480. 被引量:7

二级参考文献20

  • 1饶丹,谢菠荪,谢志文.双通路立体声条件下的双耳掩蔽[J].电声技术,2005,29(2):53-56. 被引量:8
  • 2Freymaaa et al. The role of perceived spatial separation in the unmasking of speech. J. Acoust. Soc. Am., 1999; 106:3578-3588
  • 3Good et al. The relation between detection in noise and localization in noise in the free field. Binaural and Spatial Heaving in Real and Virtual Environments, Edited by R.Gilkey and T. Anderson Erlbaum, New York, 1997: 349-376
  • 4Doll T J, Hanna T E. Spatial and spectral release from masking in three-dimensional auditory displays. Hum.Factors, 1995; 37:341-355
  • 5Gatehouse R W. Further research on free-field masking. J.Acoust. Soc. Am. 1987; 82(Suppl.1): S108
  • 6Moore B C J. An introduction to the psychology of hearing. Second Edition, Academic Press, Orlando, F1, USA,1982, Chapter 5
  • 7Johnston J D, Ferreira A J. Sum-difference stereo transfer coding. In: Proc. IEEE ICASSP, 1992:569-571
  • 8Douglas S et al. The effects of spatial separation in distance on the informational and energetic masking of a nearby speech signal. J. Acoust. Soc. Am., 2002; 112(2): 664-676
  • 9Zwicker E, Flottorp G, Stevens S S. Critical Bandwid thin Loudness Summation. J. Acoust. Soc. Am., 1957; 29:548-557
  • 10Zwicker E. Psychoacoustics facts and models. Springer-Verlag, 1990

共引文献14

同被引文献149

引证文献9

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部