期刊文献+

一种适于计算声场景分析的混叠语音基音检测方法 被引量:3

A Multi-Pitch Detecting Method Suitable for CASA
在线阅读 下载PDF
导出
摘要 本文提出了一种在混叠语音信号中检测各自语音分量基音信息的方法 .该方法采用小波变换作为基音检测模型中的滤波处理 ,并用广义自相关运算突出基音信息 ,用增强自相关累和消除冗余信息 ,并提出了用基音概率函数来预测并跟踪不同基音的变化以提高基音检测的准确性 .本文提出的方法可应用于计算声场景分析中 .实验结果表明 ,该方法对于混叠语音的基音检测是非常有效的 . This paper puts up a method suitable for multi pitch detecting under overlapping speech signals environment.In this method,wavelet transform is used as filtering analysis part of this pitch detecting model.Besides that,generalized autocorrelation function is used to strengthen pitch information and enhanced summary autocorrelation function is used to weaken redundant information.It is the most important that a pitch probability function is given to predict and tail after each pitch tracking to improve the veracity of pitch detecting.Above mentioned method could be applied to computational auditory scene analysis.From the experiment results provided,we can infer that this method is very useful and efficient.
出处 《电子学报》 EI CAS CSCD 北大核心 2003年第1期123-126,共4页 Acta Electronica Sinica
基金 国家自然科学基金 (No 60 1 72 0 1 6)
关键词 声场景分析 混叠语音 基音检测 小波变换 overlapping speech pitch detecting wavelet transform
  • 相关文献

参考文献3

二级参考文献6

  • 1杨行峻 迟惠生.语音信号数学处理[M].北京:电子工业出版社,1995.8-21.
  • 2程俊,Proc of Inter Conf on Signal Processing Vol.1,1993年
  • 3拉宾纳 L R,语音信号数字处理,1983年
  • 4杨行峻,语音信号数字处理,1995年
  • 5Gu Y H,Proc of IEEE ICASSP.2,1992年,21页
  • 6林焘,语音学教程,1992年

共引文献53

同被引文献30

  • 1王珊,许刚.基于计算听觉场景分析的语音混叠信号分离[J].计算机工程,2007,33(18):211-213. 被引量:1
  • 2Van der Kouwe J W,Wang D L,Brown G L.A Comparison of Auditory and Blind Separation Techniques for Speech Segregation[J].IEEE Trans.on Speech Audio Processing,2001,9(3):189-195.
  • 3Roman N,Wang D L.Binaural Sound Segregation for Multisource Reverberant Environment[C]//Proc.of Int'l Conference on Acoustics,Speech,and Signal Processing.2004:373-376.
  • 4Wang D L,Brown G L.Separation of Speech from Interfering Sounds Based on Oscillatory Correlation[J].IEEE Trans.on Neural Networks,1999,10(3):684-697.
  • 5Carlyon R P,Shackleton T M.Comparing the Fundamental Frequencies of Resolved and Unresolved Harmonics:Evidence for Two Pitch Mechanisms?[J].Journal of the Acoustic Society of America,1994,95(6):3541-3554.
  • 6Ellis D P W,Rosenthal D.Mid-level Representations for Computational Auditory Scene Analysis:The Weft Element[C]//Proc.of Int'l Joint Conference on Artificial Intelligence.Mahwah,NJ:Lawrence Erlbaum,1998.
  • 7Hu G,Wang D L.Monaural Speech Segregation Based on Pitch Tracking and Amplitude Modulation[J].IEEE Trans.on Neural Networks,2004,15(5):1135-1150.
  • 8Voiers W D.Evaluating Processed Speech Using the DiagnosticRhyme Test[J].Speech Technology,1983,1(4):30-39.
  • 9Meddis R.Simulation of Auditory-neural Transduction:Further Studies[J] Journal of the Acoustic Society of America,1988,83(3):1056-1063.
  • 10Cooke M P.Modeling Auditory Processing and Organization[D].CS Dept.,Univ.of Sheffield,1991.

引证文献3

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部