期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
语音声特征提取的总变分正则化流形学习方法
1
作者 张开业 赵化良 +2 位作者 刘志红 徐希鑫 李建华 《噪声与振动控制》 北大核心 2025年第2期97-104,共8页
语音声信号具有显著的时频稀疏性、时变性和高维非线性,为具体表征和有效提取其声特征,提出一种总变分正则化流形学习方法。以局部线性嵌入算法为基础,对预处理后的语音声信号先后进行二次傅里叶变换,再经统计分析提取长时幅值特征,构... 语音声信号具有显著的时频稀疏性、时变性和高维非线性,为具体表征和有效提取其声特征,提出一种总变分正则化流形学习方法。以局部线性嵌入算法为基础,对预处理后的语音声信号先后进行二次傅里叶变换,再经统计分析提取长时幅值特征,构造包含短时和长时幅值特征的声特征向量,生成高维特征矩阵;在利用总变分对其k邻域进行优化,最后构造基于权重值能量最小化约束的总变分正则化流形学习声特征提取数学模型,经凸优化得出最优权重,解析语音声特征的低维流形。经分析与方法对比,该方法不仅可以明确声特征流形的物理意义,避免流形的扭曲变形,而且还能大幅降低数值计算量,提升计算速度,为智能语音的机器学习和模式识别提供方法技术支持。 展开更多
关键词 语音声信号 正则化流形 总变分 高维特征矩阵 k邻域 特征提取
在线阅读 下载PDF
An Analysis of the Hoarse Speech Signals by the Three Mass Model of Vocal Cords *
2
作者 程启明 陈雪丽 万德钧 《Journal of Southeast University(English Edition)》 EI CAS 1998年第1期81-85,共5页
A three mass model of vocal cords as well as mathematical expression of the model are discussed. Different kinds of typical hoarse speech due to laryngeal diseases are simulated on microcomputer and the effects of di... A three mass model of vocal cords as well as mathematical expression of the model are discussed. Different kinds of typical hoarse speech due to laryngeal diseases are simulated on microcomputer and the effects of different pathological factors of vocal cords on model parameters are studied. Some typical spectrum distribution of the simulated speech signals are given. Moreover, hoarse speech signals of some typical cases are analyzed by the methods of digital signal processing, including FFT, LPC, Cepstrum technique, Pseudocolor encoding, etc. The experiment results show that the three mass model analysis of vocal cords is an efficient method for analysis of hoarse speech signals. 展开更多
关键词 hoarse speech signal three mass model of vocal cords laryngeal diseases
在线阅读 下载PDF
A continuous differentiable wavelet threshold function for speech enhancement 被引量:3
3
作者 贾海蓉 张雪英 白静 《Journal of Central South University》 SCIE EI CAS 2013年第8期2219-2225,共7页
Enhanced speech based on the traditional wavelet threshold function had auditory oscillation distortion and the low signal-to-noise ratio (SNR). In order to solve these problems, a new continuous differentiable thresh... Enhanced speech based on the traditional wavelet threshold function had auditory oscillation distortion and the low signal-to-noise ratio (SNR). In order to solve these problems, a new continuous differentiable threshold function for speech enhancement was presented. Firstly, the function adopted narrow threshold areas, preserved the smaller signal speech, and improved the speech quality; secondly, based on the properties of the continuous differentiable and non-fixed deviation, each area function was attained gradually by using the method of mathematical derivation. It ensured that enhanced speech was continuous and smooth; it removed the auditory oscillation distortion; finally, combined with the Bark wavelet packets, it further improved human auditory perception. Experimental results show that the segmental SNR and PESQ (perceptual evaluation of speech quality) of the enhanced speech using this method increase effectively, compared with the existing speech enhancement algorithms based on wavelet threshold. 展开更多
关键词 continuous differentiable wavelet threshold fimction speech enhancement Bark wavelet packet non-fixed deviation noise
在线阅读 下载PDF
Filter algorithm based on cochlear mechanics and neuron filter mechanism and application on enhancement of audio signals 被引量:1
4
作者 GAO Wa KAN Yue ZHA Fu-sheng 《Journal of Central South University》 SCIE EI CAS CSCD 2021年第6期1813-1828,共16页
A filter algorithm based on cochlear mechanics and neuron filter mechanism is proposed from the view point of vibration.It helps to solve the problem that the non-linear amplification is rarely considered in studying ... A filter algorithm based on cochlear mechanics and neuron filter mechanism is proposed from the view point of vibration.It helps to solve the problem that the non-linear amplification is rarely considered in studying the auditory filters.A cochlear mechanical transduction model is built to illustrate the audio signals processing procedure in cochlea,and then the neuron filter mechanism is modeled to indirectly obtain the outputs with the cochlear properties of frequency tuning and non-linear amplification.The mathematic description of the proposed algorithm is derived by the two models.The parameter space,the parameter selection rules and the error correction of the proposed algorithm are discussed.The unit impulse responses in the time domain and the frequency domain are simulated and compared to probe into the characteristics of the proposed algorithm.Then a 24-channel filter bank is built based on the proposed algorithm and applied to the enhancements of the audio signals.The experiments and comparisons verify that,the proposed algorithm can effectively divide the audio signals into different frequencies,significantly enhance the high frequency parts,and provide positive impacts on the performance of speech enhancement in different noise environments,especially for the babble noise and the volvo noise. 展开更多
关键词 COCHLEA neuron filter audio signal processing speech enhancement
在线阅读 下载PDF
A noise cross PSD estimator for dual-microphone speech enhancement based on minimum statistics 被引量:3
5
作者 Mohsen RAHMANI Ahmad AKBARI +1 位作者 Beghdad AYAD Nima DERAKHSHAN 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2009年第6期805-809,共5页
Some two-microphone noise reduction techniques that work in the frequency domain exploit coherence function between two noisy signals. They have shown good results when noise signals on two sensors are uncorrelated, b... Some two-microphone noise reduction techniques that work in the frequency domain exploit coherence function between two noisy signals. They have shown good results when noise signals on two sensors are uncorrelated, but their per-formance decreases with correlated noises. Coherence based methods can be improved when the cross power spectral density (CPSD) of correlated noise signals is available. In this paper, we propose a new method for estimation of the CPSD of the noise, which is based on the minimum tracking technique. Despite the fact that the proposed estimator does not need to implement a voice activity detector (VAD), its performance is comparable to a CPSD estimator that uses an ideal VAD. 展开更多
关键词 Two-channel noise reduction Noise estimation Minima tracking
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部