期刊文献+

Initialization for NMF-Based Audio Source Separation Using Priors on Encoding Vectors 被引量:2

Initialization for NMF-Based Audio Source Separation Using Priors on Encoding Vectors
在线阅读 下载PDF
导出
摘要 Nonnegative matrix factorization(NMF)has shown good performances on blind audio source separation(BASS).While the NMF analysis is a non-convex optimization problem when both the basis and encoding matrices need to be estimated simultaneously,the source separation step of the NMF-based BASS with a fixed basis matrix has been considered convex.However,because the basis matrix for the BASS is typically constructed by concatenating the basis matrices trained with individual source signals,the subspace spanned by the basis vectors for one source may overlap with that for other sources.In this paper,we have shown that the resulting encoding vector is not unique when the subspaces spanned by basis vectors for the sources overlap,which implies that the initialization of the encoding vector in the source separation stage is not trivial.Furthermore,we propose a novel method to initialize the encoding vector for the separation step based on the prior model of the encoding vector.Experimental results showed that the proposed method outperformed the uniform random initialization by 1.09 and 2.21dB in the source-to-distortion ratio,and 0.20 and 0.23 in PESQ scores for supervised and semi-supervised cases,respectively. Nonnegative matrix factorization(NMF) has shown good performances on blind audio source separation(BASS). While the NMF analysis is a non-convex optimization problem when both the basis and encoding matrices need to be estimated simultaneously,the source separation step of the NMF-based BASS with a fixed basis matrix has been considered convex. However, because the basis matrix for the BASS is typically constructed by concatenating the basis matrices trained with individual source signals, the subspace spanned by the basis vectors for one source may overlap with that for other sources. In this paper, we have shown that the resulting encoding vector is not unique when the subspaces spanned by basis vectors for the sources overlap,which implies that the initialization of the encoding vector in the source separation stage is not trivial. Furthermore, we propose a novel method to initialize the encoding vector for the separation step based on the prior model of the encoding vector. Experimental results showed that the proposed method outperformed the uniform random initialization by 1.09 and2.21 dB in the source-to-distortion ratio, and0.20 and 0.23 in PESQ scores for supervised and semi-supervised cases, respectively.
出处 《China Communications》 SCIE CSCD 2019年第9期177-186,共10页 中国通信(英文版)
基金 supported by the research fund of Signal Intelligence Research Center supervised by the Defense Acquisition Program Administration and Agency for Defense Development of Korea
关键词 blind AUDIO source separation NONNEGATIVE matrix FACTORIZATION speech enhancement blind audio source separation nonnegative matrix factorization speech enhancement
  • 相关文献

同被引文献2

引证文献2

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部