期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
一种新的支持向量机主动学习策略及其在文本分类中的应用 被引量:4
1
作者 刘宏 屠轶清 黄上腾 《计算机科学》 CSCD 北大核心 2003年第6期110-112,135,共4页
There are two well-known characteristics about text classification. One is that the dimension of the sample space is very high, while the number of examples available usually is very small. The other is that the examp... There are two well-known characteristics about text classification. One is that the dimension of the sample space is very high, while the number of examples available usually is very small. The other is that the example vectors are sparse. Meanwhile, we find existing support vector machines active learning approaches are subject to the influence of outliers. Based on these observations, this paper presents a new hybr/d active learning approach. In this approach, to select the unlabelled example(s) to query, the learner takes into account both sparseness and high-dimension characteristics of examples as well as its uncertainty about the examples' categorization. This way, the active learner needs less labeled examples, but still can get a good generalization performance more quickly than competing methods. Our empirical results indicate that this new approach is effective. 展开更多
关键词 支持向量机 主动学习策略 文本分类 机器学习
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部