期刊文献+

一种基于属性频率划分的决策树算法

A decision tree algorithm based on attribute frequency splitting
在线阅读 下载PDF
导出
摘要 决策树是数据挖掘任务中分类的常用方法。在构造决策树的过程中,节点划分属性选择的标准直接影响决策树分类的效果。基于粗糙集的属性频率函数等方法度量属性重要性的概念,将其用于分枝划分属性的选择,提出一种决策树学习算法。该方法仅利用区分矩阵就可以计算出属性的出现频率函数值,计算简单。实验结果表明,用该方法构造的决策树与传统的基于信息熵方法构造的决策树相比较,结构简单,且能有效提高分类效果。 Decision tree is a usual method of classification in data mining. In the process of the decision tree construeting, the criteria of selecting partition attributes will influence the efficiency of classification. Based on the concept of attributes importance metric that is measured by the function of attribute frequency in Rough Set theory, and the metric being used to select the partition attribute, a new decision tree algorithm is proposed.In the algorithm, the function of attribute frequency is computed only using the discernibility matrix of data set. So, the computation is simple. The results of experiment show that compared with the entropy-based method, the decision tree constructed by the new algorithm is simpler in the structure, and the new algorithm can improve the efficiency of classification.
出处 《广西工学院学报》 CAS 2007年第4期1-4,共4页 Journal of Guangxi University of Technology
基金 广西自然科学基金项目(桂科自0481016) 广西教育厅2006年科研基金项目(149) 广西工学院博士基金项目
关键词 决策树 粗糙集 属性重要性 属性频率 decision tree sough set attribute importance attribute frequency
  • 相关文献

参考文献6

二级参考文献24

  • 1吴成东,许可,王欣,韩中华.软计算方法在数据挖掘中的应用[J].计算机测量与控制,2005,13(3):294-297. 被引量:8
  • 2[1]Quinlan JR. C4.5: Programs for Machine Learning [M]. San Mateo, CA: Morgan Kaufmann, 1993.
  • 3[2]Liu B, Hsu W, Ma Y. Intergrating Classification and Association Rule Mining [A]. Proc KDD[C], 1998.
  • 4[3]Buntine WL, Weigend AS. Computing Second Derivatives in Feed-forward Networks: A Review [J]. IEEE Transactions on Neural Networks, 1991,5(3):480-488.
  • 5[4]Cristianini N, Shawe-Taylor J. An Introduction to Support Vector Machines [M]. Cambridge Press, 2000. 1-18.
  • 6[5]Pawlak ZW. Rough Sets [J]. International Journal of Information and Computer Science, 1982,11(5):341-356.
  • 7[6]Pawlak ZW. Rough Sets and Intelligent Data Analysis [J]. Information Sciences, 2002,147(1-4):1-12.
  • 8[7]张文修,吴伟志,梁吉业. 粗糙集理论及方法 [M]. 北京:科学出版社,2003. 1-25.
  • 9[9]Beynon M. Reducts within the Variable Precision Rough Set Model: A Further Investigation [J]. European Journal of Operational Research, 2001, 134: 592-605.
  • 10[10]Murphy P, Aha W. UCI Repository of Machine Learning Databases [DB/OL]. http://www.ics.uci.edu/~mlearn/MLRepository.html, 1996.

共引文献45

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部