摘要
多粒度形式概念分析是数据挖掘与知识发现的重要工具,但现有的多粒度形式概念分析理论中并未提出选择最优形式背景的标准,这导致只能对多个单粒度形式背景逐一研究其知识发现问题,因此无法应对含有多个粒度属性的形式背景.鉴于此,对多粒度形式背景的粒度树上的属性块进行组合,将信息熵作为组合形式背景优劣的判别标准以评价最优粒度选择的性能.首先,基于粒度树提出广义介粒度剪枝形式背景,它既能实现属性块内部跨粒度组合,又能实现属性块之间跨层组合;其次,给出广义介粒度剪枝形式背景的信息熵,以评价广义介粒度剪枝形式背景的优劣,并设计出最优粒度选择算法;接着,利用信息熵度量多粒度剪枝类属性块和粒度树的重要性;最后,通过实验分析表明基于信息熵的最优粒度选择和粒度树重要性度量方法是有效的.
Multi-granularity formal concept analysis is an important tool for data mining and knowledge discovery.However,there is no standard to select an optimal formal context in the existing multi-granularity formal concept analysis theory,which leads to the fact that multiple single-granularity formal contexts have to be studied separately one by one for achieving the task of knowledge discovery,leaving the formal contexts with multi-granularity attributes unexplored.In this paper,how to combine attribute blocks of the granularity tree of a multi-granularity formal context is studied,and information entropy is used as a criterion to judge whether a combined formal context is good or not,so as to evaluate the performance of the obtained optimal granularity selection results.Firstly,based on a granularity tree,the notion of a generalized meso-granularity pruning formal context is proposed,which can realize not only inter-layer cross-granularity combination but also cross-layer combination of attribute blocks.Secondly,the information entropy of a generalized mesogranularity pruning formal context is defined to evaluate its advantages and disadvantages,and an optimal granularity selection algorithm is designed.Then,the information entropy is used to measure the importance of the multi-granularity pruning class-attribute block and granularity tree.Finally,experimental analysis shows the effectiveness of the proposed methods of optimal granularity selection and importance measurement of a granularity tree based on information entropy.
作者
李金海
贺建君
LI Jin-hai;HE Jian-jun(Data Science Research Center,Kunming University of Science and Technology,Kunming 650500,China;Faculty of Science,Kunming University of Science and Technology,Kunming 650500,China)
出处
《控制与决策》
EI
CSCD
北大核心
2022年第5期1299-1308,共10页
Control and Decision
基金
国家自然科学基金项目(11971211)。
关键词
多粒度形式背景
多粒度类属性块
粒计算
信息熵
剪枝形式背景
最优粒度选择
multi-granularity formal context
multi-granularity class-attribute block
granular computing
information entropy
pruning formal context
optimal granularity selection