期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
A Fast Algorithm for Support Vector Clustering
1
作者 吕常魁 姜澄宇 王宁生 《Journal of Southwest Jiaotong University(English Edition)》 2004年第2期136-140,共5页
Support Vector Clustering (SVC) is a kernel-based unsupervised learning clustering method. The main drawback of SVC is its high computational complexity in getting the adjacency matrix describing the connectivity for ... Support Vector Clustering (SVC) is a kernel-based unsupervised learning clustering method. The main drawback of SVC is its high computational complexity in getting the adjacency matrix describing the connectivity for each pairs of points. Based on the proximity graph model [3], the Euclidean distance in Hilbert space is calculated using a Gaussian kernel, which is the right criterion to generate a minimum spanning tree using Kruskal's algorithm. Then the connectivity estimation is lowered by only checking the linkages between the edges that construct the main stem of the MST (Minimum Spanning Tree), in which the non-compatibility degree is originally defined to support the edge selection during linkage estimations. This new approach is experimentally analyzed. The results show that the revised algorithm has a better performance than the proximity graph model with faster speed, optimized clustering quality and strong ability to noise suppression, which makes SVC scalable to large data sets. 展开更多
关键词 support vector machines support vector clustering Proximity graph Minimum spanning tree
在线阅读 下载PDF
Convex Decomposition Based Cluster Labeling Method for Support Vector Clustering 被引量:5
2
作者 平源 田英杰 +1 位作者 周亚建 杨义先 《Journal of Computer Science & Technology》 SCIE EI CSCD 2012年第2期428-442,共15页
Support vector clustering (SVC) is an important boundary-based clustering algorithm in multiple applications for its capability of handling arbitrary cluster shapes.However,SVC's popularity is degraded by its highl... Support vector clustering (SVC) is an important boundary-based clustering algorithm in multiple applications for its capability of handling arbitrary cluster shapes.However,SVC's popularity is degraded by its highly intensive time complexity and poor label performance.To overcome such problems,we present a novel efficient and robust convex decomposition based cluster labeling (CDCL) method based on the topological property of dataset.The CDCL decomposes the implicit cluster into convex hulls and each one is comprised by a subset of support vectors (SVs).According to a robust algorithm applied in the nearest neighboring convex hulls,the adjacency matrix of convex hulls is built up for finding the connected components;and the remaining data points would be assigned the label of the nearest convex hull appropriately.The approach's validation is guaranteed by geometric proofs.Time complexity analysis and comparative experiments suggest that CDCL improves both the efficiency and clustering quality significantly. 展开更多
关键词 support vector clustering convex decomposition convex hull GEOMETRIC
原文传递
PARAMETRIC AND NON-PARAMETRIC COMBINATION MODEL TO ENHANCE OVERALL PERFORMANCE ON DEFAULT PREDICTION 被引量:1
3
作者 LI Jun PAN Liang +1 位作者 CHEN Muzi YANG Xiaoguang 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2014年第5期950-969,共20页
The probability of default(PD) is the key element in the New Basel Capital Accord and the most essential factor to financial institutions' risk management.To obtain good PD estimation,practitioners and academics h... The probability of default(PD) is the key element in the New Basel Capital Accord and the most essential factor to financial institutions' risk management.To obtain good PD estimation,practitioners and academics have put forward numerous default prediction models.However,how to use multiple models to enhance overall performance on default prediction remains untouched.In this paper,a parametric and non-parametric combination model is proposed.Firstly,binary logistic regression model(BLRM),support vector machine(SVM),and decision tree(DT) are used respectively to establish models with relatively stable and high performance.Secondly,in order to make further improvement to the overall performance,a combination model using the method of multiple discriminant analysis(MDA) is constructed.In this way,the coverage rate of the combination model is greatly improved,and the risk of miscarriage is effectively reduced.Lastly,the results of the combination model are analyzed by using the K-means clustering,and the clustering distribution is consistent with a normal distribution.The results show that the combination model based on parametric and non-parametric can effectively enhance the overall performance on default prediction. 展开更多
关键词 Binary logistic regression combination model decision tree K-means clustering multiple discriminant analysis probability of default support vector machine
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部