TY - JOUR
T1 - Proximity multi-sphere support vector clustering
AU - TRAN, Dat
AU - MA, Wanli
AU - SHARMA, Dharmendra
PY - 2013
Y1 - 2013
N2 - Support vector data description constructs an optimal hypersphere in feature space as a description of a data set. This hypersphere when mapped back to input space becomes a set of contours, and support vector clustering (SVC) employs these contours as cluster boundaries to detect clusters in the data set. However real-world data sets may have some distinctive distributions and hence a single hypersphere cannot be the best description. As a result, the set of contours in input space does not always detect all clusters in the data set. Another issue in SVC is that in some cases, it cannot preserve proximity notation which is crucial for cluster analysis, that is, two data points that are close to each other can be assigned to different clusters using cluster labelling method of SVC. To overcome these drawbacks, we propose Proximity Multi-sphere Support Vector Clustering which employs a set of hyperspheres to provide a better data description for data sets having distinctive distributions and a proximity graph to favour the proximity notation. Experimental results on different data sets are presented to evaluate the proposed clustering technique and compare it with SVC and other clustering techniques
AB - Support vector data description constructs an optimal hypersphere in feature space as a description of a data set. This hypersphere when mapped back to input space becomes a set of contours, and support vector clustering (SVC) employs these contours as cluster boundaries to detect clusters in the data set. However real-world data sets may have some distinctive distributions and hence a single hypersphere cannot be the best description. As a result, the set of contours in input space does not always detect all clusters in the data set. Another issue in SVC is that in some cases, it cannot preserve proximity notation which is crucial for cluster analysis, that is, two data points that are close to each other can be assigned to different clusters using cluster labelling method of SVC. To overcome these drawbacks, we propose Proximity Multi-sphere Support Vector Clustering which employs a set of hyperspheres to provide a better data description for data sets having distinctive distributions and a proximity graph to favour the proximity notation. Experimental results on different data sets are presented to evaluate the proposed clustering technique and compare it with SVC and other clustering techniques
KW - Clustering
KW - Multi-sphere support vector clustering
KW - Multi-sphere support vector data description
KW - Proximity graph
KW - Support vector clustering
KW - Support vector data description
U2 - 10.1007/s00521-012-1001-7
DO - 10.1007/s00521-012-1001-7
M3 - Article
SN - 0941-0643
VL - 22
SP - 1309
EP - 1319
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 7-8
ER -