TY - JOUR
T1 - Composite kernels for semi-supervised clustering
AU - Domeniconi, Carlotta
AU - Peng, Jing
AU - Yan, Bojun
PY - 2011/7
Y1 - 2011/7
N2 - A critical problem related to kernel-based methods is how to select optimal kernels. A kernel function must conform to the learning target in order to obtain meaningful results. While solutions to the problem of estimating optimal kernel functions and corresponding parameters have been proposed in a supervised setting, it remains a challenge when no labeled data are available, and all we have is a set of pairwise must-link and cannot-link constraints. In this paper, we address the problem of optimizing the kernel function using pairwise constraints for semi-supervised clustering. We propose a new optimization criterion for automatically estimating the optimal parameters of composite Gaussian kernels, directly from the data and given constraints. We combine our proposal with a semi-supervised kernel-based algorithm to demonstrate experimentally the effectiveness of our approach. The results show that our method is very effective for kernel-based semi-supervised clustering.
AB - A critical problem related to kernel-based methods is how to select optimal kernels. A kernel function must conform to the learning target in order to obtain meaningful results. While solutions to the problem of estimating optimal kernel functions and corresponding parameters have been proposed in a supervised setting, it remains a challenge when no labeled data are available, and all we have is a set of pairwise must-link and cannot-link constraints. In this paper, we address the problem of optimizing the kernel function using pairwise constraints for semi-supervised clustering. We propose a new optimization criterion for automatically estimating the optimal parameters of composite Gaussian kernels, directly from the data and given constraints. We combine our proposal with a semi-supervised kernel-based algorithm to demonstrate experimentally the effectiveness of our approach. The results show that our method is very effective for kernel-based semi-supervised clustering.
KW - Clustering
KW - Kernel methods
KW - Semi-supervised clustering
UR - http://www.scopus.com/inward/record.url?scp=79959506150&partnerID=8YFLogxK
U2 - 10.1007/s10115-010-0318-8
DO - 10.1007/s10115-010-0318-8
M3 - Article
AN - SCOPUS:79959506150
SN - 0219-1377
VL - 28
SP - 99
EP - 116
JO - Knowledge and Information Systems
JF - Knowledge and Information Systems
IS - 1
ER -