5. Conclusions
In this paper the CLSVM methodology is proposed, based on the SVM with the linear kernel and performing a clustering for categorical features and building an SVM classifier in the clustered feature space. Four strategies are presented to build the CLSVM classifier by means of QCQP, MIQP and QP formulations. When using two clusters, the CLSVM classifier has a comparable classi- fication accuracy to the SVMO classifier, in nine of the twelve benchmark datasets. In the remaining three datasets, the CLSVM classifier outperforms the SVMO classifier in two datasets, and is outperformed in the other one. In terms of sparsity of the classifier with respect to the categorical features, the CLSVM methodology shows a dramatic improvement over the SVMO.