ACTIVE CONSTRAINTS SELECTION BASED ON DENSITY PEAK ESTIMATION 

Pham Gia Bao1, Dinh Quoc Viet1, Le Tuan Linh1, Phung Van Linh1, Ha Khanh1, Vu Viet Vu2, , Tran Doan Vinh1, Vu Viet Thang1
1 CMC University
2 Trường Đại học CMC

Nội dung chính của bài viết

Tóm tắt

Semi-supervised clustering, which integrates side information from users to enhance clustering performance, has gained considerable attention in the research community. However, the quality of clustering is highly dependent on the side information provided, and different inputs can lead to different results. In this paper, we propose an active learning approach for selection good constraints, which employs a min-max strategy and density-based estimation of data points to optimize the constraints selection process. Experimental evaluations on datasets from UCI and an real face image data show the effectiveness of our method.

Chi tiết bài viết

Tài liệu tham khảo

[1] S. Basu, I. Davidson, and K. L. Wagstaff (2008), Constrained Clustering: Advances in Algorithms, Theory, and Applications, Chapman and Hall/CRC Data Mining and Knowledge Discovery Series, 1st edn.,.
[2] Kiri Wagstaff, Claire Cardie (2000), Clustering with Instance-level Constraints. ICML: 1103-1110.
[3] Viet-Vu Vu et al. (2022), Active constraints selection based on density peak. Proc. of International Conference on Advanced Communications Technology, 447-452.
[4] Nizar Grira, Michel Crucianu, Nozha Boujemaa (2008), Active semi-supervised fuzzy clustering. Pattern Recognit. 41(5): 1834-1844.
[5] Ahmad Ali Abin, Viet-Vu Vu (2020), A density-based approach for querying informative constraints for clustering. Expert Syst. Appl. 161: 113690.
[6] Viet-Vu Vu, Nicolas Labroche, Bernadette Bouchon-Meunier (2012), Improving constrained clustering with active query selection. Pattern Recognit. 45(4): 1749-1758.
[7] Sugato Basu, Arindam Banerjee, Raymond J. Mooney (2004), Active Semi-Supervision for Pairwise Constrained Clustering. SDM, 333-344.
[8] Viet-Vu Vu, Hong-Quan Do, Vu-Tuan Dang, Nang-Toan Do (2019), An efficient density-based clustering with side information and active learning: A case study for facial expression recognition task. Intell. Data Anal. 23(1): 227-240.
[9] W. M. Rand (1971), Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association. American Statistical Association. 66, 846–850.
[10] Rodriguez, A.; Laio (2014), A. Clustering by fast search and find of density peaks. Science, 344, 1492.
[11] Mehmood, R.; Zhang, G.; Bie, R.; Dawood, H.; Ahmad, H. (2016), Clustering by fast search and find of density peaks via heat diusion. Neurocomputing, 208, 210-217.
[12] Wang, S.; Wang, D.; Li, C.; Li, Y.; Ding, G. (2016), Clustering by Fast Search and Find of Density Peaks with Data Field. Chin. J. Electron, 25, 397-402.
[13] Du, M.; Ding, S.; Jia, H. (2016), Study on density peaks clustering based on k-nearest neighbors and principal component analysis. Knowl.-Based Syst., 99, 135-145.
[14] Burr Settles (2009), Active Learning Literature Survey. Computer Sciences Tech nical Report 1648, University of Wisconsin-Madison.