Harris’ Hawks Optimization-Tuned Density-based Clustering

Authors

  • Kashif Talpur Solent University, Southampton
  • Muhammad Shoaib Omar Department of Computer Science, Bahria University, Karachi, Pakistan
  • Syed Muhammad Waqas Department of Computer Science, Muhammad Ali Jinnah University, Karachi, Pakistan
  • Kashif Talpur Department of Science and Engineering, Solent University, Southampton, United Kingdom
  • Sumra Khan Department of Information Technology, Salim Habib University, Karachi, Pakista
  • Shakeel Ahmad Department of Science and Engineering, Solent University, Southampton, United Kingdom

DOI:

https://doi.org/10.30537/sjet.v6i1.1305

Keywords:

Machine learning, density-based clustering, metaheuristic algorithm, Harris’ hawk optimization, clustering

Abstract

Clustering is a machine learning technique that groups data samples based on similarity and identifies outliers with distinct features. Density-based clustering outperforms other methods because it can handle arbitrary shapes of clustering distributions. However, it has a limitation of requiring empirical values for the cluster center and the nominal distance between the cluster center and other data points. These values affect the accuracy and the number of clusters obtained by the algorithm. This paper proposes a solution to optimize these parameters using Harris’ hawks optimization (HHO), an efficient optimization technique that balances exploration and exploitation and avoids stagnation in later iterations. The proposed HHO-tuned density-based clustering achieves better performance as compared to other optimizers used in this work. This research also provides a reference for designing efficient clustering techniques for complex-shaped datasets.

Downloads

Download data is not yet available.

References

Bie, R., Mehmood, R., Ruan, S., Sun, Y., Dawood, H.: Adaptive fuzzy clustering by fast search and find of density peaks. Personal and Ubiquitous Computing 20, 785-793 (2016)

Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, KDD'96, p. 226-231. AAAI Press (1996).

Ali, T., Asghar, S., Sajid, N.A.: Critical analysis of dbscan variations. In: 2010 international conference on information and emerging technologies, pp.1-6. IEEE (2010).

Birant, D., Kut, A.: St-dbscan: An algorithm for clustering spatial{temporal data. Data & knowledge engineering 60(1), 208-221 (2007).

Viswanath, P., Babu, V.S.: Rough-dbscan: A fast hybrid density based clustering method for large data sets. Pattern Recognition Letters 30(16), 1477-1488 (2009).

Andrade, G., Ramos, G., Madeira, D., Sachetto, R., Ferreira, R., Rocha, L.: G-dbscan: A gpu accelerated algorithm for density-based clustering. Procedia Computer Science 18, 369-378 (2013).

Lai, W., Zhou, M., Hu, F., Bian, K., Song, Q.: A new dbscan parameters determination method based on improved mvo. Ieee Access 7, 104,085-104,095 (2019).

Rad, M.H., Abdolrazzagh-Nezhad, M.: Data cube clustering with improved dbscan based on fuzzy logic and genetic algorithm: Designing and improving data cube clustering. Information Technology and Control 49(1), 127-143 (2020).

Zhu, Q., Tang, X., Elahi, A.: Application of the novel harmony search optimization algorithm for dbscan clustering. Expert Systems with Applications 178, 115,054 (2021).

Eberhart, R., Kennedy, J.: A new optimizer using particle swarm theory. In: MHS'95. Proceedings of the sixth international symposium on micro machine and human science, pp. 39-43. IEEE (1995).

Karaboga, D., et al.: An idea based on honey bee swarm for numerical optimization. Tech. rep., Technical report-tr06, Erciyes university, engineering faculty, computer . . . (2005).

Holland, J.H.: Adaptation in natural and artificial systems: an introductory analysis with applications to biology, control, and artificial intelligence. MIT press (1992).

Dorigo, M., Gambardella, L.M.: Ant colony system: a cooperative learning approach to the traveling salesman problem. IEEE Transactions on evolutionary computation 1(1), 53-66 (1997).

Mirjalili, S., Mirjalili, S.M., Lewis, A.: Grey wolf optimizer. Advances in engineering software 69, 46-61 (2014).

Hashim, F.A., Hussain, K., Houssein, E.H., Mabrouk, M.S., Al-Atabany, W.: Archimedes optimization algorithm: a new metaheuristic algorithm for solving optimization problems. Applied Intelligence 51, 1531-1551 (2021).

Hashim, F.A., Houssein, E.H., Hussain, K., Mabrouk, M.S., Al-Atabany, W.: Honey badger algorithm: New metaheuristic algorithm for solving optimization problems. Mathematics and Computers in Simulation 192, 84-110 (2022).

Talatahari, S., Azizi, M.: Chaos game optimization: a novel metaheuristic algorithm. Artificial Intelligence Review 54, 917-1004 (2021).

Dhaenens, C., Jourdan, L.: Metaheuristics for data mining: survey and opportunities for big data. Annals of Operations Research 314(1), 117-140 (2022).

Tsai, C.W., Tsai, P.W., Pan, J.S., Chao, H.C.: Metaheuristics for the deployment problem of wsn: A review. Microprocessors and Microsystems 39(8), 1305-1317 (2015).

Papadimitrakis, M., Giamarelos, N., Stogiannos, M., Zois, E., Livanos, N.I., Alexandridis, A.: Metaheuristic search in smart grid: A review with emphasis on planning, scheduling and power flow optimization applications. Renewable and Sustainable Energy Reviews 145, 111,072 (2021).

Shahvaroughi Farahani, M., Razavi Hajiagha, S.H.: Forecasting stock price using integrated artificial neural network and metaheuristic algorithms compared to time series models. Soft computing 25(13), 8483-8513 (2021).

Heidari, A.A., Mirjalili, S., Faris, H., Aljarah, I., Mafarja, M., Chen, H.: Harris hawks optimization: Algorithm and applications. Future generation computer systems 97, 849-872 (2019).

Gupta, S., Abderazek, H., Yildiz, B.S., Yildiz, A.R., Mirjalili, S., Sait, S.M.: Comparison of metaheuristic optimization algorithms for solving constrained mechanical design optimization problems. Expert Systems with Applications 183, 115,351 (2021).

Jia, H., Lang, C., Oliva, D., Song, W., Peng, X.: Dynamic harris hawks optimization with mutation mechanism for satellite image segmentation. Remote sensing 11(12), 1421 (2019).

Houssein, E.H., Hosney, M.E., Oliva, D., Mohamed, W.M., Hassaballah, M.: A novel hybrid harris hawks optimization and support vector machines for drug design and discovery. Computers & Chemical Engineering 133, 106,656 (2020).

Houssein, E.H., Saad, M.R., Hussain, K., Zhu, W., Shaban, H., Hassaballah, M.: Optimal sink node placement in large scale wireless sensor networks based on harris' hawk optimization algorithm. IEEE Access 8, 19,381-19,397 (2020).

Rodriguez-Esparza, E., Zanella-Calzada, L.A., Oliva, D., Heidari, A.A., Zaldivar, D., Perez-Cisneros, M., Foong, L.K.: An efficient harris hawks-inspired image segmentation method. Expert Systems with Applications 155, 113,428 (2020)

Ewees, A.A., Abd Elaziz, M.: Performance analysis of chaotic multi-verse harris hawks optimization: a case study on solving engineering problems. Engineering Applications of Artificial Intelligence 88, 103,370 (2020).

Fan, Q., Chen, Z., Xia, Z.: A novel quasi-reflected harris hawks optimization algorithm for global optimization problems. Soft Computing 24, 14,825-14,843 (2020).

Dua, D., Graff, C.: UCI machine learning repository (2017). URL http://archive.ics.uci.edu/ml.

Alswaitti, M., Albughdadi, M., Isa, N.A.M.: Density-based particle swarm optimization algorithm for data clustering. Expert Systems with Applications 91, 170-186 (2018).

Jiang, H., Li, J., Yi, S., Wang, X., Hu, X.: A new hybrid method based on partitioning-based dbscan and ant clustering. Expert Systems with Applications 38(8), 9373-9381 (2011).

Downloads

Published

2023-07-10