The Application of Silhouette Coefficient, Elbow Method, and Gap Statistics for Determining the Optimal Clusters in Grouping Provinces in Indonesia Based on Happiness Index

  • Afifa Atira Universitas Singaperbangsa Karawang
  • Betha Nurina Sari Universitas Singaperbangsa Karawang
Keywords: Clustering, K-Means, Algorithm, RStudio, Happiness

Abstract

Currently, the assessment of regional development success is still limited to economic growth and poverty factors. However, evaluation based solely on economic factors is not accurate enough because high economic growth does not always guarantee the happiness of society and can even worsen social inequality. Therefore, one of the important priorities in development is to create economic growth that can improve the welfare of society equally, without creating gaps between social groups. The well-being and social progress in a region can be influenced by the happiness of its people. Happiness can be used as a measure to assess the welfare obtained by individuals. In this study, clustering of happiness index based on provinces in Indonesia was conducted using the K-Means algorithm in RStudio. Through the K-Means clustering analysis, the results of this study can be used as a reference by the government in formulating strategic plans or making improvements to increase the level of happiness and welfare of the Indonesian society. There are three methods to determine the optimal number of clusters: Silhouette method, Elbow method, and Gap Statistics. In the comparison of methods, it is observed that the optimal cluster formation occurs with 2 clusters, where the Elbow and Silhouette methods yield the best results. The data was processed using the K-Means method, resulting in 2 groups: 16 provinces with a high-level happiness group and 18 provinces with a low-level happiness group. From the evaluation using the Silhouette Index, a value of 0.4005945 was obtained at k=2, indicating that this cluster falls into the weak structure category.

References

Ali, A. (2019). Klasterisasi Data Rekam Medis Pasien Menggunakan Metode K-Means Clustering di Rumah Sakit Anwar Medika Balong Bendo Sidoarjo. MATRIK : Jurnal Manajemen, Teknik Informatika Dan Rekayasa Komputer, 19(1), 186–195. https://doi.org/10.30812/matrik.v19i1.529

Badan Pusat Statistik, (2022). Statistik Indonesia 2022. Jakarta: Badan Pusat Statistik. ISSN: 0126-2912

Banusu, Y., & Firmanto, A. D. (2020). Kebahagiaan Dalam Ruang Keseharian Manusia.

Fathia Palembang, C., Yahya Matdoan, M., & Permatasari Palembang, S. (2022). Perbandingan Algoritma K-Means Dan K-Medoids Dalam Pengelompokkan Tingkat Kebahagiaan Provinsi Di Indonesia. Jurnal Multidisiplin Ilmu, 01(5), 830–839. https://journal.mediapublikasi.id/index.php/bullet/article/download/1135/550

Fatmawati, K., & Windarto, A. P. (2018). Data Mining: Penerapan Rapidminer Dengan K-Means Cluster Pada Daerah Terjangkit Demam Berdarah Dengue (Dbd) Berdasarkan Provinsi (Vol. 3, Issue 2). https://doi.org/https://doi.org/10.24114/cess.v3i2.9661

Gustientiedina, Adiya, M. H., & Desnelita, Y. (2019). Penerapan Algoritma K-Means Untuk Clustering Data Obat-Obatan Pada RSUD Pekanbaru. Jurnal Nasional Teknologi Dan Sistem Informasi, 5(1), 17–24. https://doi.org/10.25077/teknosi.v5i1.2019.17-24

Handoyono, N. A. (2022). Apakah Semakin Tinggi Ipm Akan Semakin Bahagia? Analisis Kluster Ditinjau Dari Kualitas Perekonomian. AKUNTANSI DEWANTARA, 6(3), 1–11. https://doi.org/https://doi.org/10.26460/ad.v6i3.12220

Kumalasari, D. A., & Yasa, I. G. W. M. (2020). Pengaruh Faktor-Faktor Yang Mempengaruhi Tingkat Kebahagiaan Negara Di Dunia. E-JURNAL EKONOMI PEMBANGUNAN UNIVERSITAS UDAYANA, 9(5), 963–992.

Putra, G. B. B., & Sudibia, I. K. (2019). Faktor-Faktor Penentu Kebahagiaan Sesuai Dengan Kearifan Lokal Di Bali.

Sulistiyawan, E., Hapsery, A., & Junita Ayu Arifahanum, L. (2021). Perbandingan Metode Optimasi Untuk Pengelompokan Provinsi Berdasarkan Sektor Perikanan Di Indonesia (Studi Kasus Dinas Kelautan dan Perikanan Indonesia). JURNAL GAUSSIAN, 10(1), 76–84. https://doi.org/https://doi.org/10.14710/j.gauss.10.1.76-84

Syamhuri, W. B., Furqon, M. T., & Dewi, C. (2022). Pengelompokan Film Berdasarkan Alur Cerita menggunakan Metode Self Organizing Maps dan Silhouette Coefficient (Vol. 6, Issue 12). http://j-ptiik.ub.ac.id

Utomo, D. P., & Mesran. (2020). Analisis Komparasi Metode Klasifikasi Data Mining dan Reduksi Atribut Pada Data Set Penyakit Jantung. JURNAL MEDIA INFORMATIKA BUDIDARMA, 4(2), 437–444. https://doi.org/10.30865/mib.v4i2.2080

Published
2023-08-25
How to Cite
Atira, A., & Sari, B. (2023). The Application of Silhouette Coefficient, Elbow Method, and Gap Statistics for Determining the Optimal Clusters in Grouping Provinces in Indonesia Based on Happiness Index. Jurnal Ilmiah Wahana Pendidikan, 9(17), 76-86. https://doi.org/10.5281/zenodo.8282638

Most read articles by the same author(s)

<< < 1 2