Analisis Sentimen Data Twitter Topik Ekonomi Dan Industri Dengan Metode Naive Bayes Dan Random Forest
Abstract
Twitter has become a valuable source of information, and sentiment analysis can provide insights into public views and attitudes towards economic and industrial issues. This research aims to develop and compare the performance of two widely used classification methods, Naive Bayes and Random Forest, for sentiment analysis on Twitter data related to the economy and industry. By addressing the existing knowledge gap in sentiment analysis using Naive Bayes and Random Forest, this study provides a clear framework that empowers companies to efficiently process and leverage Twitter data, yielding valuable decision-making insights in the realm of economy and industry. A total of 11,833 data were divided into 70% training data and 30% testing data then classified using Naive Bayes, and Random Forest algorithms. The calculation results show positive sentiment of 28,52%, negative sentiment of 31,44%, and neutral sentiment of 40,04%. The comparison of the two algorithms obtained using Naïve Bayes gets the highest accuracy of 71,89%.
References
Apriani, R., & Gustian, D. (2019). Analisis Sentimen dengan Naïve Bayes
terhadap Komentar Aplikasi Tokopedia. Jurnal Rekayasa Teknologi Nusa
Putra, 6(1), 54-62.
Bird, S., Klein, E., & Loper, E. (2020). Natural Language Processing with Python:
Analyzing Text with the Natural Language Toolkit. O'Reilly Media.
Darwis, D., Siskawati, N., & Abidin, Z. (2021). Penerapan Algoritma Naive
Bayes untuk Analisis Sentimen Review Data Twitter BMKG Nasional.
TEKNO KOMPAK Journal, 15(1), 131-145. P-ISSN: 1412-9663, E-ISSN:
-3525.
Jurafsky, D., & Martin, J. H. (2019). Speech and Language Processing: An
Introduction to Natural Language Processing, Computational Linguistics,
and Speech Recognition (3rd ed.). Pearson.
Manning, C. D., Raghavan, P., & Schütze, H. (2021). Introduction to Information
Retrieval. Cambridge University Press.
Pak, A., & Parvez, M. T. (2020). Sentiment Analysis of Twitter Data Using Naive
Bayes Classifier. In 2020 11th International Conference on Computing,
Communication and Networking Technologies (ICCCNT) (pp. 1-5). IEEE.
Shrivastava, A., & Gupta, R. (2019). Comparative study of Random Forest,
Gradient Boosting and Support Vector Machine for Classification of IoT
attacks. In 2019 10th International Conference on Computing,
Communication and Networking Technologies (ICCCNT) (pp. 1-6). IEEE