A predictive model for fake news detection using ensemble learning techniques

Main Article Content

Thongchai Kaewkiriya
Kanchana Silawarawet

Abstract

Nowadays, social media is one of the important activities for the general public for the purpose of sharing news and various stories with people. As a result, there are ill-intentioned individuals who seek to benefit themselves and cause chaos in society by creating and spreading fake news to the public. Recently, efforts have been made to develop systems for automatically detecting fake news on social media, such as using machine learning to analyze news data or using systems that learn the behavior of people on social media. From these methods, the concept of integrating multi-factor data analysis has emerged to enhance the accuracy and efficiency of fake news detection systems. This research aims to present a predictive model for detecting fake news using ensemble learning techniques with voting methods. It includes a dataset of 20,000 data points, considering factors such as media sources and user interactions. The test results, in conjunction with the fake news database recorded from social media like Fakeddit.com, show that the proposed model has a prediction accuracy of 96.97%, which is higher than other learning techniques used for comparison.

Article Details

Section
Original Articles

References

Al Asaad, B., & Erascu, M. (2018, September). A tool for fake news detection. In 2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC) (pp. 379–386). IEEE.

Allcott, H., & Gentzkow, M. (2017). Social media and fake news in the 2016 election. Journal of Economic Perspectives, 31(2), 211–236.

Amini, A., Bayiz, Y. E., Ram, A., Marculescu, R., & Topcu, U. (2024). News source credibility assessment: A Reddit case study. arXiv preprint arXiv:2402.10938.

Collins, B., Hoang, D. T., Nguyen, N. T., & Hwang, D. (2021). Trends in combating fake news on social media–a survey. Journal of Information and Telecommunication, 5(2), 247–266.

Dietterich, T. G. (2000, June). Ensemble methods in machine learning. In International Workshop on Multiple Classifier Systems (pp. 1–15). Springer Berlin Heidelberg.

Duguay, P. A. (2022). Read it on Reddit: Homogeneity and ideological segregation in the age of social news. Social Science Computer Review, 40(5), 1186–1202.

Gimpel, H., Heger, S., Olenberger, C., & Utz, L. (2021). The effectiveness of social norms in fighting fake news on social media. Journal of Management Information Systems, 38(1), 196–221.

Hamed, S. K., Ab Aziz, M. J., & Yaakub, M. R. (2023). Fake news detection model on social media by leveraging sentiment analysis of news content and emotion analysis of users’ comments. Sensors, 23(4), 1748.

Jain, A., Shakya, A., Khatter, H., & Gupta, A. K. (2019, September). A smart system for fake news detection using machine learning. In 2019 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT) (Vol. 1, pp. 1–4). IEEE.

Kaushik, D., & Nadeem, M. (2024, May). Fake news detection using evolutionary ensemble deep learning. In 2024 International Conference on Communication, Computer Sciences and Engineering (IC3SE) (pp. 161–166). IEEE.

Mustafaraj, E., & Metaxas, P. T. (2017, June). The fake news' spreading plague: Was it preventable? In Proceedings of the 2017 ACM on Web Science Conference (pp. 235–239).

Paschen, J. (2020). Investigating the emotional appeal of fake news using artificial intelligence and human contributions. Journal of Product & Brand Management, 29(2), 223–233.

Pizarro, J. (2020, October). Profiling bots and fake news spreaders at PAN’19 and PAN’20: Bots and gender profiling 2019, profiling fake news spreaders on Twitter 2020. In 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA) (pp. 626–630). IEEE.

Proferes, N., Jones, N., Gilbert, S., Fiesler, C., & Zimmer, M. (2021). Studying Reddit: A systematic overview of disciplines, approaches, methods, and ethics. Social Media+ Society, 7(2), 20563051211019004.

Przybyla, P. (2020, April). Capturing the style of fake news. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 34, No. 01, pp. 490–497).

Reshmi, T. S. (2022, July). Fake news detection using voting ensemble classifier. In 2022 International Conference on Inventive Computation Technologies (ICICT) (pp. 1241–1244). IEEE.

Shu, K., Sliva, A., Wang, S., Tang, J., & Liu, H. (2017). Fake news detection on social media: A data mining perspective. ACM SIGKDD Explorations Newsletter, 19(1), 22–36.

Shu, K., Wang, S., & Liu, H. (2019, January). Beyond news contents: The role of social context for fake news detection. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining (pp. 312–320).

Tandoc Jr, E. C., Lim, D., & Ling, R. (2020). Diffusion of disinformation: How social media users respond to fake news and why. Journalism, 21(3), 381–398.

Yu, X., Gil-López, T., Shen, C., & Wojcieszak, M. (2022). Engagement with social media posts in experimental and naturalistic settings: How do message incongruence and incivility influence commenting? International Journal of Communication, 16, 24.