Optimized Fake News Classification: Leveraging Ensembles Learning and Parameter Tuning in Machine and Deep Learning Methods


Alguttar A. A., Shaaban O. A., Yildirim R.

Applied Artificial Intelligence, vol.38, no.1, 2024 (SCI-Expanded) identifier

  • Publication Type: Article / Review
  • Volume: 38 Issue: 1
  • Publication Date: 2024
  • Doi Number: 10.1080/08839514.2024.2385856
  • Journal Name: Applied Artificial Intelligence
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Aerospace Database, Applied Science & Technology Source, Business Source Elite, Business Source Premier, Communication Abstracts, Compendex, Computer & Applied Sciences, INSPEC, Metadex, Psycinfo, Directory of Open Access Journals, Civil Engineering Abstracts
  • Ankara Yıldırım Beyazıt University Affiliated: Yes

Abstract

The proliferation of misinformation across various domains necessitates robust detection mechanisms. With its ability to analyze vast datasets, machine learning emerges as a powerful tool. This research aims to explore fake news detection and emphasize the crucial role of preprocessing techniques. In addition to using individual models for training like SVMs, Logistic Regression, and LSTMs, we investigated the combined power of these methods through Stacking and Delegation. This paper analyzes frequently used preprocessing techniques like counter-vectorizer and TF-IDF to understand their impact on detection effectiveness. This is also aligned with the United Nations SDG 16, which seeks to promote informed decision-making by fighting against misinformation. The outcomes of this study underscored the significance of preprocessing steps for optimal classification performance. The ensemble methods consistently excelled, particularly with the probability-based stacking, achieving an AUC of 0.9394 and 0.9509 along with F1 Scores of 0.956248 and 0.945644. On the other hand, the delegation strategies also emerged as solid alternatives, with delegation (iterated) reaching AUCs of 0.9280 and 0.9477. The findings of this study confirm the efficacy of ensemble techniques and delegation for effective fake news detection. This study offers insights into model selection and optimization.