Assessment of iterative semi-supervised feature selection learning for sentiment analyses: Digital currency markets


Akba F., MEDENİ İ. T., GÜZEL M. S., ASKERBEYLİ İ.

14th IEEE International Conference on Semantic Computing, ICSC 2020, California, United States Of America, 3 - 05 February 2020, pp.459-463 identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/icsc.2020.00088
  • City: California
  • Country: United States Of America
  • Page Numbers: pp.459-463
  • Keywords: Digital currency, Exchange market, Machine learning, Natural language process, Random forest, Semi-supervised feature selection, Sentiment analysis, Support vector machine, Word embedding
  • Ankara Yıldırım Beyazıt University Affiliated: Yes

Abstract

Currently, the cryptocurrencies have been highly attracted by investors. Due to the lack of a central money authority, the stock markets are experiencing large price fluctuations based on speculation. In this study, our goal is to try to determine the effects of social media topics in digital money markets by using sentiment analysis methods on natural data. First, related comments on Twitter about digital currencies were collected by using a web crawler. The comments investigated with feature selection metrics, semi-supervised feature selection metrics, Random Forest, Support Vector Machine (SVM) methods. The Random Forest was observed to perform the most successful and fastest sentiment classification process using a combination of semi-supervised feature selection metrics by iterative actions. Iterative Semi-Supervised Feature Selection (ISSFS) method proposed and evaluated in this paper. As a result of the experiments, the optimum number of words for the English language was calculated in order to perform sentiment analysis of digital currency markets. Classifiers and methods used were shared as results.