A comparison study on active learning integrated ensemble approaches in sentiment analysis


ALDOGAN D., Yaslan Y.

COMPUTERS & ELECTRICAL ENGINEERING, cilt.57, ss.311-323, 2017 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 57
  • Basım Tarihi: 2017
  • Doi Numarası: 10.1016/j.compeleceng.2016.11.015
  • Dergi Adı: COMPUTERS & ELECTRICAL ENGINEERING
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.311-323
  • Anahtar Kelimeler: Active learning, Ensemble learning, Sentiment analysis, Machine learning, Artificial intelligence
  • İstanbul Teknik Üniversitesi Adresli: Evet

Özet

One of the most challenging problems of sentiment analysis on social media is that labelling huge amounts of instances can be very expensive. Active learning has been proposed to overcome this problem and to provide means for choosing the most useful training instances. In this study, we introduce active learning to a framework which is comprised of most popular base and ensemble approaches for sentiment analysis. In addition, the implemented framework contains two ensemble approaches, i.e. a probabilistic algorithm and a derived version of Behavior Knowledge Space (BKS) algorithm. The Shannon Entropy approach was utilized for choosing among training data during active learning process and it was compared with maximum disagreement method and random selection of instances. It was observed that the former method causes better accuracies in less number of iterations. The above methods were tested on Cornell movie review dataset and a popular multi-domain product review dataset. (C) 2016 Elsevier Ltd. All rights reserved.