A comparison study on active learning integrated ensemble approaches in sentiment analysis

ALDOGAN, Deniz; Yaslan, Yusuf

doi:10.1016/j.compeleceng.2016.11.015

A comparison study on active learning integrated ensemble approaches in sentiment analysis

Atıf İçin Kopyala

ALDOGAN D., Yaslan Y.

COMPUTERS & ELECTRICAL ENGINEERING, cilt.57, ss.311-323, 2017 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 57
Basım Tarihi: 2017
Doi Numarası: 10.1016/j.compeleceng.2016.11.015
Dergi Adı: COMPUTERS & ELECTRICAL ENGINEERING
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.311-323
Anahtar Kelimeler: Active learning, Ensemble learning, Sentiment analysis, Machine learning, Artificial intelligence
İstanbul Teknik Üniversitesi Adresli: Evet

Özet

One of the most challenging problems of sentiment analysis on social media is that labelling huge amounts of instances can be very expensive. Active learning has been proposed to overcome this problem and to provide means for choosing the most useful training instances. In this study, we introduce active learning to a framework which is comprised of most popular base and ensemble approaches for sentiment analysis. In addition, the implemented framework contains two ensemble approaches, i.e. a probabilistic algorithm and a derived version of Behavior Knowledge Space (BKS) algorithm. The Shannon Entropy approach was utilized for choosing among training data during active learning process and it was compared with maximum disagreement method and random selection of instances. It was observed that the former method causes better accuracies in less number of iterations. The above methods were tested on Cornell movie review dataset and a popular multi-domain product review dataset. (C) 2016 Elsevier Ltd. All rights reserved.