Co-training with relevant random subspaces

Yaslan, Yusuf; Çataltepe, Zehra

doi:10.1016/j.neucom.2010.01.018

Co-training with relevant random subspaces

Atıf İçin Kopyala

Yaslan Y., Çataltepe Z.

NEUROCOMPUTING, cilt.73, ss.1652-1661, 2010 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 73
Basım Tarihi: 2010
Doi Numarası: 10.1016/j.neucom.2010.01.018
Dergi Adı: NEUROCOMPUTING
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.1652-1661
Anahtar Kelimeler: Semi-supervised learning, Co-training, Random subspace methods, Multiple classifier systems, Relevant subspace method, RASCO, MACHINE
İstanbul Teknik Üniversitesi Adresli: Evet

Özet

We introduce the relevant random subspace Co-training (Rel-RASCO) algorithm which produces relevant random subspaces and then does semi-supervised ensemble learning using those subspaces and unlabeled data. Ensemble learning algorithms may benefit from diversity of classifiers used. However, for high dimensional data choosing subspaces randomly, as in RASCO (Random Subspace Method for Co-training, Wang et al. 2008 [5]) algorithm, may produce diverse but inaccurate classifiers. We produce relevant random subspaces by means of drawing features with probabilities proportional to their relevances measured by the mutual information between features and class labels. We show that Rel-RASCO achieves better accuracy by this relevant and random subspace selection scheme. Experiments on five real and one synthetic datasets show that Rel-RASCO algorithm outperforms both RASCO and Co-training in terms of the accuracy achieved at the end of Co-training. (C) 2010 Elsevier B.V. All rights reserved.