A hybrid algorithm with cluster analysis in modelling high dimensional data

Tunga, Burcu

doi:10.1016/j.dam.2017.09.002

A hybrid algorithm with cluster analysis in modelling high dimensional data

Atıf İçin Kopyala

Tunga B.

DISCRETE APPLIED MATHEMATICS, cilt.235, ss.161-168, 2018 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 235
Basım Tarihi: 2018
Doi Numarası: 10.1016/j.dam.2017.09.002
Dergi Adı: DISCRETE APPLIED MATHEMATICS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.161-168
İstanbul Teknik Üniversitesi Adresli: Evet

Özet

Multivariate data modelling aims to predict unknown function values through an established mathematical model. It is essential to construct an analytical structure using the given set of high dimensional data points with corresponding function values. The level of multivariance directly affects the modelling process. Increase in the number of independent variables makes the standard numerical methods incapable of obtaining the sought analytical structure. This work aims to overcome the difficulties of high multivariance and to improve the modelling quality by carrying out two main steps: data clustering and data partitioning. Data clustering step deals with dividing the whole problem domain into several clusters by performing k-means clustering algorithm. Data partitioning step performs the Enhanced Multivariance Product Representation method to partition the high dimensional data set of each cluster. The analytical structure is obtained through the partitioned data for each cluster and can be used to predict the unknown function values. (C) 2017 Elsevier B.V. All rights reserved.