A Scalable Unsupervised Feature Selection with Orthogonal Graph Representation for Hyperspectral Images

Taskin G., Fatih Yetkin E., Camps-Valls G.

IEEE Transactions on Geoscience and Remote Sensing, vol.61, 2023 (SCI-Expanded) identifier

  • Publication Type: Article / Article
  • Volume: 61
  • Publication Date: 2023
  • Doi Number: 10.1109/tgrs.2023.3284475
  • Journal Name: IEEE Transactions on Geoscience and Remote Sensing
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, Aerospace Database, Applied Science & Technology Source, Aquatic Science & Fisheries Abstracts (ASFA), Business Source Elite, Business Source Premier, CAB Abstracts, Communication Abstracts, Compendex, Computer & Applied Sciences, Geobase, INSPEC, Metadex, Pollution Abstracts, Civil Engineering Abstracts
  • Keywords: Dimensionality reduction, feature selection (FS), global sensitivity analysis, graph embedding (GE), hyperspectral image (HSI) analysis
  • Istanbul Technical University Affiliated: Yes


Feature selection is essential in various fields of science and engineering, from remote sensing to computer vision. Reducing data dimensionality by removing redundant features and selecting the most informative ones improves machine learning algorithms’ performance, especially in supervised classification tasks, while lowering storage needs. Graph-embedding techniques have recently been found efficient for feature selection since they preserve the geometric structure of the original feature space while embedding data into a low-dimensional subspace. However, the main drawback is the high computational cost of solving an eigenvalue decomposition problem, especially for large-scale problems. This paper addresses this issue by combining the graph embedding framework and representation theory for a novel feature selection method. Inspired by the high dimensional model representation, the feature transformation is assumed to be a linear combination of a set of univariate orthogonal functions carried out in the graph embedding framework. As a result, an explicit embedding function is created, which can be utilised to embed out-of-samples into low-dimensional space and provide a feature relevance score. The significant contribution of the proposed method is to divide an n-dimensional generalised eigenvalue problem into n small-sized eigenvalue problems. With this property, the computational complexity of the graph embedding is significantly reduced, resulting in a scalable feature selection method, which could be easily parallelized too. The performance of the proposed method is compared favourably to its counterparts in high-dimensional hyperspectral image processing in terms of classification accuracy, feature stability, and computational time.