| Issue |
BIO Web Conf.
Volume 228, 2026
Biospectrum 2025: International Conference on Biotechnology and Biological Science
|
|
|---|---|---|
| Article Number | 01002 | |
| Number of page(s) | 8 | |
| Section | Use of AI and ML in Biotechnology | |
| DOI | https://doi.org/10.1051/bioconf/202622801002 | |
| Published online | 11 March 2026 | |
A comparative classification framework using PCA and modified PCA with ensemble and kernel-based learning models for mangrove feature analysis
1 Department of Computer Science and Information Technology, Institute of Engineering & Management, Kolkata (Newtown Sector), School of University of Engineering and Management, Kolkata, West Bengal, India
2 Department of Biotechnology, Institute of Engineering & Management, Kolkata (Newtown Sector), School of University of Engineering and Management, Kolkata, West Bengal, India
3 Department of Computer Science and Engineering, National Institute of Technology Patna, Bihar – 800005, India
* Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.
Abstract
Hyperspectral image (HSI) classification remains challenging due to high spectral dimensionality, redundancy among bands, and limited labeled samples, particularly in high–spatial-resolution agricultural and coastal environments. A comparative dimensionality-reduction and classification framework is presented and evaluated on two distinct hyperspectral scenarios: the WHU-Hi benchmark dataset acquired using UAV-borne hyperspectral sensors for precision crop classification, and a mangrove hyperspectral dataset collected over the Henry Island coastal ecosystem. The hyperspectral data cubes, consisting of hundreds of spectral bands and over 386,000 labeled samples, are transformed using Principal Component Analysis (PCA), a Modified PCA (MPCA) strategy with standardized variance normalization, and Kernel PCA to obtain compact and discriminative feature representations. The reduced feature sets, limited to 30 principal components, are evaluated using five supervised machine-learning classifiers, including Random Forest, Light Gradient Boosting Machine, Extreme Gradient Boosting, Support Vector Machine, and K-Nearest Neighbors. Experimental results indicate that PCA- and MPCA-based features achieve consistently high classification performance across all classifiers. The highest overall accuracy of 87.96% is obtained using SVM with PCA/MPCA features, while Random Forest and KNN achieve accuracies of 85.18% and 84.34%, respectively. Notably, MPCA achieves equivalent classification accuracy to conventional PCA while reducing feature extraction time by more than 60%, demonstrating superior computational efficiency. Overall, the framework provides an effective and computationally efficient solution for UAV-based crop classification and large-scale coastal ecosystem monitoring using hyperspectral imagery.
Key words: PCA / Modified PCA / Random Forest / LightGBM / XGBoost / SVM / KNN / Mangroves / Remote Sensing / Classification
© The Authors, published by EDP Sciences, 2026
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

