Ensemble learning prediction framework for EGFR amplification status of glioma based on terahertz spectral features

Spectrochim Acta A Mol Biomol Spectrosc. 2024 Aug 5:316:124351. doi: 10.1016/j.saa.2024.124351. Epub 2024 Apr 26.

Abstract

Epidermal growth factor receptor (EGFR) plays a pivotal role in the initiation and progression of gliomas. In particular, in glioblastoma, EGFR amplification emerges as a catalyst for invasion, proliferation, and resistance to radiotherapy and chemotherapy. Current approaches are not capable of providing rapid diagnostic results of molecular pathology. In this study, we propose a terahertz spectroscopic approach for predicting the EGFR amplification status of gliomas for the first time. A machine learning model was constructed using the terahertz response of the measured glioma tissues, including the absorption coefficient, refractive index, and dielectric loss tangent. The novelty of our model is the integration of three classical base classifiers, i.e., support vector machine, random forest, and extreme gradient boosting. The ensemble learning method combines the advantages of various base classifiers, this model has more generalization ability. The effectiveness of the proposed method was validated by applying an individual test set. The optimal performance of the integrated algorithm was verified with an area under the curve (AUC) maximum of 85.8 %. This signifies a significant stride toward more effective and rapid diagnostic tools for guiding postoperative therapy in gliomas.

Keywords: EGFR amplification status; Ensemble learning; Glioblastoma; Molecular pathology; Terahertz spectra.

MeSH terms

  • Algorithms
  • Brain Neoplasms / genetics
  • Brain Neoplasms / pathology
  • ErbB Receptors* / genetics
  • ErbB Receptors* / metabolism
  • Gene Amplification
  • Glioma* / diagnosis
  • Glioma* / genetics
  • Glioma* / pathology
  • Humans
  • Machine Learning
  • Support Vector Machine
  • Terahertz Spectroscopy* / methods

Substances

  • ErbB Receptors
  • EGFR protein, human