Transparent Machine Learning Reveals Diagnostic Glycan Biomarkers in Subarachnoid Hemorrhage and Vasospasm

Attila, Garami [Garami, Attila (Energetika), szerző] Energia-, Kerámia- és Polimertechnológiai Intézet (ME / AVK); Máté, Czabajszki [Czabajszki, Máté (orvostudomány), szerző]; Béla, Viskolcz [Viskolcz, Béla (Anyagtudomány és ...), szerző] Kémiai Intézet (ME / AVK); Csaba, Oláh [Oláh, Csaba Zsolt (idegsebészet), szerző] Idegsebészeti Tanszék (DE / ÁOK); Csaba, Váradi [Váradi, Csaba (Molekuláris biológia), szerző] Molekuláris Medicina Kutató Központ (DE / ÁOK); Kémiai Intézet (ME / AVK)

Angol nyelvű Szakcikk (Folyóiratcikk) Tudományos
Megjelent: INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES 1661-6596 1422-0067 26 (16) Paper: 7727 , 14 p. 2025
  • SJR Scopus - Organic Chemistry: D1
Azonosítók
Subarachnoid hemorrhage (SAH) and its major complication, cerebral vasospasm (CVS), present significant challenges for early diagnosis and risk stratification. In this study, we developed interpretable decision tree models to differentiate between healthy controls, SAH patients, and SAH patients with vasospasm using serum N-glycomic data. Building on previously published glycomic profiles, we introduced a refined modeling approach combining systematic preprocessing, feature selection, and interpretable machine learning. Our methodology included outlier removal, standard scaling, and a novel correlation-based feature reduction guided by feature importance scores derived from preliminary decision trees. Binary classification tasks (Control vs. SAH and Control vs. CVS, and SAH vs. CVS) were evaluated through stratified repeated cross-validation and hyperparameter optimization. Models achieved high accuracy (up to 0.91) and stable F1-scores across configurations. Key glycans such as FA2(6)G1 (bi-antennary, fucosylated, monogalactosylated), A4G4S3(2) (tetra-antennary, tetra-galactosylated, tri-sialylated), and A3G3S3(5) (tri-antennary, tri-galactosylated, tri-sialylated) emerged as the most discriminative. Visualizations that combine joint feature distributions and decision boundaries provided intuitive insight into the classifier’s logic. These findings support the integration of interpretable glycomics-based models into clinical workflows.
Hivatkozás stílusok: IEEEACMAPAChicagoHarvardCSLMásolásNyomtatás
2026-01-19 12:29