Background/Objectives: Human carbonic anhydrases (hCAs) are metalloenzymes involved in essential physiological processes, and their selective inhibition holds therapeutic potential across a wide range of disorders. However, the high degree of structural similarity among isoforms poses a significant challenge for the design of selective inhibitors. In this work, we present a machine learning (ML)-based platform for the isoform-specific prediction and profiling of small molecules targeting hCA I, II, IX, and XII. Methods: By integrating four molecular representations with four ML algorithms, we built 64 classification models, each extensively optimized and validated. The best-performing models for each isoform were applied in a virtual screening campaign for ~2 million compounds. Results: Following a multi-step refinement process, 12 candidates were identified, purchased, and experimentally tested. Several compounds showed potent inhibitory activity in the nanomolar to submicromolar range, with selectivity profiles across the isoforms. To gain mechanistic insights, SHAP-based feature importance analysis and molecular docking supported by molecular dynamics simulations were employed, highlighting the structural determinants of the predicted activity. Conclusions: This study demonstrates the effectiveness of integrating ML, cheminformatics, and experimental validation to accelerate the discovery of selective carbonic anhydrase inhibitors and provides a generalizable framework for activity profiling across enzyme isoforms.
A Machine Learning Platform for Isoform-Specific Identification and Profiling of Human Carbonic Anhydrase Inhibitors
Piazza, LisaPrimo
;Di Stefano, Miriana;Poles, Clarissa;Bononi, Giulia;Poli, Giulio;Galati, Salvatore;Macchia, Marco;Tuccinardi, Tiziano
Ultimo
2025-01-01
Abstract
Background/Objectives: Human carbonic anhydrases (hCAs) are metalloenzymes involved in essential physiological processes, and their selective inhibition holds therapeutic potential across a wide range of disorders. However, the high degree of structural similarity among isoforms poses a significant challenge for the design of selective inhibitors. In this work, we present a machine learning (ML)-based platform for the isoform-specific prediction and profiling of small molecules targeting hCA I, II, IX, and XII. Methods: By integrating four molecular representations with four ML algorithms, we built 64 classification models, each extensively optimized and validated. The best-performing models for each isoform were applied in a virtual screening campaign for ~2 million compounds. Results: Following a multi-step refinement process, 12 candidates were identified, purchased, and experimentally tested. Several compounds showed potent inhibitory activity in the nanomolar to submicromolar range, with selectivity profiles across the isoforms. To gain mechanistic insights, SHAP-based feature importance analysis and molecular docking supported by molecular dynamics simulations were employed, highlighting the structural determinants of the predicted activity. Conclusions: This study demonstrates the effectiveness of integrating ML, cheminformatics, and experimental validation to accelerate the discovery of selective carbonic anhydrase inhibitors and provides a generalizable framework for activity profiling across enzyme isoforms.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


