This paper reports some recent results from the empirical evaluation of different types of structured molecular representations used in QSPR analysis through a Recursive Neural Network (RNN) model, which allows for their direct use without the need for measuring or computing molecular descriptors. This RNN methodology has been applied to the prediction of the properties of small molecules and polymers. In particular, three different descriptions of cyclic moieties, namely group, template and cycle break have been proposed. The effectiveness of the proposed method in dealing with different representations of chemical structures, either specifically designed or of more general use, has been demonstrated by its application to data sets encompassing various types of cyclic structures. For each class of experiments a test set with data that were not used for the development of the model was used for validation, and the comparisons have been based on the test results. The reported results highlight the flexibility of the RNN in directly treating different classes of structured input data without using input descriptors.
Evaluation of hierarchical structured representations for QSPR studies of small molecules and polymers by recursive neural networks
DUCE, CELIA;MICHELI, ALESSIO;SOLARO, ROBERTO;STARITA, ANTONINA;TINE', MARIA ROSARIA
2009-01-01
Abstract
This paper reports some recent results from the empirical evaluation of different types of structured molecular representations used in QSPR analysis through a Recursive Neural Network (RNN) model, which allows for their direct use without the need for measuring or computing molecular descriptors. This RNN methodology has been applied to the prediction of the properties of small molecules and polymers. In particular, three different descriptions of cyclic moieties, namely group, template and cycle break have been proposed. The effectiveness of the proposed method in dealing with different representations of chemical structures, either specifically designed or of more general use, has been demonstrated by its application to data sets encompassing various types of cyclic structures. For each class of experiments a test set with data that were not used for the development of the model was used for validation, and the comparisons have been based on the test results. The reported results highlight the flexibility of the RNN in directly treating different classes of structured input data without using input descriptors.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.