To build more accurate and trustworthy artificial intelligence algorithms in deep learning, it is essential to understand the mechanisms driving classification systems to identify their targets. Typically, post hoc methods are used to provide insights into this process. By contrast, in this work, we investigate the possibility of using class activation maps in combination with contrastive loss to enhance the reliability of training of a deep learning model. MNIST and Fashion MNIST datasets are considered in our investigation since they have already proven a practical starting point for assessing an almost tautologic training strategy for deep learning algorithms, given that classification targets are the primary significant content of the images in these datasets. Starting from the raw comparison of accuracy and system complexity of the proposed approach, a further investigation of the technique’s feasibility in a deep learning study is conducted over six random seed splits of the training data and model performance. A modern deep learning network, such as ConvNeXT, determines whether a more robust architecture trained with the proposed mechanics provides better insights than a simple convolutional neural network. This investigation also addresses the importance of skip connections, structured learning layers, and feature map dimensions in the learning process.

You’ve Got the Wrong Outfit: Evaluating Deep Learning Paradigms on Digit and Fashion Recognition

Ignesti, Giacomo;Martinelli, Massimo;
2025-01-01

Abstract

To build more accurate and trustworthy artificial intelligence algorithms in deep learning, it is essential to understand the mechanisms driving classification systems to identify their targets. Typically, post hoc methods are used to provide insights into this process. By contrast, in this work, we investigate the possibility of using class activation maps in combination with contrastive loss to enhance the reliability of training of a deep learning model. MNIST and Fashion MNIST datasets are considered in our investigation since they have already proven a practical starting point for assessing an almost tautologic training strategy for deep learning algorithms, given that classification targets are the primary significant content of the images in these datasets. Starting from the raw comparison of accuracy and system complexity of the proposed approach, a further investigation of the technique’s feasibility in a deep learning study is conducted over six random seed splits of the training data and model performance. A modern deep learning network, such as ConvNeXT, determines whether a more robust architecture trained with the proposed mechanics provides better insights than a simple convolutional neural network. This investigation also addresses the importance of skip connections, structured learning layers, and feature map dimensions in the learning process.
2025
9783031876622
9783031876639
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11568/1352608
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact