Interpretable Latent Space to Enable Counterfactual Explanations

Bodria, F.; Guidotti, R.; Giannotti, F.; Pedreschi, D.

doi:10.1007/978-3-031-18840-4_37

Many dimensionality reduction methods have been introduced to map a data space into one with fewer features and enhance machine learning models’ capabilities. This reduced space, called latent space, holds properties that allow researchers to understand the data better and produce better models. This work proposes an interpretable latent space that preserves the similarity of data points and supports a new way of learning a classification model that allows prediction and explanation through counterfactual examples. We demonstrate with extensive experiments the effectiveness of the latent space with respect to different metrics in comparison with several competitors, as well as the quality of the achieved counterfactual explanations.

Interpretable Latent Space to Enable Counterfactual Explanations

Bodria F.^Primo;Guidotti R.^Secondo;Giannotti F.;Pedreschi D.^Ultimo

2022-01-01

Abstract

Many dimensionality reduction methods have been introduced to map a data space into one with fewer features and enhance machine learning models’ capabilities. This reduced space, called latent space, holds properties that allow researchers to understand the data better and produce better models. This work proposes an interpretable latent space that preserves the similarity of data points and supports a new way of learning a classification model that allows prediction and explanation through counterfactual examples. We demonstrate with extensive experiments the effectiveness of the latent space with respect to different metrics in comparison with several competitors, as well as the quality of the achieved counterfactual explanations.

Scheda breve

Scheda completa

Scheda completa (DC)

Anno

2022

Codice ISBN

978-3-031-18839-8
978-3-031-18840-4

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11568/1162775

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

5

3

CINECA IRIS Institutional Research Information System

Interpretable Latent Space to Enable Counterfactual Explanations

Bodria F.^Primo;Guidotti R.^Secondo;Giannotti F.;Pedreschi D.^Ultimo

Primo

Secondo

Ultimo

2022-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Attenzione

Citazioni

social impact

CINECA IRIS Institutional Research Information System

Interpretable Latent Space to Enable Counterfactual Explanations

Bodria F.Primo;Guidotti R.Secondo;Giannotti F.;Pedreschi D.Ultimo

Primo

Secondo

Ultimo

2022-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Attenzione

Citazioni

social impact

Conferma cancellazione

Bodria F.^Primo;Guidotti R.^Secondo;Giannotti F.;Pedreschi D.^Ultimo

Scheda breve

Scheda completa

Scheda completa (DC)