Incremental Training of a Recurrent Neural Network Exploiting a  Multi-Scale Dynamic Memory

Carta, Antonio; Sperduti, Alessandro; Bacciu, Davide

doi:10.1007/978-3-030-67658-2_39

The effectiveness of recurrent neural networks can be largely influenced by their ability to store into their dynamical memory information extracted from input sequences at different frequencies and timescales. Such a feature can be introduced into a neural architecture by an appropriate modularization of the dynamic memory. In this paper we propose a novel incrementally trained recurrent architecture targeting explicitly multi-scale learning. First, we show how to extend the architecture of a simple RNN by separating its hidden state into different modules, each subsampling the network hidden activations at different frequencies. Then, we discuss a training algorithm where new modules are iteratively added to the model to learn progressively longer dependencies. Each new module works at a slower frequency than the previous ones and it is initialized to encode the subsampled sequence of hidden activations. Experimental results on synthetic and real-world datasets on speech recognition and handwritten characters show that the modular architecture and the incremental training algorithm improve the ability of recurrent neural networks to capture long-term dependencies.

Incremental Training of a Recurrent Neural Network Exploiting a Multi-Scale Dynamic Memory

Antonio Carta;Alessandro Sperduti;Davide Bacciu

2020-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Codice ISBN
	
				978-3-030-67664-3
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11568/1078279

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

CINECA IRIS Institutional Research Information System

Incremental Training of a Recurrent Neural Network Exploiting a Multi-Scale Dynamic Memory

Antonio Carta;Alessandro Sperduti;Davide Bacciu

2020-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Attenzione

Citazioni

social impact

CINECA IRIS Institutional Research Information System

Incremental Training of a Recurrent Neural Network Exploiting a Multi-Scale Dynamic Memory

Antonio Carta;Alessandro Sperduti;Davide Bacciu

2020-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Attenzione

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)