In this paper, we propose a compilation tool-chain supporting the eective exploitation of multi-core architectures oering hundreds of cores. The tool-chain leverages on both the application requirements and the platform-specic features to provide developers with a powerful parallel-programming environment able to generate ecient parallel code. The design of parallel applications follows a semi-automatic approach enabling the programmer to transfer to back-end tools platform-specic code generation and optimization, thus making possible to avoid the clobbering of code with non-portable and complex directives. The programmer can graphically parallelize the application (mainly data-streaming ones) for the target platform using Thales' Spear Design Environment. The resulting parallelization is generated under the form of an Intermediate Representation, which is then passed to the back-end tools (HPC Project's Par4All) that generates efficient target code. We present the results obtained parallelizing a small subset of the RT-STAP radar algorithm and the Chirp filltering algorithm on standard multi-core and on nVidia GPUs.

An innovative compilation tool-chain for embedded multi-core architectures

TORQUATI, MASSIMO;VANNESCHI, MARCO;
2012

Abstract

In this paper, we propose a compilation tool-chain supporting the eective exploitation of multi-core architectures oering hundreds of cores. The tool-chain leverages on both the application requirements and the platform-specic features to provide developers with a powerful parallel-programming environment able to generate ecient parallel code. The design of parallel applications follows a semi-automatic approach enabling the programmer to transfer to back-end tools platform-specic code generation and optimization, thus making possible to avoid the clobbering of code with non-portable and complex directives. The programmer can graphically parallelize the application (mainly data-streaming ones) for the target platform using Thales' Spear Design Environment. The resulting parallelization is generated under the form of an Intermediate Representation, which is then passed to the back-end tools (HPC Project's Par4All) that generates efficient target code. We present the results obtained parallelizing a small subset of the RT-STAP radar algorithm and the Chirp filltering algorithm on standard multi-core and on nVidia GPUs.
9783645500722
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11568/566469
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact