Streaming nested data parallelism on multicores

Frederik Meisner Madsen, Andrzej Filinski

1 Citationer (Scopus)

Abstract

The paradigm of nested data parallelism (NDP) allows a variety of semi-regular computation tasks to be mapped onto SIMD-style hardware, including GPUs and vector units. However, some care is needed to keep down space consumption in situations where the available parallelism may vastly exceed the available computation resources. To allow for an accurate space-cost model in such cases, we have previously proposed the Streaming NESL language, a refinement of NESL with a high-level notion of streamable sequences.

In this paper, we report on experience with a prototype implementation of Streaming NESL on a 2-level parallel platform, namely a multicore system in which we also aggressively utilize vector instructions on each core. We show that for several examples of simple, but not trivially parallelizable, text-processing tasks, we obtain single-core performance on par with off-the-shelf GNU Coreutils code, and near-linear speedups for multiple cores.
OriginalsprogEngelsk
TitelProceedings of the 5th International Workshop on Functional High-Performance Computing
Antal sider8
ForlagAssociation for Computing Machinery
Publikationsdato8 sep. 2016
Sider44-51
ISBN (Elektronisk)978-1-4503-4433-3
DOI
StatusUdgivet - 8 sep. 2016
BegivenhedInternational Workshop on Functional High-Performance Computing - Nara, Japan
Varighed: 22 sep. 201622 sep. 2016
Konferencens nummer: 5
https://sites.google.com/site/fhpcworkshops/

Workshop

WorkshopInternational Workshop on Functional High-Performance Computing
Nummer5
Land/OmrådeJapan
ByNara
Periode22/09/201622/09/2016
Internetadresse

Citationsformater