Activities per year
Abstract
We address the problem of fusing array operations based on criteria such as shape compatibility, data reuse, and minimizing for data reuse, the fusion problem has been formulated as a static weighted graph partitioning problem (known as the Weighted Loop Fusion problem). We show that this scheme cannot accurately track data reuse between multiple independent loops, since it overestimates total data reuse of certain cases. Our formulation in terms of partitions allows use of realistic cost functions that can track resource usage accurately. We give correctness proofs, and prove that WSP can maximize data reuse in programs exactly, in contrast to prior work. For the exact optimal solution, which is NP-hard to find, we present a branch-and-bound algorithm together with a polynomial-time preconditioner that reduces the problem size significantly in practice. We further present a polynomial-time greedy approximation that is fast enough to use for JIT-compilation and gives near-optimal results in practice. All algorithms have been implemented in the automatic parallelization platform Bohrium, run on a set of benchmarks, and compared to existing methods from the literature.
Original language | English |
---|---|
Title of host publication | PACT' 16 : Proceedings of the 2016 International Conference on Parallel Architectures and Compilation |
Number of pages | 15 |
Publisher | Association for Computing Machinery |
Publication date | 2016 |
Pages | 71-85 |
ISBN (Print) | 978-1-4503-4121-9 |
DOIs | |
Publication status | Published - 2016 |
Event | The 25th International Conference on Parallel Architectures and Compilation Techniques - Haifa, Israel Duration: 11 Sept 2016 → 15 Sept 2016 Conference number: 25 |
Conference
Conference | The 25th International Conference on Parallel Architectures and Compilation Techniques |
---|---|
Number | 25 |
Country/Territory | Israel |
City | Haifa |
Period | 11/09/2016 → 15/09/2016 |
Fingerprint
Dive into the research topics of 'Fusion of parallel array operations'. Together they form a unique fingerprint.Activities
- 1 Participation in workshop, seminar, course
-
The 25th International Conference on Parallel Architectures and Compilation Techniques
James Emil Avery (Participant)
9 Sept 2016 → 13 Sept 2016Activity: Participating in or organising an event types › Participation in workshop, seminar, course