Prefetch Tuning Optimizations

Diana Guttman, Meenakshi Arunachalam, Vlad Calina, Mahmut Taylan Kandemir

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

This chapter looks at methods to improve prefetching effectiveness, and therefore increase performance of applications, through the use of the superior knowledge of the programmer. It is known that prefetching is extremely important for good performance on in-order architectures like the Intel Xeon Phi coprocessor however, the authors surprised even themselves by being able to expose techniques which show value on out-of-order cores as well. Often simply tuning the compiler prefetching distance is an easy way for application developers to get better performance without having to rewrite their code. In some cases, the more labor-intensive method of adding intrinsics for prefetching may be worthwhile.

Original languageEnglish (US)
Title of host publicationHigh Performance Parallelism Pearls
Subtitle of host publicationMulticore and Many-core Programming Approaches
PublisherElsevier Inc.
Pages401-419
Number of pages19
Volume2
ISBN (Electronic)9780128038901
ISBN (Print)9780128038192
DOIs
StatePublished - Jul 23 2015

All Science Journal Classification (ASJC) codes

  • Computer Science(all)

Fingerprint Dive into the research topics of 'Prefetch Tuning Optimizations'. Together they form a unique fingerprint.

  • Cite this

    Guttman, D., Arunachalam, M., Calina, V., & Kandemir, M. T. (2015). Prefetch Tuning Optimizations. In High Performance Parallelism Pearls: Multicore and Many-core Programming Approaches (Vol. 2, pp. 401-419). Elsevier Inc.. https://doi.org/10.1016/B978-0-12-803819-2.00018-5