Optimizing energy consumption in GPUS through feedback-driven CTA scheduling

Amin Jadidi, Mohammad Arjomand, Mahmut Taylan Kandemir, Chita R. Das

Research output: Contribution to journalConference article

1 Scopus citations

Abstract

Emerging GPU architectures offer a cost-effective computing platform by providing thousands of energy-efficient compute cores and high bandwidth memory that facilitate the execution of highly parallel applications. In this paper, we show that different applications, and in fact different kernels from the same application might exhibit significantly varying utilizations of compute and memory resources. In order to improve the energy efficiency of the GPU system, we propose a run-time characterization strategy that classifies kernels as compute- or memory-intensive based on their resource utilizations. Using this knowledge, our proposed mechanism employs core shut-down technique for memory-intensive kernels in order to manage energy in a more efficient way. This strategy uses performance and memory bandwidth utilization information to determine the ideal hardware configuration at run-time. The proposed technique saves on average 21% of total chip energy for memory-intensive applications, which is within 8% of the optimal saving that can be obtained from an oracle scheme.

Original languageEnglish (US)
Pages (from-to)129-140
Number of pages12
JournalSimulation Series
Volume49
Issue number3
StatePublished - Jan 1 2017
Event25th High Performance Computing Symposium, HPC 2017, Part of the 2017 Spring Simulation Multi-Conference, SpringSim 2017 - Virginia Beach, United States
Duration: Apr 23 2017Apr 26 2017

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications

Fingerprint Dive into the research topics of 'Optimizing energy consumption in GPUS through feedback-driven CTA scheduling'. Together they form a unique fingerprint.

  • Cite this