Analysis and optimization of gyrokinetic toroidal simulations on homogenous and heterogenous platforms

Khaled Z. Ibrahim, Kamesh Madduri, Samuel Williams, Bei Wang, Stephane Ethier, Leonid Oliker

Research output: Contribution to journalArticle

13 Scopus citations


The Gyrokinetic Toroidal Code (GTC) uses the particle-in-cell method to efficiently simulate plasma microturbulence. This work presents novel analysis and optimization techniques to enhance the performance of GTC on large-scale machines. We introduce cell access analysis to better manage locality vs. synchronization tradeoffs on CPU and GPU-based architectures. Our optimized hybrid parallel implementation of GTC uses MPI, OpenMP, and NVIDIA CUDA, achieves up to a 2× speedup over the reference Fortran version on multiple parallel systems, and scales efficiently to tens of thousands of cores.

Original languageEnglish (US)
Pages (from-to)454-473
Number of pages20
JournalInternational Journal of High Performance Computing Applications
Issue number4
StatePublished - Oct 30 2013


All Science Journal Classification (ASJC) codes

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture

Cite this