Congestion-aware memory management on NUMA platforms: A VMware ESXi case study

Jagadish B. Kotra, Seongbeom Kim, Kamesh Madduri, Mahmut T. Kandemir

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

He VMware ESXi hypervisor attracts a wide range of customers and is deployed in domains ranging from desktop computing to server computing. While the software systems are increasingly moving towards consolidation, hardware has already transitioned into multi-socket Non-Uniform Memory Access (NUMA)-based systems. The marriage of increasing consolidation and the multi-socket based systems warrants low-overhead, simple and practical mechanisms to detect and address performance bottlenecks, without causing additional contention for shared resources such as performance counters. In this paper, we propose a simple, practical and highly accurate, dynamic memory latency probing mechanism to detect memory congestion in a NUMA system. Using these dynamic probed latencies, we propose congestion-aware memory allocation, congestion-aware memory migration, and a combination of these two techniques. These proposals, evaluated on Intel Westmere (8 nodes) and Intel Haswell (2 nodes) using various workloads, improve the overall performance on an average by 7.2% and 9.5% respectively.

Original languageEnglish (US)
Title of host publicationProceedings of the 2017 IEEE International Symposium on Workload Characterization, IISWC 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages146-155
Number of pages10
ISBN (Electronic)9781538612323
DOIs
StatePublished - Dec 5 2017
Event2017 IEEE International Symposium on Workload Characterization, IISWC 2017 - Seattle, United States
Duration: Oct 1 2017Oct 3 2017

Publication series

NameProceedings of the 2017 IEEE International Symposium on Workload Characterization, IISWC 2017
Volume2017-January

Other

Other2017 IEEE International Symposium on Workload Characterization, IISWC 2017
CountryUnited States
CitySeattle
Period10/1/1710/3/17

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture
  • Information Systems and Management

Fingerprint Dive into the research topics of 'Congestion-aware memory management on NUMA platforms: A VMware ESXi case study'. Together they form a unique fingerprint.

Cite this