Characterizing AI Model Inference Applications Running in the SGX Environment

Shixiong Jing, Qinkun Bao, Pei Wang, Xulong Tang, Dinghao Wu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Intel Software Guard Extensions (SGX) is a set of extensions built into Intel CPUs for the trusted computation. It creates a hardware-assisted secure container, within which programs are protected from data leakage and data manipulations by privileged software and hypervisors. With the trend that more and more machine learning based programs are moving to cloud computing, SGX can be used in cloud-based Machine Learning applications to protect user data from malicious privileged programs.However, applications running in SGX suffer from several overheads, including frequent context switching, memory page encryption/decryption, and memory page swapping, which significantly degrade the execution efficiency. In this paper, we aim to i) comprehensively explore the execution of general AI applications running on SGX, ii) systematically characterize the data reuses at both page granularity and cacheline granularity, and iii) provide optimization insights for efficient deployment of machine learning based applications on SGX. To the best of our knowledge, our work is the first to study machine learning applications on SGX and explore the potential of data reuses to reduce the runtime overheads in SGX.

Original languageEnglish (US)
Title of host publication2021 IEEE International Conference on Networking, Architecture and Storage, NAS 2021 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728177441
DOIs
StatePublished - 2021
Event15th IEEE International Conference on Networking, Architecture and Storage, NAS 2021 - Riverside, United States
Duration: Oct 24 2021Oct 26 2021

Publication series

Name2021 IEEE International Conference on Networking, Architecture and Storage, NAS 2021 - Proceedings

Conference

Conference15th IEEE International Conference on Networking, Architecture and Storage, NAS 2021
Country/TerritoryUnited States
CityRiverside
Period10/24/2110/26/21

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Hardware and Architecture
  • Information Systems
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Characterizing AI Model Inference Applications Running in the SGX Environment'. Together they form a unique fingerprint.

Cite this