TY - JOUR
T1 - Service placement and request scheduling for data-intensive applications in edge clouds
AU - Farhadi, Vajiheh
AU - Mehmeti, Fidan
AU - He, Ting
AU - Porta, Thomas F.La
AU - Khamfroush, Hana
AU - Wang, Shiqiang
AU - Chan, Kevin S.
AU - Poularakis, Konstantinos
N1 - Funding Information:
Manuscript received December 19, 2019; revised May 14, 2020; accepted December 17, 2020; approved by IEEE/ACM TRANSACTIONS ON NET-WORKING Editor H. Shen. Date of publication February 3, 2021; date of current version April 16, 2021. This work was supported by the U.S. Army Research Laboratory and the U.K. Ministry of Defence under Agreement W911NF-16-3-0001. This article was presented in part at the 38th 2018 IEEE International Conference on Distributed Computing Systems (IEEE ICDCS 2018) and in part at the 2019 IEEE Conference on Computer Communications Workshops (IEEE INFOCOM 2019). (Corresponding author: Vajiheh Farhadi.) Vajiheh Farhadi, Fidan Mehmeti, Ting He, and Thomas F. La Porta are with the Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA 16802 USA (e-mail: vuf8@cse.psu.edu; fzm82@cse.psu.edu; tzh58@cse.psu.edu; tlp@cse.psu.edu).
Funding Information:
She has received several research grants, including the CRII Award from the National Science Foundation in support of her research.
Publisher Copyright:
© 1993-2012 IEEE.
PY - 2021/4
Y1 - 2021/4
N2 - Mobile edge computing provides the opportunity for wireless users to exploit the power of cloud computing without a large communication delay. To serve data-intensive applications (e.g., video analytics, machine learning tasks) from the edge, we need, in addition to computation resources, storage resources for storing server code and data as well as network bandwidth for receiving user-provided data. Moreover, due to time-varying demands, the code and data placement needs to be adjusted over time, which raises concerns of system stability and operation cost. In this paper, we address these issues by proposing a two-time-scale framework that jointly optimizes service (code and data) placement and request scheduling, while considering storage, communication, computation, and budget constraints. First, by analyzing the hardness of various cases, we completely characterize the complexity of our problem. Next, we develop a polynomial-time service placement algorithm by formulating our problem as a set function optimization, which attains a constant-factor approximation under certain conditions. Furthermore, we develop a polynomial-time request scheduling algorithm by computing the maximum flow in a carefully constructed auxiliary graph, which satisfies hard resource constraints and is provably optimal in the special case where requests have homogeneous resource demands. Extensive synthetic and trace-driven simulations show that the proposed algorithms achieve 90% of the optimal performance.
AB - Mobile edge computing provides the opportunity for wireless users to exploit the power of cloud computing without a large communication delay. To serve data-intensive applications (e.g., video analytics, machine learning tasks) from the edge, we need, in addition to computation resources, storage resources for storing server code and data as well as network bandwidth for receiving user-provided data. Moreover, due to time-varying demands, the code and data placement needs to be adjusted over time, which raises concerns of system stability and operation cost. In this paper, we address these issues by proposing a two-time-scale framework that jointly optimizes service (code and data) placement and request scheduling, while considering storage, communication, computation, and budget constraints. First, by analyzing the hardness of various cases, we completely characterize the complexity of our problem. Next, we develop a polynomial-time service placement algorithm by formulating our problem as a set function optimization, which attains a constant-factor approximation under certain conditions. Furthermore, we develop a polynomial-time request scheduling algorithm by computing the maximum flow in a carefully constructed auxiliary graph, which satisfies hard resource constraints and is provably optimal in the special case where requests have homogeneous resource demands. Extensive synthetic and trace-driven simulations show that the proposed algorithms achieve 90% of the optimal performance.
UR - http://www.scopus.com/inward/record.url?scp=85100767228&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100767228&partnerID=8YFLogxK
U2 - 10.1109/TNET.2020.3048613
DO - 10.1109/TNET.2020.3048613
M3 - Article
AN - SCOPUS:85100767228
SN - 1063-6692
VL - 29
SP - 779
EP - 792
JO - IEEE/ACM Transactions on Networking
JF - IEEE/ACM Transactions on Networking
IS - 2
M1 - 9345766
ER -