TY - JOUR
T1 - Flat and hierarchical system deployment for edge computing systems
AU - Wang, En
AU - Li, Dawei
AU - Dong, Boxiang
AU - Zhou, Huan
AU - Zhu, Michelle
N1 - Publisher Copyright:
© 2019 Elsevier B.V.
PY - 2020/4
Y1 - 2020/4
N2 - In this paper, we consider the server allocation problem for edge computing system deployment where each edge cloud is modeled as an M/M/c queue. Our goal is to minimize the overall average system response time of application requests generated by all mobile devices/users. We consider two approaches for edge cloud deployment: the flat deployment, where all edge clouds are co-located with the base stations, and the hierarchical deployment, where edge clouds can be co-located with other system components besides the base stations. In flat deployment, we demonstrate that the allocation of edge cloud servers should be balanced across all the base stations, if the application request arrival rates at the base stations are equal to each other; if the application request arrival rates are not the same, we propose a Largest Weighted Reduction Time First (LWRTF) algorithm to assign servers to edge clouds. Numerical comparisons of the proposed algorithm against several other reasonably designed heuristics verify that algorithm LWRTF has very good performances in terms of minimizing the average system response time. By theoretical analysis and numerical evaluations, we also show that, the hierarchical deployment approach has great potentials in minimizing the overall average system response time compared to the flat deployment approach. We also investigate the server allocation problem in hierarchical deployment and derive important insights to guide practical edge cloud server allocation in real-world systems.
AB - In this paper, we consider the server allocation problem for edge computing system deployment where each edge cloud is modeled as an M/M/c queue. Our goal is to minimize the overall average system response time of application requests generated by all mobile devices/users. We consider two approaches for edge cloud deployment: the flat deployment, where all edge clouds are co-located with the base stations, and the hierarchical deployment, where edge clouds can be co-located with other system components besides the base stations. In flat deployment, we demonstrate that the allocation of edge cloud servers should be balanced across all the base stations, if the application request arrival rates at the base stations are equal to each other; if the application request arrival rates are not the same, we propose a Largest Weighted Reduction Time First (LWRTF) algorithm to assign servers to edge clouds. Numerical comparisons of the proposed algorithm against several other reasonably designed heuristics verify that algorithm LWRTF has very good performances in terms of minimizing the average system response time. By theoretical analysis and numerical evaluations, we also show that, the hierarchical deployment approach has great potentials in minimizing the overall average system response time compared to the flat deployment approach. We also investigate the server allocation problem in hierarchical deployment and derive important insights to guide practical edge cloud server allocation in real-world systems.
KW - Average system response time
KW - Edge cloud
KW - Edge computing
KW - Flat deployment
KW - Hierarchical deployment
UR - http://www.scopus.com/inward/record.url?scp=85076262273&partnerID=8YFLogxK
U2 - 10.1016/j.future.2019.12.004
DO - 10.1016/j.future.2019.12.004
M3 - Article
AN - SCOPUS:85076262273
SN - 0167-739X
VL - 105
SP - 308
EP - 317
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -