TY - JOUR
T1 - Achieving Accountable MapReduce in cloud computing
AU - Xiao, Zhifeng
AU - Xiao, Yang
N1 - Funding Information:
This work was supported in part by the US National Science Foundation (NSF) under grants CNS-0737325 , CNS-0716211 , CCF-0829827 , and CNS-1059265 .
PY - 2014
Y1 - 2014
N2 - MapReduce is a programming model that is capable of processing large data sets in distributed computing environments. The original MapReduce model was designed to be fault-tolerant in case of various network abnormalities. However, fault-tolerance does not guarantee that each working machine will be completely accountable; when nodes are malicious, they may intentionally misrepresent the processing result during mapping or reducing, and they may thus make the final results inaccurate and untrustworthy. In this paper, we propose Accountable MapReduce, which forces each machine to be held responsible for its behaviors. In our approach, we set up a group of auditors to perform an Accountability Test (A-test) that checks all of the working machines and detects malicious nodes in real time. The A-test can be implemented with different options depending upon how the auditors are assigned. To optimize the utilization resource, we also formalize the Optimal Worker and Auditor Assignment (OWAA) problem, which is aimed at finding the optimal number of workers and auditors in order to minimize the total processing time. Our evaluation results show that the A-test can be practically and effectively applied to existing cloud platforms employing MapReduce.
AB - MapReduce is a programming model that is capable of processing large data sets in distributed computing environments. The original MapReduce model was designed to be fault-tolerant in case of various network abnormalities. However, fault-tolerance does not guarantee that each working machine will be completely accountable; when nodes are malicious, they may intentionally misrepresent the processing result during mapping or reducing, and they may thus make the final results inaccurate and untrustworthy. In this paper, we propose Accountable MapReduce, which forces each machine to be held responsible for its behaviors. In our approach, we set up a group of auditors to perform an Accountability Test (A-test) that checks all of the working machines and detects malicious nodes in real time. The A-test can be implemented with different options depending upon how the auditors are assigned. To optimize the utilization resource, we also formalize the Optimal Worker and Auditor Assignment (OWAA) problem, which is aimed at finding the optimal number of workers and auditors in order to minimize the total processing time. Our evaluation results show that the A-test can be practically and effectively applied to existing cloud platforms employing MapReduce.
UR - http://www.scopus.com/inward/record.url?scp=84883165397&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84883165397&partnerID=8YFLogxK
U2 - 10.1016/j.future.2013.07.001
DO - 10.1016/j.future.2013.07.001
M3 - Article
AN - SCOPUS:84883165397
SN - 0167-739X
VL - 30
SP - 1
EP - 13
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
IS - 1
ER -