TY - GEN
T1 - RoVEr
T2 - 27th International Conference on Computer Communications and Networks, ICCCN 2018
AU - Wang, Teng
AU - Nguyen, Nam Son
AU - Wang, Jiayin
AU - Li, Tengpeng
AU - Zhang, Xiaoqian
AU - Mi, Ningfang
AU - Zhao, Bin
AU - Sheng, Bo
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/10/9
Y1 - 2018/10/9
N2 - Erasure Coding based Storage (ECS) is replacing tradition replica-based systems because of its low storage overhead. In an ECS, however, every task needs to fetch remote pieces of data for its execution, and data verification is missing in the current framework. As security issues keep rising and there have been security incidents occurred in big data platforms, the compromised nodes in a computing cluster may manipulate its hosted data fed for other nodes yielding misleading results. Without replicas, it is quite challenging to efficiently verify the data integrity in ECS. In this paper, we develop ROVER, which is an efficient and verifiable ECS for big data platforms. In ROVER, every piece of data is monitored by its checksums stored on a set of witnesses. Bloom filter technique is used on each witness to efficiently keep the records of the checksums. The data verification is based on the majority voting. ROVER also supports a quick reconstruction of Bloom Filter when a node recovers from a failure. We present a complete system framework, security analysis, and a guideline for setting the parameters. The implementation and evaluation show that ROVER is robust and efficient against the attack from the compromised nodes.
AB - Erasure Coding based Storage (ECS) is replacing tradition replica-based systems because of its low storage overhead. In an ECS, however, every task needs to fetch remote pieces of data for its execution, and data verification is missing in the current framework. As security issues keep rising and there have been security incidents occurred in big data platforms, the compromised nodes in a computing cluster may manipulate its hosted data fed for other nodes yielding misleading results. Without replicas, it is quite challenging to efficiently verify the data integrity in ECS. In this paper, we develop ROVER, which is an efficient and verifiable ECS for big data platforms. In ROVER, every piece of data is monitored by its checksums stored on a set of witnesses. Bloom filter technique is used on each witness to efficiently keep the records of the checksums. The data verification is based on the majority voting. ROVER also supports a quick reconstruction of Bloom Filter when a node recovers from a failure. We present a complete system framework, security analysis, and a guideline for setting the parameters. The implementation and evaluation show that ROVER is robust and efficient against the attack from the compromised nodes.
UR - http://www.scopus.com/inward/record.url?scp=85060496880&partnerID=8YFLogxK
U2 - 10.1109/ICCCN.2018.8487406
DO - 10.1109/ICCCN.2018.8487406
M3 - Conference contribution
AN - SCOPUS:85060496880
T3 - Proceedings - International Conference on Computer Communications and Networks, ICCCN
BT - ICCCN 2018 - 27th International Conference on Computer Communications and Networks
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 30 July 2018 through 2 August 2018
ER -