PraDa

Privacy-preserving data-deduplication-as-a-service

Boxiang Dong, Ruilin Liu, Wendy Hui Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionResearchpeer-review

8 Citations (Scopus)

Abstract

The data-cleaning-as-a-service (DCaS) paradigm enables users to outsource their data and data cleaning needs to computationally powerful third-party service providers. It raises several security issues. One of the issues is how the client can protect the private information in the outsourced data. In this paper, we focus on data deduplication as the main data cleaning task, and design two efficient privacy-preserving data-deduplication methods for the DCaS paradigm. We analyze the robustness of our two methods against the attacks that exploit the auxiliary frequency distribution and the knowledge of the encoding algorithms. Our empirical study demonstrates the efficiency and effectiveness of our privacy preserving approaches.

Original languageEnglish
Title of host publicationCIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery, Inc
Pages1559-1568
Number of pages10
ISBN (Electronic)9781450325981
DOIs
StatePublished - 3 Nov 2014
Event23rd ACM International Conference on Information and Knowledge Management, CIKM 2014 - Shanghai, China
Duration: 3 Nov 20147 Nov 2014

Publication series

NameCIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management

Other

Other23rd ACM International Conference on Information and Knowledge Management, CIKM 2014
CountryChina
CityShanghai
Period3/11/147/11/14

Fingerprint

Data as a service (DaaS)
Cleaning
Data privacy
Privacy preserving
Data cleaning

Keywords

  • Data deduplication
  • Data-cleaning-as-a-service
  • Outsourcing
  • Privacy-preserving
  • Security

Cite this

Dong, B., Liu, R., & Wang, W. H. (2014). PraDa: Privacy-preserving data-deduplication-as-a-service. In CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management (pp. 1559-1568). (CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management). Association for Computing Machinery, Inc. https://doi.org/10.1145/2661829.2661863
Dong, Boxiang ; Liu, Ruilin ; Wang, Wendy Hui. / PraDa : Privacy-preserving data-deduplication-as-a-service. CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management. Association for Computing Machinery, Inc, 2014. pp. 1559-1568 (CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management).
@inproceedings{d3aa0da54f0845ca8ab546691a08b170,
title = "PraDa: Privacy-preserving data-deduplication-as-a-service",
abstract = "The data-cleaning-as-a-service (DCaS) paradigm enables users to outsource their data and data cleaning needs to computationally powerful third-party service providers. It raises several security issues. One of the issues is how the client can protect the private information in the outsourced data. In this paper, we focus on data deduplication as the main data cleaning task, and design two efficient privacy-preserving data-deduplication methods for the DCaS paradigm. We analyze the robustness of our two methods against the attacks that exploit the auxiliary frequency distribution and the knowledge of the encoding algorithms. Our empirical study demonstrates the efficiency and effectiveness of our privacy preserving approaches.",
keywords = "Data deduplication, Data-cleaning-as-a-service, Outsourcing, Privacy-preserving, Security",
author = "Boxiang Dong and Ruilin Liu and Wang, {Wendy Hui}",
year = "2014",
month = "11",
day = "3",
doi = "10.1145/2661829.2661863",
language = "English",
series = "CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management",
publisher = "Association for Computing Machinery, Inc",
pages = "1559--1568",
booktitle = "CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management",

}

Dong, B, Liu, R & Wang, WH 2014, PraDa: Privacy-preserving data-deduplication-as-a-service. in CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management. CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management, Association for Computing Machinery, Inc, pp. 1559-1568, 23rd ACM International Conference on Information and Knowledge Management, CIKM 2014, Shanghai, China, 3/11/14. https://doi.org/10.1145/2661829.2661863

PraDa : Privacy-preserving data-deduplication-as-a-service. / Dong, Boxiang; Liu, Ruilin; Wang, Wendy Hui.

CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management. Association for Computing Machinery, Inc, 2014. p. 1559-1568 (CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management).

Research output: Chapter in Book/Report/Conference proceedingConference contributionResearchpeer-review

TY - GEN

T1 - PraDa

T2 - Privacy-preserving data-deduplication-as-a-service

AU - Dong, Boxiang

AU - Liu, Ruilin

AU - Wang, Wendy Hui

PY - 2014/11/3

Y1 - 2014/11/3

N2 - The data-cleaning-as-a-service (DCaS) paradigm enables users to outsource their data and data cleaning needs to computationally powerful third-party service providers. It raises several security issues. One of the issues is how the client can protect the private information in the outsourced data. In this paper, we focus on data deduplication as the main data cleaning task, and design two efficient privacy-preserving data-deduplication methods for the DCaS paradigm. We analyze the robustness of our two methods against the attacks that exploit the auxiliary frequency distribution and the knowledge of the encoding algorithms. Our empirical study demonstrates the efficiency and effectiveness of our privacy preserving approaches.

AB - The data-cleaning-as-a-service (DCaS) paradigm enables users to outsource their data and data cleaning needs to computationally powerful third-party service providers. It raises several security issues. One of the issues is how the client can protect the private information in the outsourced data. In this paper, we focus on data deduplication as the main data cleaning task, and design two efficient privacy-preserving data-deduplication methods for the DCaS paradigm. We analyze the robustness of our two methods against the attacks that exploit the auxiliary frequency distribution and the knowledge of the encoding algorithms. Our empirical study demonstrates the efficiency and effectiveness of our privacy preserving approaches.

KW - Data deduplication

KW - Data-cleaning-as-a-service

KW - Outsourcing

KW - Privacy-preserving

KW - Security

UR - http://www.scopus.com/inward/record.url?scp=84937567622&partnerID=8YFLogxK

U2 - 10.1145/2661829.2661863

DO - 10.1145/2661829.2661863

M3 - Conference contribution

T3 - CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management

SP - 1559

EP - 1568

BT - CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management

PB - Association for Computing Machinery, Inc

ER -

Dong B, Liu R, Wang WH. PraDa: Privacy-preserving data-deduplication-as-a-service. In CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management. Association for Computing Machinery, Inc. 2014. p. 1559-1568. (CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management). https://doi.org/10.1145/2661829.2661863