K-nearest neighbor classification over semantically secure encrypted relational data

Bharath Kumar Samanthula, Yousef Elmehdwi, Wei Jiang

Research output: Contribution to journalArticle

72 Citations (Scopus)

Abstract

Data Mining has wide applications in many areas such as banking, medicine, scientific research and among government agencies. Classification is one of the commonly used tasks in data mining applications. For the past decade, due to the rise of various privacy issues, many theoretical and practical solutions to the classification problem have been proposed under different security models. However, with the recent popularity of cloud computing, users now have the opportunity to outsource their data, in encrypted form, as well as the data mining tasks to the cloud. Since the data on the cloud is in encrypted form, existing privacy-preserving classification techniques are not applicable. In this paper, we focus on solving the classification problem over encrypted data. In particular, we propose a secure k-NN classifier over encrypted data in the cloud. The proposed protocol protects the confidentiality of data, privacy of user's input query, and hides the data access patterns. To the best of our knowledge, our work is the first to develop a secure k-NN classifier over encrypted data under the semi-honest model. Also, we empirically analyze the efficiency of our proposed protocol using a real-world dataset under different parameter settings.

Original languageEnglish
Article number6930802
Pages (from-to)1261-1273
Number of pages13
JournalIEEE Transactions on Knowledge and Data Engineering
Volume27
Issue number5
DOIs
StatePublished - 1 May 2015

Fingerprint

Data mining
Classifiers
Data privacy
Cloud computing
Medicine

Keywords

  • Encryption
  • Outsourced Databases
  • Security
  • k-NN Classifier

Cite this

@article{d49a3cc079a541e8b57732db39f235c4,
title = "K-nearest neighbor classification over semantically secure encrypted relational data",
abstract = "Data Mining has wide applications in many areas such as banking, medicine, scientific research and among government agencies. Classification is one of the commonly used tasks in data mining applications. For the past decade, due to the rise of various privacy issues, many theoretical and practical solutions to the classification problem have been proposed under different security models. However, with the recent popularity of cloud computing, users now have the opportunity to outsource their data, in encrypted form, as well as the data mining tasks to the cloud. Since the data on the cloud is in encrypted form, existing privacy-preserving classification techniques are not applicable. In this paper, we focus on solving the classification problem over encrypted data. In particular, we propose a secure k-NN classifier over encrypted data in the cloud. The proposed protocol protects the confidentiality of data, privacy of user's input query, and hides the data access patterns. To the best of our knowledge, our work is the first to develop a secure k-NN classifier over encrypted data under the semi-honest model. Also, we empirically analyze the efficiency of our proposed protocol using a real-world dataset under different parameter settings.",
keywords = "Encryption, Outsourced Databases, Security, k-NN Classifier",
author = "Samanthula, {Bharath Kumar} and Yousef Elmehdwi and Wei Jiang",
year = "2015",
month = "5",
day = "1",
doi = "10.1109/TKDE.2014.2364027",
language = "English",
volume = "27",
pages = "1261--1273",
journal = "IEEE Transactions on Knowledge and Data Engineering",
issn = "1041-4347",
publisher = "IEEE Computer Society",
number = "5",

}

K-nearest neighbor classification over semantically secure encrypted relational data. / Samanthula, Bharath Kumar; Elmehdwi, Yousef; Jiang, Wei.

In: IEEE Transactions on Knowledge and Data Engineering, Vol. 27, No. 5, 6930802, 01.05.2015, p. 1261-1273.

Research output: Contribution to journalArticle

TY - JOUR

T1 - K-nearest neighbor classification over semantically secure encrypted relational data

AU - Samanthula, Bharath Kumar

AU - Elmehdwi, Yousef

AU - Jiang, Wei

PY - 2015/5/1

Y1 - 2015/5/1

N2 - Data Mining has wide applications in many areas such as banking, medicine, scientific research and among government agencies. Classification is one of the commonly used tasks in data mining applications. For the past decade, due to the rise of various privacy issues, many theoretical and practical solutions to the classification problem have been proposed under different security models. However, with the recent popularity of cloud computing, users now have the opportunity to outsource their data, in encrypted form, as well as the data mining tasks to the cloud. Since the data on the cloud is in encrypted form, existing privacy-preserving classification techniques are not applicable. In this paper, we focus on solving the classification problem over encrypted data. In particular, we propose a secure k-NN classifier over encrypted data in the cloud. The proposed protocol protects the confidentiality of data, privacy of user's input query, and hides the data access patterns. To the best of our knowledge, our work is the first to develop a secure k-NN classifier over encrypted data under the semi-honest model. Also, we empirically analyze the efficiency of our proposed protocol using a real-world dataset under different parameter settings.

AB - Data Mining has wide applications in many areas such as banking, medicine, scientific research and among government agencies. Classification is one of the commonly used tasks in data mining applications. For the past decade, due to the rise of various privacy issues, many theoretical and practical solutions to the classification problem have been proposed under different security models. However, with the recent popularity of cloud computing, users now have the opportunity to outsource their data, in encrypted form, as well as the data mining tasks to the cloud. Since the data on the cloud is in encrypted form, existing privacy-preserving classification techniques are not applicable. In this paper, we focus on solving the classification problem over encrypted data. In particular, we propose a secure k-NN classifier over encrypted data in the cloud. The proposed protocol protects the confidentiality of data, privacy of user's input query, and hides the data access patterns. To the best of our knowledge, our work is the first to develop a secure k-NN classifier over encrypted data under the semi-honest model. Also, we empirically analyze the efficiency of our proposed protocol using a real-world dataset under different parameter settings.

KW - Encryption

KW - Outsourced Databases

KW - Security

KW - k-NN Classifier

UR - http://www.scopus.com/inward/record.url?scp=84926628839&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2014.2364027

DO - 10.1109/TKDE.2014.2364027

M3 - Article

AN - SCOPUS:84926628839

VL - 27

SP - 1261

EP - 1273

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

SN - 1041-4347

IS - 5

M1 - 6930802

ER -