N-gram based secure similar document detection

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

17 Scopus citations


Secure similar document detection (SSDD) plays an important role in many applications, such as justifying the need-to-know basis and facilitating communication between government agencies. The SSDD problem considers situations where Alice with a query document wants to find similar information from Bob's document collection. During this process, the content of the query document is not disclosed to Bob, and Bob's document collection is not disclosed to Alice. Existing SSDD protocols are developed under the vector space model, which has the advantage of identifying global similar information. To effectively and securely detect similar documents with overlapping text fragments, this paper proposes a novel n-gram based SSDD protocol.

Original languageEnglish
Title of host publicationData and Applications Security and Privacy XXV - 25th Annual IFIP WG 11.3 Conference, DBSec 2011, Proceedings
Number of pages8
StatePublished - 2011
Event25th Annual WG 11.3 Conference on Data and Applications Security and Privacy, DBSec 2011 - Richmond, VA, United States
Duration: 11 Jul 201113 Jul 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6818 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Other25th Annual WG 11.3 Conference on Data and Applications Security and Privacy, DBSec 2011
Country/TerritoryUnited States
CityRichmond, VA


  • n-gram
  • privacy
  • security


Dive into the research topics of 'N-gram based secure similar document detection'. Together they form a unique fingerprint.

Cite this