VeriDedup: A Verifiable Cloud Data Deduplication Scheme with Integrity and Duplication Proof

Xixun Yu, Hui Bai, Zheng Yan, Rui Zhang

Research output: Contribution to journalArticleScientificpeer-review

7 Citations (Scopus)
57 Downloads (Pure)


Data deduplication is a technique to eliminate duplicate data in order to save storage space and enlarge upload bandwidth, which has been applied by cloud storage systems. However, a cloud storage provider (CSP) may tamper user data or cheat users to pay unused storage for duplicate data that are only stored once. Although previous solutions adopt message-locked encryption along with Proof of Retrievability (PoR) to check the integrity of deduplicated encrypted data, they ignore proving the correctness of duplication check during data upload and require the same file to be derived into same verification tags, which suffers from brute-force attacks and restricts users from flexibly creating their own individual verification tags. In this paper, we propose a verifiable deduplication scheme called VeriDedup to address the above problems. It can guarantee the correctness of duplication check and support flexible tag generation for integrity check over encrypted data deduplication in an integrative way. Concretely, we propose a novel Tag-flexible Deduplication-supported Integrity Check Protocol (TDICP) based on Private Information Retrieval (PIR) by introducing a novel verification tag called noteset, which allows multiple users holding the same file to generate their individual verification tags and still supports tag deduplication at the CSP. Furthermore, we make the first attempt to guarantee the correctness of data duplication check by introducing a novel User Determined Duplication Check Protocol (UDDCP) based on Private Set Intersection (PSI), which can resist a CSP from providing a fake duplication check result to users. Security analysis shows the correctness and soundness of our scheme. Simulation studies based on real data show the efficacy and efficiency of our proposed scheme and its significant advantages over prior arts.

Original languageEnglish
Pages (from-to)680-694
Number of pages15
JournalIEEE Transactions on Dependable and Secure Computing
Issue number1
Early online dateJan 2022
Publication statusPublished - 1 Jan 2023
MoE publication typeA1 Journal article-refereed


  • Cloud computing
  • Cloud Computing
  • Data Deduplication
  • Data integrity
  • Duplication Check
  • Electronic mail
  • Information retrieval
  • Integrity Check
  • Licenses
  • Memory
  • Private Information Retrieval
  • Protocols
  • Verifiable Computation


Dive into the research topics of 'VeriDedup: A Verifiable Cloud Data Deduplication Scheme with Integrity and Duplication Proof'. Together they form a unique fingerprint.

Cite this