Distributed Exact Deduplication for Primary Storage Infrastructures - LNCS 8460: Distributed Applications and Interoperable Systems
Conference Papers Year : 2014

Distributed Exact Deduplication for Primary Storage Infrastructures

Abstract

Deduplication of primary storage volumes in a cloud computing environment is increasingly desirable, as the resulting space savings contribute to the cost effectiveness of a large scale multi-tenant infrastructure. However, traditional archival and backup deduplication systems impose prohibitive overhead for latency-sensitive applications deployed at these infrastructures while, current primary deduplication systems rely on special cluster filesystems, centralized components, or restrictive workload assumptions.We present DEDIS, a fully-distributed and dependable system that performs exact and cluster-wide background deduplication of primary storage. DEDIS does not depend on data locality and works on top of any unsophisticated storage backend, centralized or distributed, that exports a basic shared block device interface. The evaluation of an open-source prototype shows that DEDIS scales out and adds negligible overhead even when deduplication and intensive storage I/O run simultaneously.
Fichier principal
Vignette du fichier
326177_1_En_5_Chapter.pdf (289.34 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-01287732 , version 1 (14-03-2016)

Licence

Identifiers

Cite

João Paulo, José Pereira. Distributed Exact Deduplication for Primary Storage Infrastructures. 4th International Conference on Distributed Applications and Interoperable Systems (DAIS), Jun 2014, Berlin, Germany. pp.52-66, ⟨10.1007/978-3-662-43352-2_5⟩. ⟨hal-01287732⟩
120 View
101 Download

Altmetric

Share

More