Multi-version coding in distributed storage

Zhiying Wang, Viveck Cadambe

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    10 Scopus citations

    Abstract

    We investigate an information theoretic problem motivated by storing multiple versions of a data object in distributed storage systems. Specifically, in a storage system with n server nodes, where there are ? independent message versions, each server receives message values corresponding to some arbitrary subset of the versions. The versions are assumed to be totally ordered. Each server is unaware of the set of versions at the other servers, and aims to encode the values corresponding to the versions it has. We investigate codes where, from any set of c nodes (c < n), the value corresponding to the highest common version, as per the version ordering, available at this set of c nodes is decodable. We aim to design codes that minimize the storage cost. We present two main results in this paper. First, we show that the storage cost is lower bounded by 1 - (1 - 1 over c)υ, measured in terms of the bits of the values. Second, for the cases of υ = 2 and υ = 3, we provide new code constructions that respectively achieve storage costs of 2c-1 over c2 and 3c-2 over c2, measured in terms of the bits of the values. Our code constructions are simple in that we do not code across versions. We argue that when the number of versions υ is much larger than c, then replication is close to optimal.

    Original languageEnglish (US)
    Title of host publication2014 IEEE International Symposium on Information Theory, ISIT 2014
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages871-875
    Number of pages5
    ISBN (Print)9781479951864
    DOIs
    StatePublished - 2014
    Event2014 IEEE International Symposium on Information Theory, ISIT 2014 - Honolulu, HI, United States
    Duration: Jun 29 2014Jul 4 2014

    Publication series

    NameIEEE International Symposium on Information Theory - Proceedings
    ISSN (Print)2157-8095

    Other

    Other2014 IEEE International Symposium on Information Theory, ISIT 2014
    CountryUnited States
    CityHonolulu, HI
    Period6/29/147/4/14

    All Science Journal Classification (ASJC) codes

    • Theoretical Computer Science
    • Information Systems
    • Modeling and Simulation
    • Applied Mathematics

    Fingerprint Dive into the research topics of 'Multi-version coding in distributed storage'. Together they form a unique fingerprint.

    Cite this