Abstract
We provide novel coded computation strategies for distributed matrix-matrix products that outperform the recent 'Polynomial code' constructions in recovery threshold, i.e., the required number of successful workers. When a fixed 1/m fraction of each matrix can be stored at each worker node, Polynomial codes require m2 successful workers, while our MatDot codes only require 2m-1 successful workers. However, MatDot codes have higher computation cost per worker and higher communication cost from each worker to the fusion node. We also provide a systematic construction of MatDot codes. Furthermore, we propose 'PolyDot' coding that interpolates between Polynomial codes and MatDot codes to trade off computation/communication costs and recovery thresholds. Finally, we demonstrate a novel coding technique for multiplying n matrices (n ≥ 3) using ideas from MatDot and PolyDot codes.
Original language | English (US) |
---|---|
Article number | 8765375 |
Pages (from-to) | 278-301 |
Number of pages | 24 |
Journal | IEEE Transactions on Information Theory |
Volume | 66 |
Issue number | 1 |
DOIs | |
State | Published - Jan 2020 |
All Science Journal Classification (ASJC) codes
- Information Systems
- Computer Science Applications
- Library and Information Sciences