Tornado Codes for MAID Archival Storage

Matthew Woitaszek, Henry M. Tufo. Tornado Codes for MAID Archival Storage. In MSST 2007: Proceedings of the IEEE / NASA Goddard Conference on Mass Storage Systems and Technologies, San Diego, California, USA, September 2007.

This paper examines the application of Tornado Codes, a class of low density parity check (LDPC) erasure codes, to archival storage systems based on massive arrays of idle disks (MAID). We present a log-structured extent-based archival file system based on Tornado Coded stripe storage. The file system is combined with a MAID simulator to emulate the behavior of a large-scale storage system with the goal of employing Tornado Codes to provide fault tolerance and performance in a power-constrained environment. The effect of power conservation constraints on system throughput is examined, and a policy of placing multiple data nodes on a single device is shown to increase read throughput at the cost of a measurable, but negligible, decrease in fault tolerance. Finally, a system prototype is implemented on a 100 TB Lustre storage cluster, providing GridFTP accessible storage with higher reliability and availability than the underlying storage architecture.

@inproceedings{200709-msst2007-tcms,
      Address = {San Diego, California, USA},
      Author = {Woitaszek, Matthew and Tufo, Henry M.},
      Booktitle = {MSST 2007: Proceedings of the IEEE / NASA Goddard Conference on Mass Storage Systems and Technologies},
      Doi = {http://doi.ieeecomputersociety.org/10.1109/MSST.2007.33},
      Isbn = {0-7695-3025-7},
      Month = {September},
      Note = {Short paper and poster.},
      Pages = {221-226},
      Title = {Tornado Codes for {MAID} Archival Storage},
      Year = {2007},}