TY - GEN
T1 - Positioning dynamic storage caches for transient data
AU - Vazhkudai, Sudharshan S.
AU - Thain, Douglas
AU - Xiaosong, Ma
AU - Freeh, Vincent W.
PY - 2006
Y1 - 2006
N2 - Simulations, experiments and observatories are generating a deluge of scientific data. Even more staggering is the ever growing application demand to process and assimilate these datasets. Application users perform a range of data operations, collaborate and share data in many novel ways. The current storage landscape is struggling to keep up with these trends in scientific data processing. Application users pay the price due to over-crowded sharedfilesystems, or expensive storage area networks, or not enough local storage, or high-latency archival or wide-area transfers. In order to sustain and maximize I/O bandwidth relative to increasing CPU speeds, applications must take advantage of large amounts of intermediate commodity storage, However, intermediate storage presents new challenges above and beyond the traditional distributed filesystem paradigm: persistent scheduling, storage/CPU coallocation, namespace management, lifetime management, and novel application interfaces. In this paper, we describe applications that require intermediate storage management, suggest several open research problems, and illustrate two systems - Freeloader and Tactical Storage - that attack different aspects of these problems.
AB - Simulations, experiments and observatories are generating a deluge of scientific data. Even more staggering is the ever growing application demand to process and assimilate these datasets. Application users perform a range of data operations, collaborate and share data in many novel ways. The current storage landscape is struggling to keep up with these trends in scientific data processing. Application users pay the price due to over-crowded sharedfilesystems, or expensive storage area networks, or not enough local storage, or high-latency archival or wide-area transfers. In order to sustain and maximize I/O bandwidth relative to increasing CPU speeds, applications must take advantage of large amounts of intermediate commodity storage, However, intermediate storage presents new challenges above and beyond the traditional distributed filesystem paradigm: persistent scheduling, storage/CPU coallocation, namespace management, lifetime management, and novel application interfaces. In this paper, we describe applications that require intermediate storage management, suggest several open research problems, and illustrate two systems - Freeloader and Tactical Storage - that attack different aspects of these problems.
UR - https://www.scopus.com/pages/publications/46049116478
U2 - 10.1109/CLUSTR.2006.311900
DO - 10.1109/CLUSTR.2006.311900
M3 - Conference contribution
AN - SCOPUS:46049116478
SN - 1424403286
SN - 9781424403288
T3 - Proceedings - IEEE International Conference on Cluster Computing, ICCC
BT - 2006 IEEE International Conference on Cluster Computing, Cluster 2006
T2 - 2006 IEEE International Conference on Cluster Computing, Cluster 2006
Y2 - 25 September 2006 through 28 September 2006
ER -