TY - GEN
T1 - Automatic identification of application I/O signatures from noisy server-side traces
AU - Liu, Yang
AU - Gunasekaran, Raghul
AU - Ma, Xiaosong
AU - Vazhkudai, Sudharshan S.
PY - 2014
Y1 - 2014
N2 - Competing workloads on a shared storage system cause I/O resource contention and application performance vagaries. This problem is already evident in today’s HPC storage systems and is likely to become acute at ex-ascale. We need more interaction between application I/O requirements and system software tools to help alleviate the I/O bottleneck, moving towards I/O-aware job scheduling. However, this requires rich techniques to capture application I/O characteristics, which remain evasive in production systems. Traditionally, I/O characteristics have been obtained using client-side tracing tools, with drawbacks such as non-trivial instrumentation/development costs, large trace traffic, and inconsistent adoption. We present a novel approach, I/O Signature Identifier (IOSI), to characterize the I/O behavior of data-intensive applications. IOSI extracts signatures from noisy, zero-overhead server-side I/O throughput logs that are already collected on today’s supercomputers, without interfering with the compiling/execution of applications. We evaluated IOSI using the Spider storage system at Oak Ridge National Laboratory, the S3D turbulence application (running on 18,000 Titan nodes), and benchmark-based pseudo-applications. Through our experiments we confirmed that IOSI effectively extracts an application’s I/O signature despite significant server-side noise. Compared to client-side tracing tools, IOSI is transparent, interface-agnostic, and incurs no overhead. Compared to alternative data alignment techniques (e.g., dynamic time warping), it offers higher signature accuracy and shorter processing time.
AB - Competing workloads on a shared storage system cause I/O resource contention and application performance vagaries. This problem is already evident in today’s HPC storage systems and is likely to become acute at ex-ascale. We need more interaction between application I/O requirements and system software tools to help alleviate the I/O bottleneck, moving towards I/O-aware job scheduling. However, this requires rich techniques to capture application I/O characteristics, which remain evasive in production systems. Traditionally, I/O characteristics have been obtained using client-side tracing tools, with drawbacks such as non-trivial instrumentation/development costs, large trace traffic, and inconsistent adoption. We present a novel approach, I/O Signature Identifier (IOSI), to characterize the I/O behavior of data-intensive applications. IOSI extracts signatures from noisy, zero-overhead server-side I/O throughput logs that are already collected on today’s supercomputers, without interfering with the compiling/execution of applications. We evaluated IOSI using the Spider storage system at Oak Ridge National Laboratory, the S3D turbulence application (running on 18,000 Titan nodes), and benchmark-based pseudo-applications. Through our experiments we confirmed that IOSI effectively extracts an application’s I/O signature despite significant server-side noise. Compared to client-side tracing tools, IOSI is transparent, interface-agnostic, and incurs no overhead. Compared to alternative data alignment techniques (e.g., dynamic time warping), it offers higher signature accuracy and shorter processing time.
UR - https://www.scopus.com/pages/publications/85073543975
M3 - Conference contribution
AN - SCOPUS:85073543975
T3 - Proceedings of the 12th USENIX Conference on File and Storage Technologies, FAST 2014
SP - 213
EP - 228
BT - Proceedings of the 12th USENIX Conference on File and Storage Technologies, FAST 2014
PB - USENIX Association
T2 - 12th USENIX Conference on File and Storage Technologies, FAST 2014
Y2 - 17 February 2014 through 20 February 2014
ER -