Synthesizing representative I/O workloads for TPC-H

Jianyong Zhang, Anand Sivasubramaniam, Hubertus Franke, Natarajan Gautam, Yanyong Zhang, Shailabh Nagar

Research output: Contribution to journalConference article

39 Citations (Scopus)

Abstract

Synthesizing I/O requests that can accurately capture workload behavior is extremely valuable for the design, implementation and optimization of disk subsystems. This paper presents a synthetic workload generator for TPC-H, an important decision-support commercial workload, by completely characterizing the arrival and access patterns of its queries. We present a novel approach for parameterizing the behavior of inter-mingling streams of sequential requests, and exploit correlations between multiple attributes of these requests, to generate disk block-level traces that are shown to accurately mimic the behavior of a real trace in terms of response time characteristics for each TPC-H query.

Original languageEnglish (US)
Pages (from-to)142-151
Number of pages10
JournalIEEE High-Performance Computer Architecture Symposium Proceedings
Volume10
StatePublished - May 24 2004
EventProceedings - 10th International Symposium on High Performance Computer Architecture - Madrid, Spain
Duration: Feb 14 2004Feb 18 2004

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture

Cite this

Zhang, Jianyong ; Sivasubramaniam, Anand ; Franke, Hubertus ; Gautam, Natarajan ; Zhang, Yanyong ; Nagar, Shailabh. / Synthesizing representative I/O workloads for TPC-H. In: IEEE High-Performance Computer Architecture Symposium Proceedings. 2004 ; Vol. 10. pp. 142-151.
@article{457c236ce77d42fcaa469dac36f1f176,
title = "Synthesizing representative I/O workloads for TPC-H",
abstract = "Synthesizing I/O requests that can accurately capture workload behavior is extremely valuable for the design, implementation and optimization of disk subsystems. This paper presents a synthetic workload generator for TPC-H, an important decision-support commercial workload, by completely characterizing the arrival and access patterns of its queries. We present a novel approach for parameterizing the behavior of inter-mingling streams of sequential requests, and exploit correlations between multiple attributes of these requests, to generate disk block-level traces that are shown to accurately mimic the behavior of a real trace in terms of response time characteristics for each TPC-H query.",
author = "Jianyong Zhang and Anand Sivasubramaniam and Hubertus Franke and Natarajan Gautam and Yanyong Zhang and Shailabh Nagar",
year = "2004",
month = "5",
day = "24",
language = "English (US)",
volume = "10",
pages = "142--151",
journal = "Proceedings - International Symposium on High-Performance Computer Architecture",
issn = "1530-0897",

}

Synthesizing representative I/O workloads for TPC-H. / Zhang, Jianyong; Sivasubramaniam, Anand; Franke, Hubertus; Gautam, Natarajan; Zhang, Yanyong; Nagar, Shailabh.

In: IEEE High-Performance Computer Architecture Symposium Proceedings, Vol. 10, 24.05.2004, p. 142-151.

Research output: Contribution to journalConference article

TY - JOUR

T1 - Synthesizing representative I/O workloads for TPC-H

AU - Zhang, Jianyong

AU - Sivasubramaniam, Anand

AU - Franke, Hubertus

AU - Gautam, Natarajan

AU - Zhang, Yanyong

AU - Nagar, Shailabh

PY - 2004/5/24

Y1 - 2004/5/24

N2 - Synthesizing I/O requests that can accurately capture workload behavior is extremely valuable for the design, implementation and optimization of disk subsystems. This paper presents a synthetic workload generator for TPC-H, an important decision-support commercial workload, by completely characterizing the arrival and access patterns of its queries. We present a novel approach for parameterizing the behavior of inter-mingling streams of sequential requests, and exploit correlations between multiple attributes of these requests, to generate disk block-level traces that are shown to accurately mimic the behavior of a real trace in terms of response time characteristics for each TPC-H query.

AB - Synthesizing I/O requests that can accurately capture workload behavior is extremely valuable for the design, implementation and optimization of disk subsystems. This paper presents a synthetic workload generator for TPC-H, an important decision-support commercial workload, by completely characterizing the arrival and access patterns of its queries. We present a novel approach for parameterizing the behavior of inter-mingling streams of sequential requests, and exploit correlations between multiple attributes of these requests, to generate disk block-level traces that are shown to accurately mimic the behavior of a real trace in terms of response time characteristics for each TPC-H query.

UR - http://www.scopus.com/inward/record.url?scp=2342525809&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=2342525809&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:2342525809

VL - 10

SP - 142

EP - 151

JO - Proceedings - International Symposium on High-Performance Computer Architecture

JF - Proceedings - International Symposium on High-Performance Computer Architecture

SN - 1530-0897

ER -