Parallel Input/Output with Heterogeneous Disks.

Szu-Wen Kuo, Marianne Winslett, Ying Chen, Yong Cho, Mahesh Subramaniam, Kent E. Seamons: Parallel Input/Output with Heterogeneous Disks. SSDBM 1997: 79-90
  author    = {Szu-Wen Kuo and
               Marianne Winslett and
               Ying Chen and
               Yong Cho and
               Mahesh Subramaniam and
               Kent E. Seamons},
  editor    = {Yannis E. Ioannidis and
               David M. Hansen},
  title     = {Parallel Input/Output with Heterogeneous Disks},
  booktitle = {Ninth International Conference on Scientific and Statistical
               Database Management, Proceedings, August 11-13, 1997, Olympia,
               Washington, USA},
  publisher = {IEEE Computer Society},
  year      = {1997},
  isbn      = {0-8186-7952-2},
  pages     = {79-90},
  ee        = {db/conf/ssdbm/KuoWCCSS97.html},
  crossref  = {DBLP:conf/ssdbm/97},
  bibsource = {DBLP,}


Panda is a high-performance library for accessing large multidimensional array data on secondary storage of parallel platforms and networks of workstations. When using Panda as the I/O component of a scientific application, H3expresso, on the IBM SP2 at Cornell Theory Center, we found that some nodes are more powerful with respect to I/O than others, requiring the introduction of load balancing techniques to maintain high performance. We expect that heterogeneity will also be a big issue for DBMSs or parallel I/O libraries designed for scientific applications running on networks of workstations, and the methods of allocating data to servers in these environments will need to be upgraded to take heterogeneity into account, while still allowing users to exert control over data layout. We propose such an approach to load balancing, under which we respect the user's choice of high-level disk layout, but introduce automatic subchunking. The use of subchunks allows us to divide the very large chunks typically specified by the user's disk layout into more manageable-size units that can be allocated to I/O nodes in a manner that fairly distributes the load. We also present two techniques for allocating subchunks to nodes, static and dynamic, and evaluate their performance on the SP2.

Copyright © 1997 by The Institute of Electrical and Electronic Engineers, Inc. (IEEE). Abstract used with permission.

ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 5, SSDBM, DBPL, KRDB, ADBIS, COOPIS, SIGBDP" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Online Edition: IEEE Computer Society DL

Citation Page

Printed Edition

Yannis E. Ioannidis, David M. Hansen (Eds.): Ninth International Conference on Scientific and Statistical Database Management, Proceedings, August 11-13, 1997, Olympia, Washington, USA. IEEE Computer Society 1997, ISBN 0-8186-7952-2
Contents BibTeX


Jean L. Bell: A Specialized Data Management System For Parallel Execution of Particle Physics Codes. SIGMOD Conference 1988: 277-285 BibTeX
Rajesh Bordawekar, Alok N. Choudhary, Ken Kennedy, Charles Koelbel, Michael H. Paleczny: A Model and Compilation Strategy for Out-of-Core Data Parallel Programs. PPOPP 1995: 1-10 BibTeX
Clemens H. Cap, Volker Strumpen: Efficient Parallel Computing in Distributed Workstation Environments. Parallel Computing 19(11): 1221-1234(1993) BibTeX
Ying Chen, Marianne Winslett, Kent E. Seamons, Szu-Wen Kuo, Yong Cho, Mahesh Subramaniam: Scalable Message Passing in Panda. IOPADS 1996: 109-121 BibTeX
Ying Chen, Jarek Nieplocha, Ian T. Foster, Marianne Winslett: Optimizing Collective I/O Performance on Parallel Computers: A Multisystem Study. International Conference on Supercomputing 1997: 28-35 BibTeX
David J. DeWitt, Navin Kabra, Jun Luo, Jignesh M. Patel, Jie-Bing Yu: Client-Server Paradise. VLDB 1994: 558-569 BibTeX
Derek L. Eager, Edward D. Lazowska, John Zahorjan: Adaptive Load Sharing in Homogeneous Distributed Systems. IEEE Trans. Software Eng. 12(5): 662-675(1986) BibTeX
John F. Karpovich, James C. French, Andrew S. Grimshaw: High Performance Access to Radio Astronomy Data: A Case Study. SSDBM 1994: 240-249 BibTeX
Leonid Libkin, Rona Machlin, Limsoon Wong: A Query Language for Multidimensional Arrays: Design, Implementation, and Optimization Techniques. SIGMOD Conference 1996: 228-239 BibTeX
Sunita Sarawagi, Michael Stonebraker: Efficient Organization of Large Multidimensional Arrays. ICDE 1994: 328-336 BibTeX
Kent E. Seamons, Marianne Winslett: Physical Schemas for Large Multidimensional Arrays in Scientific Computing Applications. SSDBM 1994: 218-227 BibTeX
Kent E. Seamons, Ying Chen, Marianne Winslett, Yong Cho, Szu-Wen Kuo, Mahesh Subramaniam: Persistent Array Access Using Server-Directed I/O. SSDBM 1996: 98-107 BibTeX
Michael Stonebraker, James Frew, Kenn Gardels, Jeff Meredith: The Sequoia 2000 Benchmark. SIGMOD Conference 1993: 2-11 BibTeX

Referenced by

  1. Szu-Wen Kuo, Marianne Winslett, Ying Chen, Yong Cho: Efficient I/O of Grid Hierarchies for AMR Computations on Parallel Disks. SSDBM 1998: 12-21
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
SSDBM 1997: Copyright © by IEEE,
ACM SIGMOD Anthology: Copyright © by ACM (, Corrections:
DBLP: Copyright © by Michael Ley (, last change: Sat May 16 23:42:53 2009