Seeking the Truth About ad hoc Join Costs.
Laura M. Haas, Michael J. Carey, Miron Livny, Amit Shukla:
Seeking the Truth About ad hoc Join Costs.
VLDB J. 6(3): 241-256(1997)@article{DBLP:journals/vldb/HaasCLS97,
author = {Laura M. Haas and
Michael J. Carey and
Miron Livny and
Amit Shukla},
title = {Seeking the Truth About ad hoc Join Costs},
journal = {VLDB J.},
volume = {6},
number = {3},
year = {1997},
pages = {241-256},
ee = {db/journals/vldb/HaasCLS97.html},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
In this paper, we re-examine the results of prior work on methods for computing ad hoc joins.
We develop a detailed cost model for predicting join algorithm performance, and we use the model to develop cost formulas for the major ad hoc join methods found in the relational database literature.
We show that various pieces of "common wisdom" about join algorithm performance fail to hold up when analyzed carefully, and we use our detailed cost model to derive optimal buffer allocation schemes for each of the join methods examined here.
We show that optimizing their buffer allocations can lead to large performance improvements, e.g., as much as a 400% improvement in some cases.
We also validate our cost model's predictions by measuring an actual implementation of each join algorithm considered.
The results of this work should be directly useful to implementors of relational query optimizers and query processing systems.
Key Words
Optimization, Cost models, Join methods, Buffer allocation, Performance
Copyright © 1997 by Springer, Berlin, Heidelberg.
Permission to make digital or hard copies of the abstract is granted provided that copies are not made or distributed for profit or
direct commercial advantage, and that copies show this notice along with the full citation.
Citation Page
CDROM Version: Load the CDROM "Volume 4 Issue 1, Books, VLDB-j, TODS, ..." and ...
DVD Version: Load ACM SIGMOD Anthology DVD 2" and ...
BibTeX
References
- [1]
- Mike W. Blasgen, Kapali P. Eswaran:
Storage and Access in Relational Data Bases.
IBM Systems Journal 16(4): 362-377(1977) BibTeX
- [2]
- Kjell Bratbergsengen:
Hashing Methods and Relational Algebra Operations.
VLDB 1984: 323-333 BibTeX
- [3]
- ...
- [4]
- Michael J. Carey, Laura M. Haas, Miron Livny:
Tapes Hold Data, Too: Challenges of Tuples on Tertiary Store.
SIGMOD Conference 1993: 413-417 BibTeX
- [5]
- Diane L. Davison, Goetz Graefe:
Dynamic Resource Brokering for Multi-User Query Execution.
SIGMOD Conference 1995: 281-292 BibTeX
- [6]
- David J. DeWitt, Randy H. Katz, Frank Olken, Leonard D. Shapiro, Michael Stonebraker, David A. Wood:
Implementation Techniques for Main Memory Database Systems.
SIGMOD Conference 1984: 1-8 BibTeX
- [7]
- ...
- [8]
- Goetz Graefe:
Query Evaluation Techniques for Large Databases.
ACM Comput. Surv. 25(2): 73-170(1993) BibTeX
- [9]
- Goetz Graefe, Ann Linville, Leonard D. Shapiro:
Sort versus Hash Revisited.
IEEE Trans. Knowl. Data Eng. 6(6): 934-944(1994) BibTeX
- [10]
- Robert B. Hagmann:
An Observation on Database Buffering Performance Metrics.
VLDB 1986: 289-293 BibTeX
- [11]
- Yannis E. Ioannidis, Stavros Christodoulakis:
On the Propagation of Errors in the Size of Join Results.
SIGMOD Conference 1991: 268-277 BibTeX
- [12]
- Won Kim:
A New Way to Compute the Product and Join of Relations.
SIGMOD Conference 1980: 179-187 BibTeX
- [13]
- Donald E. Knuth:
The Art of Computer Programming, Volume III: Sorting and Searching.
Addison-Wesley 1973, ISBN 0-201-03803-X
BibTeX
- [14]
- ...
- [15]
- Lothar F. Mackert, Guy M. Lohman:
R* Optimizer Validation and Performance Evaluation for Local Queries.
SIGMOD Conference 1986: 84-95 BibTeX
- [16]
- Michael V. Mannino, Paicheng Chu, Thomas Sager:
Statistical Profile Estimation in Database Systems.
ACM Comput. Surv. 20(3): 191-221(1988) BibTeX
- [17]
- Priti Mishra, Margaret H. Eich:
Join Processing in Relational Databases.
ACM Comput. Surv. 24(1): 63-113(1992) BibTeX
- [18]
- HweeHwa Pang, Michael J. Carey, Miron Livny:
Partially Preemptive Hash Joins.
SIGMOD Conference 1993: 59-68 BibTeX
- [19]
- Jignesh M. Patel, Michael J. Carey, Mary K. Vernon:
Accurate Modeling of the Hybrid Hash Join Algorithm.
SIGMETRICS 1994: 56-66 BibTeX
- [20]
- Betty Salzberg:
Merging Sorted Runs Using Large Main Memory.
Acta Inf. 27(3): 195-215(1989) BibTeX
- [21]
- Betty Salzberg, Alex Tsukerman, Jim Gray, Michael Stewart, Susan Uren, Bonnie Vaughan:
FastSort: A Distributed Single-Input Single-Output External Sort.
SIGMOD Conference 1990: 94-101 BibTeX
- [22]
- Patricia G. Selinger, Morton M. Astrahan, Donald D. Chamberlin, Raymond A. Lorie, Thomas G. Price:
Access Path Selection in a Relational Database Management System.
SIGMOD Conference 1979: 23-34 BibTeX
- [23]
- Eugene J. Shekita, Michael J. Carey:
A Performance Evaluation of Pointer-Based Joins.
SIGMOD Conference 1990: 300-311 BibTeX
- [24]
- Leonard D. Shapiro:
Join Processing in Database Systems with Large Main Memories.
ACM Trans. Database Syst. 11(3): 239-264(1986) BibTeX
- [25]
- Patrick Valduriez:
Join Indices.
ACM Trans. Database Syst. 12(2): 218-246(1987) BibTeX
- [26]
- ...
Referenced by
- Reinhard Braumandl, Jens Claußen, Alfons Kemper, Donald Kossmann:
Functional-Join Processing.
VLDB J. 8(3-4): 156-177(2000)
- Till Westmann, Donald Kossmann, Sven Helmer, Guido Moerkotte:
The Implementation and Performance of Compressed Databases.
SIGMOD Record 29(3): 55-67(2000)
- Zhe Li, Kenneth A. Ross:
Fast Joins Using Join Indices.
VLDB J. 8(1): 1-24(1999)
- Alfons Kemper, Donald Kossmann, Christian Wiesner:
Generalised Hash Teams for Join and Group-by.
VLDB 1999: 30-41
- Achim Kraiss, Peter Muth, Michael Gillmann:
Tape-Disk Join Strategies under Disk Contention.
ICDE 1999: 552-559
- Sven Helmer, Till Westmann, Guido Moerkotte:
Diag-Join: An Opportunistic Join Algorithm for 1:N Relationships.
VLDB 1998: 98-109
- Reinhard Braumandl, Jens Claußen, Alfons Kemper:
Evaluating Functional Joins Along Nested Reference Sets in Object-Relational and Object-Oriented Databases.
VLDB 1998: 110-122
- Jussi Myllymaki, Miron Livny:
Relational Joins for Data on Tertiary Storage.
ICDE 1997: 159-168
- Wilburt Labio, Hector Garcia-Molina:
Efficient Snapshot Differential Algorithms for Data Warehousing.
VLDB 1996: 63-74
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
VLDB Journal: 1992-1995 Copyright © by VLDB Endowment / 1996-... Copyright © by Springer Verlag,
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sun May 17 00:31:30 2009