Index Scans Using a Finite LRU Buffer: A Validated I/O Model.
Lothar F. Mackert, Guy M. Lohman:
Index Scans Using a Finite LRU Buffer: A Validated I/O Model.
ACM Trans. Database Syst. 14(3): 401-424(1989)@article{DBLP:journals/tods/MackertL89,
author = {Lothar F. Mackert and
Guy M. Lohman},
title = {Index Scans Using a Finite LRU Buffer: A Validated I/O Model},
journal = {ACM Trans. Database Syst.},
volume = {14},
number = {3},
year = {1989},
pages = {401-424},
ee = {http://doi.acm.org/10.1145/68012.68016, db/journals/tods/MackertL89.html},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
Indexes are commonly employed to retrieve a portion of a file
or to retrieve its records in a particular order. An accurate
performance model of indexes is essential to the design,
analysis, and tuning of file management and database systems,
and particularly to database query optimization. Many previous
studies have addressed the problem of estimating the number of
disk page fetches when randomly accessing k records out
of N given records stored on T disk pages. This
paper generalizes these results, relaxing two assumptions that
usually do not hold in practice: unlimited buffer and unique
records for each key value. Experiments show that the
performance of an index scan is very sensitive to buffer size
limitations and multiple records per key value. A model for
these more practical situations is presented and a formula
derived for estimating the performance of an index scan.
We also give a closed-form approximation that is easy to
compute. The theoretical results are validated using the R*
distributed relational database system. Although we use
database terminology throughout the paper, the model is more
generally applicable whenever random accesses are made
using keys.
Copyright © 1989 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 3 Issue 1, TODS 1976-1990" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 2" and ...
BibTeX
References
- [1]
- Morton M. Astrahan, Mario Schkolnick, Won Kim:
Performance of the System R Access Path Selection Mechanism.
IFIP Congress 1980: 487-491 BibTeX
- [2]
- Dina Bitton, David J. DeWitt:
Duplicate Record Elimination in Large Data Files.
ACM Trans. Database Syst. 8(2): 255-265(1983) BibTeX
- [3]
- Alfonso F. Cardenas:
Analysis and Performance of Inverted Data Base Structures.
Commun. ACM 18(5): 253-263(1975) BibTeX
- [4]
- To-Yat Cheung:
Estimating Block Accesses and Number of Recorde in File Management.
Commun. ACM 25(7): 484-487(1982) BibTeX
- [5]
- Hong-Tai Chou, David J. DeWitt:
An Evaluation of Buffer Management Strategies for Relational Database Systems.
VLDB 1985: 127-141 BibTeX
- [6]
- Stavros Christodoulakis:
Estimating Block Transfers and Join Sizes.
SIGMOD Conference 1983: 40-54 BibTeX
- [7]
- Stavros Christodoulakis:
Estimating Block Selectivities.
Inf. Syst. 9(1): 69-79,(1984) BibTeX
- [8]
- Stavros Christodoulakis:
Implications of Certain Assumptions in Database Performance Evaluation.
ACM Trans. Database Syst. 9(2): 163-186(1984) BibTeX
- [9]
- Douglas Comer:
The Ubiquitous B-Tree.
ACM Comput. Surv. 11(2): 121-137(1979) BibTeX
- [10]
- Wolfgang Effelsberg, Theo Härder:
Principles of Database Buffer Management.
ACM Trans. Database Syst. 9(4): 560-595(1984) BibTeX
- [11]
- Alle IJbema, Henk M. Blanken:
Estimating Bucket Accesses: A Practical Approach.
ICDE 1986: 30-37 BibTeX
- [12]
- ...
- [13]
- W. S. Luk:
On Estimating Block Accesses in Database Organizations.
Commun. ACM 26(11): 945-947(1983) BibTeX
- [14]
- ...
- [15]
- Lothar F. Mackert, Guy M. Lohman:
R* Optimizer Validation and Performance Evaluation for Local Queries.
SIGMOD Conference 1986: 84-95 BibTeX
- [16]
- Lothar F. Mackert, Guy M. Lohman:
R* Optimizer Validation and Performance Evaluation for Distributed Queries.
VLDB 1986: 149-159 BibTeX
- [17]
- Giovanni Maria Sacco, Mario Schkolnick:
A Mechanism for Managing the Buffer Pool in a Relational Database System Using the Hot Set Model.
VLDB 1982: 257-262 BibTeX
- [18]
- Patricia G. Selinger, Morton M. Astrahan, Donald D. Chamberlin, Raymond A. Lorie, Thomas G. Price:
Access Path Selection in a Relational Database Management System.
SIGMOD Conference 1979: 23-34 BibTeX
- [19]
- Michael Stonebraker:
Operating System Support for Database Management.
Commun. ACM 24(7): 412-418(1981) BibTeX
- [20]
- Brad T. Vander Zanden, Howard M. Taylor, Dina Bitton:
Estimating Block Accessses when Attributes are Correlated.
VLDB 1986: 119-127 BibTeX
- [21]
- S. J. Waters:
Hit Ratios.
Comput. J. 19(1): 21-24(1976) BibTeX
- [22]
- Kyu-Young Whang, Gio Wiederhold, Daniel Sagalowicz:
Estimating Block Accesses in Database Organizations: A Closed Noniterative Formula.
Commun. ACM 26(11): 940-944(1983) BibTeX
- [23]
- S. Bing Yao:
Approximating the Number of Accesses in Database Organizations.
Commun. ACM 20(4): 260-261(1977) BibTeX
Referenced by
- Michael Gesmann:
A Cost Model for Parallel Navigational Access in Complex-Object DBMSs.
DASFAA 1997: 1-10
- Silvio Salza, Massimiliano Renzetti:
A Modeling Tool for Workload Analysis and Performance Tuning of Parallel Database Applications.
ADBIS 1997: 152-161
- Peter A. Buhr, Anil K. Goel, Naomi Nishimura, Prabhakar Ragde:
Parallel Pointer-Based Join Algorithms in Memory-mapped Environments.
ICDE 1996: 266-275
- Arun N. Swami, K. Bernhard Schiefer:
Estimating Page Fetches for Index Scans with Finite LRU Buffers.
VLDB J. 4(4): 675-701(1995)
- Paolo Ciaccia, Dario Maio:
Domains and Active Domains: What This Distinction Implies for the Estimation of Projection Sizes in Relational Databases.
IEEE Trans. Knowl. Data Eng. 7(4): 641-655(1995)
- George Diehr, Aditya N. Saharia:
Estimating Block Accesses in Database Organizations.
IEEE Trans. Knowl. Data Eng. 6(3): 497-499(1994)
- Arun N. Swami, K. Bernhard Schiefer:
Estimating Page Fetches for Index Scans with Finite LRU Buffers.
SIGMOD Conference 1994: 173-184
- Richard L. Cole, Goetz Graefe:
Optimization of Dynamic Query Evaluation Plans.
SIGMOD Conference 1994: 150-160
- Philip S. Yu, Douglas W. Cornell:
Buffer Management Based on Return on Consumption in a Multi-Query Environment.
VLDB J. 2(1): 1-37(1993)
- Goetz Graefe:
Query Evaluation Techniques for Large Databases.
ACM Comput. Surv. 25(2): 73-170(1993)
- Pai-Cheng Chu:
Estimating Block Selectivities for Physical Database Design.
IEEE Trans. Knowl. Data Eng. 4(1): 89-98(1992)
- M. Seetha Lakshmi, Philip S. Yu:
Effectiveness of Parallel Joins.
IEEE Trans. Knowl. Data Eng. 2(4): 410-424(1990)
- Eugene J. Shekita, Michael J. Carey:
A Performance Evaluation of Pointer-Based Joins.
SIGMOD Conference 1990: 300-311
- Michael V. Mannino, Paicheng Chu, Thomas Sager:
Statistical Profile Estimation in Database Systems.
ACM Comput. Surv. 20(3): 191-221(1988)
- Mavis K. Lee, Johann Christoph Freytag, Guy M. Lohman:
Implementing an Interpreter for Functional Rules in a Query Optimizer.
VLDB 1988: 218-229
- Ming-Chien Shan:
Optimal Plan Search in a Rule-Based Query Optimizer.
EDBT 1988: 92-112
- C. Mohan, Bruce G. Lindsay, Ron Obermarck:
Transaction Management in the R* Distributed Database Management System.
ACM Trans. Database Syst. 11(4): 378-396(1986)
- Lothar F. Mackert, Guy M. Lohman:
R* Optimizer Validation and Performance Evaluation for Distributed Queries.
VLDB 1986: 149-159
- Guy M. Lohman:
Do semantically equivalent SQL queries perform differently?
ICDE 1986: 225-226
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
TODS, ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Tue Jun 24 18:39:07 2008