A Cache Filtering Optimisation for Queries to Massive Datasets on Tertiary Storage.
Koen Holtman, Peter van der Stok, Ian Willers:
A Cache Filtering Optimisation for Queries to Massive Datasets on Tertiary Storage.
DOLAP 1999: 94-100@inproceedings{DBLP:conf/dolap/HoltmanSW99,
author = {Koen Holtman and
Peter van der Stok and
Ian Willers},
title = {A Cache Filtering Optimisation for Queries to Massive Datasets
on Tertiary Storage},
booktitle = {DOLAP '99, ACM Second International Workshop on Data Warehousing
and OLAP, November 6, 1999, Kansas City, Missouri, USA, Proceedings},
publisher = {ACM},
year = {1999},
pages = {94-100},
ee = {db/conf/dolap/HoltmanSW99.html, http://doi.acm.org/10.1145/319757.319797},
crossref = {DBLP:conf/dolap/99},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
We consider a system in which many users run queries to examine subsets of a large object set.
The object set is partitioned into files on tape.
A single subset of objects will be visited by multiple queries in the workload.
This locality of access creates the opportunity for caching on disk.
We introduce and evaluate a novel optimisation, cache filtering, in which the 'hot' objects are automatically extracted from the files that are staged on disk, and then cached separately in new files on disk.
Cache filtering can lead to complex situations in the disk cache.
We show that these do not prevent effective caching and we introduce a special cache replacement algorithm to maximise efficiency.
Through simulations we evaluate the system over a broad range of likely workloads.
Depending on workload and system parameters, the cache filtering optimisation yields speedup factors up to 6.
Copyright © 1999 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 2 Issue 4, CIKM, DOLAP, GIS, SIGFIDET, ..." and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
BibTeX
Printed Edition
DOLAP '99, ACM Second International Workshop on Data Warehousing and OLAP, November 6, 1999, Kansas City, Missouri, USA, Proceedings.
ACM 1999
Contents BibTeX
Online Edition
Citation Page
BibTeX
References
- [1]
- Luis M. Bernardo, Henrik Nordberg, Doron Rotem, Arie Shoshani:
Determining the Optimal File Size on Tertiary Storage Systems Based on the Distribution of Query Sizes.
SSDBM 1998: 22-31 BibTeX
- [2]
- Ling Tony Chen, R. Drach, M. Keating, S. Louis, Doron Rotem, Arie Shoshani:
Efficient organization and access of multi-dimensional datasets on tertiary storage systems.
Inf. Syst. 20(2): 155-183(1995) BibTeX
- [3]
- ...
- [4]
- Jim Gray, Goetz Graefe:
The Five-Minute Rule Ten Years Later, and Other Computer Storage Rules of Thumb.
SIGMOD Record 26(4): 63-68(1997) BibTeX
- [5]
- Robert L. Grossman, David Hanley, Xiao Qin:
Caching and Migration for Multilevel Persistent Object Stores.
IEEE Symposium on Mass Storage Systems 1995: 127-135 BibTeX
- [6]
- Andrew Hanushevsky, Marcia Nowark:
Pursuit of a Scalable High Performance Multi-Petabyte Database.
IEEE Symposium on Mass Storage Systems 1999: 169-175 BibTeX
- [7]
- Koen Holtman, Peter van der Stok, Ian Willers:
Automatic Reclustering of Objects in Very Large Databases for High Energy Physics.
IDEAS 1998: 132-140 BibTeX
- [8]
- ...
- [9]
- ...
- [10]
- Sunita Sarawagi, Michael Stonebraker:
Efficient Organization of Large Multidimensional Arrays.
ICDE 1994: 328-336 BibTeX
- [11]
- Jie-Bing Yu, David J. DeWitt:
Query Pre-Execution and Batching in Paradise: A Two-Pronged Approach to the Efficient Processing of Queries on Tape-Resident Raster Images.
SSDBM 1997: 64-78 BibTeX
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
DOLAP 1999 Proceedings, ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:07:21 2009