ACM SIGMOD Anthology ACM SIGMOD dblp.uni-trier.de

A Cache Filtering Optimisation for Queries to Massive Datasets on Tertiary Storage.

Koen Holtman, Peter van der Stok, Ian Willers: A Cache Filtering Optimisation for Queries to Massive Datasets on Tertiary Storage. DOLAP 1999: 94-100
@inproceedings{DBLP:conf/dolap/HoltmanSW99,
  author    = {Koen Holtman and
               Peter van der Stok and
               Ian Willers},
  title     = {A Cache Filtering Optimisation for Queries to Massive Datasets
               on Tertiary Storage},
  booktitle = {DOLAP '99, ACM Second International Workshop on Data Warehousing
               and OLAP, November 6, 1999, Kansas City, Missouri, USA, Proceedings},
  publisher = {ACM},
  year      = {1999},
  pages     = {94-100},
  ee        = {db/conf/dolap/HoltmanSW99.html, http://doi.acm.org/10.1145/319757.319797},
  crossref  = {DBLP:conf/dolap/99},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

We consider a system in which many users run queries to examine subsets of a large object set. The object set is partitioned into files on tape. A single subset of objects will be visited by multiple queries in the workload. This locality of access creates the opportunity for caching on disk. We introduce and evaluate a novel optimisation, cache filtering, in which the 'hot' objects are automatically extracted from the files that are staged on disk, and then cached separately in new files on disk. Cache filtering can lead to complex situations in the disk cache. We show that these do not prevent effective caching and we introduce a special cache replacement algorithm to maximise efficiency. Through simulations we evaluate the system over a broad range of likely workloads. Depending on workload and system parameters, the cache filtering optimisation yields speedup factors up to 6.

Copyright © 1999 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 4, CIKM, DOLAP, GIS, SIGFIDET, ..." and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

DOLAP '99, ACM Second International Workshop on Data Warehousing and OLAP, November 6, 1999, Kansas City, Missouri, USA, Proceedings. ACM 1999
Contents BibTeX

Online Edition

Citation Page BibTeX

References

[1]
Luis M. Bernardo, Henrik Nordberg, Doron Rotem, Arie Shoshani: Determining the Optimal File Size on Tertiary Storage Systems Based on the Distribution of Query Sizes. SSDBM 1998: 22-31 BibTeX
[2]
Ling Tony Chen, R. Drach, M. Keating, S. Louis, Doron Rotem, Arie Shoshani: Efficient organization and access of multi-dimensional datasets on tertiary storage systems. Inf. Syst. 20(2): 155-183(1995) BibTeX
[3]
...
[4]
Jim Gray, Goetz Graefe: The Five-Minute Rule Ten Years Later, and Other Computer Storage Rules of Thumb. SIGMOD Record 26(4): 63-68(1997) BibTeX
[5]
Robert L. Grossman, David Hanley, Xiao Qin: Caching and Migration for Multilevel Persistent Object Stores. IEEE Symposium on Mass Storage Systems 1995: 127-135 BibTeX
[6]
Andrew Hanushevsky, Marcia Nowark: Pursuit of a Scalable High Performance Multi-Petabyte Database. IEEE Symposium on Mass Storage Systems 1999: 169-175 BibTeX
[7]
Koen Holtman, Peter van der Stok, Ian Willers: Automatic Reclustering of Objects in Very Large Databases for High Energy Physics. IDEAS 1998: 132-140 BibTeX
[8]
...
[9]
...
[10]
Sunita Sarawagi, Michael Stonebraker: Efficient Organization of Large Multidimensional Arrays. ICDE 1994: 328-336 BibTeX
[11]
Jie-Bing Yu, David J. DeWitt: Query Pre-Execution and Batching in Paradise: A Two-Pronged Approach to the Efficient Processing of Queries on Tape-Resident Raster Images. SSDBM 1997: 64-78 BibTeX
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
DOLAP 1999 Proceedings, ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:07:21 2009