![]() |
![]() |
![]() |
@inproceedings{DBLP:conf/dolap/GoilC98,
author = {Sanjay Goil and
Alok N. Choudhary},
title = {High Performance Multidimensional Analysis of Large Datasets},
booktitle = {DOLAP '98, ACM First International Workshop on Data Warehousing
and OLAP, November 7, 1998, Bethesda, Maryland, USA, Proceedings},
publisher = {ACM},
year = {1998},
pages = {34-39},
ee = {db/conf/dolap/GoilC98.html, http://doi.acm.org/10.1145/294260.294269},
crossref = {DBLP:conf/dolap/98},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Summary information from data in large databases is used to auswer queries in On-Line Analytical Processing (OLAP) systems and to build decision support systems over them. The Data Cube is used to calculate and store summary information on a variety of dimensions, which is computed only partially if the number of dimensions is large. Queries posed on such systems are quite complex and require different views of data. These may either be auswered from a materialized cube in the data cube or calculated on the fly. Further, data mining for associations can be performed on the data cube. Analytical models need to capture the multidimensionality of the underlying data, a task for which multidimensional databases are well suited. Multidimensional databases store data in multidimensional structure on which analytical operations are performed. A challenge for these systems is how to handle large data sets in a large number of dimensions.
This paper presents a parallel OLAP infrastructure for multidimensional databases integrated with association rule mining. Scheduling optimisations for parallel computation of complete data cubes are presented. We propose left and right schedules for partial data cubes for m-way mining of association rules. Our implementation on the IBM SP-2, a shared-nothing parallel machine, can handle large data sets and a large number of dimensions by using disk I/O in our algorithms.
Copyright © 1998 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.