ACM SIGMOD Anthology ACM SIGMOD dblp.uni-trier.de

High Performance Multidimensional Analysis of Large Datasets.

Sanjay Goil, Alok N. Choudhary: High Performance Multidimensional Analysis of Large Datasets. DOLAP 1998: 34-39
@inproceedings{DBLP:conf/dolap/GoilC98,
  author    = {Sanjay Goil and
               Alok N. Choudhary},
  title     = {High Performance Multidimensional Analysis of Large Datasets},
  booktitle = {DOLAP '98, ACM First International Workshop on Data Warehousing
               and OLAP, November 7, 1998, Bethesda, Maryland, USA, Proceedings},
  publisher = {ACM},
  year      = {1998},
  pages     = {34-39},
  ee        = {db/conf/dolap/GoilC98.html, http://doi.acm.org/10.1145/294260.294269},
  crossref  = {DBLP:conf/dolap/98},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

Summary information from data in large databases is used to auswer queries in On-Line Analytical Processing (OLAP) systems and to build decision support systems over them. The Data Cube is used to calculate and store summary information on a variety of dimensions, which is computed only partially if the number of dimensions is large. Queries posed on such systems are quite complex and require different views of data. These may either be auswered from a materialized cube in the data cube or calculated on the fly. Further, data mining for associations can be performed on the data cube. Analytical models need to capture the multidimensionality of the underlying data, a task for which multidimensional databases are well suited. Multidimensional databases store data in multidimensional structure on which analytical operations are performed. A challenge for these systems is how to handle large data sets in a large number of dimensions.

This paper presents a parallel OLAP infrastructure for multidimensional databases integrated with association rule mining. Scheduling optimisations for parallel computation of complete data cubes are presented. We propose left and right schedules for partial data cubes for m-way mining of association rules. Our implementation on the IBM SP-2, a shared-nothing parallel machine, can handle large data sets and a large number of dimensions by using disk I/O in our algorithms.

Copyright © 1998 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 4, CIKM, DOLAP, GIS, SIGFIDET, ..." and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

DOLAP '98, ACM First International Workshop on Data Warehousing and OLAP, November 7, 1998, Bethesda, Maryland, USA, Proceedings. ACM 1998
Contents BibTeX

Online Edition

Citation Page BibTeX

References

[Bha95]
...
[FPSSU95]
Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, Ramasamy Uthurusamy (Eds.): Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press 1996, ISBN 0-262-56097-6
Contents BibTeX
[GBLP96]
Jim Gray, Adam Bosworth, Andrew Layman, Hamid Pirahesh: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total. ICDE 1996: 152-159 BibTeX
[GC97a]
Sanjay Goil, Alok N. Choudhary: High Performance OLAP and Data Mining on Parallel Computers. Data Min. Knowl. Discov. 1(4): 391-417(1997) BibTeX
[GC97b]
...
[HCC93]
Jiawei Han, Yandong Cai, Nick Cercone: Data-Driven Discovery of Quantitative Rules in Relational Databases. IEEE Trans. Knowl. Data Eng. 5(1): 29-40(1993) BibTeX
[HF95]
Jiawei Han, Yongjian Fu: Discovery of Multiple-Level Association Rules from Large Databases. VLDB 1995: 420-431 BibTeX
[HRU96]
Venky Harinarayan, Anand Rajaraman, Jeffrey D. Ullman: Implementing Data Cubes Efficiently. SIGMOD Conference 1996: 205-216 BibTeX
[KHC97]
Micheline Kamber, Jiawei Han, Jenny Chiang: Metarule-Guided Mining of Multi-Dimensional Association Rules Using Data Cubes. KDD 1997: 207-210 BibTeX
[LS98]
...
[Sof97]
...
[ZDN97]
Yihong Zhao, Prasad Deshpande, Jeffrey F. Naughton: An Array-Based Algorithm for Simultaneous Multidimensional Aggregates. SIGMOD Conference 1997: 159-170 BibTeX
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
DOLAP 1998 Proceedings, ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:07:20 2009