ACM SIGMOD Anthology ACM SIGMOD dblp.uni-trier.de

Data Abstraction through Density Estimation by Storage Management.

Kathrin Anne Meier: Data Abstraction through Density Estimation by Storage Management. SSDBM 1997: 39-47
@inproceedings{DBLP:conf/ssdbm/Meier97,
  author    = {Kathrin Anne Meier},
  editor    = {Yannis E. Ioannidis and
               David M. Hansen},
  title     = {Data Abstraction through Density Estimation by Storage Management},
  booktitle = {Ninth International Conference on Scientific and Statistical
               Database Management, Proceedings, August 11-13, 1997, Olympia,
               Washington, USA},
  publisher = {IEEE Computer Society},
  year      = {1997},
  isbn      = {0-8186-7952-2},
  pages     = {39-47},
  ee        = {db/conf/ssdbm/Meier97.html},
  crossref  = {DBLP:conf/ssdbm/97},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

One way to cope with the constantly growing amount of scientific data to be analyzed is to derive data abstractions from the original data. Data abstractions can provide a representation of the data in compressed form where the data's semantic structure is maintained. We have explored data abstractions based on} density estimation. Our method to estimate the density of scientific data sets is based on the directory of a multidimensional data access structure. This data density estimator is called directory estimator. It is based on multidimensional adaptive histograms and is therefore computationally efficient, even for large data sets and many dimensions.

This paper describes the methodology in general and focuses on the estimator's accuracy in particular. The accuracy of the directory estimator depends on the parameters of the access structures used, such as the bucket capacity. We evaluate the choice of bucket capacity theoretically as well as empirically with the ISE (Integrated Squared Error) being the measure of error and using a gridfile as the data access structure.

A useful application of the directory estimator in the field of scientific data is presented with a practical example from astronomy.

Copyright © 1997 by The Institute of Electrical and Electronic Engineers, Inc. (IEEE). Abstract used with permission.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 5, SSDBM, DBPL, KRDB, ADBIS, COOPIS, SIGBDP" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Online Edition: IEEE Computer Society DL

Citation Page

Printed Edition

Yannis E. Ioannidis, David M. Hansen (Eds.): Ninth International Conference on Scientific and Statistical Database Management, Proceedings, August 11-13, 1997, Olympia, Washington, USA. IEEE Computer Society 1997, ISBN 0-8186-7952-2
Contents BibTeX

References

[1]
...
[2]
...
[3]
...
[4]
...
[5]
...
[6]
...
[7]
Hans Hinterberger: Multidimensional Data Visualization Design Tradeoffs: Speed vs. Detail. SSDBM 1986: 85-97 BibTeX
[8]
...
[9]
...
[10]
...
[11]
Jürg Nievergelt, Hans Hinterberger, Kenneth C. Sevcik: The Grid File: An Adaptable, Symmetric Multikey File Structure. ACM Trans. Database Syst. 9(1): 38-71(1984) BibTeX
[12]
...
[13]
...
[14]
...
[15]
...

Referenced by

  1. Zuotao Li, Xiaoyang Sean Wang, Menas Kafatos, Ruixin Yang: A Pyramid Data Model for Supporting Content-Based Browsing and Knowledge Discovery. SSDBM 1998: 170-179
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
SSDBM 1997: Copyright © by IEEE,
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:42:53 2009