Data Management Support for Statistical Data Editing and Subset Selection.
Robert A. Burnett, James J. Thomas:
Data Management Support for Statistical Data Editing and Subset Selection.
SSDBM 1981: 88-102@inproceedings{DBLP:conf/ssdbm/BurnettT81,
author = {Robert A. Burnett and
James J. Thomas},
editor = {Harry K. T. Wong},
title = {Data Management Support for Statistical Data Editing and Subset
Selection},
booktitle = {Proceedings of the First LBL Workshop on Statistical Database
Management, Melno Park, California, USA, December 2-4, 1981},
publisher = {Lawrence Berkeley Laboratory},
year = {1981},
pages = {88-102},
ee = {db/conf/ssdbm/BurnettT81.html},
crossref = {DBLP:conf/ssdbm/81},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
Statistical analysis of large data sets often involves an initial data editing
and preparation phase to check the validity of individual data items, check
for consistency among related data, correct erroneous data, and supply
(impute) values for missing data where possible. During this preparatory
phase of analysis , it is often necessary to partition the data set into a
number of subsets by logical selection and/or random sampling techniques for
purposes of hypothesis testing. This paper examines the data management support required by these editing and subsetting operations in terms of data
descriptions, data manipulation functions, and logical and physical data
structures. The design of a data management system which seeks to meet these
requirements is described in detail. The system, called SDB, is built around
a self-describing transposed file structure and supporting data access
software. SDB representations of some logical data structures which are
commonly encountered in statistical databases are also described. Experiences
with a partial implementation of the system and its application in an
interactive data editor have been encouraging.
CDROM Version: Load the CDROM "Volume 2 Issue 5, SSDBM, DBPL, KRDB, ADBIS, COOPIS, SIGBDP" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
BibTeX
Printed Edition
Harry K. T. Wong (Ed.):
Proceedings of the First LBL Workshop on Statistical Database Management, Melno Park, California, USA, December 2-4, 1981.
Lawrence Berkeley Laboratory 1982
Contents BibTeX
References
- [1]
- ...
- [2]
- ...
- [3]
- ...
- [4]
- ...
- [5]
- ...
- [6]
- Ryosuke Hotaka, Masaaki Tsubaki:
Self-Descriptive Relational Data Base.
VLDB 1977: 415-426 BibTeX
- [7]
- Don S. Batory:
On Searching Transposed Files.
ACM Trans. Database Syst. 4(4): 531-544(1979) BibTeX
- [8]
- M. J. Turner, R. Hammond, P. Cotton:
A DBMS for Large Statistical Databases.
VLDB 1979: 319-327 BibTeX
- [9]
- ...
- [10]
- Michael Stonebraker:
Operating System Support for Database Management.
Commun. ACM 24(7): 412-418(1981) BibTeX
Referenced by
- James J. Thomas, David L. Hall:
ALDS Project: Motivation, Statistical Database Management Issues, Perspectives, and Directions.
SSDBM 1983: 82-88
- Frank Olken:
How Baroque Should a Statistical Database Management System Be?
SSDBM 1983: 212-219
- Robert A. Burnett, Paula J. Cowley, James J. Thomas:
Management and Display of Data Analysis Environments for Large Data Sets.
SSDBM 1983: 22-31
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
SSDBM 1981 Proceedings: Copyright © by Lawrence Berkeley National Laboratory, ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:42:33 2009