ACM SIGMOD Anthology VLDB dblp.uni-trier.de

A New Compression Method with Fast Searching on Large Databases.

Jianzhong Li, Doron Rotem, Harry K. T. Wong: A New Compression Method with Fast Searching on Large Databases. VLDB 1987: 311-318
@inproceedings{DBLP:conf/vldb/LiRW87,
  author    = {Jianzhong Li and
               Doron Rotem and
               Harry K. T. Wong},
  editor    = {Peter M. Stocker and
               William Kent and
               Peter Hammersley},
  title     = {A New Compression Method with Fast Searching on Large Databases},
  booktitle = {VLDB'87, Proceedings of 13th International Conference on Very
               Large Data Bases, September 1-4, 1987, Brighton, England},
  publisher = {Morgan Kaufmann},
  year      = {1987},
  isbn      = {0-934613-46-X},
  pages     = {311-318},
  ee        = {db/conf/vldb/LiRW87.html},
  crossref  = {DBLP:conf/vldb/87},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

In this paper, a new compression method for constant removal from very large scientific and statistical databases is presented. The new method combines the best features from several classical constant removal compression methods. The result, both analytical and experimental, shows that the method is superior to these popular methods in terms of compression effectiveness and eficient searching on the compressed data. In addition to the development, analysis and validation of this new method, this paper also presents analysis of several traditional constant removal methods for the purpose of analytic comparison. A large collection of experiments have been designed and run to observe and validate the behavior of the compression methods. Another contribution of the paper is that performance characteristics are identified for different compression methods under different data properties assumptions. The result can be used as a basis of selecting compression methods by matching the properties of the database at hand to the data properties experimented in the paper.

Copyright © 1987 by the VLDB Endowment. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by the permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment.


Online Paper

ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 1 Issue 4, VLDB '75-'88" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

Peter M. Stocker, William Kent, Peter Hammersley (Eds.): VLDB'87, Proceedings of 13th International Conference on Very Large Data Bases, September 1-4, 1987, Brighton, England. Morgan Kaufmann 1987, ISBN 0-934613-46-X
Contents BibTeX

References

[1]
...
[2]
Susan J. Eggers, Arie Shoshani: Efficient Access of Compressed Data. VLDB 1980: 205-211 BibTeX
[3]
Susan J. Eggers, Frank Olken, Arie Shoshani: A Compression Technique for Large Statistical Data-Bases. VLDB 1981: 424-434 BibTeX
[4]
Arie Shoshani, Frank Olken, Harry K. T. Wong: Characteristics of Scientific Databases. VLDB 1984: 147-160 BibTeX
[5]
Arie Shoshani: Statistical Databases: Characteristics, Problems, and some Solutions. VLDB 1982: 208-222 BibTeX
[6]
...
[7]
...
[8]
...
[9]
...
[10]
Bruce Hahn: A New Technique for Compression and Storage of Data. Commun. ACM 17(8): 434-436(1974) BibTeX
[11]
Donald E. Knuth: The Art of Computer Programming, Volume III: Sorting and Searching. Addison-Wesley 1973, ISBN 0-201-03803-X
BibTeX
[12]
Robert Endre Tarjan, Andrew Chi-Chih Yao: Storing a Sparse Table. Commun. ACM 22(11): 606-611(1979) BibTeX
[13]
Mostafa A. Bassiouni: Data Compression in Scientific and Statistical Databases. IEEE Trans. Software Eng. 11(10): 1047-1058(1985) BibTeX
[14]
...
[15]
Jukka Teuhola: A Compression Method for Clustered Bit-Vectors. Inf. Process. Lett. 7(6): 308-311(1978) BibTeX
[16]
...
[17]
Harry K. T. Wong, J. Z. Li: Transposition Algorithms on Very Large Compressed Databases. VLDB 1986: 304-311 BibTeX

Referenced by

  1. Wee Keong Ng, Chinya V. Ravishankar: Block-Oriented Compression Techniques for Large Statistical Databases. IEEE Trans. Knowl. Data Eng. 9(2): 314-328(1997)
  2. Flip Korn, H. V. Jagadish, Christos Faloutsos: Efficiently Supporting Ad Hoc Queries in Large Datasets of Time Sequences. SIGMOD Conference 1997: 289-300
  3. Wee Keong Ng, Chinya V. Ravishankar: A Physical Storage for Efficient Statistical Query Processing. SSDBM 1994: 97-106
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
VLDB Proceedings: Copyright © by VLDB Endowment,
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:45:35 2009