ACM SIGMOD Anthology SIGIR dblp.uni-trier.de

Compression of Concordances in Full-Text Retrieval Systems.

Yaacov Choueka, Aviezri S. Fraenkel, Shmuel T. Klein: Compression of Concordances in Full-Text Retrieval Systems. SIGIR 1988: 597-612
@inproceedings{DBLP:conf/sigir/ChouekaFK88,
  author    = {Yaacov Choueka and
               Aviezri S. Fraenkel and
               Shmuel T. Klein},
  editor    = {Yves Chiaramella},
  title     = {Compression of Concordances in Full-Text Retrieval Systems},
  booktitle = {SIGIR'88, Proceedings of the 11th Annual International ACM SIGIR
               Conference on Research and Development in Information Retrieval,
               Grenoble, France, June 13-15, 1988},
  publisher = {ACM},
  year      = {1988},
  pages     = {597-612},
  ee        = {db/conf/sigir/ChouekaFK88.html},
  crossref  = {DBLP:conf/sigir/88},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

The concordance of a full-text information retrieval system contains for every different word W of the data base, a list L(W) of "coordinates", each of which describes the exact location of an occurrence of w in the text. The concordance should be compressed, not only for the savings in storage space, but also in order to reduce the number of I/O operations, since the file is usually kept in secondary memory. Several methods are presented, which efficiently compress concordances of large full-text retrieval systems. The methods were tested on the concordance of the Responsa Retrieval Project and yield savings of up to 49% relative to the non-compressed file; this is a relative improvement of about 27% over the currently used prefix-omission compression technique.

Copyright © 1988 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 3, SIGIR, DASFAA'97, OODBS'86" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

Yves Chiaramella (Ed.): SIGIR'88, Proceedings of the 11th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Grenoble, France, June 13-15, 1988. ACM 1988
Contents BibTeX

Online Edition: ACM Digital Library

Citation Page

Referenced by

  1. Brian Lowe, Justin Zobel, Ron Sacks-Davis: A Formal Model for Databases of Structured Text. DASFAA 1995: 449-456
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:38:34 2009