ACM SIGMOD Anthology VLDB dblp.uni-trier.de

Fast Incremental Indexing for Full-Text Information Retrieval.

Eric W. Brown, James P. Callan, W. Bruce Croft: Fast Incremental Indexing for Full-Text Information Retrieval. VLDB 1994: 192-202
@inproceedings{DBLP:conf/vldb/BrownCC94,
  author    = {Eric W. Brown and
               James P. Callan and
               W. Bruce Croft},
  editor    = {Jorge B. Bocca and
               Matthias Jarke and
               Carlo Zaniolo},
  title     = {Fast Incremental Indexing for Full-Text Information Retrieval},
  booktitle = {VLDB'94, Proceedings of 20th International Conference on Very
               Large Data Bases, September 12-15, 1994, Santiago de Chile, Chile},
  publisher = {Morgan Kaufmann},
  year      = {1994},
  isbn      = {1-55860-153-8},
  pages     = {192-202},
  ee        = {db/conf/vldb/vldb94-192.html},
  crossref  = {DBLP:conf/vldb/94},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

Full-text information retrieval systems have traditionally been designed for archival environments. They often provide little or no support for adding new documents to an existing document collection, requiring instead that the entire collection be re-indexed. Modern applications, such as information filtering, operate in dynamic environments that require frequent additions to document collections. We provide this ability using a traditional inverted file index built on top of a persistent object store. The data management facilities of the persistent object store are used to produce efficient incremental update of the inverted lists. We describe our system and present experimental results showing superior incremental indexing and competitive query processing performance.

Copyright © 1994 by the VLDB Endowment. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by the permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment.


Online Paper

ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 1 Issue 5, VLDB '89-'97" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

Jorge B. Bocca, Matthias Jarke, Carlo Zaniolo (Eds.): VLDB'94, Proceedings of 20th International Conference on Very Large Data Bases, September 12-15, 1994, Santiago de Chile, Chile. Morgan Kaufmann 1994, ISBN 1-55860-153-8
Contents BibTeX

References

[BCCM94]
Eric W. Brown, James P. Callan, W. Bruce Croft, J. Eliot B. Moss: Supporting Full-Text Information Retrieval with a Persistent Object Store. EDBT 1994: 365-378 BibTeX
[CCH92]
James P. Callan, W. Bruce Croft, Stephen M. Harding: The INQUERY Retrieval System. DEXA 1992: 78-83 BibTeX
[CP90]
Douglas R. Cutting, Jan O. Pedersen: Optimizations for Dynamic Inverted Index Maintenance. SIGIR 1990: 405-411 BibTeX
[Fal85]
Christos Faloutsos: Access Methods for Text. ACM Comput. Surv. 17(1): 49-74(1985) BibTeX
[FJ92a]
Christos Faloutsos, H. V. Jagadish: Hybrid Index Organizations for Text Databases. EDBT 1992: 310-327 BibTeX
[FJ92b]
Christos Faloutsos, H. V. Jagadish: On B-Tree Indices for Skewed Distributions. VLDB 1992: 363-374 BibTeX
[FL91]
...
[Har94]
...
[Hea78]
...
[HFBYL92]
...
[Mos90]
J. Eliot B. Moss: Design of the Mneme Persistent Object Store. ACM Trans. Inf. Syst. 8(2): 103-139(1990) BibTeX
[MZ94a]
...
[MZ94b]
...
[SM83]
Gerard Salton, Michael McGill: Introduction to Modern Information Retrieval. McGraw-Hill Book Company 1984, ISBN 0-07-054484-0
BibTeX
[STGM94]
Kurt A. Shoens, Anthony Tomasic, Hector Garcia-Molina: Synthetic Workload Performance Analysis of Incremental Updates. SIGIR 1994: 329-338 BibTeX
[TC91]
Howard R. Turtle, W. Bruce Croft: Evaluation of an Inference Network-Based Retrieval Model. ACM Trans. Inf. Syst. 9(3): 187-222(1991) BibTeX
[TC92]
Howard R. Turtle, W. Bruce Croft: A Comparison of Text Retrieval Models. Comput. J. 35(3): 279-290(1992) BibTeX
[TGMS94]
Anthony Tomasic, Hector Garcia-Molina, Kurt A. Shoens: Incremental Updates of Inverted Lists for Text Document Retrieval. SIGMOD Conference 1994: 289-300 BibTeX
[Wil84]
...
[WL93]
Wai Yee Peter Wong, Dik Lun Lee: Implementations of Partial Document Ranking Using Inverted Files. Inf. Process. Manage. 29(5): 647-669(1993) BibTeX
[WMB94]
Ian H. Witten, Alistair Moffat, Timothy C. Bell: Managing Gigabytes: Compressing and Indexing Documents and Images. Van Nostrand Reinhold 1994, ISBN 0-442-01863-0
BibTeX
[Wol92a]
Dietmar Wolfram: Applying Informetric Characteristics of Databases to IR System File Design, Part I: Informetric Models. Inf. Process. Manage. 28(1): 121-134(1992) BibTeX
[Wol92b]
Dietmar Wolfram: Applying Informetric Characteristics of Databases to IR System File Design, Part II: Simulation Comparisons. Inf. Process. Manage. 28(1): 135-(1992) BibTeX
[Zip49]
George Kingsley Zipf: Human Behaviour and the Principle of Least Effort: an Introduction to Human Ecology. Addison-Wesley 1949
BibTeX
[ZMSD92]
Justin Zobel, Alistair Moffat, Ron Sacks-Davis: An Efficient Indexing Technique for Full Text Databases. VLDB 1992: 352-362 BibTeX

Referenced by

  1. Michael J. Carey, Donald Kossmann: Reducing the Braking Distance of an SQL Query Engine. VLDB 1998: 158-169
  2. Sibel Adali, Piero A. Bonatti, Maria Luisa Sapino, V. S. Subrahmanian: A Multi-Similarity Algebra. SIGMOD Conference 1998: 402-413
  3. Narayanan Shivakumar, Hector Garcia-Molina: Wave-Indices: Indexing Evolving Databases. SIGMOD Conference 1997: 381-392
  4. Charles L. Viles, James C. French: On the Update of Term Weights in Dynamic Information Retrieval Systems. CIKM 1995: 167-174
  5. Helmut Kaufmann, Moira C. Norrie: Relaxation of Correctness in Database Systems. ADBIS 1995: 90-105
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
VLDB Proceedings: Copyright © by VLDB Endowment,
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:46:01 2009