ACM SIGMOD Anthology SIGIR dblp.uni-trier.de

An Automatic and Tunable Document Indexing System.

Esen A. Ozkarahan, Fazli Can: An Automatic and Tunable Document Indexing System. SIGIR 1986: 234-243
@inproceedings{DBLP:conf/sigir/OzkarahanC86,
  author    = {Esen A. Ozkarahan and
               Fazli Can},
  title     = {An Automatic and Tunable Document Indexing System},
  booktitle = {SIGIR'86, Proceedings of the 9th Annual International ACM SIGIR
               Conference on Research and Development in Information Retrieval,
                Pisa, Italy, September 8-10, 1986},
  publisher = {ACM},
  year      = {1986},
  pages     = {234-243},
  ee        = {db/conf/sigir/OzkarahanC86.html},
  crossref  = {DBLP:conf/sigir/86},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

In this article we present an interactive automatic document indexing software together with various index tuning/optimization strategies. After stems are generated from the raw text, the initial index vocabulary is narrowed down and tuned with the use of indexing versus clustering theory relationships. The narrowed down vocabulary is further optimized with the inclusion of term phrases and virtual terms corresponding to high and low frequency terms respectively. The results of performance experimentation which proved significant improvements of index vocabulary optimization are presented. The exploitation of the term discrimination value concept in index and retrieval system tuning and optimization is discussed.

Copyright © 1986 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 3, SIGIR, DASFAA'97, OODBS'86" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

SIGIR'86, Proceedings of the 9th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Pisa, Italy, September 8-10, 1986. ACM 1986
Contents BibTeX

Online Edition: ACM Digital Library

Citation Page

Referenced by

  1. Fazli Can, Esen A. Ozkarahan: Concepts and Effectiveness of the Cover-Coefficient-Based Clustering Methodology for Text Databases. ACM Trans. Database Syst. 15(4): 483-517(1990)
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:38:29 2009