Experiments in Automatic Statistical Thesaurus Construction.
Carolyn J. Crouch, Bokyung Yang:
Experiments in Automatic Statistical Thesaurus Construction.
SIGIR 1992: 77-88@inproceedings{DBLP:conf/sigir/CrouchY92,
author = {Carolyn J. Crouch and
Bokyung Yang},
editor = {Nicholas J. Belkin and
Peter Ingwersen and
Annelise Mark Pejtersen},
title = {Experiments in Automatic Statistical Thesaurus Construction},
booktitle = {Proceedings of the 15th Annual International ACM SIGIR Conference
on Research and Development in Information Retrieval. Copenhagen,
Denmark, June 21-24, 1992},
publisher = {ACM},
year = {1992},
isbn = {0-89791-523-2},
pages = {77-88},
ee = {db/conf/sigir/CrouchY92.html},
crossref = {DBLP:conf/sigir/92},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
A well constructed thesaurus has long been recognized as a valuable tool in the effective operation
of an information retrieval system. This paper reports the results of experiments designed to
determine the validity of an approach to the automatic construction of global thesauri (described
originally by Crouch in [1] and [2] based on a clustering of the document collection. The authors
validate the approach by showing that the use of thesauri generated by this method results in
substantial improvements in retrieval effectiveness in four test collections. The term discrimination
value theory, used in the thesaurus generation algorithm to determine a term's membership in a
particular thesaurus class, is found not to be useful in distinguishing a "good" from an "indifferent" or
"poor" thesaurus class). In conclusion, the authors suggest an alternate approach to automatic
thesaurus construction which greatly simplifies the work of producing viable thesaurus classes.
Experimental results show that the alternate approach described herein in some cases produces
thesauri which are comparable in retrieval effectiveness to those produced by the first method at
much lower cost.
Copyright © 1992 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 2 Issue 3, SIGIR, DASFAA'97, OODBS'86" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
BibTeX
Printed Edition
Nicholas J. Belkin, Peter Ingwersen, Annelise Mark Pejtersen (Eds.):
Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Copenhagen, Denmark, June 21-24, 1992.
ACM 1992, ISBN 0-89791-523-2
Contents BibTeX
Citation page
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:38:40 2009