Constant Interaction-Time Scatter/Gather Browsing of Very Large Document Collections.
Douglas R. Cutting, David R. Karger, Jan O. Pedersen:
Constant Interaction-Time Scatter/Gather Browsing of Very Large Document Collections.
SIGIR 1993: 126-134@inproceedings{DBLP:conf/sigir/CuttingKP93,
author = {Douglas R. Cutting and
David R. Karger and
Jan O. Pedersen},
editor = {Robert Korfhage and
Edie M. Rasmussen and
Peter Willett 0002},
title = {Constant Interaction-Time Scatter/Gather Browsing of Very Large
Document Collections},
booktitle = {Proceedings of the 16th Annual International ACM-SIGIR Conference
on Research and Development in Information Retrieval. Pittsburgh,
PA, USA, June 27 - July 1, 1993},
publisher = {ACM},
year = {1993},
isbn = {0-89791-605-0},
pages = {126-134},
ee = {db/conf/sigir/CuttingKP93.html},
crossref = {DBLP:conf/sigir/93},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
The Scatter/Gather document browsing method uses fast document clustering to produce
table-of-contents-like outlines of large document collections. Previous work [1] developed
linear-time document clustering algorithms to establish the feasibility of this method over moderately
large collections. However, even linear-time algorithms are too slow to support interactive browsing
of very large collections such as Tipster, the DARPA standard text retrieval evaluation collection. We
present a scheme that supports constant interaction-time Scatter/Gather of arbitrarily large
collections after near-linear time preprocessing. This involves the construction of a cluster hierarchy.
A modification of Scatter/Gather employing this scheme, and an example of its use over the Tipster
collection are presented.
Copyright © 1993 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 2 Issue 3, SIGIR, DASFAA'97, OODBS'86" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
BibTeX
Printed Edition
Robert Korfhage, Edie M. Rasmussen, Peter Willett (Eds.):
Proceedings of the 16th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. Pittsburgh, PA, USA, June 27 - July 1, 1993.
ACM 1993, ISBN 0-89791-605-0
Contents BibTeX
Citation page
Referenced by
- Philip S. Yu:
Data Mining and Personalization Technologies.
DASFAA 1999: 6-13
- Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan:
Scalable Feature Selection, Classification and Signature Generation for Organizing Large Text Databases into Hierarchical Topic Taxonomies.
VLDB J. 7(3): 163-178(1998)
- Wen-Syan Li, K. Selçuk Candan, Kyoji Hirata, Yoshinori Hara:
Facilitating Multimedia Database Exploration through Visual Interfaces and Perpetual Query Reformulations.
VLDB 1997: 538-547
- Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan:
Using Taxonomy, Discriminants, and Signatures for Navigating in Text Databases.
VLDB 1997: 446-455
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:38:42 2009