ACM SIGMOD Anthology ACM SIGMOD dblp.uni-trier.de

The Effectiveness of GlOSS for the Text Database Discovery Problem.

Luis Gravano, Hector Garcia-Molina, Anthony Tomasic: The Effectiveness of GlOSS for the Text Database Discovery Problem. SIGMOD Conference 1994: 126-137
@inproceedings{DBLP:conf/sigmod/GravanoGT94,
  author    = {Luis Gravano and
               Hector Garcia-Molina and
               Anthony Tomasic},
  editor    = {Richard T. Snodgrass and
               Marianne Winslett},
  title     = {The Effectiveness of GlOSS for the Text Database Discovery Problem},
  booktitle = {Proceedings of the 1994 ACM SIGMOD International Conference on
               Management of Data, Minneapolis, Minnesota, May 24-27, 1994},
  publisher = {ACM Press},
  year      = {1994},
  pages     = {126-137},
  ee        = {http://doi.acm.org/10.1145/191839.191869, db/conf/sigmod/GravanoGT94.html},
  crossref  = {DBLP:conf/sigmod/94},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

The popularity of on-line document databases has led to a new problem: finding which text databases (out of many candidate choices) are the most relevant to a user. Identifying the relevant databases for a given query is the text-database discovery problem. The first part of this paper presents a practical solution based on estimating the result size of a query and a database. The method is termed GlOSS - Glossary of Servers Server. The second part of this paper evaluates the effectiveness of GlOSS based on a trace of real-user queries. In addition, we analyze the storage cost of our approach.

Copyright © 1994 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

Online Version (ACM WWW Account required): Full Text in PDF Format

CDROM Version: Load the CDROM "Volume 1 Issue 1, SIGMOD '93-'97" and ...

DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

Richard T. Snodgrass, Marianne Winslett (Eds.): Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, Minneapolis, Minnesota, May 24-27, 1994. ACM Press 1994 BibTeX , SIGMOD Record 23(2), June 1994
Contents

Online Edition: ACM Digital Library

[Abstract and Index Terms]
[Full Text in PDF Format, 1325 KB]

References

[1]
...
[2]
...
[3]
Katia Obraczka, Peter B. Danzig, Shih-Hao Li: Internet Resource Discovery Services. IEEE Computer 26(9): 8-22(1993) BibTeX
[4]
...
[5]
...
[6]
...
[7]
...
[8]
...
[9]
...
[10]
Michael F. Schwartz: Internet Resource Discovery at the University of Colorado. IEEE Computer 26(9): 25-35(1993) BibTeX
[11]
...
[12]
Peter B. Danzig, Jong Suk Ahn, John Noll, Katia Obraczka: Distributed Indexing: A Scalable Mechanism for Distributed Information Retrieval. SIGIR 1991: 220-229 BibTeX
[13]
...
[14]
...
[15]
...
[16]
...
[17]
...
[18]
Mark A. Sheldon, Andrzej Duda, Ron Weiss, James O'Toole, David K. Gifford: Content Routing for Distributed Information Servers. EDBT 1994: 109-122 BibTeX
[19]
...
[20]
...
[21]
Gerard Salton, Chris Buckley: Parallel Text Search Methods. Commun. ACM 31(2): 202-215(1988) BibTeX

Referenced by

  1. Luis Gravano, Hector Garcia-Molina, Anthony Tomasic: GlOSS: Text-Source Discovery over the Internet. ACM Trans. Database Syst. 24(2): 229-264(1999)
  2. Felix Naumann, Ulf Leser, Johann Christoph Freytag: Quality-driven Integration of Heterogenous Information Systems. VLDB 1999: 447-458
  3. James P. Callan, Margaret E. Connell, Aiqun Du: Automatic Discovery of Language Models for Text Databases. SIGMOD Conference 1999: 479-490
  4. Ling Liu: Query Routing in Large-Scale Digital Library Systems. ICDE 1999: 154-163
  5. Yong S. Choi, Suk I. Yoo: Neural Net Agent for Discovering Text Databases on the Web. ADBIS (Short Papers) 1999: 221-231
  6. Shih-Fu Chang, Luis Gravano, Gail E. Kaiser, Kenneth A. Ross, Salvatore J. Stolfo: Database Research at Columbia University. SIGMOD Record 27(3): 75-80(1998)
  7. Luis Gravano, Yannis Papakonstantinou: Mediating and Metasearching on the Internet. IEEE Data Eng. Bull. 21(2): 28-36(1998)
  8. Hector Garcia-Molina, Steven P. Ketchpel, Narayanan Shivakumar: Safeguarding and Charging for Information on the Internet. ICDE 1998: 182-189
  9. Luis Gravano, Kevin Chen-Chuan Chang, Hector Garcia-Molina, Andreas Paepcke: STARTS: Stanford Proposal for Internet Meta-Searching (Experience Paper). SIGMOD Conference 1997: 207-218
  10. Budi Yuwono, Dik Lun Lee: Server Ranking for Distributed Text Retrieval Systems on the Internet. DASFAA 1997: 41-50
  11. Daniel Barbará, Sharad Mehrotra, Padmavathi Vallabhaneni: The Gold Text Indexing Engine. ICDE 1996: 172-179
  12. Luis Gravano, Hector Garcia-Molina: Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies. VLDB 1995: 78-89
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:40:20 2009