ACM SIGMOD Anthology VLDB dblp.uni-trier.de

Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies.

Luis Gravano, Hector Garcia-Molina: Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies. VLDB 1995: 78-89
@inproceedings{DBLP:conf/vldb/GravanoG95,
  author    = {Luis Gravano and
               Hector Garcia-Molina},
  editor    = {Umeshwar Dayal and
               Peter M. D. Gray and
               Shojiro Nishio},
  title     = {Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies},
  booktitle = {VLDB'95, Proceedings of 21th International Conference on Very
               Large Data Bases, September 11-15, 1995, Zurich, Switzerland},
  publisher = {Morgan Kaufmann},
  year      = {1995},
  isbn      = {1-55860-379-4},
  pages     = {78-89},
  ee        = {db/conf/vldb/GravanoG95.html},
  crossref  = {DBLP:conf/vldb/95},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

As large numbers of text databases have become available on the Internet, it is harder to locate the right sources for given queries. In this paper we present gGlOSS, a generalized Glossary-Of-Servers Server, that keeps statistics on the available databases to estimate which databases are the potentially most useful for a given query. gGlOSS extends our previous work [l], which focused on databases using the boolean model of document retrieval, to cover databases using the more sophisticated vector-space retrieval model. We evaluate our new techniques using real-user queries and 53 databases. Finally, we further generalize our approach by showing how to build a hierarchy of gGlOSS brokers. The top level of the hierarchy is so small it could be widely replicated, even at end-user workstations.

Copyright © 1995 by the VLDB Endowment. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by the permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment.


Online Paper

ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 1 Issue 5, VLDB '89-'97" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

Umeshwar Dayal, Peter M. D. Gray, Shojiro Nishio (Eds.): VLDB'95, Proceedings of 21th International Conference on Very Large Data Bases, September 11-15, 1995, Zurich, Switzerland. Morgan Kaufmann 1995, ISBN 1-55860-379-4
Contents BibTeX

References

[1]
Luis Gravano, Hector Garcia-Molina, Anthony Tomasic: The Effectiveness of GlOSS for the Text Database Discovery Problem. SIGMOD Conference 1994: 126-137 BibTeX
[2]
Michael F. Schwartz, Alan Emtage, Brewster Kahle, B. Clifford Neuman: A Comparison of Internet Resource Discovery Approaches. Computing Systems 5(4): 461-493(1992) BibTeX
[3]
Katia Obraczka, Peter B. Danzig, Shih-Hao Li: Internet Resource Discovery Services. IEEE Computer 26(9): 8-22(1993) BibTeX
[4]
Luis Gravano, Hector Garcia-Molina, Anthony Tomasic: Precision and Recall of GlOSS Estimators for Database Discovery. PDIS 1994: 103-106 BibTeX
[5]
Gerard Salton, Michael McGill: Introduction to Modern Information Retrieval. McGraw-Hill Book Company 1984, ISBN 0-07-054484-0
BibTeX
[6]
Gerard Salton: Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley 1989, ISBN 0-201-12227-8
BibTeX
[7]
B. Clifford Neuman: The Prospero File System: A Global File System Based on the Virtual System Model. Computing Systems 5(4): 407-432(1992) BibTeX
[8]
Tim Berners-Lee, Robert Cailliau, Jean-François Groff, Bernd Pollermann: World-Wide Web: The Information Universe. Electronic Networking: Research, Applications and Policy 1(2): 74-82(1992) BibTeX
[9]
...
[10]
James P. Callan, Zhihong Lu, W. Bruce Croft: Searching Distributed Collections with Inference Networks. SIGIR 1995: 21-28 BibTeX
[11]
Mark A. Sheldon, Andrzej Duda, Ron Weiss, James O'Toole, David K. Gifford: Content Routing for Distributed Information Servers. EDBT 1994: 109-122 BibTeX
[12]
Andrzej Duda, Mark A. Sheldon: Content Routing in a Network of WAIS Servers. ICDCS 1994: 124-132 BibTeX
[13]
...
[14]
Anthony Tomasic, Luis Gravano, Calvin Lue, Peter M. Schwarz, Laura M. Haas: Data Structures for Efficient Broker Implementation. ACM Trans. Inf. Syst. 15(3): 223-253(1997) BibTeX
[15]
Tak W. Yan, Hector Garcia-Molina: SIFT - a Tool for Wide-Area Information Dissemination. USENIX Winter 1995: 177-186 BibTeX
[16]
...
[17]
Luis Gravano, Hector Garcia-Molina, Anthony Tomasic: Precision and Recall of GlOSS Estimators for Database Discovery. PDIS 1994: 103-106 BibTeX
[18]
Ellen M. Voorhees, Narendra Kumar Gupta, Ben Johnson-Laird: The Collection Fusion Problem. TREC 1994: 0- BibTeX
[19]
Alistair Moffat, Justin Zobel: Information Retrieval Systems for Large Document Collections. TREC 1994: 0- BibTeX

Referenced by

  1. Luis Gravano, Hector Garcia-Molina, Anthony Tomasic: GlOSS: Text-Source Discovery over the Internet. ACM Trans. Database Syst. 24(2): 229-264(1999)
  2. James P. Callan, Margaret E. Connell, Aiqun Du: Automatic Discovery of Language Models for Text Databases. SIGMOD Conference 1999: 479-490
  3. Weiyi Meng, King-Lup Liu, Clement T. Yu, Wensheng Wu, Naphtali Rishe: Estimating the Usefulness of Search Engines. ICDE 1999: 146-153
  4. Yong S. Choi, Suk I. Yoo: Neural Net Agent for Discovering Text Databases on the Web. ADBIS (Short Papers) 1999: 221-231
  5. Wendy Chang, Gholamhosein Sheikholeslami, Jia Wang, Aidong Zhang: Data Resource Selection in Distributed Visual Information Systems. IEEE Trans. Knowl. Data Eng. 10(6): 926-946(1998)
  6. Shih-Fu Chang, Luis Gravano, Gail E. Kaiser, Kenneth A. Ross, Salvatore J. Stolfo: Database Research at Columbia University. SIGMOD Record 27(3): 75-80(1998)
  7. Luis Gravano, Yannis Papakonstantinou: Mediating and Metasearching on the Internet. IEEE Data Eng. Bull. 21(2): 28-36(1998)
  8. Weiyi Meng, King-Lup Liu, Clement T. Yu, Xiaodong Wang, Yuhsi Chang, Naphtali Rishe: Determining Text Databases to Search in the Internet. VLDB 1998: 14-25
  9. Wendy Chang, Deepak Murthy, Aidong Zhang, Tanveer Fathima Syeda-Mahmood: Global Integration of Visual Databases. ICDE 1998: 542-549
  10. Luis Gravano, Kevin Chen-Chuan Chang, Hector Garcia-Molina, Andreas Paepcke: STARTS: Stanford Proposal for Internet Meta-Searching (Experience Paper). SIGMOD Conference 1997: 207-218
  11. Budi Yuwono, Dik Lun Lee: Server Ranking for Distributed Text Retrieval Systems on the Internet. DASFAA 1997: 41-50
  12. Ron Dolin, Divyakant Agrawal, Amr El Abbadi: Classifying Network Architectures for Locating Information Sources. DASFAA 1997: 31-40
  13. Surajit Chaudhuri, Luis Gravano: Optimizing Queries over Multimedia Repositories. SIGMOD Conference 1996: 91-102
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
VLDB Proceedings: Copyright © by VLDB Endowment,
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:46:04 2009