Digital Symposium Collection 2000  

 
 
 
 
 
 

 





















Storing Semistructured Data with STORED

Alin Deutsch, Mary F. Fernandez, and Dan Suciu

  View Paper (PDF)  

Return to Semistructured Data and Mediators

Abstract
Systems for managing and querying semistructured-data sources often store data in proprietary object repositories or in a tagged-text format. We describe a technique that can use relational database management systems to store and manage semistructured data. Our technique relies on a mapping between the semistructured data model and the relational data model, expressed in a query language called STORED. When a semistrcutured data instance is given, a STORED mapping can be generated automatically using data-mining techniques. We are interested in applying STORED to XML data, which is an instance of semistructured data. We show how a document-type-descriptor (DTD), when present, can be exploited to further improve performance.


References

Note: References link to DBLP on the Web.

[1]
Serge Abiteboul , Dallan Quass , Jason McHugh , Jennifer Widom , Janet L. Wiener : The Lorel Query Language for Semistructured Data. Int. J. on Digital Libraries 1(1) : 68-88(1997)
[2]
Rakesh Agrawal , Tomasz Imielinski , Arun N. Swami : Mining Association Rules between Sets of Items in Large Databases. SIGMOD Conference 1993 : 207-216
[3]
Catriel Beeri , Tova Milo : Schemas for Integration and Translation of Structured and Semi-structured Data. ICDT 1999 : 296-313
[4]
Peter Buneman , Susan B. Davidson , Gerd G. Hillebrand , Dan Suciu : A Query Language and Optimization Techniques for Unstructured Data. SIGMOD Conf. 1996 : 505-516
[5]
Vassilis Christophides , Serge Abiteboul , Sophie Cluet , Michel Scholl : From Structured Documents to Novel Query Facilities. SIGMOD Conference 1994 : 313-324
[6]
Mary F. Fernandez , Daniela Florescu , Jaewoo Kang , Alon Y. Levy , Dan Suciu : Catching the Boat with Strudel: Experiences with a Web-Site Management System. SIGMOD Conference 1998 : 414-425
[7]
Mary F. Fernandez , Daniela Florescu , Alon Y. Levy , Dan Suciu : A Query Language for a Web-Site Management System. SIGMOD Record 26(3) : 4-11(1997)
[8]
M. R. Garey , David S. Johnson : Computer and Intractability: A Guide to NP-Completeness. W. H. Freeman 1979, ISBN 0-7167-1044-7
[9]
...
[10]
Klemens Böhm , Karl Aberer , Erich J. Neuhold , Xiaoya Yang : Structured Document Storage and Refined Declarative and Navigational Access Mechanisms in HyperStorM. VLDB Journal 6(4) : 296-311(1997)
[11]
Alon Y. Levy , Alberto O. Mendelzon , Yehoshua Sagiv , Divesh Srivastava : Answering Queries Using Views. PODS 1995 : 95-104
[12]
Marc Volz , Karl Aberer , Klemens Böhm : Applying a Flexible OODBMS-IRS-Coupling for Structured Document Handling. ICDE 1996 : 10-19
[13]
Svetlozar Nestorov , Serge Abiteboul , Rajeev Motwani : Extracting Schema from Semistructured Data. SIGMOD Conference 1998 : 295-306
[14]
Oliver M. Duschka , Michael R. Genesereth : Answering Recursive Queries Using Views. PODS 1997 : 109-116
[15]
Yannis Papakonstantinou , Serge Abiteboul , Hector Garcia-Molina : Object Fusion in Mediator Systems. VLDB 1996 : 413-424
[16]
Yannis Papakonstantinou , Hector Garcia-Molina , Jennifer Widom : Object Exchange Across Heterogeneous Information Sources. ICDE 1995 : 251-260
[17]
Dallan Quass , Anand Rajaraman , Yehoshua Sagiv , Jeffrey D. Ullman , Jennifer Widom : Querying Semistructured Heterogeneous Information. DOOD 1995 : 319-344
[18]
Dimitri Theodoratos , Timos K. Sellis : Data Warehouse Configuration. VLDB 1997 : 126-135
[19]
Odysseas G. Tsatalos , Marvin H. Solomon , Yannis E. Ioannidis : The GMAP: A Versatile Tool for Physical Data Independence. VLDB 1994 : 367-378
[20]
Jeffrey D. Ullman : Principles of Database and Knowledge-Base Systems, Volume II. Computer Science Press 1989, ISBN 0-7167-8162-X
[21]
Ke Wang , Huiqing Liu : Discovering Typical Structures of Documents: A Road Map Approach. SIGIR 1998 : 146-154
[22]
Tian Zhang , Raghu Ramakrishnan , Miron Livny : BIRCH: An Efficient Data Clustering Method for Very Large Databases. SIGMOD Conf. 1996 : 103-114

Referenced by

  1. Jayavel Shanmugasundaram , Kristin Tufte , Chun Zhang , Gang He , David J. DeWitt , Jeffrey F. Naughton : Relational Databases for Querying XML Documents: Limitations and Opportunities. VLDB 1999 : 302-314

BIBTEX

@inproceedings{DBLP:conf/sigmod/DeutschFS99,
  author    = {Alin Deutsch and
                Mary F. Fernandez and
                Dan Suciu},
   editor    = {Alex Delis and
                Christos Faloutsos and
                Shahram Ghandeharizadeh},
   title     = {Storing Semistructured Data with STORED},
   booktitle = {SIGMOD 1999, Proceedings ACM SIGMOD International Conference
                on Management of Data, June 1-3, 1999, Philadephia, Pennsylvania,
                USA},
   publisher = {ACM Press},
   year      = {1999},
   isbn      = {1-58113-084-8},
   pages     = {431-442},
   crossref  = {DBLP:conf/sigmod/99},
   bibsource = {DBLP, http://dblp.uni-trier.de} } },


























Copyright(C) 2000 ACM