ACM SIGMOD Anthology ACM SIGMOD dblp.uni-trier.de

Semistructured Data.

Peter Buneman: Semistructured Data. PODS 1997: 117-121
@inproceedings{DBLP:conf/pods/Buneman97,
  author    = {Peter Buneman},
  title     = {Semistructured Data},
  booktitle = {Proceedings of the Sixteenth ACM SIGACT-SIGMOD-SIGART Symposium
               on Principles of Database Systems, May 12-14, 1997, Tucson, Arizona},
  publisher = {ACM Press},
  year      = {1997},
  isbn      = {0-89791-910-6},
  pages     = {117-121},
  ee        = {http://doi.acm.org/10.1145/263661.263675, db/conf/pods/Buneman97.html},
  crossref  = {DBLP:conf/pods/97},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

In semistructured data, the information that is normally associated with a schema is contained within the data, which is sometimes called "self-describing". In some forms of semistructured data there is no separate schema, in others it exists but only places loose constraints on the data. Semistructured data has recently emerged as an important topic of study for a variety of reasons. First, there are data sources such as the Web, which we would like to treat as databases but which cannot be constrained by a schema. Second, it may be desirable to have an extremely flexible format for data exchange between disparate databases. Third, even when dealing with structured data, it may be helpful to view it as semistructured for the purposes of browsing. This tutorial will cover a number of issues surrounding such data: finding a concise formulation, building a sufficiently expressive language for querying and transformation, and optimization problems.

Copyright © 1997 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


Load The ACM SIGMOD Anthology, CDROM Edition, Volume 1-3, PODS '82-'98. and ... Load The ACM SIGMOD Anthology, Silver Edition, DVD 1, Proceedings. and ... BibTeX

Printed Edition

Proceedings of the Sixteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, May 12-14, 1997, Tucson, Arizona. ACM Press 1997, ISBN 0-89791-910-6
Contents BibTeX

Online Edition: ACM Digital Library

[Index Terms]
[Full Text in PDF Format, 843 KB]

Slides

Slides of the Tutorial

References

[1]
Serge Abiteboul: Querying Semi-Structured Data. ICDT 1997: 1-18 BibTeX
[2]
Serge Abiteboul, Sophie Cluet, Vassilis Christophides, Tova Milo, Guido Moerkotte, Jérôme Siméon: Querying Documents in Object Databases. Int. J. on Digital Libraries 1(1): 5-19(1997) BibTeX
[3]
Serge Abiteboul, Sophie Cluet, Tova Milo: Querying and Updating the File. VLDB 1993: 73-84 BibTeX
[4]
...
[5]
Serge Abiteboul, Dallan Quass, Jason McHugh, Jennifer Widom, Janet L. Wiener: The Lorel Query Language for Semistructured Data. Int. J. on Digital Libraries 1(1): 68-88(1997) BibTeX
[6]
Serge Abiteboul, Victor Vianu: Queries and Computation on the Web. ICDT 1997: 262-275 BibTeX
[7]
...
[8]
Peter Buneman, Susan B. Davidson, Mary F. Fernandez, Dan Suciu: Adding Structure to Unstructured Data. ICDT 1997: 336-350 BibTeX
[9]
Peter Buneman, Susan B. Davidson, Kyle Hart, G. Christian Overton, Limsoon Wong: A Data Transformation System for Biological Data Sources. VLDB 1995: 158-169 BibTeX
[10]
Peter Buneman, Susan B. Davidson, Gerd G. Hillebrand, Dan Suciu: A Query Language and Optimization Techniques for Unstructured Data. SIGMOD Conference 1996: 505-516 BibTeX
[11]
Peter Buneman, Susan B. Davidson, Dan Suciu: Programming Constructs for Unstructured Data. DBPL 1995: 12 BibTeX
[12]
Peter Buneman, Shamim A. Naqvi, Val Tannen, Limsoon Wong: Principles of Programming with Complex Objects and Collection Types. Theor. Comput. Sci. 149(1): 3-48(1995) BibTeX
[13]
R. G. G. Cattell: The Object Database Standard: ODMG-93 (Release 1.2). Morgan Kaufmann 1996
BibTeX
[14]
Sophie Cluet, Claude Delobel: A General Framework for the Optimization of Object-Oriented Queries. SIGMOD Conference 1992: 383-392 BibTeX
[15]
...
[16]
Mariano P. Consens, Alberto O. Mendelzon: Expressing Structural Hypertext Queries in GraphLog. Hypertext 1989: 269-292 BibTeX
[17]
Susan B. Davidson, G. Christian Overton, Val Tannen, Limsoon Wong: BioKleisli: A Digital Library for Biomedical Researchers. Int. J. on Digital Libraries 1(1): 36-53(1997) BibTeX
[18]
Mary F. Fernández, Daniela Florescu, Jaewoo Kang, Alon Y. Levy, Dan Suciu: STRUDEL: A Web-site Management System. SIGMOD Conference 1997: 549-552 BibTeX
[19]
Mary F. Fernandez, Lucian Popa, Dan Suciu: A Structure-Based Approach to Querying Semi-Structured Data. DBPL 1997: 136-159 BibTeX
[20]
...
[21]
Hector Garcia-Molina, Dallan Quass, Yannis Papakonstantinou, Anand Rajaraman, Yehoshua Sagiv, Jeffrey D. Ullman, Jennifer Widom: The TSIMMIS Approach to Mediation: Data Models and Languages. NGITS 1995: 0- BibTeX
[22]
Roy Goldman, Jennifer Widom: DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. VLDB 1997: 436-445 BibTeX
[23]
...
[24]
Michael Kifer, Won Kim, Yehoshua Sagiv: Querying Object-Oriented Databases. SIGMOD Conference 1992: 393-402 BibTeX
[25]
...
[26]
Laks V. S. Lakshmanan, Fereidoon Sadri, Iyer N. Subramanian: A Declarative Language for Querying and Restructuring the WEB. RIDE-NDS 1996: 12-21 BibTeX
[27]
Jason McHugh, Serge Abiteboul, Roy Goldman, Dallan Quass, Jennifer Widom: Lore: A Database Management System for Semistructured Data. SIGMOD Record 26(3): 54-66(1997) BibTeX
[28]
Jason McHugh, Jennifer Widom: Integrating Dynamically-Fetched External Information into a DBMS for Semistructured Data. SIGMOD Record 26(4): 24-31(1997) BibTeX
[29]
Alberto O. Mendelzon, George A. Mihaila, Tova Milo: Querying the World Wide Web. PDIS 1996: 80-91 BibTeX
[30]
Alberto O. Mendelzon, Tova Milo: Formal Models of Web Queries. PODS 1997: 134-143 BibTeX
[31]
Svetlozar Nestorov, Jeffrey D. Ullman, Janet L. Wiener, Sudarshan S. Chawathe: Representative Objects: Concise Representations of Semistructured, Hierarchial Data. ICDE 1997: 79-90 BibTeX
[32]
Yannis Papakonstantinou, Serge Abiteboul, Hector Garcia-Molina: Object Fusion in Mediator Systems. VLDB 1996: 413-424 BibTeX
[33]
Yannis Papakonstantinou, Hector Garcia-Molina, Jennifer Widom: Object Exchange Across Heterogeneous Information Sources. ICDE 1995: 251-260 BibTeX
[34]
Dallan Quass, Anand Rajaraman, Yehoshua Sagiv, Jeffrey D. Ullman, Jennifer Widom: Querying Semistructured Heterogeneous Information. DOOD 1995: 319-344 BibTeX
[35]
Dan Suciu: Query Decomposition and View Maintenance for Query Languages for Unstructured Data. VLDB 1996: 227-238 BibTeX
[36]
...

Referenced by

  1. Holger Meuss, Klaus U. Schulz, François Bry: Towards Aggregated Answers for Semistructured Data. ICDT 2001: 346-360
  2. Gösta Grahne, Alex Thomo: Algebraic Rewritings for Optimizing Regular Path Queries. ICDT 2001: 301-315
  3. Yannis Papakonstantinou, Victor Vianu: DTD Inference for Views of XML Data. PODS 2000: 35-46
  4. Hasan Davulcu, Guizhen Yang, Michael Kifer, I. V. Ramakrishnan: Computational Aspects of Resilient Data Extraction from Semistructured Sources. PODS 2000: 136-144
  5. Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, Moshe Y. Vardi: View-Based Query Processing for Regular Path Queries with Inverse. PODS 2000: 58-66
  6. Qiu Yue Wang, Jeffrey Xu Yu, Kam-Fai Wong: Approximate Graph Schema Extraction for Semi-Structured Data. EDBT 2000: 302-316
  7. Sihem Amer-Yahia, H. V. Jagadish, Laks V. S. Lakshmanan, Divesh Srivastava: On Bounding-Schemas for LDAP Directories. EDBT 2000: 287-301
  8. Serge Abiteboul: On Views and XML. SIGMOD Record 28(4): 30-38(1999)
  9. Alin Deutsch, Mary F. Fernández, Daniela Florescu, Alon Y. Levy, David Maier, Dan Suciu: Querying XML Data. IEEE Data Eng. Bull. 22(3): 10-18(1999)
  10. Jason McHugh, Jennifer Widom: Query Optimization for XML. VLDB 1999: 315-326
  11. Curtis E. Dyreson, Michael H. Böhlen, Christian S. Jensen: Capturing and Querying Multiple Aspects of Semistructured Data. VLDB 1999: 290-301
  12. Yaron Kanza, Werner Nutt, Yehoshua Sagiv: Queries with Incomplete Answers over Semistructured Data. PODS 1999: 227-236
  13. Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, Moshe Y. Vardi: Rewriting of Regular Expressions and Regular Path Queries. PODS 1999: 194-204
  14. Peter Buneman, Wenfei Fan, Scott Weinstein: Interaction between Path and Type Constraints. PODS 1999: 56-67
  15. Serge Abiteboul: On Views and XML. PODS 1999: 1-9
  16. Zoé Lacroix: Object Views through Search Views of Web Datasources. ER 1999: 176-187
  17. Georges Gardarin, Fei Sha, Tuyet-Tram Dang-Ngoc: XML-based Components for Federating Multiple Heterogeneous Data Sources. ER 1999: 506-519
  18. Silvana Castano, Valeria De Antonellis: Building Views over Semistructured Data Sources. ER 1999: 146-160
  19. Daniela Florescu, Alon Y. Levy, Alberto O. Mendelzon: Database Techniques for the World-Wide Web: A Survey. SIGMOD Record 27(3): 59-74(1998)
  20. Mary F. Fernandez, Daniela Florescu, Alon Y. Levy, Dan Suciu: Web-Site Management: The Strudel Approach. IEEE Data Eng. Bull. 21(2): 14-20(1998)
  21. Serge Abiteboul, Jason McHugh, Michael Rys, Vasilis Vassalos, Janet L. Wiener: Incremental Maintenance for Materialized Views over Semistructured Data. VLDB 1998: 38-49
  22. Svetlozar Nestorov, Serge Abiteboul, Rajeev Motwani: Extracting Schema from Semistructured Data. SIGMOD Conference 1998: 295-306
  23. Mary F. Fernández, Daniela Florescu, Jaewoo Kang, Alon Y. Levy, Dan Suciu: Catching the Boat with Strudel: Experiences with a Web-Site Management System. SIGMOD Conference 1998: 414-425
  24. Daniela Florescu, Alon Y. Levy, Dan Suciu: Query Containment for Conjunctive Queries with Regular Expressions. PODS 1998: 139-148
  25. Richard Hull: Managing Semantic Heterogeneity in Databases: A Theoretical Perspective. PODS 1997: 51-61
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:34:17 2009