Integrating IR and RDBMS Using Cooperative Indexing.

Samuel DeFazio, Amjad M. Daoud, Lisa Ann Smith, Jagannathan Srinivasan, W. Bruce Croft, James P. Callan: Integrating IR and RDBMS Using Cooperative Indexing. SIGIR 1995: 84-92
The full integration of information retrieval (IR) features into a database management system (DBMS) has long been recognized as both a significant goal and a challenging undertaking. By full integration we mean:
  1. support for document storage, indexing, retrieval, and update,
  2. transaction semantics, thus all database operations on documents have the ACID properties of atomicity, consistency, isolation, and durability,
  3. concurrent addition, update, and retrieval of documents, and
  4. database query language extensions to provide ranking for document retrieval operations.
It is also necessary for the integrated offering to exhibit scaleable performance for document indexing and retrieval processes. To identify the implementation requirements imposed by the desired level of integration, we layered a representative IR application on Oracle Rdb and then conducted a number of database load and document retrieval experiments. The results of these experiments suggest that infrastructural extensions are necessary to obtain both the desired level of IR integration and scaleable performance. With the insight gained from our initial experiments, we developed an approach, called cooperative indexing, that provides a framework to achieve both scalability and full integration of IR and RDBMS technology. Prototype implementations of system-level extensions to support cooperative indexing were evaluated with a modified version of Oracle Rdb. Our experimental findings validate the cooperative indexing scheme and suggest alternatives to further improve performance.

Copyright © 1995 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.

