Integrating contents and structure in text retrieval
- 1 March 1996
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMOD Record
- Vol. 25 (1), 67-79
- https://doi.org/10.1145/381854.381890
Abstract
The purpose of a textual database is to store textual documents. These documents have not only textual contents, but also structure. Many traditional text database systems have focused only on querying by contents or by structure. Recently, a number of models integrating both types of queries have appeared. We argue in favor of that integration, and focus our attention on these recent models, covering a representative sampling of the proposals in the field. We pay special attention to the tradeoffs between expressiveness and efficiency, showing the compromises taken by the models. We argue in favor of achieving a good compromise, since being weak in any of these two aspects makes the model useless for many applications.Keywords
This publication has 22 references indexed in Scilit:
- Ordered and Unordered Tree InclusionSIAM Journal on Computing, 1995
- An Algebra for Structured Text Search and a Framework for its ImplementationThe Computer Journal, 1995
- Text databasesACM SIGMOD Record, 1994
- Shortening the OEDACM Transactions on Information Systems, 1992
- An algebra for hierarchically organized text-dominated databasesInformation Processing & Management, 1992
- A Query Language for Retrieving Information from Hierarchic Text StructuresThe Computer Journal, 1991
- Storage and retrieval of structured documentsInformation Processing & Management, 1990
- Query processing in a multimedia document systemACM Transactions on Information Systems, 1988
- Semantic database modeling: survey, applications, and research issuesACM Computing Surveys, 1987
- Document processing in a relational database systemACM Transactions on Information Systems, 1983