Integrating contents and structure in text retrieval

1 March 1996

journal article
Published by Association for Computing Machinery (ACM) in ACM SIGMOD Record

Vol. 25 (1), 67-79
https://doi.org/10.1145/381854.381890

Abstract

The purpose of a textual database is to store textual documents. These documents have not only textual contents, but also structure. Many traditional text database systems have focused only on querying by contents or by structure. Recently, a number of models integrating both types of queries have appeared. We argue in favor of that integration, and focus our attention on these recent models, covering a representative sampling of the proposals in the field. We pay special attention to the tradeoffs between expressiveness and efficiency, showing the compromises taken by the models. We argue in favor of achieving a good compromise, since being weak in any of these two aspects makes the model useless for many applications.

Keywords

This publication has 22 references indexed in Scilit:

Ordered and Unordered Tree Inclusion
SIAM Journal on Computing, 1995
An Algebra for Structured Text Search and a Framework for its Implementation
The Computer Journal, 1995
Text databases
ACM SIGMOD Record, 1994
Shortening the OED
ACM Transactions on Information Systems, 1992
An algebra for hierarchically organized text-dominated databases
Information Processing & Management, 1992
A Query Language for Retrieving Information from Hierarchic Text Structures
The Computer Journal, 1991
Storage and retrieval of structured documents
Information Processing & Management, 1990
Query processing in a multimedia document system
ACM Transactions on Information Systems, 1988
Semantic database modeling: survey, applications, and research issues
ACM Computing Surveys, 1987
Document processing in a relational database system
ACM Transactions on Information Systems, 1983

Cited by 65 articles