Implementing ranking strategies using text signatures
- 1 January 1988
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Information Systems
- Vol. 6 (1), 42-62
- https://doi.org/10.1145/42279.45947
Abstract
Signature files provide an efficient access method for text in documents, but retrieval is usually limited to finding documents that contain a specified Boolean pattern of words. Effective retrieval requires that documents with similar meanings be found through a process of plausible inference. The simplest way of implementing this retrieval process is to rank documents in order of their probability of relevance. In this paper techniques are described for implementing probabilistic ranking strategies with sequential and bit-sliced signature tiles and the limitations of these implementations with regard to their effectiveness are pointed out. A detailed comparison is made between signature-based ranking techniques and ranking using term-based document representatives and inverted files. The comparison shows that term-based representations are at least competitive (in terms of efficiency) with signature files and, in some situations, superior.Keywords
This publication has 17 references indexed in Scilit:
- Description and performance analysis of signature file methods for office filingACM Transactions on Information Systems, 1987
- Parallel free-text search on the connection machine systemCommunications of the ACM, 1986
- A non-classical logic for information retrievalThe Computer Journal, 1986
- A comparison of a network structure and a database system used for document retrievalInformation Systems, 1985
- Signature filesACM Transactions on Information Systems, 1984
- A two level superimposed coding scheme for partial match retrievalInformation Systems, 1983
- Message filesACM Transactions on Information Systems, 1983
- Experiments with automatic text filing and retrieval in the office environmentACM SIGIR Forum, 1982
- Document representation in probabilistic models of information retrievalJournal of the American Society for Information Science, 1981
- THE PROBABILITY RANKING PRINCIPLE IN IRJournal of Documentation, 1977