Extended Boolean information retrieval
- 1 November 1983
- journal article
- Published by Association for Computing Machinery (ACM) in Communications of the ACM
- Vol. 26 (11), 1022-1036
- https://doi.org/10.1145/182.358466
Abstract
In conventional information retrieval Boolean combinations of index terms are used to formulate the users'' information requests. While any document is in principle retrievable by a Boolean query, the amount of output obtainable by Boolean processing is difficult to control, and the retrieved items are not ranked in any presumed order of importance to the user population. In the vector processing model of retrieval, the retrieved items are easily ranked in decreasing order of the query-record similarity, but the queries themselves are unstructured and expressed as simple sets of weighted index terms. A new, extended Boolean information retrieval system is introduced which is intermediate between the Boolean system of query processing and the vector processing model. The query structure inherent in the Boolean system is preserved, while at the same time weighted terms may be incorporated into both queries and stored documents; the retrieved output can also be ranked in strict similarity order with the user queries. A conventional retrieval system can be modified to make use of the extended system. Laboratory tests indicate that the extended system produces better retrieval output than either the Boolean or the vector processing systems.Keywords
This publication has 12 references indexed in Scilit:
- Threshold values and Boolean retrieval systemsInformation Processing & Management, 1981
- A comparison of two systems of weighted boolean retrievalJournal of the American Society for Information Science, 1981
- Fuzzy requests: An approach to weighted boolean searchesJournal of the American Society for Information Science, 1980
- THE USE OF AUTOMATIC RELEVANCE FEEDBACK IN BOOLEAN RETRIEVAL SYSTEMSJournal of Documentation, 1980
- Automatic indexing using term discrimination and term precision measurementsInformation Processing & Management, 1976
- Relevance weighting of search termsJournal of the American Society for Information Science, 1976
- Precision Weighting—An Effective Automatic Indexing MethodJournal of the ACM, 1976
- A new comparison between conventional indexing (MEDLARS) and automatic text processing (SMART)Journal of the American Society for Information Science, 1972
- A STATISTICAL INTERPRETATION OF TERM SPECIFICITY AND ITS APPLICATION IN RETRIEVALJournal of Documentation, 1972
- A comparison between manual and automatic indexing methodsAmerican Documentation, 1969