Algorithms for parsing search queries in systems with inverted file organization
- 1 December 1976
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems
- Vol. 1 (4), 299-316
- https://doi.org/10.1145/320493.320490
Abstract
In an inverted file system a query is in the form of a Boolean expression of index terms. In response to a query the system accesses the inverted lists corresponding to the index terms, merges them, and selects from the merged list those records that satisfy the search logic. Considered in this paper is the problem of determining a Boolean expression which leads to the minimum total merge time among all Boolean expressions that are equivalent to the expression given in the query. This problem is the same as finding an optimal merge tree among all trees that realize the truth function determined by the Boolean expression in the query. Several algorithms are described which generate optimal merge trees when the sizes of overlaps between different lists are small compared with the length of the lists.Keywords
This publication has 5 references indexed in Scilit:
- Probabilistic models of inverted file information retrieval systemsPublished by Association for Computing Machinery (ACM) ,1976
- Analysis and performance of inverted data base structuresCommunications of the ACM, 1975
- A formal system for information retrieval from filesCommunications of the ACM, 1970
- The Influence of Data Base Characteristics and Usage on Direct Access File OrganizationJournal of the ACM, 1968
- Man-computer problem solving with multilistProceedings of the IEEE, 1966