Optimal partial-match retrieval when fields are independently specified
- 1 June 1979
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems
- Vol. 4 (2), 168-179
- https://doi.org/10.1145/320071.320074
Abstract
This paper considers the design of a system to answer partial-match queries from a file containing a collection of records, each record consisting of a sequence of fields. A partial-match query is a specification of values for zero or more fields of a record, and the answer to a query is a listing of all records in the file whose fields match the specified values. A design is considered in which the file is stored in a set of bins. A formula is derived for the optimal number of bits in a bin address to assign to each field, assuming the probability that a given field is specified in a query is independent of what other fields are specified. Implications of the optimality criterion on the size of bins are also discussed.Keywords
This publication has 8 references indexed in Scilit:
- Hashing and trie algorithms for partial match retrievalACM Transactions on Database Systems, 1976
- Newton-like methods for two-point boundary value problemsBIT Numerical Mathematics, 1976
- Partial-Match Retrieval AlgorithmsSIAM Journal on Computing, 1976
- Sorting and Searching in MultisetsSIAM Journal on Computing, 1976
- Efficient string matchingCommunications of the ACM, 1975
- Attribute based file organization in a paged memory environmentCommunications of the ACM, 1974
- Implementation of the substring test by hashingCommunications of the ACM, 1971
- Programming Techniques: Regular expression search algorithmCommunications of the ACM, 1968