On searching transposed files
- 1 December 1979
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems
- Vol. 4 (4), 531-544
- https://doi.org/10.1145/320107.320125
Abstract
A transposed file is a collection of nonsequential files called subfiles. Each subfile contains selected attribute data for all records. It is shown that transposed file performance can be enhanced by using a proper strategy to process queries. Analytic cost expressions for processing conjunctive, disjunctive, and batched queries are developed and an effective heuristic for minimizing query processing costs is presented. Formulations of the problem of optimally processing queries for a particular family or transposed files are shown to be NP-complete. Query processing performance comparisons of multilist, inverted, and nonsequential files with transposed files are also considered.Keywords
This publication has 10 references indexed in Scilit:
- The determination of efficient record segmentations and blocking factors for shared data filesACM Transactions on Database Systems, 1977
- Approximating block accesses in database organizationsCommunications of the ACM, 1977
- A record and file partitioning modelCommunications of the ACM, 1977
- Storage and access in relational data basesIBM Systems Journal, 1977
- Mathematical Techniques for Efficient Record Segmentation in Large Shared DatabasesJournal of the ACM, 1976
- Storage cost considerations in secondary index selectionInternational Journal of Parallel Programming, 1975
- The use of cluster analysis in physical data base designPublished by Association for Computing Machinery (ACM) ,1975
- Definition of extraction files and their optimization by zero-one programmingBIT Numerical Mathematics, 1967
- Letter to the Editor—On Optimal Extracting from a Multiple File Data Storage System: An Application of Integer ProgrammingOperations Research, 1965
- System optimizationPublished by Association for Computing Machinery (ACM) ,1965