Classification Space: A Multivariate Procedure For Automatic? Document Indexing And Retrieval
- 1 October 1966
- journal article
- research article
- Published by Taylor & Francis in Multivariate Behavioral Research
- Vol. 1 (4), 479-524
- https://doi.org/10.1207/s15327906mbr0104_6
Abstract
A conceptual approach to linguistic data processing problems is sketched and empirical illustrations are presented of the major software components- indexing, storage, and retrieval-of a document processing system which offers, in principle, the advantages of complete automation, unlimited cross- indexing, effective sequential retrieval, sub-documentary indexing reflecting heterogeneity of subject matter within a document, and a procedure for automatically identifying retrieval requests which would be inadequately handled by the system. The indexing schema, designated as a "Classification Space" consists of a Euclidean model for mapping subject matter similarity within a given subject matter domain. A schema of this kind is empirically derived for certain fields of Engineering and Chemistry. A set of five related empirical studies provide convincing evidence that when appropriate experimental procedures are followed a very stable C-Space for a given content domain can be constructed on a surprisingly smal...Keywords
This publication has 1 reference indexed in Scilit:
- Unrestricted Cluster And Factor Analysis, With Applications To The MMPI And Holzinger-Harman ProblemsMultivariate Behavioral Research, 1966