Top-Down Specialization for Information and Privacy Preservation
Top Cited Papers
- 19 April 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 10636382,p. 205-216
- https://doi.org/10.1109/icde.2005.143
Abstract
Releasing person-specific data in its most specific state poses a threat to individual privacy. This paper presents a practical and efficient algorithm for determining a generalized version of data that masks sensitive information and remains useful for modelling classification. The generalization of data is implemented by specializing or detailing the level of information in a top-down manner until a minimum privacy requirement is violated. This top-down specialization is natural and efficient for handling both categorical and continuous attributes. Our approach exploits the fact that data usually contains redundant structures for classification. While generalization may eliminate some structures, other structures emerge to help. Our results show that quality of classification can be preserved even for highly restrictive privacy requirements. This work has great applicability to both public and private sectors that share information for mutual benefits and productivity.Keywords
This publication has 4 references indexed in Scilit:
- Bottom-Up Generalization: A Data Mining Solution to Privacy ProtectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Transforming data to satisfy privacy constraintsPublished by Association for Computing Machinery (ACM) ,2002
- Protecting respondents identities in microdata releaseIEEE Transactions on Knowledge and Data Engineering, 2001
- Privacy-preserving data miningPublished by Association for Computing Machinery (ACM) ,2000