Disclosure limitation of sensitive rules
- 22 January 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 21, 45-52
- https://doi.org/10.1109/kdex.1999.836532
Abstract
Data products (macrodata or tabular data and micro-data or raw data records), are designed to inform public or business policy, and research or public information. Securing these products against unauthorized accesses has been a long-term goal of the database security research community and the government statistical agencies. Solutions to this problem require combining several techniques and mechanisms. Recent advances in data mining and machine learning algorithms have, however, increased the security risks one may incur when releasing data for mining from outside parties. Issues related to data mining and security have been recognized and investigated only recently. This paper deals with the problem of limiting disclosure of sensitive rules. In particular it is attempted to selectively hide some frequent itemsets from large databases with as little as possible impact on other non-sensitive frequent itemsets. Frequent itemsets are sets of items that appear in the database "frequently enough" and identifying them is usually the first step toward association/correlation rule or sequential pattern mining. Experimental results are presented along with some theoretical issues related to this problem.Keywords
This publication has 4 references indexed in Scilit:
- Inference in MLS database systemsIEEE Transactions on Knowledge and Data Engineering, 1996
- Wizard: a database inference analysis and detection systemIEEE Transactions on Knowledge and Data Engineering, 1996
- Mining association rules between sets of items in large databasesPublished by Association for Computing Machinery (ACM) ,1993
- Security-control methods for statistical databases: a comparative studyACM Computing Surveys, 1989