The Center for Education and Research in Information Assurance and Security (CERIAS)

The Center for Education and Research in
Information Assurance and Security (CERIAS)

Using sample size to limit exposure to data mining

Download

Download PDF Document
PDF

Author

Christopher Clifton

Tech report number

CERIAS TR 2001-79

Entry type

article

Abstract

Data mining introduces new problems in database security. The basic problem of using non-sensitive data to infer sensitive data is made more difficult by the “probabilistic” inferences possible with data mining. This paper shows how lower bounds from pattern recognition theory can be used to determine sample sizes where data mining tools cannot obtain reliable results.

Download

PDF

Date

2000 – 11

Journal

Journal of Computer Security

Key alpha

Clifton

Number

4

Pages

281-307

Publisher

IOS Press

Volume

8

Publication Date

2001-11-01

BibTex-formatted data

To refer to this entry, you may select and copy the text below and paste it into your BibTex document. Note that the text may not contain all macros that BibTex supports.