Options
COCOA: A Synthetic Data Generator for Testing Anonymization Techniques
Date Issued
2016-09-16
Date Available
2017-09-13T12:20:22Z
Abstract
Conducting extensive testing of anonymization techniques is critical to assess their robustness and identify the scenarios where they are most suitable. However, the access to real microdata is highly restricted and the one that is publicly-available is usually anonymized or aggregated; hence, reducing its value for testing purposes. In this paper, we present a framework (COCOA) for the generation of realistic synthetic microdata that allows to define multi-attribute relationships in order to preserve the functional dependencies of the data. We prove how COCOA is useful to strengthen the testing of anonymization techniques by broadening the number and diversity of the test scenarios. Results also show how COCOA is practical to generate large datasets.
Sponsorship
Science Foundation Ireland
Type of Material
Conference Publication
Publisher
Springer
Copyright (Published Version)
2017 Springer
Language
English
Status of Item
Peer reviewed
Journal
Domingo-Ferrer, J., Pejić-Bach, M. (eds.). Lecture Notes in Computer Science, volume 9867
Conference Details
UNESCO Chair in Data Privacy, International Conference, PSD 2016, Dubrovnik, Croatia, September 14–16, 2016
This item is made available under a Creative Commons License
File(s)
Loading...
Name
COCOA_PSD2016.pdf
Size
1.19 MB
Format
Adobe PDF
Checksum (MD5)
b1745df491b46907209e9ba6ed86463f
Owning collection
Mapped collections