DAG: A General Model for Privacy-Preserving Data Mining

Page view(s)

Checked on

Please use this identifier to cite or link to this item: https://astaroar.ripplewerkz.co/communities-collections/articles/14249

Title:

DAG: A General Model for Privacy-Preserving Data Mining

Journal Title:

IEEE Transactions on Knowledge and Data Engineering

DOI:

10.1109/TKDE.2018.2880743

OA Status:

Publication URL:

https://doi.org/10.1109/TKDE.2018.2880743

Authors:

Sin Gee Teo, Jianneng Cao, Vincent CS Lee

Keywords:

DAG

Publication Date:

12 November 2018

Citation:

S. G. Teo, J. Cao and V. C. Lee, "DAG: A General Model for Privacy-Preserving Data Mining," in IEEE Transactions on Knowledge and Data Engineering. doi: 10.1109/TKDE.2018.2880743

Abstract:

Secure multi-party computation (SMC) allows parties to jointly compute a function over their inputs, while keeping every input confidential. It has been extensively applied in tasks with privacy requirements, such as privacy-preserving data mining (PPDM), to learn task output and at the same time protect input data privacy. However, existing SMC-based solutions are ad-hoc – they are proposed for specific applications, and thus cannot be applied to other applications directly. To address this issue, we propose a privacy model DAG (Directed Acyclic Graph) that consists of a set of fundamental secure operators (e.g., +, -, , /, and power). Our model is general – its operators, if pipelined together, can implement various functions, even complicated ones like Na¨ıve Bayes classifier. It is also extendable – new secure operators can be defined to expand the functions that the model supports. For case study, we have applied our DAG model to two data mining tasks: kernel regression and Na¨ıve Bayes. Experimental results show that DAG generates outputs that are almost the same as those by non-private setting, where multiple parties simply disclose their data. The experimental results also show that our DAG model runs in acceptable time, e.g., in kernel regression, when training data size is 683,093, one prediction in non-private setting takes 5.93 sec, and that by our DAG model takes 12.38 sec.

License type:

PublisherCopyrights

Funding Info:

Description:

URI:

https://astaroar.ripplewerkz.co/communities-collections/articles/14249

ISSN:

1041-4347
1558-2191

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
There are no attached files.