Privacy Preserving Data Publishing: A Classification Perspective

Abstract

The concept of privacy is expressed as release of information in a controlled way. Privacy could also be defined as privacy decides what type of personal information should be released and which group or person can access and use it. Privacy Preserving Data Publishing (PPDP) is a way to allow one to share anonymous data to ensure protection against identity disclosure of an individual. Data anonymization is a technique for PPDP, which makes sure the published data, is practically useful for processing (mining) while preserving individuals sensitive information. Most works reported in literature on privacy preserving data publishing for classification task handle numerical data. However, most real life data contains both numerical and non-numerical data. Another shortcoming is that use of distributed model called Secure Multiparty Computation (SMC). For this research, a centralized model is used for independent data publication by a single data owner. The key challenge for PPDP is to ensure privacy as well as to keep the data usable for research. Differential privacy is a technique that ensures the highest level of privacy for a record owner while providing actual information of the data set. The aim of this research is to develop a framework that satisfies differential privacy standards and to ensure maximum data usability for a classification tasks such as patient data classification in terms of blood pressure.

Authors and Affiliations

A N K Zaman, Charlie Obimbo

Keywords

Related Articles

Achieving Regulatory Compliance for Data Protection in the Cloud

The advent of cloud computing has enabled organizations to take advantage of cost-effective, scalable and reliable computing platforms. However, entrusting data hosting to third parties has inherent risks. Where the data...

Brain Signal Classification using Genetic Algorithm for Right-Left Motion Pattern

Brain signals or EEG are non-stationary signals and are difficult to analyze visually. The brain signal has five waves alpha, beta, delta, gamma, and theta. The five waves have their frequency to describe the level of at...

Comparative Analysis of Cow Disease Diagnosis Expert System using Bayesian Network and Dempster-Shafer Method

Livestock is a source of animal protein that contains essential acids that improve human intelligence and health. Popular livestock in Indonesia is cow. Consumption of meat per capita is increased by 0.1% kg / capita / y...

Energy-Efficient Security Threshold Determination Method for the Enhancement of Interleaved Hop-By-Hop Authentication

Wireless sensor networks allow attackers to inject false reports by compromising sensor nodes due to the use of wireless communication, the limited energy resources of the sensor nodes, and deployment in an open environm...

A Text based Authentication Scheme for Improving Security of Textual Passwords

User authentication through textual passwords is very common in computer systems due to its ease of use. However textual passwords are vulnerable to different kinds of security attacks, such as spyware and dictionary att...

Download PDF file
  • EP ID EP99977
  • DOI 10.14569/IJACSA.2014.050919
  • Views 87
  • Downloads 0

How To Cite

A N K Zaman, Charlie Obimbo (2014). Privacy Preserving Data Publishing: A Classification Perspective. International Journal of Advanced Computer Science & Applications, 5(9), 129-134. https://europub.co.uk./articles/-A-99977