A Privacy-Preserving Data Mining Through Comprehensive GNIPP Approach in Sensitive Data Sets

Journal Title: International Journal of Experimental Research and Review - Year 2024, Vol 44, Issue 8

Abstract

The quick growth of methods for analyzing data and the availability of easily available datasets have made it possible to build a thorough analytics model that can help with support decision-making. In the meantime, protecting personal privacy is crucial. A popular technique for medical evaluation and prediction, decision trees are easy to comprehend and interpret. However, the decision tree construction procedure may reveal personal information about an individual. By keeping the statistical properties intact and limiting the chance of privacy leaking within a reasonable bound, differential privacy offers a formal mathematical definition of privacy. To construct a boosting random forest that preserves privacy, we propose a Gaussian Noise Integrated Privacy Preservation (GNIPP) in this study. To address the issue of personal information breaches, we have designed a unique Gaussian distribution mechanism in GNIPP that enables the nodes with deeper depth to obtain more privacy during the decision tree construction process. We propose a comprehensive boosting technique based on the decision forest's prediction accuracy for assembling multiple decision trees into a forest. Furthermore, we propose an iterative technique to accelerate the assembly of decision trees. After all, we demonstrate through experimentation that the suggested GNIPP outperforms alternative algorithms on two real-world datasets.

Authors and Affiliations

Shailesh Kumar Vyas, Swapnili Karmore

Keywords

Related Articles

The Impact of Leadership Styles, Cultural Dimensions and Values on Academic Leaders

This study investigates the cultural dimensions, values, and leadership styles of school leaders in Indian K-12 and European schools, specifically focusing on cross-cultural differences. The objective is to explore how l...

Wavelet transformation and predictability of Gold Price Index Series with ARMA model

The U.S. gold futures market has recently attracted significant attention globally in the highly volatile equity and commodity futures markets. This study investigates an efficient algorithm based on ARMA denoising with...

A Computation of Frequent Itemset using Matrix Based Apriori Algorithm

The Apriori Algorithm is a traditional method for determining the frequent itemsets from a lot of data. Association rules can be generated based on frequently occurring item sets. The Apriori algorithm has two bottleneck...

Revitalizing the Forensic Accounting: An Exploratory Study on Mitigating the Financial Risk using Data Analytics

Risk mitigation and fraud prevention in the present times has more focus on digital metadata, wherein forensic accountants are required to make use of robust IT techniques and tools. Data Analytics has got huge implicati...

An IoT-based soil analysis system using optical sensors and multivariate regression

Food is the primary requirement for the survival of any living being on this planet. The rapid increment in the population is a major concern for adequate food production due to the depletion of agricultural land, which...

Download PDF file
  • EP ID EP750721
  • DOI 10.52756/ijerr.2024.v44spl.002
  • Views 36
  • Downloads 0

How To Cite

Shailesh Kumar Vyas, Swapnili Karmore (2024). A Privacy-Preserving Data Mining Through Comprehensive GNIPP Approach in Sensitive Data Sets. International Journal of Experimental Research and Review, 44(8), -. https://europub.co.uk./articles/-A-750721