Is differential privacy a data perturbation technique?

More Jalpesh Vasa's questions See All

Where to provide Privacy in Deep Learning?

Which is best way to provide privacy in Deep Learning? At Data Pre-processing Phase At Model Training Phase At Data Inference(validation/Testing) Phase At Model Deployment Phase? Please suggest...

29 September 2021 312 4 View

How Differential Privacy Useful in Deep Learning??

As Proposed by Cynthia Dwork, A Differential privacy gain a lots of attention nowadays in the field of Data Privacy. Also, there are various version of Differential Privacy(DP), Mainly used either...

29 September 2021 208 1 View

Can anyone give answers for the following questions related to privacy in Deep Learning?

Which Python Architecture to use for accessing existing algorithm???(keras, Tensorflow_privacy, PyTorch etc) Which DL Model to use?(like CNN, CDBN, DNN, LSTM etc) Is there any Privacy Preserving...

29 September 2021 4,466 2 View

When is performing temporal convergence necessary in wall-bounded flows?

In CFD, grid convergence study involves both spatial and temporal discretization error analysis. In curved channels, where the fluid experiences centrifugal forces, secondary flows are formed...

11 August 2020 4,832 3 View

Can anyone suggest open source of Big Data, which can be freely accessible/downloadable and having sensitive attributes?

Is there any source available, which allows us to download/access Big Data(min. size of 500GB to 5TB) which having sensitive attributes combination like income/disease/age/ethnicity/country/gender...

28 April 2020 9,317 5 View

While meshing a pipe for simulation as periodic, is it safe to consider computational domain dimensions used in the case of periodic channel?

ANSYS recommendation states that the size of the computational domain for a straight channel with channel height H is taken as 4H in the stream-wise direction and 1.5H in the span-wise direction....

08 January 2020 6,548 3 View

Which are data analytics techniques for which privacy preserving methods can be applied?

I'm using Big Data. Which are different Big Data Analytics/Data Analytics techniques, for which Privacy Preserving methods(Which method) can be applied? Please suggest.

14 April 2018 1,084 3 View

Can anyone suggest open issues/research gap/research direction in privacy preservation and security in Big Data?

Also suggest some research papers on it. Thank you.

05 March 2017 4,216 8 View

Can anyone suggest Mathematical modeling of different privacy preserving models like differential privacy, anonymity, noise addition techniques??

For providing Big Data privacy, its important that utility of the data/mining result should be preserved. For that before implementing anything, first we need to prove the concept/idea...

01 January 1970 7,985 3 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Is Galaxy.org good to use for research for analyzing data and for publication?

Hello all, I wanted to know, can I use galaxy (USA, Europe or Australia) platform for analyzing the shotgun data, and can it be used for publication purpose as well? Thanks :)

06 August 2024 6,610 4 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

"A Markov-like Model for Patient Progression"?

A Markov-like Model for Patient Progression" Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC) is a powerful computational technique used to draw samples from a probability...

05 August 2024 10,079 0 View

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

Are there any statistical methods to justify your sampling technique using SPSS or AMOS?

05 August 2024 9,153 4 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

How can I interpret the data without the need of solving it manually?

How can I interpret the data gathered without solving?

03 August 2024 9,054 3 View

Why can't academics earn the money they deserve?

Only Journals make money from the articles we have worked on for years. Academics do not earn money from their refereeing. Then shouldn't the solution be a system in which academics can earn...

01 August 2024 6,469 6 View

Kgs Venkatesan

Differential privacy a data perturbation technique are Privacy-preserving data publishing techniques based on differential privacy through data perturbation provide a safe release of datasets such that sensitive information present in the dataset cannot be inferred from the published data. Existing privacy-preserving data publishing solutions have focused on publishing a single snapshot of the data with the assumption that all users of the data share the same level of privilege and access the data with a fixed privacy level. Thus, such schemes do not directly support data release in cases when data users have different levels of access on the published data. Privacy-preserving data publishing (PPDP) schemes are designed to prevent the inference of sensitive information in published datasets from data users accessing the published information. Dataset owners use privacy-preserving data publishing (PPDP) techniques to perturb their datasets prior to publishing. In many real-world scenarios, users of a dataset may have different privileges and may need to access the same dataset at different privacy/utility levels requiring multi-level access on the published data.

Renaud Di Francesco

Can I recommend that you look at k-anonymity by Sweeney? The method is not sufficient, but the intention makes sense: not disclosing anything which could harm the people described in the dataset.

I would recommend you to also look at Quantization, done in Information Theory to efficiently represent/transmit data.

Assume that John's height is 1.8555 m. He is the only one in the database to reach this value. There are 2000 other members in the database who have height of 1.85 m with an accuracy in cm. Then you quantize and assign 1.85 m to John for his height.

Moti Yung

Can be indeed viewed as randomized data perturbation technique.

Jalpesh Vasa

Renaud Di Francesco sir, I went through techniques proposed by sweeney like k-anonymity, l-diversity and t-closeness, but this all have limitation in big data environment. here in differential privacy can perrtube data based on probability and add random noise in your data to increase privacy in your data while publishing in the open world.

While recognising the limitations of k-anonymity and evolutions, I recommend to look at these in mathematical depth:

f: name -> data record

it's a mapping

What Sweeney says is: look at k-surjectivity, that is if the inverse map which associates to every "data record" the "name"s of all whose f(name) is "data record" , denote it by f**(-1)(data record), and require before disclosure (as part of a SDC Statistical Disclosure Control methodology) that the set f**(-1)(data record) has at least k elements.

My point is, yes, that's a first approach, Card(f**(-1)(name)) > k

where CardX is the number of elements x in set X.

Now let us define G=f**(-1) and study its properties.

G is a set valued map. There is a deep mathematical framework for set valued maps, developed by Aubin, Cellina, Frankowska.

This includes subdifferentials and similar replacement for set valued maps of differentials for point valued maps (classical f: x ->y, where x and y are elements not like G: y -> X, where y is an element and X is a set), and there is a complete construction called viability theory by Aubin. See for instance:

Book Viability Theory: New Directions

The above defines a massive research domain, to be explored further, beyond the k-anonymity (but inspired by it) and differential privacy (but informed by it).

Agreed?

Mohamed R. Alkotby

Ralf Kneuper

In this case, you need to distinguish between the definition of differential privacy, and the various algorithms that may be used to achieve it.

Differential privacy itself is defined as a property that helps to measure the degree of privacy achieved with a randomized function. In that sense, it is not a technique for achieving privacy but measures the degree to which such techniques are successful.

To achieve differential privacy, there are many different algorithms which are all based on some form of data perturbation.

Put differently, differential privacy is not itself a data perturbation technique, but you need such a technique to achieve differential privacy.

The time and space complexity can of course only be evaluated for any specific algorithm for achieving differential privacy, not for differential privacy itself. It might be possible to prove that there cannot be any algorithm for achieving differential privacy with a space or time complexity better than some defined value, but I do not know whether any such work exists.

Wanyong Qiu

Differential privacy is a data disturbance mechanism. Dwork et al. proposed the definition of differential privacy, which solved the two shortcomings of the traditional privacy protection model. 1) Differential privacy protection has nothing to do with background knowledge. 2) Differential privacy is based on strict mathematics and provides a quantitative evaluation method for privacy protection. The core concept of differential privacy covers a series of research from the field of privacy protection to data science (such as machine learning, data mining, statistics and learning theory). Differential privacy implementation mechanisms generally include Laplace mechanism and exponential mechanism. The perturbation methods include input perturbation, target perturbation, gradient perturbation and output perturbation.

Reference

J. Jia and W. Qiu, "Research on an Ensemble Classification Algorithm Based on Differential Privacy," in IEEE Access, vol. 8, pp. 93499-93513, 2020, doi: 10.1109/ACCESS.2020.2995058.