Differential privacy at the end of the rainbow

Apple’s heavily-marketed but proprietary implementation of differential privacy is no longer secret. Researchers at the University of Southern California, Indiana University, and Tsinghua University have reverse engineered Apple’s MacOS and iOS implementations of differential privacy. An academic article describing the results was published on Sept. 8, and Wired broke the news the following week in an article titled, “How One of Apple’s Key Privacy Safeguards Falls Short.”

As I described in a previous post, differential privacy measures theoretical privacy loss with a parameter called epsilon—smaller epsilon means more private. Theoreticians generally agree that an epsilon less than 0.1 is very safe, and an epsilon less than 1.0 is probably ok. The USC, Indiana, and Tsinghua researchers reveal that Apple’s MacOS implementation uses an epsilon of 6, while iOS 10 uses an epsilon of 14. So what does this tell us about how private Apple’s data collection is? Or to take a concrete example, if the government were to subpoena Apple’s data, what do these epsilons tell us about how likely it is that the government could identify specific individuals in the data? The answer: The epsilons tell us nothing. Nada.

Not. A. Thing.

Why is this? Because differential privacy is only a theoretical worst-case bound on a particular mathematical definition of privacy loss. In other words, the data might be much better protected than the epsilon suggests, but it will never be less protected. A high epsilon doesn’t mean the data is unsafe, only that it might be.

My research group first encountered this limitation three years ago when Alexey Reznichenko implemented and deployed a privacy preserving behavioral advertising system that used differential privacy to collect usage statistics from over 13,000 opted-in users. We set our epsilon to 1.0: the edge of what might be considered private. What struck us the most is that we could have gathered far more data from each user and still been nowhere near able to identify specific users. In other words, differential privacy was unnecessarily limiting our analytics. That is when I decided that differential privacy had a long way to go to being practical.

So we have a bit of a mess.

Researchers are saying that Apple’s epsilon parameters are not meaningful, Wired is obliquely suggesting that Apple has been deceptive, and Apple is rigorously defending its privacy practices. Who is at fault here? While I do think that Apple made a misstep in over-hyping its use of differential privacy and keeping its implementation proprietary, I have no reason to believe that their privacy practices are not quite strong. I am inclined to believe that they are genuine in their commitment to privacy, and that their practices are good.

Where I find the main fault is in the academic research community. In all my 30 years as a researcher, differential privacy is the most over-hyped technology I have ever seen. To listen to differential privacy researchers, you would think that we are now able to nearly perfectly defend against even the most resourceful attacker. “Guaranteed privacy,” “future proof.” The problem is that nobody has figured out how to build a differentially private system that has both a low epsilon and adequate utility. Invariably to get a low epsilon one must simply stop gathering data. This is just not a realistic option.

To quote from the Wired article, “a new study … suggests [Apple] has ratcheted that dial further toward aggressive data-mining than its public promises imply.” The problem is that if Apple ratcheted that dial to a strongly private setting, then their data collection would be useless.

Rather than accusing Apple of being a metaphoric privacy mega-polluter, the academic research community should be cleaning up its own act. Until they can produce a system that is strongly private while at the same time provides good utility, the academic research community should stop implying that differential privacy is a workable technology. It is not. The pot of privacy at the end of the differentially private rainbow is, for now, unreachable.

photo credit: ::ErWin Südtirol 2017 via photopin (license)

Author

Paul Francis Nonmember Contributor

Comments

If you want to comment on this post, you need to login.

Washington Post Editorial Board announces support for APRA

In an op-ed, The Washington Post Editorial Board came out in support of the proposed American Privacy Rights Act, noting that it "is a long-awaited paradigm shift in modern-day privacy policy." Though the board notes that there are "flaws" with the current proposal, "There's plenty of room to addres...

Read More Save This

Biden signs bill reauthorizing FISA Section 702

U.S. President Joe Biden signed legislation reauthorizing Section 702 of the Foreign Intelligence Surveillance Act 20 April, The Associated Press reports. The Senate passed a bill 60-34, renewing Section 702 authority just after the deadline for the program to expire. Meanwhile, Columbia University ...

Read More Save This

Catching up on IAPP GPS 2024 keynote speeches

The IAPP released several full-length keynote presentations from the IAPP Global Privacy Summit 2024. Columbia University Law Professor Anu Bradford discussed the differences between digital regulatory models in China, the EU and the U.S. and their implications for liberal democracy. Other presentat...

Read More Save This

Op-ed: Global Privacy Control restricts consumers' need for tailored ads

In an op-ed for AdExchanger, Neolaw co-founder and Privacy Lawyer Andy Hepburn, CIPP/US, said the Global Privacy Control that allows consumers to opt out of targeted advertisements is too restrictive. Hepburn said a better approach for GPC settings "would enable advertisers to deliver relevant ads t...

Read More Save This

A view from Brussels: Behavioral advertising is an unstoppable current

The European Data Protection Board's opinion on the pay-or-consent models being deployed by large online platforms as a legal construct to support behavioral advertising, is generating unprecedented aggravation and heated opinions, IAPP Managing Director, Europe, Isabelle Roccia, CIPP/E, writes. Det...

Read More Save This

Privacy Tech | Differential privacy at the end of the rainbow Related reading: Washington Post Editorial Board announces support for APRA

Differential privacy at the end of the rainbow

Author

Tags

Comments

Tags

Recent Comments

Author

Tags

Comments

Related Stories

Washington Post Editorial Board announces support for APRA

Biden signs bill reauthorizing FISA Section 702

Catching up on IAPP GPS 2024 keynote speeches

Op-ed: Global Privacy Control restricts consumers' need for tailored ads

A view from Brussels: Behavioral advertising is an unstoppable current

Related Stories

Tags

Recent Comments