Special Offer: FREE Google Mobile-First Indexing Readiness Test

Google PageSpeed

In March 2021 Google began removing all desktop-only sites from its index.

Is your website 100% ready for Mobile-First Indexing? Find Out!

Your report will compare your mobile and desktop pages and show you any discrepancies between SEO signals, content and structured-data markup, and test your site's mobile-friendliness.

Click to get your free test

Google announces clustering algorithm designed to reveal group characteristics while maintaining individual privacy



By
27 October 2021 (Edited 27 October 2021)

'Differentially Private Clustering' could enable organizations to learn insights from group data while protecting individual data privacy


Google research scientists have provided in a post on GoogleBlog an update on several years of work on privacy-safe approaches for handling sensitive user data.

The challenge, as stated by the researchers, has been:

"Given a database containing several attributes about users, how can one create meaningful user groups and understand their characteristics? Importantly, if the database at hand contains sensitive user attributes, how can one reveal these group characteristics without compromising the privacy of individual users?"

In developing a solution, the researchers have created a new "differentially private clustering algorithm" which can privately generate representative data points from a dataset, so as to reveal group characteristics without revealing the private data of the individuals in the dataset.

To test the new algorithm, the researchers ran it on 4 large, publicly-available benchmark databases and compared its performance to that of several publicly-available algorithms.

In the researchers' words:

"We analyze the normalized k-means loss (mean squared distance from data points to the nearest center) while varying the number of target centers (k) for these benchmark datasets. The described algorithm achieves a lower loss than the other private algorithms in three out of the four datasets we consider."

Which in plain English means that the Google clustering algorithm produced more accurate representations of the characteristics of 3 of the 4 data sets on which they ran it, compared to results from the algorithms used for comparison.

The conclusion reached from the research results, in the words of the researchers, is:

"This work proposes a new algorithm for computing representative points (cluster centers) within the framework of differential privacy. With the rise in the amount of datasets collected around the world, we hope that our open source tool will help organizations obtain and share meaningful insights about their datasets, with the mathematical assurance of differential privacy".

This work looks promising, and I'm glad to see that phrase "open source" in there!

Stay tuned for further updates.


If you found this article helpful and would like to see more like it, please share it via the Share This Article link, below.

And if you have questions or comments, you can easily send them to me with the Quick Reply form, below, or send me an e-mail.


David Boggs    - David
David@DavidHBoggs.com
View David Boggs's profile on LinkedIn

Google Certifications - David H Boggs
View my profile on Quora
Share This Article

   
   
Website
Visit Website
Rating
4/5 based on 1 vote.
Show Individual Votes
Tags , , , , , , , ,
Related Listings
External Article: https://ai.googleblog.com/2021/10/practical-differentially-private.html


E-mail:
Quick Reply
Name:
E-mail:
Subscribe to my blog:
Your Comment:


You may use BB Codes in your message.
Spam Prevention:


Members currently reading this thread:

Previous Article | Next Article