Why Pete Warden Should Not Release Profile Data on 215 Million Facebook Users
Why Pete Warden Should Not Release Profile Data on 215 Million Facebook Users: Via Michael Zimmer.org .
Speaking of the research ethics related to automatically harvesting public social networking data, we are confronted this week with the story of Pete Warden, a former Apple engineer who has spent the last six months harvesting and analyzing data from some 215 million public Facebook profile pages.
According to Warden, he exploited a flaw in Facebook’s architecture to access public profiles without needing to be signed in to a Facebook account, effectively avoiding being bound by Facebook’s Terms of Service preventing such automated harvesting of data. As a result, he amassed a database of names, fan pages, and lists of friends for 215 million public Facebook accounts.
Warden has already done some impressive analysis of this data at an aggregate level, and I know researchers would love to get their hands on it. And like the “Tastes, Ties, and Time” Facebook project, Warden wants to release the dataset to the academic community.
But also like the “Tastes, Ties, and Time” project, Warden would be wrong to do so.
First, similar to our discussion of the ethics of collecting public Twitter streams, just because these Facebook users made their profiles publicly available does not mean they are fair game for scraping for research purposes. Yes, I have limited profile information viewable to the public, and I’ve authorized Facebook to make that information available for search engines to crawl. But the purpose of this public availability is to help people — humans, not bots — find me. The presumption is that my public profile data will only be found and viewed if someone actually searches for “Michael Zimmer” on Facebook or a search engine. In reality, my profile is only “public” if a human being takes specific and conscious action to find me.
Warden’s actions, however, violate this implicit understanding for making profiles publicly searchable. Rather than trying to find me, Warden is systematically sought everyone, letting a script to the work of seeking and harvesting my data. There is no genuine desire to find me, to friend me, and so on. He’s just collecting data. His reasons might be honest and beneficial, but that’s not what’s at issue here. The point is whether the 215 million Facebook users who now have some of their information in Warden’s database contemplated such harvesting and aggregating when they built their profile and configured their privacy settings. They almost certainly didn’t, which brings into doubt whether this data has been collected with proper consent.
[...]
Read Original Article:(Via Michael Zimmer.org .)
Recent blog posts
- The Secrecy Double-Standard
- Fully-qualified Nonsense in the SSL Observatory
- Appeals Court Strengthens Warrantless Searches at Border
- Justice Dept. to Congress: Don’t Saddle 4th Amendment on Us
- Feds, RIAA Ask $22,500 in Damages Per Song
- Building a better Certificate Authority (CA) infrastructure
- Where’s EFF? Why EFF Is Sometimes Quiet About Important Cases
- Congressman Wants YouTube Video Covered Up
- Man Creates "Creepy" Stalking App
- Boston College Says Using WiFi Is a Sign of Infringement