<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>

<channel>
	<title>unitstep.net &#187; privacy</title>
	<atom:link href="http://unitstep.net/blog/category/privacy/feed/" rel="self" type="application/rss+xml" />
	<link>http://unitstep.net</link>
	<description>the home of peter chng</description>
	<pubDate>Sun, 30 Nov 2008 23:12:11 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.5</generator>
	<language>en</language>
			<item>
		<title>Facebook Application Development - Pros and Cons</title>
		<link>http://unitstep.net/blog/2007/07/16/facebook-application-development-pros-and-cons/</link>
		<comments>http://unitstep.net/blog/2007/07/16/facebook-application-development-pros-and-cons/#comments</comments>
		<pubDate>Tue, 17 Jul 2007 03:35:23 +0000</pubDate>
		<dc:creator>Peter Chng</dc:creator>
		
		<category><![CDATA[api]]></category>

		<category><![CDATA[facebook]]></category>

		<category><![CDATA[privacy]]></category>

		<category><![CDATA[programming]]></category>

		<category><![CDATA[social networking]]></category>

		<guid isPermaLink="false">http://unitstep.net/blog/2007/07/16/facebook-application-development-pros-and-cons/</guid>
		<description><![CDATA[
It&#8217;s been a while since Facebook introduced the concept of &#8220;applications&#8221; to their API, providing developers a very flexible set of tools to create and add features to Facebook itself.  Applications on Facebook are best described as &#8220;widgets&#8221;, since they offer functionality that&#8217;s often provided by widgets on other sites, so why Facebook chose [...]]]></description>
			<content:encoded><![CDATA[<p class="image align-right"><img src="http://unitstep.net/wordpress/wp-content/uploads/2007/06/facebook-platform.png" alt="Facebook Platform" /></p>
<p>It&#8217;s been a while <a href="/blog/2007/06/03/facebooks-platform-adds-integration-to-applications/">since Facebook introduced</a> the concept of &#8220;applications&#8221; to their API, providing developers a very flexible set of tools to create and add features to Facebook itself.  Applications on Facebook are best described as &#8220;widgets&#8221;, since they offer functionality that&#8217;s often provided by widgets on other sites, so why Facebook chose a different name is somewhat peculiar.</p>
<p>I&#8217;m not going to debate the merits of these applications, or whether they&#8217;re leading Facebook down the road to a MySpace-hell dominated by personalized profiles that are an eyesore (there&#8217;s already <a href="http://www.downloadsquad.com/2007/06/20/facebook-users-fight-back-against-new-facebook-applications/">enough discourse</a> on that subject, both on Facebook and outside of it), but rather I&#8217;ll take a look at this from a developer&#8217;s point of view.  What is the potential for developing an application for Facebook, and at the same time, what are the pitfalls?</p>
<h3>Pro: Relatively easy to develop</h3>
<p><a href="http://developers.facebook.com/resources.php">Facebook&#8217;s API</a> is easy to work with, especially if you have experience with <acronym class="uttInitialism" title="PHP: Hypertext Preprocessor">PHP</acronym>.  One of the official client libraries is written in <acronym class="uttInitialism" title="PHP: Hypertext Preprocessor">PHP</acronym>, and since it&#8217;s intended to be used for web applications, this makes perfect sense.  Using it to develop an application should therefore be a relatively pain-free process.</p>
<p>While the <a href="http://developers.facebook.com/documentation.php">documentation</a> isn&#8217;t always complete, there&#8217;s a <a href="http://wiki.developers.facebook.com/index.php/Main_Page">nice Wiki</a> that covers additional details, and together, these two should allow you to overcome most problems you may run into during development.</p>
<p>Overall, the abilities offered by the API coupled with its ease of use make development relatively straightforward.</p>
<h3>Pro: Access to a large user base</h3>
<p>Facebook&#8217;s current <a href="http://mashable.com/2007/07/10/facebook-users-2/">user base of 30 million</a> is rapidly growing.  It&#8217;s already doubled since the start of this year, and continues to <a href="http://www7.flickr.com/photos/500hats/398658777/in/set-72157594550002847/">grow at close to 3% per week</a>.  If that rate continues, they&#8217;ll double again by the end of this year.  But perhaps the most amazing fact here is the dedication of Facebook&#8217;s user base.  It&#8217;s one thing to have millions and millions of users, with only a small percentage that are actually active.  This isn&#8217;t the case with Facebook, which <a href="http://www7.flickr.com/photos/500hats/398658941/in/set-72157594550002847/">sees over half of its users</a> log in daily.  That&#8217;s an astonishing statistic by any measure, and even more so when you consider Facebook large and fast-growing user base.</p>
<p>Thus, developing an application for Facebook gives it access to a large, and more importantly, active user base.  This gives it a huge potential to grow virally, that is, from person-to-person.  When someone adds your application to their profile or account, it will show up as a News Feed item.  This means that anyone who has this user as a friend will see a message in their News Feed about this user adding your application.  The idea is that they might get interested, click on your application, and then add it to their profile as well, continuing the viral spread cycle.  </p>
<p>They can, of course, also find your application by doing a search - if you&#8217;re lucky enough to get your application listed in Facebook&#8217;s official application directory - more on that later.</p>
<h3>Con: Cost and scalability</h3>
<p>I think Marc Andreessen (of Netscape fame) <a href="http://blog.pmarca.com/2007/06/analyzing_the_f.html">put it best</a> when referring to the popularity of an application on Facebook, namely that <q>success kills</q>.  (The rest of his article, while lengthy, is an excellent read and I suggest you take the time to read it)  Marc gives the example of the <a href="http://ilike.com/">iLike</a> application, one of the first, which experienced an insane growth in the early stages.  During that time, they grew by 300,000 <em>per day</em>, eating up all of their capacity to the point where they had to buy new servers on a daily basis.  </p>
<p>While a company such as iLike, with sources of revenue, may be able to shoulder this kind of burden, it&#8217;s quite unlikely that a lone developer would.  This creates a sort of Catch-22 - you want your application to become popular, so that you could hopefully monetize it.  However, in order for that to happen, you need plenty of resources - which require money.</p>
<p>If all you&#8217;re looking to create is a neat little application for you and your friends, or if your application doesn&#8217;t really create a huge load on your end, you&#8217;ll probably be alright.  But be aware that the same dedication and large user base of Facebook that can be so excellent for spreading an application, can be the same environment that quickly overruns your capacity to support your creation.</p>
<h3>Con: Monetization</h3>
<p>While we&#8217;re talking about monetization, it&#8217;s worthwhile to point out that if this is your main concern, it&#8217;s probably best not to pursue Facebook application development.  Since <a href="http://www.kinggary.com/archives/the-issues-that-ail-facebook-platform/">support costs can be high</a> for Facebook applications, attempting to derive a successful business model on this will most likely be met with limited success, if any.  Some have called for Facebook to <a href="http://www.feld.com/blog/archives/2007/06/the_facebook_pr.html">start a revenue sharing program</a>, but this is unlikely, considering Facebook&#8217;s <a href="http://valleywag.com/tech/mythbusting/facebooks-fake-revenues-278437.php">uncertain financial future</a> - even <em>if</em> the <a href="http://gigaom.com/2007/07/13/facebook-ipo-fact-or-fiction/">IPO rumours</a> turn out to be true.  (Which they likely will be)  </p>
<p>A revenue sharing program, however, would be fair, since by creating a successful application you are adding value to Facebook&#8217;s site, and directly increasing their revenues by nature of increasing the time a user spends on the site.  I would not, however, hold out on this to develop.</p>
<p>This doesn&#8217;t mean that application development is limited to those who just want to create something &#8220;for fun&#8221; or to add something to their portfolio.  On the contrary, I believe the best opportunities the Facebook platform affords is to other businesses based on web services.  By utilizing Facebook, they can create applications that promote their site, or tie into their own API, thus using Facebook&#8217;s large user base as a source for new customers, users, and thus revenue.  Indeed, the first applications developed were made by companies with this very intent.  Other sites could follow this model.</p>
<h3>Other Cons</h3>
<p>Some believe that the viral nature of Facebook apps <a href="http://valleywag.com/tech/hypebusting/facebook-has-thrown-the-entire-startup-world-for-a-loop-273359.php">has been overstated</a>, and that they now don&#8217;t spread as easily.  </p>
<p>Another, more serious aspect, relates to something I <a href="/blog/2007/05/14/facebook-marketplace-debuts/">alluded to</a> when I commented on Facebook&#8217;s introduction of their Marketplace, a classified-listing type service.  It seems that in the TOS of the Facebook Platform, they <a href="http://www.vecosys.com/2007/05/28/working-with-facebook-f8-you-are-not-in-control-of-your-access/">reserve the right to make a similar application</a> to yours, without obligation to you.  I don&#8217;t know if this is standard legalese in a TOS for API usage, but certainly it can be construed to be something more sinister, and would deny you any right you had to something you&#8217;d created.  Of course, if Facebook <em>did</em> do something like this, I&#8217;m sure there would be an uproar from the community, with <a href="http://digg.com">Digg</a> fanning the flames, of course.</p>
<h3>Privacy, as always</h3>
<p>Privacy is also a concern, as it now not only falls on the user to determine what settings to implement, but also whether developers will respect the TOS of the Facebook Platform.  The documentation <a href="http://developers.facebook.com/documentation.php?v=1.0&#038;method=users.getInfo">clearly outlines</a> what user (and user&#8217;s friend) info can be stored, but there is nothing to stop a developer from storing more of that information, either for the purposes of adding another feature to their application, or for malicious intent.  There have <a href="http://valleywag.com/tech/ilike/limitations-and-potential-of-new-facebook-applications-263728.php">already been concerns</a> about this with popular applications.</p>
<p>The issue of privacy here is that when you sign into a Facebook application, it basically has access to all of the friend info that you would - after all, it probably needs this to perform whatever function it does.  Though you may have consented to allow this information to be used, your friends may not have explicitly done this, and may not want their information to be used by 3rd party developers.  </p>
<p>Though users can opt-out of participation in the Facebook Platform altogether (and thus prevent 3rd party developers having access), I&#8217;m guessing that not everyone or even a majority know about this feature.  The privacy settings of Facebook have grown almost as much as the site itself, with settings divided amongst sections and sub-sections, creating somewhat of a convoluted mess.  The best precaution to all of this, of course, is not posting anything on the Internet that you wouldn&#8217;t want to be known in public.  This applies not only to Facebook but other sites as well; I only emphasize this for Facebook because of the notoriety its obtained from such incidents.</p>
<hr/>Copyright &copy; 2008 <strong><a href="http://unitstep.net">unitstep.net</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact <strong><a href="mailto:webmaster@unitstep.net">webmaster@unitstep.net</a></strong> for more information.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span>]]></content:encoded>
			<wfw:commentRss>http://unitstep.net/blog/2007/07/16/facebook-application-development-pros-and-cons/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Google Street View and privacy</title>
		<link>http://unitstep.net/blog/2007/06/12/google-street-view-and-privacy/</link>
		<comments>http://unitstep.net/blog/2007/06/12/google-street-view-and-privacy/#comments</comments>
		<pubDate>Tue, 12 Jun 2007 21:13:29 +0000</pubDate>
		<dc:creator>Peter Chng</dc:creator>
		
		<category><![CDATA[google]]></category>

		<category><![CDATA[maps]]></category>

		<category><![CDATA[privacy]]></category>

		<guid isPermaLink="false">http://unitstep.net/blog/2007/06/12/google-street-view-and-privacy/</guid>
		<description><![CDATA[
Google Street View has been out for some time now, and its proven to be a pretty nifty feature.  Basically, Google has sent out teams of people in vans with cameras mounted on the tops to major cities across the US.  They&#8217;ve been able to capture 360-degree panoramic views of most of the [...]]]></description>
			<content:encoded><![CDATA[<p class="image align-right"><a href="/wordpress/wp-content/uploads/2007/06/google-street-view.jpg" title="Google Street View" rel="lightbox"><img src="/wordpress/wp-content/uploads/2007/06/google-street-view.thumbnail.jpg" alt="Google Street View" /></a></p>
<p><a href="http://maps.google.com/help/maps/streetview/">Google Street View</a> has been out for some time now, and its proven to be a pretty nifty feature.  Basically, Google has sent out teams of people in vans with cameras mounted on the tops to major cities across the US.  They&#8217;ve been able to capture 360-degree panoramic views of most of the streets in these cities, so you can view the surrounding area on almost any point in the street.  This is obviously a great feature for people who find that maps don&#8217;t provide the best experience of &#8220;getting to know a place&#8221;.  With Street View, you get the best experience of seeing what a new place will be like, short of actually visiting it.  It&#8217;ll be great for visitors.  Google hopes to <a href="http://maps.google.com/support/bin/answer.py?answer=68384&#038;topic=11640">extend to feature to more cities</a> as time goes on.  </p>
<p>However, the new feature has obviously raised some privacy concerns, but perhaps not in the strictest legal sense.  Since Google has taken these pictures from public areas, they&#8217;re probably in the clear.  However, they is somewhat of a difference when these public pictures are being integrated into the largest search engine for everyone to see.</p>
<h3>Precedent</h3>
<p>This isn&#8217;t <a href="http://www.readwriteweb.com/archives/google_street_view_maps.php">first service</a> that&#8217;s offered street views.  Microsoft has also offered <a href="http://preview.local.live.com/">their own limited service</a>, and as one blogger <a href="http://www.gearthblog.com/blog/archives/2006/03/street_level_dr.html">pointed out over a year ago</a>, it would be cool if Google did the same.  However, as pointed out in <a href="http://www.readwriteweb.com/archives/google_street_view_maps.php">this article</a>, Google Street Views are far more detailed and interactive.  In addition to being able to rotate the view a full 360 degrees, you can zoom in.  The level of quality in most pictures is quite good, so zooming actually produces good results, rather than just a larger blurry section of an image.  You can easily see way down the street by zooming.  </p>
<h3>Privacy</h3>
<p>However, this has also raised what some have called &#8220;privacy concerns&#8221; - not in the &#8220;NSA wiretapping&#8221; sense, but rather in the &#8220;it&#8217;s freaky&#8221; sense.  A <a href="http://mashable.com/2007/05/31/top-15-google-street-view-sightings">list was quickly formed</a> for the most intriguing Street View sightings, and people also found one of a <a href="http://maps.google.com/maps?f=q&#038;hl=en&#038;q=181+west+san+carlos+st.&#038;sll=37.0625,-95.677068&#038;sspn=48.688845,82.265625&#038;ie=UTF8&#038;ll=37.333433,-121.889663&#038;spn=0.012028,0.020084&#038;z=16&#038;om=1&#038;layer=c&#038;cbll=37.330351,-121.889659&#038;cbp=1,345.096039112741,0.631646838164524,3">guy digging for gold</a>.  In another more <a href="http://mashable.com/2007/06/10/google-street-view-2/">innocent list</a>, we can see Steve Jobs&#8217; Mercedes.  In many of these images, you can easily see the faces of those involved.  Now, while it&#8217;s true that you could have easily seen all of this yourself going for a stroll down the street, it&#8217;s somewhat different when a picture is taken and put onto one of the most popular search engine maps for all to see. </p>
<p>Google has provided <a href="http://www.seroundtable.com/archives/013780.html">some recourse</a>, though.  If there&#8217;s an offending image, you can <a href="http://maps.google.com/support/bin/answer.py?answer=68385&#038;topic=11640">submit a complaint</a> to have it removed.  Indeed, some of the more racier images have been removed; however, while they&#8217;re no longer on Google Maps, they&#8217;ve forever been immortalized on the various blogs and forums that have covered this topic. </p>
<h3>A big deal?</h3>
<p>Now, don&#8217;t get me wrong.  I don&#8217;t think this is a major breach of privacy, but if a government had done this, do you think people would respond differently?  I think it&#8217;s great that Google is trying to improve their search and map functions through this &#8220;information fusion&#8221;.  Once it expands to more cities, it&#8217;ll definitely help people like me who get lost easily.  Being able to see and identify with a place before going there will be a huge aid.  Who knows, once they roll out holographic monitors, I may not even have to save up for that trip around the world&#8230;</p>
<hr/>Copyright &copy; 2008 <strong><a href="http://unitstep.net">unitstep.net</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact <strong><a href="mailto:webmaster@unitstep.net">webmaster@unitstep.net</a></strong> for more information.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span>]]></content:encoded>
			<wfw:commentRss>http://unitstep.net/blog/2007/06/12/google-street-view-and-privacy/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Facebook users vent their rage at &#8220;privacy&#8221; violations</title>
		<link>http://unitstep.net/blog/2006/09/06/facebook-users-vent-their-rage-at-privacy-violations/</link>
		<comments>http://unitstep.net/blog/2006/09/06/facebook-users-vent-their-rage-at-privacy-violations/#comments</comments>
		<pubDate>Thu, 07 Sep 2006 03:55:00 +0000</pubDate>
		<dc:creator>Peter Chng</dc:creator>
		
		<category><![CDATA[facebook]]></category>

		<category><![CDATA[privacy]]></category>

		<category><![CDATA[social networking]]></category>

		<guid isPermaLink="false">http://unitstep.net/blog/2006/09/06/facebook-users-vent-their-rage-at-privacy-violations/</guid>
		<description><![CDATA[It seems that many, if not most, Facebook users do not like the changes made to the system, namely, the addition of the &#8220;News Feed&#8221; and &#8220;Mini News Feed&#8221; that quickly allow one to see all the updates that another Facebook user has done.  In effect, these new additions make it easy to &#8220;track&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p>It seems that many, if not most, <a href="http://www.techcrunch.com/2006/09/06/facebook-users-revolt-facebook-replies/">Facebook users do not like</a> the <a href="http://unitstep.net/blog/2006/09/05/facebook-updates-its-interface-sort-of/">changes made</a> to the system, namely, the addition of the &#8220;News Feed&#8221; and &#8220;Mini News Feed&#8221; that quickly allow one to see all the updates that another Facebook user has done.  In effect, these new additions make it easy to &#8220;track&#8221; what another user has been up to, since the events are tagged with a timestamp and provide all the details.</p>
<p>Since the changes were put into effect, scarcely a day ago, a bevy of groups voicing their opposition to the new features have sprung up on Facebook.  Many of them are united by the fact that the news feeds crowd their starting page now, and cause information overload - something I criticized as well.  However, most of them decry these new features as being huge breaches of privacy and point to how they make stalking much easier.</p>
<h3>Outrage and calls for a boycott</h3>
<p>Many anti-changes-to-Facebook groups have been formed, but the largest one (with an <a href="http://www.petitiononline.com/faceb00k/petition.html">online petition</a>), has almost 500,000 users and is still growing at a fast pace.  Many of the comments talk about how this is turning Facebook into a ripe target for stalkers, and of the potential for privacy violation.  Certainly such a large group of people can&#8217;t be wrong?</p>
<h3>The Internet is not private</h3>
<p>While I admit these changes do help to advertise events in such a way that more people find out about them then some would like to, I don&#8217;t believe it fundamentally affects the privacy of Facebook.  As I mentioned <a href="http://unitstep.net/blog/2006/09/05/facebook-updates-its-interface-sort-of/">in my original post</a>, this will hopefully serve as a reminder that privacy on the Internet, even it is in a seemingly locked-away area such as Facebook, is something that one cannot expect.  If you post information in a publically-accessible area on the Internet, someone will eventually find out about it, either by mistake or through the <a href="http://en.wikipedia.org/wiki/Googlebot">Googlebot</a>.</p>
<p>However, much of these concerns are ill-placed.  As mentioned on the <a href="http://blog.facebook.com/blog.php?post=2208197130">Facebook blog</a>, no extra information that wasn&#8217;t publically available before, has been made available by this update.  All it does is aggregate all of that information about your friends in a way that&#8217;s supposed to make it easier for you to read.  I agree that some options should be provided - such as what information you want to see about your friends (eg. I don&#8217;t care that someone wrote on someone else&#8217;s wall), but if <strong>you</strong> don&#8217;t want your information showing up on other people&#8217;s feed, simple disable those options in your privacy setting in Facebook.</p>
<h3>Of more concern</h3>
<p>What&#8217;s more important to know is that Facebook is not a secretive place and has <strong>never</strong> been.  It&#8217;s <a href="http://arstechnica.com/news.ars/post/20060119-6016.html">already been used</a> to identify and charge people for offences because of photos someone posted of them committing incriminating acts.  It&#8217;s also <a href="http://www.tsl.pomona.edu/index.php?article=1401">reportedly used</a> by employers concerning potential employees.  Don&#8217;t think that someone can access your profile if they&#8217;re not in your network? <a href="http://yro.slashdot.org/article.pl?sid=06/07/11/183219">Patriot Act - Read it.</a> (Shameless <cite>Arrested Development</cite> quotation)  Are your <a href="http://www.privacyrights.org/ar/OnlinePersona.htm">privacy rights being violated</a> by all this information-gathering online? I don&#8217;t know, maybe; but the lesson here is clear.  If you don&#8217;t want to take the chance having something private revealed about yourself, don&#8217;t post it online. </p>
<h3>A rude wakeup</h3>
<p>Any big change to a user interface is bound to cause some uproar, but this has provoked extra outcry because it has outlined the true nature of &#8220;privacy&#8221; on Facebook.  The default privacy options are apparently too loose for most people, but most people haven&#8217;t bothered to change them - until now.  Additionally, many people simply add whomever requests it to their friends list, without thinking about whether they would want that person to be privy to certain bits of information they post on Facebook.  While I do think that some changes need to be made now with the new features, I don&#8217;t think that these new features intentionally infringe on privacy.  The features were meant to allow one to quickly share updates with friends - not with random people you just add as friends.  Perhaps people will exercise more discretion in the future, returning Facebook into a real &#8220;social&#8221; network.</p>
<h3>Update (2006-09-08)</h3>
<p>Facebook has responded by making changes to <a href="http://blog.facebook.com/blog.php?post=2208562130">allow users to choose</a> what updates and information they want to be displayed in the News Feed and their Mini-Feed.  Sounds like a good way to ease in the changes, as people can now decide what they want advertised - they should have done this from the beginning, but it&#8217;s nonetheless good to see them listening to their users.</p>
<hr/>Copyright &copy; 2008 <strong><a href="http://unitstep.net">unitstep.net</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact <strong><a href="mailto:webmaster@unitstep.net">webmaster@unitstep.net</a></strong> for more information.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span>]]></content:encoded>
			<wfw:commentRss>http://unitstep.net/blog/2006/09/06/facebook-users-vent-their-rage-at-privacy-violations/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Facebook updates its interface (sort of)</title>
		<link>http://unitstep.net/blog/2006/09/05/facebook-updates-its-interface-sort-of/</link>
		<comments>http://unitstep.net/blog/2006/09/05/facebook-updates-its-interface-sort-of/#comments</comments>
		<pubDate>Tue, 05 Sep 2006 16:49:30 +0000</pubDate>
		<dc:creator>Peter Chng</dc:creator>
		
		<category><![CDATA[facebook]]></category>

		<category><![CDATA[privacy]]></category>

		<category><![CDATA[social networking]]></category>

		<category><![CDATA[user interface]]></category>

		<guid isPermaLink="false">http://unitstep.net/blog/2006/09/05/facebook-updates-its-interface-sort-of/</guid>
		<description><![CDATA[Logged into Facebook today (who doesn&#8217;t?), and was surprised to see the new interface, added just in time for the new school year and thus just before increased usage of the service will start.  Added were two features - firstly, a &#8220;News Feed&#8221; that displays when you first login, instead of the old notification [...]]]></description>
			<content:encoded><![CDATA[<p>Logged into Facebook today (who doesn&#8217;t?), and was surprised to see the <a href="http://blog.facebook.com/blog.php?post=2207967130">new interface</a>, added just in time for the new school year and thus just before increased usage of the service will start.  Added were two features - firstly, a &#8220;News Feed&#8221; that displays when you first login, instead of the old notification page that most people just clicked through.  Besides displaying the usual notifications (new friend confirmations, upcoming birthdays), which have been relocated to a sidebar, it displays an <acronym class="uttInitialism" title="Really Simple Syndication">RSS</acronym>-style feed of all things that have changed or been updated with your friends. </p>
<p>Things like friends&#8217; status updates, new friends of your friends, new photos of your friends, and groups that you friends have joined/left are listed, along with when these actions occurred.  Basically, most of things that people would browse around for about their friends have been nicely aggregated into one page for easier viewing, making the service easier to use and more relevant.  However, it&#8217;s almost information overload - I don&#8217;t really need to see every status update on what groups my friends have joined or left, do I? </p>
<p>Additionally, it makes it easier to snoop in on what your friends are doing on Facebook and when they&#8217;re online - though it should be noted that all the information posted in the News Feed was always publically available; this just makes it dead simple to find all of it.</p>
<p>The second feature that was added was what they called a &#8220;Mini News Feed&#8221;, which is displayed on every personal user&#8217;s profile.  It&#8217;s basically the same as your News Feed, except it displays only the updates for that user/friend - again making it easier to keep track of your friends and what they&#8217;ve been doing, which only serves to further improve Facebook&#8217;s social networking ability. </p>
<p>I agree with <a href="http://www.techcrunch.com/2006/09/05/new-facebook-redesign-more-than-just-aesthetics/trackback/">TechCrunch&#8217;s assessment</a> that Facebook &#8220;gets it right&#8221; when it comes to social networking - it&#8217;s more about making the end user experience better than just about increasing pageviews, and often the two are conflict.  Other sites just can&#8217;t compare, in my opinion.  My only suggestion for improvement would be to have some sort of settings page for the News Feed so what&#8217;s displayed can be customized. </p>
<h3>Nothing&#8217;s changed with privacy?</h3>
<p>However, I disagree with Facebook&#8217;s assertion that these changes &#8220;<i class="quote">do not give out any information that wasn&#8217;t already visible</i>&#8220;, at least in principle.  While it&#8217;s true that before you could have constantly scanned each and every one of your friend&#8217;s profiles in order to figure out when they had joined certain groups, posted wall messages or otherwise updated their profile, this would have been very tedious and time-consuming, and only the most dedicated stalkers would be able to keep up.  The new features make this pseudo-voyeurism all too easy. (You could have also written a scraper before, but this would probably have been against the TOS)</p>
<p>If there&#8217;s anything good about these changes, it&#8217;s that it&#8217;ll make people think twice about posting <a href="http://arstechnica.com/news.ars/post/20060119-6016.html">incriminating</a> or otherwise <a href="http://yro.slashdot.org/article.pl?sid=06/07/11/183219&#038;from=rss">personal information</a> that they would normally want to remain private. It may also force users to reconsider who they really want to add as a &#8220;friend&#8221;, and how they want their privacy to be set. </p>
<h3>I&#8217;ll still be using it</h3>
<p>Of course, I&#8217;m not going to stop using Facebook.  But, maybe the way I use it will change - and most likely this will be true with many other users, if the newest Facebook group, entitled &#8220;<cite>The New Facebook is Freaky as Sh*t</cite>&#8220;, is in any indication of this.  How did I learn of this group? By noting that a friend of mine had just joined it, as of 11:17 PM EST tonight.</p>
<hr/>Copyright &copy; 2008 <strong><a href="http://unitstep.net">unitstep.net</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact <strong><a href="mailto:webmaster@unitstep.net">webmaster@unitstep.net</a></strong> for more information.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span>]]></content:encoded>
			<wfw:commentRss>http://unitstep.net/blog/2006/09/05/facebook-updates-its-interface-sort-of/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Google moves to add facial recognition to search?</title>
		<link>http://unitstep.net/blog/2006/08/20/google-moves-to-add-facial-recognition-to-search/</link>
		<comments>http://unitstep.net/blog/2006/08/20/google-moves-to-add-facial-recognition-to-search/#comments</comments>
		<pubDate>Sun, 20 Aug 2006 17:42:13 +0000</pubDate>
		<dc:creator>Peter Chng</dc:creator>
		
		<category><![CDATA[google]]></category>

		<category><![CDATA[image processing]]></category>

		<category><![CDATA[privacy]]></category>

		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://unitstep.net/blog/2006/08/20/google-moves-to-add-facial-recognition-to-search/</guid>
		<description><![CDATA[Google recently acquired Neven Vision, a company with lots of technology related to image-processing, specifically, facial recognition.  The initial aim of this IP acquisition is to integrate it with Picasa, Google&#8217;s photo-organizing tool and online-photo sharing site.  According to Google:
&#8220;It could be as simple as detecting whether or not a photo contains a [...]]]></description>
			<content:encoded><![CDATA[<p>Google <a href="http://www.informationweek.com/news/showArticle.jhtml?articleID=192201732&#038;subSection=Breaking+News">recently acquired Neven Vision</a>, a company with lots of technology related to image-processing, specifically, facial recognition.  The initial aim of this IP acquisition is to integrate it with <a href="http://picasa.google.com/">Picasa</a>, Google&#8217;s photo-organizing tool and online-photo sharing site.  According to Google:</p>
<blockquote cite="http://www.informationweek.com/news/showArticle.jhtml?articleID=192201732&#038;subSection=Breaking+News"><p>&#8220;It could be as simple as detecting whether or not a photo contains a person, or, one day, as complex as recognizing people, places, and objects&#8230;&#8221;</p>
</blockquote>
<p>The article goes on to speculate that Google may use this technology to enable searching of people on the web by any photos of them that may be online.  Certainly, this wouldn&#8217;t be out of their league, and <a href="http://gigaom.com/2005/11/16/googles-riya-designs/">they&#8217;ve been interested</a> in this technology for some time.  Indeed, Sergey Brin, one of the co-founders of Google, has said that <a href="http://tailrank.com/posts/562949953800800/Google_Buys_Neven_Vision">image recognition</a> is something they&#8217;d like to do.  Despite the obvious usefulness of such a tool, it does raise privacy questions.</p>
<h3>Add it to the list</h3>
<p>Besides facial recognition, Google is also active in <a href="http://www.micropersuasion.com/2006/06/google_execs_hi.html">voice recognition</a> as well, and have hinted at <a href="http://news.bbc.co.uk/2/hi/technology/5084870.stm">using it serve up web ads</a> based on what&#8217;s playing on your TV.  (Seems far-fetched at present time)  Their OCR technology is also pretty decent, as <a href="http://books.google.com/">Google Book Search</a> is able to search the full text of many books online, that have had their pages scanned in - the service will also hilite the relevant passages, a neat feature.  All of these services are part of Google&#8217;s future plans to catalogue and make available for search, all forms of information, not just those that are presented as text on the web. </p>
<h3>Do no evil</h3>
<p>Of course, while Google has been getting attention from privacy advocates as of late, because of the huge amounts of information they collect from users, they&#8217;re one of the better companies when it comes to keeping your information private.  When the US DOJ tried to get search engine companies to turn over anonymized search queries for their users, all of them complied except for Google, which took the matter to court and won.  Thus, Google is probably one of the companies you should worry about less, as there are many others who have less or no respect for an individual&#8217;s privacy.  However, <a href="http://unitstep.net/blog/2006/08/07/aol-releases-search-queries-for-650000-users-in-blatant-disregard-for-privacy/">recent blunders by AOL</a> have done little to quell the fears of privacy advocates.</p>
<p>Facial recognition is almost certainly used by governments, as it&#8217;s an important tool for them.  So, what&#8217;s the big deal with Google getting it, then, if it&#8217;s already in use?  Well, the problem is that this signals the technology is becoming more available and pervasive, and indeed, as that happens, we will become more used to it.  The fact that anything you put on the Internet can, and will, be viewable by all is only driven home more by the use of this technology - some people still don&#8217;t realize this, though.</p>
<h3>Reasonable expectation of privacy</h3>
<p>While it&#8217;s probably true that when you go out in public, you have little to no expectation of privacy, I doubt that this is a good thing overall.  Facial recognition technology, when fully developed, and combined with a network of security cameras could enable <em>automated</em> tracking of a person&#8217;s whereabouts - no need for them to be carrying a cellphone or other tracking device.  Is this a good thing?  Some would argue it would better enable the tracking of criminals and terrorists, but I believe the potential for abuse is very real and something that needs to be addressed. </p>
<p>While I don&#8217;t think Google would abuse this technology, others certainly would, which is why I think people need to be aware of what&#8217;s going on and for companies like Google to be open about what they&#8217;re planning on doing with it.</p>
<hr/>Copyright &copy; 2008 <strong><a href="http://unitstep.net">unitstep.net</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact <strong><a href="mailto:webmaster@unitstep.net">webmaster@unitstep.net</a></strong> for more information.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span>]]></content:encoded>
			<wfw:commentRss>http://unitstep.net/blog/2006/08/20/google-moves-to-add-facial-recognition-to-search/feed/</wfw:commentRss>
		</item>
		<item>
		<title>AOL search query data release: The aftermath</title>
		<link>http://unitstep.net/blog/2006/08/08/aol-search-query-data-release-the-aftermath/</link>
		<comments>http://unitstep.net/blog/2006/08/08/aol-search-query-data-release-the-aftermath/#comments</comments>
		<pubDate>Wed, 09 Aug 2006 00:44:11 +0000</pubDate>
		<dc:creator>Peter Chng</dc:creator>
		
		<category><![CDATA[aol]]></category>

		<category><![CDATA[privacy]]></category>

		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://unitstep.net/blog/2006/08/08/aol-search-query-data-release-the-aftermath/</guid>
		<description><![CDATA[As expected, there has been much discourse following AOL&#8217;s release of the search query records of 650,000 of their users over a three-month period.  After the publically-released data (which was originally put up for research purposes, with no ill intent), was discovered by a blogger, news subsequently spread throughout the online communities, prompting many [...]]]></description>
			<content:encoded><![CDATA[<p>As expected, there has been much discourse following <a href="http://unitstep.net/blog/2006/08/07/aol-releases-search-queries-for-650000-users-in-blatant-disregard-for-privacy/">AOL&#8217;s release of the search query records</a> of 650,000 of their users over a three-month period.  After the publically-released data (which was originally put up for research purposes, with no ill intent), was discovered by a blogger, news subsequently spread throughout the online communities, prompting many people to download the data and AOL to eventually pull the data from their site.  However, some of those who had already got the data began to offer it up for download, either from their servers or through Bittorrent, providing a very good example of &#8220;letting the cat outta the bag&#8221;. </p>
<h3>No end to the data</h3>
<p>Traditional news began covering the story today, with even <a href="http://www.time.com/time/business/article/0,8599,1224405,00.html">TIME magazine</a> penning a little piece about the incident and its relation with privacy rights, or the lack thereof.  Other people have been more enterprising, with <a href="http://www.techcrunch.com/2006/08/08/aol-data-first-web-interfaces-up/">someone setting up</a> <a href="http://www.aolsearchdatabase.com/">an online, searchable database</a> of the results, thus saving people the time of importing all the data into their own DBMS.  </p>
<p>After it was discovered that users were apparently searching for <a href="http://plentyoffish.wordpress.com/2006/08/07/aol-search-data-shows-users-planning-to-commit-murder/">&#8220;how to kill your wife&#8221;</a>, ValleyWag set up a <a href="http://valleywag.com/tech/aol/find-the-scariest-aol-user-search-record-192602.php">contest to find</a> the scariest search record.  Apparently, the online community&#8217;s appetite for the grotesque is insatiable.</p>
<p>Even more interesting were some of the <a href="http://www.valleywag.com/tech/aol/scariest-search-records-aol-saves-crew-of-oceanic-flight-815-192860.php">results  from the contest</a>: Someone apparently submitting a search query that looked very much like a &#8220;message in a bottle&#8221; of a castaway stranded on an island.  ValleyWag suspects a viral marketing campaign for the TV series <cite>Lost</cite>.  Or, maybe someone is actually stranded on an island in the South Pacific with a laptop and WiFi access - though that wouldn&#8217;t be such a bad life!</p>
<h3>Changes upcoming</h3>
<p>With all the bad publicity AOL has been getting as of late (problems with cancelling customer accounts, workforce reductions, etc.), this leak could not have come at a worse time.  Further complicating the situation is the fact that bloggers spread the information like wildfire, and helped the data get into wide release.  In fact, the previously mentioned <a href="http://www.aolsearchdatabase.com/">AOL Search Database</a> is in violation of the user agreement that came with the search data - it states that the information provided is only to be used for non-commercial (research) purposes.  The search database site is clearly displaying ads and the owner hoping to profit from it.</p>
<p>Companies are going to tighten up their policy on data release, and I wouldn&#8217;t expect any of them to be releasing data to the research community any time soon.  At the very least, the way they deal with search records will be kept even more discreet.</p>
<hr/>Copyright &copy; 2008 <strong><a href="http://unitstep.net">unitstep.net</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact <strong><a href="mailto:webmaster@unitstep.net">webmaster@unitstep.net</a></strong> for more information.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span>]]></content:encoded>
			<wfw:commentRss>http://unitstep.net/blog/2006/08/08/aol-search-query-data-release-the-aftermath/feed/</wfw:commentRss>
		</item>
		<item>
		<title>AOL releases search queries for 650,000 users in blatant disregard for privacy</title>
		<link>http://unitstep.net/blog/2006/08/07/aol-releases-search-queries-for-650000-users-in-blatant-disregard-for-privacy/</link>
		<comments>http://unitstep.net/blog/2006/08/07/aol-releases-search-queries-for-650000-users-in-blatant-disregard-for-privacy/#comments</comments>
		<pubDate>Mon, 07 Aug 2006 17:33:59 +0000</pubDate>
		<dc:creator>Peter Chng</dc:creator>
		
		<category><![CDATA[aol]]></category>

		<category><![CDATA[privacy]]></category>

		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://unitstep.net/blog/2006/08/07/aol-releases-search-queries-for-650000-users-in-blatant-disregard-for-privacy/</guid>
		<description><![CDATA[Well, the &#8220;blogosphere&#8221; (I hate using that word) and various online communities are abuzz with the news that AOL Research just released 20 million search queries of some 650,000 users over a three-month time period from March to May of 2006.  Though the data were released in order to provide &#8220;a real query log&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p>Well, the &#8220;blogosphere&#8221; (I hate using that word) and various online communities are abuzz with the news that AOL Research just <a href="http://www.techcrunch.com/2006/08/06/aol-proudly-releases-massive-amounts-of-user-search-data/">released 20 million search queries</a> of some 650,000 users over a three-month time period from March to May of 2006.  Though the data were released in order to provide &#8220;a real query log&#8221; to aid in search engine research, it constitutes a huge violation of privacy, as though usernames have been removed, they have been replaced by unique identifiers, which can be used to track an individual user&#8217;s searches, allowing information to be collected about them and a profile built.  </p>
<p>The download was taken offline sometime yesterday (August 6th, 2006), but enough people had already downloaded it to ensure that <a href="http://www.gregsadetsky.com/aol-data/">mirrors</a> would quickly be set up, ensuring its continued spread.  While the intent of this release of data was obviously not malicious, it was a poorly thought-out move that AOL is sure to receive more bad PR for - especially in light of their <a href="http://www.courant.com/business/hc-aol0804.artaug04,0,3506664.story?coll=hc-headlines-business">recent troubles</a>.</p>
<h3>Good intent, bad execution</h3>
<p>The dataset was first released sometime on or before August 4th, 2006, on <a href="http://research.aol.com">AOL Research</a>.  Like Google, Yahoo! and Microsoft, AOL maintains a Research site to inform and help members of the academic and developer community of things they&#8217;re up to, and to offer assistance to researchers.  As <a href="http://72.14.207.104/search?q=cache:2Qvd2z9VbuIJ:research.aol.com/pmwiki/pmwiki.php%3Fn%3DResearch.500kUserQueriesSampledOver3Months+&#038;hl=en&#038;gl=us&#038;ct=clnk&#038;cd=1">Google&#8217;s cache</a> of the former download page indicates, the intent of offering this download was to facilitate research into making search engine technologies better, by offering a look into real users&#8217; search queries and behaviours.  </p>
<p>Having a little bit of experience in the research field (I&#8217;m currently working as a summer undergraduate research student), I can say that it&#8217;s not unheard of to see industry researchers helping out the academic community, either through joint research projects or, as in this case, the release of datasets for testing some particular sort of algorithm.</p>
<p>For example, one of the projects of the researchers in my lab is speech quality assessment, which is essentially developing an algorithm or system that can process a speech file and assess its quality as a human would, which has enormous benefits for Telco&#8217;s since they are always seeing what speech codecs are most efficient in terms of size/quality.  In order to do this, one needs a vast quantity of voice files (and their associated Mean Opinion Scores - rated by humans) to test around.  These voice files and their data can only be obtained through costly (both in time and money) trials that cost <strong>lots</strong> of money - we&#8217;re talking six-figures and above here.  Much of the data can thus only be obtained by Telcos who have the money to conduct these trials.  Fortunately, many of them have been gracious enough to release these datasets to our lab (and others) - <strong>but the data is only released after a strict <abbr title="Non-Disclosure Agreement">NDA</abbr> (Non-Disclosure Agreement) is signed</strong>, along with perhaps other agreements.  </p>
<p>Thus AOL&#8217;s intent certainly was not bad, as they merely wanted to help out in the academic circles, and maybe get cited in a paper or two.  However, their execution of this release was poor.  They should not have released this data publically, allowing just anyone to download it, but rather have followed a pattern of <strong>releasing only under an <abbr title="Non-Disclosure Agreement">NDA</abbr></strong> and only to parties that they thought were reasonably going to use it for research purposes.  In many ways, it&#8217;s worse than releasing data they paid for, because much of the data constitutes the private data of regular AOL users, data that <em>perhaps</em> they agreed to protect under AOL&#8217;s privacy policy.  </p>
<h3>Privacy concerns?</h3>
<p>Some of you might be wondering why this matters at all.  After all, it&#8217;s just a few million search queries that AOL users entered - containing no personally identifiable information.  Well, that&#8217;s true - however, since the usernames were merely replaced by unique identifiers (eg. a username is changed to a random, unique number, so &#8220;John Doe&#8221; is always mapped to &#8220;123456&#8243;, for example), profiles of users&#8217; searches can be built - opening so many cans of worms that I can&#8217;t even count that high.</p>
<p>Here&#8217;s a complete breakdown of what information is provided for each query:</p>
<dl>
<dt>AnonID</dt>
<dd>an anonymous user ID number.</dd>
<dt>Query</dt>
<dd>the query issued by the user, case shifted with most punctuation removed.</dd>
<dt>QueryTime</dt>
<dd>the time at which the query was submitted for search.</dd>
<dt>ItemRank</dt>
<dd>if the user clicked on a search result, the rank of the item on which they clicked is listed.</dd>
<dt>ClickURL</dt>
<dd>if the user clicked on a search result, the domain portion of the <acronym class="uttInitialism" title="Uniform Resource Locator">URL</acronym> in the clicked result is listed.</dd>
</dl>
<p>This is an absolute <em>goldmine</em> of information to anyone concerned with search engines, SEO or anything related to that.  Not only do you get a complete profile of a user&#8217;s searches, but you get the exact time they conducted the search, what, if any, results they clicked on (and the rank of that result), along with the domain name of site the clicked-through to.  Running datamining techniques on this information is somewhat trivial, as it can easily be analyzed to determine patterns of most any sort. </p>
<p>Internet marketers would give an arm and a leg for this sort of data, but they now will have their hands on it - <em>for free</em>.  In fact, there&#8217;s <a href="http://plentyoffish.wordpress.com/2006/08/06/aol-releases-googles-most-prized-keyword-list-google-is-gonna-get-mega-spammed/">already been</a> analysis on the dataset, producing valuable results, and claims that Google will get spammed with made-for-adsense sites (a clear violation of their TOS) targetting keywords for popular and profitable markets.  This will further pollute search results, and generate more noise, creating more headaches for search engines like Google, who will have to further adapt to prevent the <a href="http://unitstep.net/blog/2006/07/31/comment-spam-evolution/">signal-to-noise ratio from further deterioration</a>. </p>
<p>However, this dataset probably (or hopefully?) won&#8217;t have a lasting effect for spammers.  As <a href="http://www.thoughtmarket.com/blogarch/2006/08/how_useful_will_1.php">Thought Market points out</a>, since everyone has access to this data, so does Google as well.  And Google has proved themselves not to be slackers on developing data analysis techniques - after all, how did they become arguably the most successful search engine? You can bet they will be analyzing this dataset to see what results <em>they can get</em>, and hypothesizing about how these results might be used.  If they seen anything fishy going on, you can bet they&#8217;ll remove those sites from their index. </p>
<h3>Justice is blind</h3>
<p>But perhaps the most important or unnerving aspect of the release of this dataset is the implications it has on legal matters.  The data released is <em>exactly the sort of data the <a href="http://www.boingboing.net/2006/01/19/_doj_search_requests.html">US DOJ requested</a></em> from the major search engines (Google, Microsoft, Yahoo! and AOL) earlier this year to &#8220;defend its argument that the Child Online Protection Act is constitutionally sound&#8221;.  Microsoft, Yahoo! and AOL complied, while Google resisted the subpoena - and when the matter went to court, the judge ruled <a href="http://googleblog.blogspot.com/2006/03/judge-tells-doj-no-on-search-queries.html">in Google&#8217;s favour</a>.</p>
<p>Taken in this context, things do not look good for AOL - while Google&#8217;s fighting to protect your privacy, AOL&#8217;s actively working in the opposite direction, it seems.  I realize it was not meant to be that way, but with something this serious, the effect is sometimes far more important than the intent.  Looking at the raw search queries of some users reveals things ranging from embarrasing or weird searches, to queries that may <a href="http://plentyoffish.wordpress.com/2006/08/07/aol-search-data-shows-users-planning-to-commit-murder/">indicate they&#8217;re up to no good</a>.</p>
<p>Let&#8217;s be clear here - simply searching for &#8220;how to kill your wife&#8221; is not tantamount to murder - nor is it even enough proof of conspiracy to commit murder.  At least, that&#8217;s how I see things from my &#8220;not a lawyer&#8221; perspective.  However, consider a situation where someone <em>did kill</em> their wife.  If their search results were made available, and this query turned up, how do you think law makers would respond?  Or, consider sample size of 1000 murderers, and their search queries.  If it were statistically established that a significant percentage of murderers search on the Internet for some aspect of murder before the actual crime, how would law makers respond? </p>
<p>These are complex questions, and I&#8217;m not sure what I&#8217;d do with the data.  Statistics is a complex topic, but the important fact is they are often abused to support invalid and over-reaching actions.  This obviously leads to questions and issues like &#8220;thought crime&#8221; and comparisons with <cite>Minority Report</cite>.  I generally don&#8217;t like using slippery-slope arguments, but this is an area where it&#8217;s easy to see how abuse could happen, considering that the US DOJ has already expressed an interest in this sort of data, and with all the <a href="http://www.cnn.com/2005/POLITICS/12/17/bush.nsa/">NSA wiretaps</a> going on.  </p>
<p>There were also many other weird, and often disgusting search queries going on, often by the same set of users.  I won&#8217;t repeat them here, but you can easily <a href="http://plentyoffish.wordpress.com/2006/08/07/aol-search-data-shows-users-planning-to-commit-murder/">see for yourself</a> what some AOL users were searching for.  While these searches represent a very small percentage of all the searches conducted, it doesn&#8217;t paint a good picture of the Internet for lawmakers.  (Keep in mind that these sort of searches are to be expected - after all, even if the incident rate for violent crime is something low like 27 per 100 000, that&#8217;s still 0.027%, and keep in mind this search query dataset represents over 650,000 users.)</p>
<p>Note that I haven&#8217;t even touched on the topic of false positives.  What if someone was writing a paper on the effects of torture, or of any of the other examples of &#8220;man&#8217;s inhumanity to man&#8221;?  They&#8217;d obviously have to search for some pretty disturbing stuff (I&#8217;m glad I&#8217;ve never had such a writing assignment), and then these searches would be tied together by the unique identifier.  There are plenty of other examples, but the idea is that using someone&#8217;s search queries for legal purposes is an easy way to <cite>1984</cite> - while there obviously may be statistical connections between actual crimes, the potential for &#8220;though crime&#8221; is just too large.</p>
<h3>Waves and reverberations</h3>
<p>As far as I can tell, this story (and thus the wide release of the data) only started sometime this weekend, probably on Friday or Saturday.  But, it&#8217;s already making its way around the non-traditional news outlets (blogs, social communities) like fire through a tissue factory.  There&#8217;s been talk of how <a href="http://www.ugcs.caltech.edu/~dangelo/aol-search-query-logs/">personally identifiable information</a> may be available, and of the <a href="http://plentyoffish.wordpress.com/2006/08/07/aol-search-data-shows-users-planning-to-commit-murder/#comments">legal ramifications</a>.  It&#8217;s already in the top five among <a href="http://technorati.com/">Technorati</a> searches, so you can believe that anyone who wants this data will get it.  If this doesn&#8217;t hit the mainstream news soon, I&#8217;ll be very surprised and disappointed - so far there&#8217;s <a href="http://news.google.com/news?hl=en&#038;ned=us&#038;q=aol+data&#038;btnG=Search+News">only one result</a> for it in Google News. </p>
<h3>Update</h3>
<p>AOL has apparently responded to this privacy leak, in a comment <a href="http://plentyoffish.wordpress.com/2006/08/07/aol-search-data-shows-users-planning-to-commit-murder/">available here</a>.  </p>
<p>Nothing really new or interesting - other than the data represented only about 1.5% of all AOL users, and those included were only US users who used AOL&#8217;s client software.  As expected, Mr. Weinstein of AOL said that it was an &#8220;innocent enough attempt to reach out to the academic community with new research tools&#8221;, but as expected, it didn&#8217;t go through official channels for approval.  You can be sure that AOL&#8217;s going to change their policy on this sort of stuff - don&#8217;t expect any more stuff from AOL research for a while.  (Someone or some people probably also got fired.)</p>
<hr/>Copyright &copy; 2008 <strong><a href="http://unitstep.net">unitstep.net</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact <strong><a href="mailto:webmaster@unitstep.net">webmaster@unitstep.net</a></strong> for more information.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span>]]></content:encoded>
			<wfw:commentRss>http://unitstep.net/blog/2006/08/07/aol-releases-search-queries-for-650000-users-in-blatant-disregard-for-privacy/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
