<?xml version="1.0" encoding="utf-8" ?>

<rss version="2.0" 
   xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
   xmlns:admin="http://webns.net/mvcb/"
   xmlns:dc="http://purl.org/dc/elements/1.1/"
   xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
   xmlns:wfw="http://wellformedweb.org/CommentAPI/"
   xmlns:content="http://purl.org/rss/1.0/modules/content/"
   >
<channel>
    <title>The Hacker Factor Blog</title>
    <link>http://www.hackerfactor.com/blog/</link>
    <description>Tools, Techniques, and Tangents</description>
    <dc:language>en</dc:language>

<item>
    <title>Google Abuse</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/484-Google-Abuse.html</link>
            <category>FotoForensics</category>
            <category>Image Analysis</category>
            <category>Network</category>
            <category>Programming</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/484-Google-Abuse.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=484</wfw:comment>

    <slash:comments>6</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=484</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    I watch my web logs closely. I&#039;m basically looking for possible attacks, unexpected errors, and other forms of abuse. If the error is on the server side, I want to fix it as fast as possible. If the error is from the user, then I want to understand and address the problem.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;The Joys of HTML&lt;/H3&gt;&lt;br /&gt;
One of the most common problems that I have been seeing at the &lt;a href=&quot;http://fotoforensics.com/&quot;&gt;FotoForensics&lt;/a&gt; site concerns the upload-by-URL option. Users are supposed to paste in a URL to an image, and the site will retrieve and analyze the picture.&lt;br /&gt;
&lt;br /&gt;
Unfortunately, many users don&#039;t understand the difference between a URL to a picture and a URL to a web page that contains a picture. I&#039;m mostly seeing this problem with URLs from other hosting sites (e.g., flickr or imgur) and from Facebook. These web pages contain one big picture (likely the one the user wanted to analyze) and a bunch of smaller pictures in the header, along the margin, and at the bottom.&lt;br /&gt;
&lt;br /&gt;
I finally came up with a solution. If the user uploads HTML, then the site now displays a small notice about right-clicking on the image to find the URL to the image. I thought it was kind of wordy, but I&#039;ve already seen users upload HTML, pause, and then resend the URL to the big picture on the web page. So this looks like a fast fix via a better error message.&lt;br /&gt;
&lt;br /&gt;
After I get a little more infrastructure implemented on the back-end of the service, I&#039;ll release another solution to this problem which will be much more convenient.  Hopefully it will be released by the end of this month.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Slammed By Google&lt;/H3&gt;&lt;br /&gt;
One of the reasons I looked into the &quot;uploading HTML&quot; issue is that I saw a huge increase in uploads making this mistake. Basically, I had one user who was uploading tons of pictures from imgur, but was specifying the URL to the HTML page and not to the images.&lt;br /&gt;
&lt;br /&gt;
Even after I put up the message telling users how to find the image URL, I still had this one user uploading imgur HTML. The uploads happened every few seconds. So it looks like a bot. And then I noticed the IP address and user-agent: Google.&lt;br /&gt;
&lt;blockquote&gt;66.249.72.152 - - [13/May/2012:06:25:17 -0500] &quot;GET /upload-url.php?url=http%3A%2F%2Fimgur.com%2F3iCJ8 HTTP/1.1&quot; 200 1594 &quot;-&quot; &quot;Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)&quot;&lt;br /&gt;
66.249.72.152 - - [13/May/2012:06:25:38 -0500] &quot;GET /upload-url.php?url=http%3A%2F%2Fimgur.com%2F32yOZ HTTP/1.1&quot; 200 1594 &quot;-&quot; &quot;Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)&quot;&lt;br /&gt;
66.249.72.152 - - [13/May/2012:06:25:45 -0500] &quot;GET /upload-url.php?url=http%3A%2F%2Fimgur.com%2FUamV7 HTTP/1.1&quot; 200 1594 &quot;-&quot; &quot;Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)&quot;&lt;br /&gt;
66.249.72.152 - - [13/May/2012:06:25:59 -0500] &quot;GET /upload-url.php?url=http%3A%2F%2Fimgur.com%2F32GkV HTTP/1.1&quot; 200 1594 &quot;-&quot; &quot;Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)&quot;&lt;br /&gt;
...&lt;/blockquote&gt;&lt;br /&gt;
66.249.72.&lt;i&gt;xx&lt;/i&gt; is Google and the user agent is Googlebot. Anyone who runs a web service knows that Google&#039;s indexing system will attempt to &lt;a href=&quot;http://productforums.google.com/forum/#!category-topic/webmasters/crawling-indexing--ranking/XT0LVDoO5-0&quot;&gt;submit crap&lt;/a&gt; to any form it finds. They want to find local search engines on blogs and web sites and index the results. This way, if someone searches for &quot;hypnophonic&quot;, then Google can direct them to whatever results page my own site&#039;s search would find. If Googlebot finds the search form on your blog, then it will submit all sorts of random words in order to find possible results. And it will do it for days.&lt;br /&gt;
&lt;br /&gt;
In this case, Google seems to know that &lt;a href=&quot;http://fotoforensics.com/&quot;&gt;FotoForensics&lt;/a&gt; analyzes images. And it knows that the non-Google service &lt;a href=&quot;http://imgur.com/&quot;&gt;imgur&lt;/a&gt; contains images. So Google was trying to submit every image at imgur to FotoForensics for analysis. I&#039;m not kidding.&lt;br /&gt;
&lt;br /&gt;
This uploading by Google was failing because they were submitting URLs to HTML web pages and not to the actual images. It started on Sat May 12 22:57:04 2012. I stopped it on Sun May 13 11:15:17 2012. (All times are CDT.) During those 12 hours, Googlebot submitted 1,061 URLs. My logs for them look like:&lt;br /&gt;
&lt;pre&gt;...&lt;br /&gt;
[Sat May 12 23:17:36 2012] [client 66.249.72.152] POST url: http://imgur.com/5C4Ul&lt;br /&gt;
[Sat May 12 23:18:04 2012] [client 66.249.72.152] POST url: http://imgur.com/w3Cwo&lt;br /&gt;
[Sat May 12 23:18:31 2012] [client 66.249.72.152] POST url: http://imgur.com/0KByw&lt;br /&gt;
[Sat May 12 23:18:59 2012] [client 66.249.72.152] POST url: http://imgur.com/XwYEC&lt;br /&gt;
[Sat May 12 23:19:27 2012] [client 66.249.72.152] POST url: http://imgur.com/gDnDG&lt;br /&gt;
[Sat May 12 23:19:55 2012] [client 66.249.72.152] POST url: http://imgur.com/DiBKf&lt;br /&gt;
[Sat May 12 23:20:17 2012] [client 66.249.72.152] POST url: http://imgur.com/EYmG1&lt;br /&gt;
[Sat May 12 23:20:18 2012] [client 66.249.72.152] POST url: http://imgur.com/sMleL&lt;br /&gt;
[Sat May 12 23:20:22 2012] [client 66.249.72.152] POST url: http://imgur.com/zUYB6&lt;br /&gt;
[Sat May 12 23:20:50 2012] [client 66.249.72.152] POST url: http://imgur.com/Xh3oI&lt;br /&gt;
[Sat May 12 23:21:18 2012] [client 66.249.72.152] POST url: http://imgur.com/JrmZb&lt;br /&gt;
[Sat May 12 23:21:45 2012] [client 66.249.72.152] POST url: http://imgur.com/7aHsS&lt;br /&gt;
[Sat May 12 23:22:13 2012] [client 66.249.72.152] POST url: http://imgur.com/vMcRj&lt;br /&gt;
[Sat May 12 23:22:41 2012] [client 66.249.72.152] POST url: http://imgur.com/yXNh1&lt;br /&gt;
[Sat May 12 23:23:09 2012] [client 66.249.72.152] POST url: http://imgur.com/RF4Lj&lt;br /&gt;
[Sat May 12 23:23:36 2012] [client 66.249.72.152] POST url: http://imgur.com/MJwDN&lt;br /&gt;
[Sat May 12 23:24:04 2012] [client 66.249.72.152] POST url: http://imgur.com/xF00H&lt;br /&gt;
[Sat May 12 23:24:32 2012] [client 66.249.72.152] POST url: http://imgur.com/wAXDI&lt;br /&gt;
[Sat May 12 23:25:00 2012] [client 66.249.72.152] POST url: http://imgur.com/B8bon&lt;br /&gt;
[Sat May 12 23:25:27 2012] [client 66.249.72.152] POST url: http://imgur.com/SOVBZ&lt;br /&gt;
...&lt;/pre&gt;&lt;br /&gt;
The pictures follow a theme, so it looks like Googlebot was doing what web bots do well: traverse a set of links.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Do No Evil&lt;/H3&gt;&lt;br /&gt;
The biggest problem with blocking Google is that I want them to index and link to the site. I just don&#039;t want them to submit crap to my service, or try to submit all of the content from some other service to my service. (I view this as a service abuse.) Moreover, there is no reliable method for telling Google to not post content to a web form.&lt;br /&gt;
&lt;br /&gt;
However, I did find a solution. I&#039;m blocking them in my .htaccess file:&lt;br /&gt;
&lt;blockquote&gt;SetEnvIf User-Agent &quot;.*Googlebot.*&quot; search_bot=1&lt;br /&gt;
SetEnvIf Request_URI &quot;^/upload.*&quot; has_upload=1&lt;br /&gt;
&lt;br /&gt;
RewriteEngine on&lt;br /&gt;
&lt;br /&gt;
RewriteCond %{env:search_bot} 1&lt;br /&gt;
RewriteCond %{env:has_upload} 1&lt;br /&gt;
RewriteRule $ - [R=404,L]&lt;/blockquote&gt;&lt;br /&gt;
This sets the environment variable &quot;search_bot&quot; to 1 if the user-agent contains the string &quot;Googlebot&quot;. And if the URL begins with &quot;/upload&quot; (where all of my submission forms go), then set the flag &quot;has_upload&quot;. Finally, I have a rewrite rule: if it is a search bot and it has upload, then block them with a 404 error. This request will never even get to my back-end server script. Googlebot no longer abuses my site and it no longer uses my service to abuse imgur.&lt;br /&gt;
&lt;br /&gt;
They can still index the site. They can still index links to results that other people post to other sites. But Googlebot can no longer post new content to my site.&lt;br /&gt;
&lt;br /&gt;
In the first hour that this was implemented, Googlebot received 38 &quot;404&quot; errors. I wonder how long it will take before Googlebot realizes that all of its uploads will fail...&lt;br /&gt;
&lt;br /&gt;
Truthfully, I don&#039;t think this abuse is coming from the main Google search engine. When the main Googlebot finds a form, it will &lt;a href=&quot;http://googlewebmastercentral.blogspot.com/2008/04/crawling-through-html-forms.html&quot;&gt;slam it with requests&lt;/a&gt;. Thousands upon thousands of them. I really suspect that this is some Google employee trying out a &lt;a href=&quot;http://www.american.com/archive/2009/april-2009/Success-on-the-Side&quot;&gt;side-project&lt;/a&gt;. This explains the low request volume (well, low for Google) and the fact that every request is coming from the same network address. (The main Googlebot uses a variety of network addresses.) Some Google employee probably thought it would be fun to submit every imgur picture through my FotoForensics site. Nevermind that they were submitting bad image URLs, nevermind that they are ignoring return codes and ignoring results, and nevermind that they are abusing imgur by harvesting their content and sending it to another web service.&lt;br /&gt;
&lt;br /&gt;
I wonder how &quot;do no evil&quot; compares against &quot;just be stupid&quot;? 
    </content:encoded>

    <pubDate>Sun, 13 May 2012 10:47:13 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/484-guid.html</guid>
    
</item>
<item>
    <title>A Different Request</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/483-A-Different-Request.html</link>
            <category>FotoForensics</category>
            <category>Image Analysis</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/483-A-Different-Request.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=483</wfw:comment>

    <slash:comments>9</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=483</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    The &lt;a href=&quot;http://fotoforensics.com/&quot;&gt;FotoForensics&lt;/a&gt; web site has been far more successful that I originally expected. Not knowing any better, we allocated enough resources to managed at least 250,000 pictures over a two year period. I got that from the original errorlevelanalysis.com web site -- which processed about 250,000 pictures in two years. Although the first month had relatively little traffic (5038 unique pictures), it has been steadily ramping up. April shot up to 16,421 unique pictures analyzed -- including one day with 2,545 unique uploads. At the current rate, we expect to processed over 100,000 unique pictures by October or November (far faster than the original estimate). For this reason, we are taking the server down for a few hours this Sunday and increasing the storage capacity.&lt;br /&gt;
&lt;br /&gt;
In my &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/472-Use-Cases.html&quot;&gt;previous update&lt;/a&gt; I discussed the &quot;porn problem&quot;. Since then, we&#039;ve designed a multi-stage filtering solution and it is partially implemented; porn dropped from 5% in March to 2% in April. We are getting much better at stopping pornographic uploads.&lt;br /&gt;
&lt;br /&gt;
When people get blocked for uploading porn, we don&#039;t just give them 404 errors. Instead, we show them a simple page that says to read the FAQ. The FAQ has a red highlight telling them to read the section about being banned for uploading porn. About 70% of the people who are banned read the FAQ. The remaining people didn&#039;t return to the site so they never saw the ban.&lt;br /&gt;
&lt;br /&gt;
The FAQ also includes an email address where people can protest being banned. I really didn&#039;t expect anyone to write in. But so far, I&#039;ve had two people contact me. The first person claimed that he didn&#039;t upload porn. After I reminded him about when and what he uploaded, he changed his story to claim that it was an accident. (If you saw the series of pictures he uploaded, you wouldn&#039;t believe it was an accident.) So he is still banned.&lt;br /&gt;
&lt;br /&gt;
The second person wrote in last night. She was irate at being blocked. Her email subject was &quot;A different request&quot;; she requested being banned forever. Between the venomous attacks and harsh wording, she brings up a number of points that I&#039;ve discussed with other people but have not made public. So I&#039;m reprinting my reply here, with her letter included in bold. Since I do believe in privacy, I have redacted personally identifiable information (name, age, etc.) in square brackets [redacted] and given her name the anonymous letter &quot;B&quot;.&lt;br /&gt;
&lt;br /&gt;
ps. You will notice that I bounce between &quot;me&quot; and &quot;we&quot;. I do have a few people helping me with the site. I also corrected a few of my spelling/grammar errors.&lt;br /&gt;
&lt;br /&gt;
&lt;H1&gt;A different request&lt;/H1&gt;&lt;br /&gt;
Dear [B],&lt;br /&gt;
&lt;br /&gt;
Thank you for taking the time to detail your concerns about FotoForensics. My responses to each of your issues are included below.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;On Thu, May 03, 2012 at 09:49:14PM [redacted timezone], [B] wrote:&lt;br /&gt;
Dear fotoforensics.com nerd(s),&lt;br /&gt;
&lt;br /&gt;
I do not want you to reinstate my abilities to use your crappy website, nor do I want any uploaded pictures removed. I am requesting something else, and I&#039;ll get to that right after I elaborate as to why I&#039;m a little pissed off.&lt;br /&gt;
&lt;br /&gt;
1. No one reads the god damn terms of service... ever. I don&#039;t read them so often I can&#039;t even remember if your site had any prior to using it or not.&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
The site has always had the terms of service in the FAQ. According to the web logs, the FAQ is the second most popular page on the site, after the tutorial.&lt;br /&gt;
&lt;br /&gt;
You might want to look at the terms of service from other picture hosting web sites. Such as Google&#039;s Picasa (&lt;a href=&quot;http://picasa.google.com/policy.html&quot;&gt;http://picasa.google.com/policy.html&lt;/a&gt;), yfrog (&lt;a href=&quot;http://yfrog.com/page/tos&quot;&gt;http://yfrog.com/page/tos&lt;/a&gt;), and Twitpic (&lt;a href=&quot;http://twitpic.com/terms.do&quot;&gt;http://twitpic.com/terms.do&lt;/a&gt;). You don&#039;t need to read the whole text -- just search for the word &quot;pornography&quot;. They all say the same basic thing: pornography is not permitted, pornography will be removed, and you may have your account blocked. Why would you assume that my site would be any different? Had you uploaded the same genitalia picture to Picasa, yfrog, Twitpic, Flickr, Photobucket, Zooomer, or most other image sharing sites, you would have had your account blocked as soon as they noticed.&lt;br /&gt;
&lt;br /&gt;
Some sites, like Picasa, appear to use Google&#039;s powerful image search engine and pagerank system to flag potential pornography. Other sites seem to review pictures that suddenly become popular. (Post the link to a porn web site and traffic will increase dramatically.) And still other services have a room full of people who review content, or leave it up to visitors to flag content as inappropriate.&lt;br /&gt;
&lt;br /&gt;
As a research site, we&#039;re actively developing a multi-stage porn filter system. The first two stages are already functional. (This is how your pictures were caught.) The next stage will be coming online this month. The final filter is already built and currently being tested; when deployed, this will permit real-time automated detection. And while these filters are currently being designed for the pornography problem, they are really generic and could easily be modified for other tasks.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;2. I don&#039;t imagine too many people begged you to host these images. It&#039;s probably safe to assume that the majority of your web traffic comes from online daters who want to see if their new girlfriend is enlarging her tits or what not. Therefore the hosting aspect is simply bullshit. I just can&#039;t imagine the sharing factor in this unless you&#039;re trying to encourage people to use your site in combination with facebook/twitter to ridicule people who photoshop their pictures. If so, that&#039;s pretty pathetic and probably not very well received considering how facebook is already taking shit for being a popular tool for high school bullying. And since the news tells me that high school bullying directly causes suicide and psycho rampages... well, all I can say is shame on you, shame, shame.&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
You mention a number of points here.&lt;br /&gt;
&lt;br /&gt;
Although nobody begged me, I was asked politely by a handful of people to fill the hole that errorlevelanalysis.com created when it went offline. For this reason, I chose to start fotoforensics.com as a service for the world-wide community.&lt;br /&gt;
&lt;br /&gt;
The main themes of the uploaded pictures are not sexual in nature. Rather, they fall into three main categories: testing, evaluation, and debunking. Since the purposes of this site are (1) research, and (2) give people an opportunity to try out some real photo forensic methods, I think it has been a smashing success. Remember: this is not a generic file sharing site like yfrog or Twitpic; this is a research-oriented web service.&lt;br /&gt;
&lt;br /&gt;
I do not tell people what to upload or how to use the pictures that they upload. (Although I do specify what not to upload.) And due to legal liability, I cannot even provide any automated interpretation of the results. The site applies a test and leaves the interpretation to the viewer. The tutorial and FAQ are designed to help people understand the results and how to use the site.&lt;br /&gt;
&lt;br /&gt;
While the majority of pictures are not shared, a few have become very popular. For example, the most popular picture in the last week is of a tweet. While I don&#039;t read Arabic and I don&#039;t know the details behind this picture or the controversy, I do know that it is very popular and they even put a site-by-side comparison on a web site:  &lt;a href=&quot;http://mz-mz.net/38282/&quot;&gt;http://mz-mz.net/38282/&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
As of this morning, the site is hosting 30,727 unique images. (That averages to about 10,000 a month, but the average is skewed since the average daily upload rate is still increasing.) Of the 30,727 images, 1,070 have been blocked -- either due to pornographic content or by request. That means that about 97% of the images are non-sexual in nature.&lt;br /&gt;
&lt;br /&gt;
Your final point in issue #2 concerns the distribution of images. As stated in the FAQ, we do not publicly provide the URLs to images that have been uploaded. So far, we only publicly mention links that other people have already posted publicly.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;3. The fact that you even have admins sitting around viewing each picture is very creepy in itself. Since I&#039;ve already determined that no one wants cloud storage from you, for anything constructive anyway, why not just let people view the picture comparisons and let that be the end of it? But hey, it&#039;s your website, you want to ban people go right ahead. There are plenty of sites just like this that do exactly what I&#039;m talking about. I&#039;m just saying if you&#039;d be less of a dick you might get more web traffic and then you wouldn&#039;t have to panhandle for donations.&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Before I even agreed to run this service, before I coded the first line of the server, I was discussing my concerns related to hosting this site with attorneys and law enforcement officers. The main problem is due to the US legal system: some types of pictures are illegal in the United States. The main issues are related to child pornography and harassment.&lt;br /&gt;
&lt;br /&gt;
To make it clear how strict (and screwed up) US laws are related to child porn: there have been court cases related to child porn where the judge, jury, and legal counsels on both sides never saw the pictures. Just showing them the pictures means that they would be in possession and distributing prohibited content. Imagine a court case where nobody can see the evidence -- that&#039;s a child porn lawsuit.&lt;br /&gt;
&lt;br /&gt;
With regards to reviewing content, this goes back to the legal issue. Since it is &quot;my server&quot; and the law will hold me liable for anything on my server, shouldn&#039;t I know what is on my server? Myself and my fellow moderators have better things to do than ogle over every picture uploaded to the site. We rely on various filter techniques to reduce the amount of time we spend on this task (right now, one person can review the filter results for a day of uploads in under two minutes -- and I&#039;m spending hours building filters that will further reduce that time). I am confident that FotoForensics does not host pornography or child pornography.&lt;br /&gt;
&lt;br /&gt;
Finally, although there is a donation button, I don&#039;t think anyone has clicked on it yet. The site has received no donations. And since the donation button is buried in the FAQ, I don&#039;t view it as panhandling. (Not like Wikipedia, which ran a huge banner on every page.)&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;4. What is wrong with nudity anyway? Are you mormon? Are you Mitt Romney? I did not upload &quot;pornography&quot;, I uploaded a nude picture of my [20+] year old SELF. Nudity is not always pornography... in fact we were all born completely bare-ass naked! It&#039;s almost like it&#039;s a... I don&#039;t know... natural thing, maybe? Yeah, wrap your offended little brain around that. And if you must know, the reason I chose that picture is because I dramatically lightened it, as well as cropped it.And rest assured I had no intentions what-so-fucking-ever to distribute any links to that page, I just wanted to test out the tool with a picture that I knew was photoshopped. So just to get things straight, the only people who can see the uploaded photo are site admins, the person who uploaded it, and anyone they share the link with. You say it&#039;s a PUBLIC WEB SERVICE but... only if you have the link, right? Not really PUBLIC in a comprehensive sense of the word, now is it? If you had a home page with something like &quot;latest 100 uploads from everyone&quot; then I could see a point in restricting content.&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
For site moderation, we have a written policy used by the moderators concerning acceptable content. Basically, if the picture could be found on the cover of a supermarket magazine that doesn&#039;t have a black plastic wrapper (for example, Cosmo, Vanity Fair, the Victoria&#039;s Secret catalog, or even Playboy -- that is, Playboy in the US and not Playboy in Brazil since Brazil permits full nudity on the cover), then it is permitted. But if it couldn&#039;t be found on the cover, then it is pornography. Two of your pictures clearly fall into this definition for pornography.&lt;br /&gt;
&lt;br /&gt;
Personally, I don&#039;t care about pornography. But it was made abundantly clear to me before I started building the server that pornography is like a &quot;gateway drug&quot; to child porn. If people are permitted to upload porn, then it will attract the pedophiles. If we don&#039;t permit porn, then the pedophiles will stay away. I can honestly say that, out of 30,000 pictures over three months, a total of 7 pictures -- uploaded by two people -- were classified as child pornography and had to be deleted immediately. (The deletion shows our intent to not be in possession. And I hope that &quot;Mr. Chicago&quot; and the &quot;Poland Pervert&quot; both get arrested.)&lt;br /&gt;
&lt;br /&gt;
You mentioned that the site says it is public yet the content is generally not publicly accessible. If the content was publicly viewable, would it have prevented you from uploading photos of yourself? Or, having admitted to not reading the FAQ, would you be upset about having hundreds of creepy stalkers who saw your photos?&lt;br /&gt;
&lt;br /&gt;
Finally, of all the things you could have photographed, modified, and uploaded for analysis, could you really not have come up with anything better than a topless reflection and a picture of you fingering yourself? I think this says a lot more about you than me. For my own tests, I used a picture of my bookshelf; removing my clothing never even crossed my mind.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;5. You have created something that is obviously going to be used with bad intentions most of the time. Yet you act like that will not be tolerated. And that&#039;s sort of like giving someone a crackpipe then saying &#039;you will not dare smoke crack out of it, just keep it as a nice paperweight&#039;.&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
I fully disagree here. 97% of the time, it appears to be used with good intentions. 3% of the time, it is used by people who don&#039;t care that the site contains public content. And a very tiny fraction (2 out of approximately 48,000 people who have uploaded content) have uploaded photos with &quot;bad intentions&quot;. (And if you want the full statistics, that is 48,000 uploads of 30,000 unique pictures and viewed by at least 73,500 unique visitors in three months.)&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Now then, my request. I request not to be banned for 90 days after my last visit, but to be banned FOREVER. I don&#039;t want to forget about this in the future and then accidentally use your site. I certainly don&#039;t want that to happen and then I go around telling other people to use it. I would also like to request that you suck my ass and get the fuck over yourself.&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Ignoring the hostile tone of your email, you do have valid concerns. Hopefully I have addressed those concerns and explained why the site is run this way.&lt;br /&gt;
&lt;br /&gt;
Per your request we will not remove the ban. But as we state in the FAQ, the ban will be removed after 90 days of inactivity. This prevents someone with a dynamic configuration from inheriting a ban that was caused by someone else.&lt;br /&gt;
&lt;br /&gt;
With your permission, I would like to include your remarks and my responses on my blog this evening (&lt;a href=&quot;http://www.hackerfactor.com/blog/&quot;&gt;http://www.hackerfactor.com/blog/&lt;/a&gt;). I will remove any directly-identifying information such as your name, email, age, and network address. Without your permission, I will summarize your remarks and include my responses. Without a reply, I will assume that you grant permission. And I promise to not publicly post the links to your pictures.&lt;br /&gt;
&lt;br /&gt;
Neal Krawetz, Ph.D.&lt;br /&gt;
Hacker Factor Solutions&lt;br /&gt;
&lt;a href=&quot;http://www.hackerfactor.com/&quot;&gt;http://www.hackerfactor.com/&lt;/a&gt; and &lt;a href=&quot;http://fotoforensics.com/&quot;&gt;http://fotoforensics.com/&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;A Little More Information&lt;/H3&gt;&lt;br /&gt;
I didn&#039;t hear back from her.&lt;br /&gt;
&lt;br /&gt;
Her original email did have a small addendum. She concluded it with this message:&lt;br /&gt;
&lt;blockquote&gt;Thanks!&lt;br /&gt;
[redacted network address]&lt;br /&gt;
( ^ my IP addy, and being the nerd that I know you are, I&#039;m sure you can find&lt;br /&gt;
out my blood type with that... so it&#039;s all you need to fucking know)&lt;/blockquote&gt;&lt;br /&gt;
&lt;br /&gt;
It&#039;s funny that she mentioned blood type. One of my side research projects is really panning out. I gave a rough presentation about this at a little &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/447-Small-Talk.html&quot;&gt;technical meet-up group&lt;/a&gt; I attended last year. Given a photo of a person, you can sometimes tell other things about them. In some rare cases, you can even determine their blood type. (It&#039;s scary-accurate but still working on the confidence interval.) 
    </content:encoded>

    <pubDate>Fri, 04 May 2012 17:52:24 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/483-guid.html</guid>
    
</item>
<item>
    <title>Put A Bird On It</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/482-Put-A-Bird-On-It.html</link>
            <category>Politics</category>
            <category>Privacy</category>
            <category>Security</category>
            <category>Terrorists</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/482-Put-A-Bird-On-It.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=482</wfw:comment>

    <slash:comments>0</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=482</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    Perhaps our lawmakers have always been this slimy and we&#039;ve only begun to notice because of rapid communication systems like the Internet. Or perhaps they have sunk to a new low.&lt;br /&gt;
&lt;br /&gt;
On Thursday (April 26, 2012), the House of Representatives voted on the &lt;a href=&quot;http://intelligence.house.gov/hr-3523-bill-and-amendments&quot;&gt;Cyber Intelligence Sharing and Protection Act of 2011&lt;/a&gt; (H.R.3523 aka CISPA). This passing is the first step in becoming a law. It still needs to pass the Senate and be signed by the President. And the President has already &lt;a href=&quot;http://www.washingtonpost.com/blogs/post-tech/post/white-house-opposes-cybersecurity-bill-because-of-privacy-concerns/2012/04/25/gIQAFEGbhT_blog.html&quot;&gt;vowed to veto it&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Nitty Gritty&lt;/H3&gt;&lt;br /&gt;
There are many things that trouble me about this bill.  The 18 page &lt;a href=&quot;http://docs.house.gov/billsthisweek/20120423/CPRT-112-HPRT-RU00-HR3523.pdf&quot;&gt;PDF&lt;/a&gt; is fairly straight forward, but has many unmentioned implications.&lt;br /&gt;
&lt;br /&gt;
For starters, Section 2(a)(2) makes it easier for government agencies to share information. My concern is with &lt;a href=&quot;http://en.wikipedia.org/wiki/Information_Security_Oversight_Office&quot;&gt;information oversight&lt;/a&gt;. Many government agencies are not permitted to collect or hold information related to US citizens. (It&#039;s fine to spy on others, just not on our own people.) And there are some legitimate reasons to collect information about specific US citizens. For example, if some guy is part of an active criminal investigation then federal law enforcement can gather information about him. Also, if information is publicly available, then it can be collected.&lt;br /&gt;
&lt;br /&gt;
However, a few branches of the government are permitted to collect limited information about US citizens. For example, the Transportation Security Administration (TSA) can (and does) collect information about US citizens. This permits them to compare names against the no-fly list, request your papers at the security checkpoint, and employ full-body scanners on US citizens.&lt;br /&gt;
&lt;br /&gt;
Under this bill, it would be easier for TSA to pass anything they collect to other government agencies. They just have to claim that they have a reasonable belief that it relates to a threat.&lt;br /&gt;
&lt;br /&gt;
Section 2(b) focuses on cybersecurity. In particular, 2(b)(1)(a) states that a &lt;i&gt;cybersecurity provider&lt;/i&gt; may share cybersecurity information with the government. Section 2(h) defines a &quot;cybersecurity provider&quot; as &lt;i&gt;any non-government entity&lt;/i&gt; that provides any kind of computer security. This includes your ISP (they provide firewalls and look for network attacks; the larger ones also give away anti-virus software for free), email systems (e.g., Gmail and Yahoo! employ spam filters for your security), and other online services. If there&#039;s a password involved, then that&#039;s cybersecurity!&lt;br /&gt;
&lt;br /&gt;
Section 2(b)(4) (PDF page 8 for anyone following along) states that any cybersecurity provider -- or any person with access to the provider&#039;s information -- that shares information with the Federal government is immune from prosecution. This immunity is just like the kind proposed by &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/466-SOPA-Survivalist.html&quot;&gt;SOPA&lt;/a&gt;; SOPA granted immunity from all lawsuits. This is actually more lenient that the previous draft of H.R.3523 (&lt;a href=&quot;http://intelligence.house.gov/sites/intelligence.house.gov/files/documents/HR3523.pdf&quot;&gt;PDF&lt;/a&gt;, see page 5), which did not grant immunity from civil prosecution. (Perhaps this is why Facebook proposed &lt;a href=&quot;http://articles.cnn.com/2012-04-20/tech/tech_social-media_facebook-privacy-policy_1_user-data-mark-zuckerberg-facebook?_s=PM:TECH&quot;&gt;changes their use policy&lt;/a&gt; days before the House vote, making it easier for Facebook to not be in violation if they hand over anything to the Government. And Facebook is a &lt;a href=&quot;http://abcnews.go.com/Technology/cispa-cybersecurity-bill-congress-passed/story?id=16230902&quot;&gt;CISPA supporter&lt;/a&gt;.)&lt;br /&gt;
&lt;br /&gt;
Section 2(h) defines terms. Like a &quot;protected entity&quot; is anyone who does business with the cybersecurity provider. It doesn&#039;t say that the provider must provide the &quot;protected entity&quot; with any protections. Google could offer protections to some users and still claim (under this definition) that any Google users -- even those who do not receive protections -- are protected entities. This ties in well with Section 2(b)(1)(B), which describes a &quot;Self-Protected Entity&quot;. So my &lt;a href=&quot;http://fotoforensics.com/&quot;&gt;FotoForensics&lt;/a&gt; web site -- which offers no user accounts to web clients -- is still a cybersecurity provider because it is self-protected; I use SSH to remotely manage the system and SSH is cybersecurity!&lt;br /&gt;
&lt;br /&gt;
Section 2(h)(2) brings back the horrors of SOPA. It mentions &quot;unauthorized access&quot; or &quot;misappropriate&quot; private or government information. This revisits copyright infringement. It places file sharing and blogging (sources of potential copyright infringement) on par with cyber-terrorism. If your web site contains infringing material (&quot;unauthorized access&quot; or &quot;misappropriation&quot;), then it is considered &quot;cyber threat information&quot;.&lt;br /&gt;
&lt;br /&gt;
Section 2(h)(3) defines &quot;cyber threat intelligence&quot;. This is anything that could potentially be used to evaluate a cyber threat. The actual text of the bill says, &quot;directly pertaining to a vulnerability of, or threat to, a system or network of a government or private entity&quot;. Watch those conjunctions! (... or ... or ... or ...) This section grants any service provider permission to read your emails just in case you make a threat against anything. Simply saying &quot;Screw that!&quot; in an email about a cyber security bill could be considered a threat -- since you have voiced an opinion that could jeopardize the actions of a government entity.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Hush and Rush&lt;/H3&gt;&lt;br /&gt;
CISPA is clearly a bad cyber security bill. As Rep. Jared Polis (D-Colo.) &lt;a href=&quot;http://thehill.com/blogs/hillicon-valley/technology/224017-house-advances-cybersecurity-bill&quot;&gt;said&lt;/a&gt;, &quot;This bill in its current form ... is an unprecedented, sweeping piece of legislation that would waive every single privacy law ever enacted in the name of cybersecurity.&quot;&lt;br /&gt;
&lt;br /&gt;
Amazingly, that isn&#039;t what bothers me the most. The thing that really bothers me is how this bill was handled.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Multiple copies. If you search for the text of H.R.3523, you will find many different versions. The variants on the &lt;a href=&quot;http://intelligence.house.gov/hr-3523-bill-and-amendments&quot;&gt;House&lt;/a&gt; web site range from 8 pages to 16 pages. The GOP web site has an &lt;a href=&quot;http://www.gpo.gov/fdsys/pkg/BILLS-112hr3523ih/pdf/BILLS-112hr3523ih.pdf&quot;&gt;11-page version&lt;/a&gt;. The official document that the House voted on is 18 pages, and as far as I can tell, it was only available after they voted on it. Each of these versions have different text and different nuances.&lt;br /&gt;
&lt;br /&gt;
&lt;li&gt;&lt;a href=&quot;http://thehill.com/blogs/hillicon-valley/technology/224017-house-advances-cybersecurity-bill&quot;&gt;Rush to vote&lt;/a&gt;. They were supposed to consider CISPA on Friday, April 27. But at the last minute they moved the consideration vote up to Thursday, April 26 and voted on it.&lt;br /&gt;
&lt;br /&gt;
&lt;li&gt;&lt;a href=&quot;http://en.wikipedia.org/wiki/Cyber_Intelligence_Sharing_and_Protection_Act#House_Voting_Counts&quot;&gt;Partisan Politics&lt;/a&gt;. The &lt;a href=&quot;http://thomas.loc.gov/cgi-bin/bdquery/z?d112:H.R.3523:&quot;&gt;bill passed&lt;/a&gt; with a &lt;a href=&quot;http://clerk.house.gov/evs/2012/roll192.xml&quot;&gt;vote&lt;/a&gt; of 248-168. But the 248-168 is a little misleading. These numbers are almost completely &lt;a href=&quot;http://en.wikipedia.org/wiki/CISPA#House_Voting_Counts&quot;&gt;down party lines&lt;/a&gt;! The Republican votes were 206-28, and the Democrats were 42-140 (non-votes were 7 Republicans and 8 Democrats). Fortunately, the 248-168 split isn&#039;t enough to override a Presidential veto.&lt;br /&gt;
&lt;br /&gt;
&lt;li&gt;&lt;a href=&quot;http://thomas.loc.gov/cgi-bin/bdquery/z?d112:H.R.3523:&quot;&gt;One more thing&lt;/a&gt;. There was a last-minute modification that is currently not found in the PDF. This modification was proposed by the bill&#039;s sponsor, Mike Rogers (R-MI), and directed the clerk to &#039;Insert &quot;deny access to or&quot; before &quot;degrade&quot; in each place it appears.&#039; On the surface, this does not seem major and passed unanimously. Basically, the bill now describes a cyber threat as something that &quot;deny access to, degrade, disrupt, or destroy&quot;. Now doing &lt;a href=&quot;http://help.yahoo.com/kb/index?locale=en_US&amp;page=content&amp;y=PROD_ACCT&amp;id=SLN2069&amp;pir=aBaUcedibUngDOb0c9Qv6_k18Jh_zG6UwFWt3rtGRPpuie_cwd2e7ZP328st46TpZDDmgf8idGKxB04cmrKywodXoNdjhvK9S.HAVj49r6LZZwIu1avlWcOYQEq5N4ifngy5ibmietyg2C3KrJAl0xiU9AcRe_3f5Ig28y4x1ktA38qh42f78PcL2iZxlYjVBUic0UvZRXTSrZXQdovqK6lRRm4QFRIprn4nm9exXtD8KJ3swJxyQxoq9LvVXnPfehWQOoIdqCMToYpkH4K66UYMsWhmE0.tSzTvb.LK297cF0mN5Hm4MNtcuvaoGZVHgVhqIHpEQgMcjEJMyfbkr8a2iJUCsP5cL_Pyenc.FU_567tN14YlXcUOPmba8EX0ViiaJAqV4zV6GMNc7NBPvTEgiz.hC1NCMTn9&quot;&gt;multiple bad login attempts&lt;/a&gt; which would block a Yahoo! account is considered a cyber threat and on par with cyber terrorism.&lt;br /&gt;
&lt;br /&gt;
&lt;li&gt;Other amendments. A bunch of other amendments were added to H.R.3523 but are not found in the PDF. For example, H.AMDT.1022 added an amendment that limits the purpose of information shared with the Government. The list now includes &quot;(1) cybersecurity; (2) investigation and prosecution of cybersecurity crimes; (3) protection of individuals from the danger of death or physical injury; (4) protection of minors from physical or psychological harm; and (5) protection of the national security of the United States.&quot; Now the bill does not just cover cyber security, it also covers death threats, things that may cause mental anguish, and anything inappropriate for children. This &quot;limit&quot; grossly expands this bill&#039;s reach. For example, a web site that caters to adult entertainment, even with a password and validation requirement prior to seeing content, could be considered a cyber threat since the content is not appropriate for children. (And the TV show &lt;i&gt;&lt;a href=&quot;http://www.southparkstudios.com/&quot;&gt;South Park&lt;/a&gt;&lt;/i&gt; would certainly be classified as a cyber threat.)&lt;br /&gt;
&lt;br /&gt;
In contrast, amendment H.AMDT.1023 forbids the government from using library records, firearms sales records, and tax returns that are passed in by cybersecurity providers. So just to be clear, your gun purchase and tax records showing donations to groups that support terrorism are private and cannot be used to detect a threat, even if someone willingly supplies the information to the government. However your grocery store receipt can be collected and used by the government if someone passes it to them. (Is it just me, or is this really backwards? I&#039;m only good with the library restriction.)&lt;br /&gt;
&lt;br /&gt;
H.AMDT.1031 was an amendment that authorized &quot;the Secretary to intercept and deploy countermeasure with regard to system traffic for cybersecurity purposes in effect identification of cybersecurity risks to federal systems.&quot; This would have been like the &lt;a href=&quot;http://en.wikipedia.org/wiki/Great_Firewall_of_China&quot;&gt;Great Firewall of China&lt;/a&gt;. Fortunately, this amendment was withdrawn.&lt;br /&gt;
&lt;br /&gt;
As far as I can tell, the only good amendment is H.AMDT.1027. It states that &quot;those who choose not to participate in the voluntary program authorized by this bill are not subject to new liabilities.&quot; So you can decline to participate in the voluntary part of this bill.&lt;/ul&gt;&lt;br /&gt;
&lt;H3&gt;The Soft Sell&lt;/H3&gt;&lt;br /&gt;
This bill and associated amendments really strike me more as job justification, catering to special interests, and an attempt to expand government reach than any actual attempt at stemming cyber threats. Moreover, the bipartisan rift clearly shows that this is more of a power play along party lines than an actual attempt to pass a useful law. For example, Benjamin Quayle&#039;s (R-AZ) amendment that added in protection of children seems more like an afterthought (it is an amendment after all). And when the President vetos the bill, the Republicans can claim that Obama likes to show pornography to children since he was against a bill that protected minors.&lt;br /&gt;
&lt;br /&gt;
I am becoming convinced that the Republican party has &lt;a href=&quot;http://www.washingtonpost.com/opinions/lets-just-say-it-the-republicans-are-the-problem/2012/04/27/gIQAxCVUlT_story.html&quot;&gt;evolved into a bully&lt;/a&gt; that does not know how to negotiate or make a &lt;a href=&quot;http://gigaom.com/collaboration/hard-selling-vs-soft-selling-which-approach-do-you-use-with-clients/&quot;&gt;soft sell&lt;/a&gt;. Rather than forcing through legislation in an attempt to be relevant, perhaps they should learn from other fields. In the pharmaceutical community, they learned to sugar coat medicines and lower dosages when there are undesirable side effects. And when something does not sell in the hobby community, they &lt;a href=&quot;http://www.putabirdonit.com/&quot;&gt;put a bird on it&lt;/a&gt; because items with birds always &lt;a href=&quot;http://www.etsy.com/search?q=bird&quot;&gt;sell&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Update 2012-04-28&lt;/b&gt;: Microsoft has &lt;a href=&quot;http://news.cnet.com/8301-33062_3-57423580/microsoft-backs-away-from-cispa-support-citing-privacy/&quot;&gt;dropped support&lt;/a&gt; of CISPA. They cited privacy issues, telling CNET that any new law must allow &quot;us to honor the privacy and security promises we make to our customers&quot; and protect &quot;consumer privacy.&quot; 
    </content:encoded>

    <pubDate>Sat, 28 Apr 2012 11:53:00 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/482-guid.html</guid>
    
</item>
<item>
    <title>Gradient Map Test</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/481-Gradient-Map-Test.html</link>
            <category>Forensics</category>
            <category>Image Analysis</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/481-Gradient-Map-Test.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=481</wfw:comment>

    <slash:comments>5</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=481</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    My site admin regularly watches the logs and lets me know when there is a sudden increase in unexpected activity. Recently there has been an increase in people searching for something called a &quot;Gradient Map Test&quot;.&lt;br /&gt;
&lt;br /&gt;
A new photo analysis algorithm? I&#039;m intrigued!&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;The First Clue&lt;/H3&gt;&lt;br /&gt;
Most of the requests were coming from a site called the &lt;a href=&quot;http://cluesforum.info/&quot;&gt;Clues Forum&lt;/a&gt;. As far as I can tell, this site focuses on conspiracies. In some threads they debunk hoaxes, in others threads they promote them.&lt;br /&gt;
&lt;br /&gt;
Frankly, I&#039;m not interested in getting &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/210-Bad-Science-How-Not-To-Do-Image-Analysis.html&quot;&gt;involved&lt;/a&gt; &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/450-Without-a-Crutch.html&quot;&gt;with&lt;/a&gt; &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/399-A-Harsh-Mistress.html&quot;&gt;another&lt;/a&gt; &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/235-Bad-Science-How-Not-To-Do-Image-Analysis-Part-II.html&quot;&gt;conspiracy&lt;/a&gt;. And this blog entry is not about conspiracies around the 9/11 attack.&lt;br /&gt;
&lt;br /&gt;
What got my attention was an &lt;a href=&quot;http://www.septclues.com/SIMCITY/RUBBLE%20RESEARCH%20GROUND%20ZERO/GradientMapTest1.gif&quot;&gt;animated GIF&lt;/a&gt; that used something called a &quot;Gradient Map Test&quot; to highlight &lt;u&gt;alleged&lt;/u&gt; modifications to the image.&lt;br /&gt;
&lt;img src=&quot;http://lh3.googleusercontent.com/-7pj0Npn4kIk/T5bVBtHmRAI/AAAAAAAACRg/GrP2-d8THxQ/s800/GradientMapTest1.gif&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
This GIF looks impressive. The &lt;a href=&quot;http://www.cluesforum.info/viewtopic.php?f=26&amp;t=1071&quot;&gt;posting&lt;/a&gt; claims that it shows firemen added to the photo. I wanted to know how the algorithm works, so I did a literature search. Unfortunately, I found nothing about a &quot;gradient map test&quot; -- nobody else seems to be using this, or at least, not calling it by this name. Usually I can find something about an algorithm.&lt;br /&gt;
&lt;br /&gt;
And then I noticed one more thing in the Clues Forum posting: someone attributed the algorithm &lt;a href=&quot;http://www.cluesforum.info/viewtopic.php?f=26&amp;t=1071#p2359142&quot;&gt;to &lt;i&gt;me&lt;/i&gt;&lt;/a&gt;. WHAT?!? This isn&#039;t one of mine!&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Caution: Steep Gradient&lt;/H3&gt;&lt;br /&gt;
When someone talks about a &lt;i&gt;gradient&lt;/i&gt; related to images, they are usually talking about color differentials. How much are two colors similar or different? Gradients make great edge detection algorithms and can also be used for things like topology conversions.&lt;br /&gt;
&lt;img src=&quot;http://lh4.googleusercontent.com/-GI8-1ug_89A/T5bU-cu0IvI/AAAAAAAACRg/gdLnitB1YsU/s400/4049231458_438ed9e833_o_d-gradient-edge.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
A &quot;gradient map&quot; can refer to the array of gradients across an image. However, that isn&#039;t how the animated GIF is using it. If they used a vector based gradient mapping, then the blues in the middle of the people&#039;s clothing should be muted since blue next to blue has virtually no gradient slope.&lt;br /&gt;
&lt;br /&gt;
Maybe they are using a larger gradient measurement? In the animated GIF, the black box next to the person on the left is a very different color from the black square to the right of the people (in the white box). Yet in the original picture, these black squares are the same shades of black. Both squares are large black objects with similar coloring and similar textures. So regardless of the algorithm, I would have expected them to look similar.&lt;br /&gt;
&lt;table border=&#039;0&#039;&gt;&lt;br /&gt;
&lt;tr&gt;&lt;td&gt;&lt;img src=&quot;http://lh4.googleusercontent.com/-c_NQWMdARfY/T5bU_FkR2QI/AAAAAAAACRg/GBCClqIWw3c/s400/4049231458_438ed9e833_o_d.jpg&quot;&gt;&lt;br /&gt;
&lt;small&gt;Original picture&lt;/small&gt;&lt;/td&gt;&lt;td&gt;&lt;img src=&quot;http://lh5.googleusercontent.com/-yiA-G-ak4n8/T5bU_BMmn9I/AAAAAAAACRg/MF0lMamtMss/s400/GradientMapTest1-frame13.png&quot;&gt;&lt;br /&gt;
&lt;small&gt;Animate GIF frame 13&lt;/small&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;&lt;br /&gt;
&lt;br /&gt;
A gradient map can also refer to a colorization. Basically, you convert the image to a grayscale with 256 possible values. Then you map these 256 values to 256 colorizations. For example, I can convert the &lt;a href=&quot;http://www.flickr.com/photos/edicionesmindcontrol/4049231458/&quot;&gt;original image&lt;/a&gt; to a grayscale and apply a gradient mapping based on a sinusoidal curve.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh4.googleusercontent.com/-o8FOPydwpXk/T5bU-DwxwQI/AAAAAAAACRg/NoMRqj2nN-g/s400/4049231458_438ed9e833_o_d-gray.png&quot;&gt;&amp;times;&lt;img src=&quot;http://lh4.googleusercontent.com/-w853thsOX20/T5bYNICFDyI/AAAAAAAACRw/sKAmggYkvck/s800/sinusoid-1.png&quot;&gt;=&lt;img src=&quot;http://lh5.googleusercontent.com/-2OeiYTl5o4w/T5bU9xuvn0I/AAAAAAAACRg/rep6IKTJcDI/s400/4049231458_438ed9e833_o_d-sinusoid.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
With a gradient map colorization, the people stand out because they are the darkest items in the picture and are basically the only thing that maps to that color.&lt;br /&gt;
&lt;br /&gt;
Gradient map colorizations are commonly used on weather maps and &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/371-Medical-Malpractice.html&quot;&gt;medical imaging&lt;/a&gt; to highlight measurement differences, in astronomy for converting non-visible light to a viewable spectrum, and on &lt;a href=&quot;http://www.syfy.com/ghosthunters/&quot;&gt;Ghost Hunters&lt;/a&gt; to find cold spots. It really can be used in any field that needs to visualize or colorize non-RGB data.&lt;br /&gt;
&lt;br /&gt;
The problem is, the animated GIF did not apply a gradient map colorization. We know this because the people did not change color. If the GIF was showing a gradient map colorization, then the people (skin and clothing) should have been recolored based on the gradient map coloring. &lt;br /&gt;
&lt;br /&gt;
So we don&#039;t know what they are doing in the animated GIF. We only know that they have thrown out a wizbang term and gave a picture. They never said how they created it, and they gave it a false attribution (the algorithm did not come from me).&lt;br /&gt;
&lt;br /&gt;
Using my own tools, both public and proprietary, I cannot detect anything suggesting that the people were added. I can detect a few things in the original picture, but none suggest a hoax:&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;A color profile was applied. It operated in the XYZ color space.&lt;br /&gt;
&lt;li&gt;A minor amount of color scaling was applied in the RGB color space.&lt;br /&gt;
&lt;li&gt;There is a minor amount of edge sharpening, similar to what I would expect from an Adobe application.&lt;br /&gt;
&lt;li&gt;The error level analysis does include a little rainbowing, but not a lot. A lot of rainbowing strongly suggests an Adobe product. A little rainbowing could come from Adobe, Gimp, or a half-dozen other graphic programs.&lt;/ul&gt;&lt;br /&gt;
In contrast, I do not detect splicing. I do not detect blending. I do not detect selective color corrections. I do not detect lighting abnormalities, missing shadows, or inconsistent focal points. Frankly, I&#039;m not seeing anything abnormal or suspicious in this image.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;That&#039;s Odd&lt;/H3&gt;&lt;br /&gt;
The picture is attributed to a FEMA photographer named &lt;a href=&quot;http://en.wikipedia.org/wiki/Kurt_Sonnenfeld&quot;&gt;Kurt Sonnenfeld&lt;/a&gt;. By himself, he is an interesting character: he was given exclusive access to photograph the World Trade Center area after the 9/11 attack, but he refuses to show over some of his 9/11 footage. And he is wanted for murder in Colorado concerning the death of his wife. There are a half dozen conspiracies that involve him.&lt;br /&gt;
&lt;br /&gt;
The Clues Forum and many other conspiracy-oriented forums have repeatedly attacked his 9/11 photos, as well as those by other photographers, as being faked. For this reason, I am hesitant to point out the one abnormality that I did detect. I want to emphasize that this abnormality &lt;i&gt;does not&lt;/i&gt; imply malicious intent and does not support the conspiracy around this photo allegedly having people added.&lt;br /&gt;
&lt;br /&gt;
The oddity comes from the meta data. The source image that I looked at is &lt;a href=&quot;http://www.flickr.com/photos/edicionesmindcontrol/4049231458/&quot;&gt;hosted on Flickr&lt;/a&gt;. Flickr permits you to download the &quot;&lt;a href=&quot;http://www.flickr.com/photos/edicionesmindcontrol/4049231458/sizes/o/in/photostream/&quot;&gt;original&lt;/a&gt;&quot; image. Now for clarity, &quot;original&quot; refers to the picture that was uploaded to Flickr, and not necessarily the camera-original (straight from the camera) photo.&lt;br /&gt;
&lt;br /&gt;
According to the EXIF meta data, which should represent the camera&#039;s data, the picture was taken on 2002-08-21 16:12:39. The problem is, that date is almost a year after the picture was supposed to have been taken. (And about 2 months after he was released from jail after &lt;a href=&quot;http://www.voltairenet.org/9-11-FEMA-videographer-at-Ground&quot;&gt;being detained&lt;/a&gt; relating to his wife&#039;s murder investigation.)&lt;br /&gt;
&lt;br /&gt;
The EXIF data also identifies the camera as an Olympus E-10. But this is inconsistent with the photo. Specifically, the Olympus E-10 uses big-endian byte ordering for the EXIF data, but this file uses little-endian. The E-10 specifies three quantization tables, but this file only has two tables. And there are other differences in the meta data. This file fails a camera ballistics test. We can be certain that this is not a camera-original image.&lt;br /&gt;
&lt;br /&gt;
So what happened to this picture? We know it is not camera original. But that is consistent with the application of a color profile and minor color corrections. Since I know of a half-dozen ways that the meta data could have been altered without malicious intent, I&#039;m not willing to conclude that this denotes a hoax. Moreover, the Flickr account is not attributed to Kurt Sonnenfeld. So I cannot even rule out modifications made by the person who uploaded the photo to Flickr.&lt;br /&gt;
&lt;br /&gt;
As far as I can tell, people in the Clues Forum want there to be a conspiracy so badly, that they will provide a false graphical analysis. Moreover, they have attributed their mysterious algorithm to me in order to give it some kind of credibility.&lt;br /&gt;
&lt;br /&gt;
Finally, I want to make this very clear: I am not interested in conspiracies around 9/11, Kurt Sonnenfeld, empty vaults, missing black boxes, or the death of his wife. I only want to point out that I can find no basis for this &quot;Gradient Map Test&quot; that the Clues Forum attributed to me. 
    </content:encoded>

    <pubDate>Tue, 24 Apr 2012 09:38:41 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/481-guid.html</guid>
    
</item>
<item>
    <title>Jail Bait</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/480-Jail-Bait.html</link>
            <category>Forensics</category>
            <category>Image Analysis</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/480-Jail-Bait.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=480</wfw:comment>

    <slash:comments>1</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=480</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    Earlier today the news was abuzz with an announcement. The FBI announced the arrest of Higinio O. Ochoa III. He was charged with digital trespassing into government computer systems and releasing personal details related to law enforcement officers.&lt;br /&gt;
&lt;br /&gt;
Of course, every news story focused on the fact that there was a picture of a woman&#039;s breasts. The headlines all played on the pun. For example, CBS wrote, &quot;&lt;a href=&quot;http://houston.cbslocal.com/2012/04/13/anonymous-hacker-busted-by-fbi/&quot;&gt;Suspected Anonymous Hacker Busted By FBI -- Thanks To A Racy Photo&lt;/a&gt;. (Get it? &lt;i&gt;Busted&lt;/i&gt;? It&#039;s funny because it&#039;s her bust.) And Gizmodo declared, &quot;&lt;a href=&quot;http://gizmodo.com/5901430/these-breasts-nailed-anonymous-hacker-in-fbi-case&quot;&gt;Body of Evidence: These Breasts Nailed a Hacker For the FBI&lt;/a&gt;&quot;. (You see, it&#039;s her &lt;i&gt;body&lt;/i&gt; and it&#039;s &lt;i&gt;evidence&lt;/i&gt;, and saying &quot;nailed&quot; gives a sexual undertone.) Personally, I like my own title: Jail Bait (nothing but trouble).&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Look Closely&lt;/H3&gt;&lt;br /&gt;
The thing that really got my attention was the fact that Ochoa was caught through digital photo forensics. According to reports, &quot;someone&quot; posted &lt;a href=&quot;http://pastehtml.com/view/bnik8yo1q.html&quot;&gt;some information&lt;/a&gt; to PasteHtml. The posting was credited to w0rmer @cabincr3w. At the bottom of the posting was a picture of a woman&#039;s torso and some text:&lt;br /&gt;
&lt;br /&gt;
&lt;center&gt;&lt;a href=&quot;http://i42.tinypic.com/1d9j9.jpg&quot;&gt;&lt;img src=&quot;http://lh6.googleusercontent.com/-dZcwTYrf-3k/T4jdbjbz0yI/AAAAAAAACPc/A-zf4aWh8vM/s400/1d9j9.jpg&quot;&gt;&lt;/a&gt;&lt;br /&gt;
We Are ALL Anonymous&lt;br /&gt;
&lt;br /&gt;
We NEVER Forgive.&lt;br /&gt;
&lt;br /&gt;
We NEVER Forget.&lt;br /&gt;
&amp;lt;3 @Anonw0rmer&lt;/center&gt;&lt;br /&gt;
&lt;br /&gt;
And that&#039;s where he screwed up. (It&#039;s as if he never read my blog! I&#039;m shocked!)&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Cyber-stalking&lt;/H3&gt;&lt;br /&gt;
According to the &lt;a href=&quot;http://cryptome.org/2012/04/usa-v-ochoa-complaint.pdf&quot;&gt;Criminal Complaint&lt;/a&gt;, someone using the nick &quot;wormer&quot; had repeatedly compromised systems and posted harassing messages directed at law enforcement officers. In effect, he was just begging to be caught.&lt;br /&gt;
&lt;br /&gt;
The FBI had a suspicion that wormer was Ochoa due to a &lt;a href=&quot;http://article.gmane.org/gmane.network.jabber.devel/4935/match=&quot;&gt;previous online posting&lt;/a&gt; that was from an account attributed to a user named &quot;wormer&quot; and signed using Ochoa&#039;s name. However, they couldn&#039;t be certain that this was the same person. Perhaps it was another user using the same name...&lt;br /&gt;
 &lt;br /&gt;
The picture is what sealed the link. Specifically, the picture includes meta data and GPS information. The GPS information identifies the location as 37&amp;deg; 51&#039; 25.20&quot; S, 145&amp;deg; 15&#039; 1.20&quot; E. That&#039;s a suburb of Melbourne, Australia. The FBI linked a Facebook account for Ochoa as having a friend in Australia named Kylie Gardner. Moreover, Ochoa is apparently &quot;in a relationship&quot; with Kylie. (The complaint cites sources and emphasizes that all of this was publicly available information. No subpoena required for this part.)&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh4.googleusercontent.com/-sLW5K7gV388/T4jdbkN1DbI/AAAAAAAACPY/uSO2k_6d4m4/s400/gps.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
The complaint also states that they traced network addresses to Ochoa.&lt;br /&gt;
&lt;br /&gt;
As far as I can tell, there is really only one thing missing from the criminal complaint. The report links the account for wormer to a person named Ochoa (who lives in Texas), and a network address to a location near where Ochoa resides. It links Ochoa in Texas to Gardner in Melbourne, and it links a photo of a woman used by wormer to an address in the Melbourne suburbs. However, the report does &lt;b&gt;not&lt;/b&gt; state that Kylie Gardner is the woman in the photos (plural -- the complaint lists multiple images of this woman and used by wormer), nor does it identify Gardner as residing at the GPS location.&lt;br /&gt;
&lt;br /&gt;
Although the linkage between Ochoa/w0rmer is very strong, it is incomplete. But, it is enough for the FBI to claim &quot;probable cause&quot; and to make an arrest. They have charged Ochoa with at least &lt;a href=&quot;http://www.dailymail.co.uk/news/article-2129257/Higinio-O-Ochoa-III-FBI-led-Anonymous-hacker-girlfriend-posts-picture-breasts-online.html?ITO=1490&quot;&gt;four counts&lt;/a&gt; of unauthorized access to a protected computer. With the arrest, I suspect that they will get subpoenas to search his home and computer systems. And if those systems contain any of the compromised data or links between Ochoa and the wormer accounts used to post the compromised data, then he will be going away for a very long time.&lt;br /&gt;
&lt;br /&gt;
&lt;small&gt;Thanks to Xenon for forwarding me the news of the arrest and JG for the blog title and links. &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/479-Slip-Me-Some-Skin.html#c1781&quot;&gt;Jack&lt;/a&gt;: Hope your eyes feel better.&lt;/small&gt; 
    </content:encoded>

    <pubDate>Fri, 13 Apr 2012 19:15:58 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/480-guid.html</guid>
    
</item>
<item>
    <title>Slip Me Some Skin</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/479-Slip-Me-Some-Skin.html</link>
            <category>FotoForensics</category>
            <category>Image Analysis</category>
            <category>Politics</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/479-Slip-Me-Some-Skin.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=479</wfw:comment>

    <slash:comments>4</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=479</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    Normally when I write about politics, I focus on the United States (because that&#039;s where I&#039;m located). But on occasion I mention other countries -- particularly when they work on laws that I &lt;i&gt;wish&lt;/i&gt; this country would consider. (Like truth in advertising laws regarding digitally modified pictures.) However, this time the UK has proposed an amazingly ignorant law.&lt;br /&gt;
&lt;br /&gt;
Let me preface this with: I&#039;m not a barrister or solicitor (or whatever they call an attorney in the UK), this is not legal advice, and I&#039;m certainly not familiar with the UK legal system (well, anything other than how to break away from their empire after they imposed the &lt;a href=&quot;http://en.wikipedia.org/wiki/Stamp_Act_1765&quot;&gt;Stamp Act of 1765&lt;/a&gt;). Worse: I learned of this topic on &lt;a href=&quot;http://yro.slashdot.org/story/12/04/06/0212210/uk-bill-again-demands-web-pornography-ban&quot;&gt;Slashdot&lt;/a&gt;, so I don&#039;t even know how serious it really is.&lt;br /&gt;
&lt;br /&gt;
It appears that the UK has proposed a bill that &lt;a href=&quot;http://www.pcpro.co.uk/news/373951/lords-bill-demands-isps-and-device-makers-block-porn&quot;&gt;demands ISPs and device makers block porn&lt;/a&gt;. The bill, called the &lt;a href=&quot;http://services.parliament.uk/bills/2010-12/onlinesafety.html&quot;&gt;Online Safety Bill&lt;/a&gt; (&lt;a href=&quot;http://www.publications.parliament.uk/pa/bills/lbill/2010-2012/0137/2012137.pdf&quot;&gt;PDF&lt;/a&gt;) would force service providers to restrict access to pornographic content. ISPs, mobile phone services, and other providers would need to maintain an opt-in list of people who want pornography and only permit them to see pornographic images. (As an aside: if this were in the USA, we would have given the bill a snappier acronym like the &quot;Protect Lasses from Adult Content and Erotica using Network Technologies Act&quot; commonly called PLACENTA.)&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Porn? What? Where!&lt;/H3&gt;&lt;br /&gt;
While I am not an Internet Service Provider or Internet Access Service, I do run an &lt;a href=&quot;http://fotoforensics.com&quot;&gt;online photo analysis service&lt;/a&gt; that has a &lt;a href=&quot;http://fotoforensics.com/faq.php#Is%20this%20site%20broken&quot;&gt;zero-tolerance policy&lt;/a&gt; concerning pornographic content. So any technology that would make this law possible definitely has my interest. I mean, I don&#039;t want to be known as the guy who runs the porn web site. (Other sites may not mind that reputation, but I&#039;m not them.)&lt;br /&gt;
&lt;br /&gt;
Lately I&#039;ve been doing a lot of literature searching for anti-porn filtering technologies. And frankly, there&#039;s no really good solutions right now. Here&#039;s a summary of the basic options that comprise the current state of the art:&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Filter by Domain&lt;/b&gt;&lt;br /&gt;
Some web sites are explicitly known to host adult content. If you see a picture from any of those domains, then assume it is prohibited and filter it. The problem is, if you are not careful then you can end up filtering a lot of content that isn&#039;t porn.&lt;br /&gt;
&lt;br /&gt;
There are plenty of domain blacklists out there. &lt;a href=&quot;http://urlblacklist.com/?sec=download&quot;&gt;URL Blacklist&lt;/a&gt; has their &quot;bigblacklist&quot; of potentially offending (and potentially safe) domains. While the bigblacklist is one of the better lists out there, it has some issues. For example, it includes sites like Flickr and Photobucket in their adult-related categories. While some Flickr and Photobucket users do upload adult content, that certainly isn&#039;t the majority of either site. And frankly, other photo-sharing sites like Zooomr, Facebook, and Google&#039;s Picasa also host some adult images, yet they are not classified as adult-oriented domains.&lt;br /&gt;
&lt;br /&gt;
The worst thing about filtering by domain is that you are likely to filter pictures that are not adult in nature. If the law requires filtering pictures, then filtering by domain fails; it does not filter based on pictures, and misses pictures that are not at those domains.&lt;br /&gt;
&lt;br /&gt;
To give you a real example of this, India passed a law requiring search engines to filter pornographic content. However, this law has had &lt;a href=&quot;http://www.computerworld.com/s/article/9144678/Pornography_filters_in_India_have_little_impact&quot;&gt;little impact&lt;/a&gt; because there are just too many ways around the filters. The filtering errors (Type I, Type II, and Type III) grossly outnumber the actual matches. (For you non-statistical people, a Type I error is a false-positive and a Type II error is a false-negative. A Type-III error is when you get the correct result for the wrong reason, such as getting a non-porn picture because you went to the unfiltered domain for Twitpic -- as if Twitpic didn&#039;t host porn.)&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Filter by Keywords&lt;/b&gt;&lt;br /&gt;
Another common solution filters by keywords. The hope is that filenames or some part of the URL will be descriptive and permit filtering. However, there are plenty of false positives (e.g., &quot;&lt;a href=&quot;http://en.wikipedia.org/wiki/File:Onanie_Bomb_Meets_the_Sex_Pistols.jpg&quot;&gt;Onanie Bomb Meets the Sex Pistols.jpg&lt;/a&gt;&quot; really is workplace safe, even though it contains the word &quot;sex&quot;). Also, this approach does nothing to filter pictures with numerical filenames.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Filter by Image Content&lt;/b&gt;&lt;br /&gt;
In the image analysis world, there have been interesting efforts to identify pornographic content. For example, there are face detection algorithms that look for specific features: eyes, nose, mouth, etc. If they find likely features in the correct relative orientation (two eyes above one nose above one mouth) then they determine that it is a face.&lt;br /&gt;
&lt;br /&gt;
I read one paper about a decade ago that used an eye-filter classifier to identify boobs. (Well, they kind of look like eyes: big, round, dark center...) Since then, people have tried to train classifiers to identify various body parts. And while many of these systems work well in specific circumstances, most seem to fail in the general cases. For example, it is difficult to tell a close-up of a bent elbow or a butt-crack from chest cleavage, and I have yet to see a system that can distinguish &lt;a href=&quot;http://www.marvelousmanboobs.com/&quot;&gt;man-boobs&lt;/a&gt; from their female counterparts.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Filter by Colour&lt;/b&gt;&lt;br /&gt;
Another very common approach tries to identify skin tones. This seems to be the most popular analysis approach.&lt;br /&gt;
&lt;br /&gt;
Basically, skin usually comes in shades of tan. You don&#039;t typically see blue skin except on a Smurf, in the movie Avatar, or on a corpse. (Unless it is spring break and there&#039;s ample body paint.) Same goes for green, yellow, red, and anything else not tan, brown, or beige.&lt;br /&gt;
&lt;br /&gt;
All of these algorithms use the same basic approach: define colour range that denotes &quot;likely skin&quot; and the look for a percentage of match. Some algorithms also try to determine if the patches make up the shape of a person. If there is too much &quot;skin&quot;, then assume it is porn.&lt;br /&gt;
&lt;br /&gt;
However, these algorithms are very primitive and inaccurate; they have lots of limitations. Simply using a percentage approach, a close-up of an arm will easily pass the threshold (while not being pornographic), while a nudist in the background will likely be missed (low percentage of the picture). These algorithms also have trouble with women in bikinis (large percentage of skin) and &lt;a href=&quot;http://zoestrauss.blogspot.com/2007/07/photo-sharing.html&quot;&gt;flesh-coloured shirts&lt;/a&gt; (false-positive on colours) -- these are not the same as nudes and will likely be classified incorrectly. (I read one paper that found a red firetruck classified as a nude because the lighting put it at the edge of the skin colourspace.)&lt;br /&gt;
&lt;br /&gt;
Algorithms usually define skin-tone based on a fixed range in the RGB, HSV, or YUV colourspaces. Each approach has their own limitations. Basically, there are a huge number of false-positives and false-negatives, and they are very dependent on the lighting and tone. A picture that works well with one algorithm will probably not work well with another algorithm. And using a majority rule to combine multiple algorithms typically does not work well since the majority perform poorly.&lt;br /&gt;
&lt;br /&gt;
However, the biggest problem has to do with skin colouring. People with extreme colouring (albinos, very pale, or very dark skin) usually fall outside of these colour ranges. This created a huge problem for Hewlett-Packard a few years ago, when they had a camera that identified faces using skin-colour and then focused on the faces. They were criticized for having &lt;a href=&quot;http://www.nbcbayarea.com/news/tech/Called-Racist-HP-Says-Thanks-for-the-Feedback-79822477.html&quot;&gt;racist cameras&lt;/a&gt; since the devices would not focus on black people. Technically, the skin colour fell outside of the colour swath, so it failed to match. (In HP&#039;s case, they used an algorithm that was optimized for people with typical Asian skin tones because that is where the research originated.) As I understand it, very pale people had the same focus problem, but dark-skinned people had it much worse.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Making Flesh Crawl&lt;/H3&gt;&lt;br /&gt;
To give you an example of how these skin detection algorithms work, here are a few pictures that use the YUV and HSV colourspace definitions from &quot;Detection of Pornographic Digital Images&quot; (J. Marcial-Basilio, et al., &lt;i&gt;International Journal of Computers&lt;/i&gt;, Issue 2, Volume 5, 2011) and the normalized RGB colourspace from &quot;Simple Face-detection Algorithm Based on Minimum Facial Features&quot; (Y. Chen and Y. Lin, &lt;i&gt;The 33rd Annual Conference of the IEEE Industrial Electronics Society (IECON)&lt;/i&gt;, Nov. 5-8, 2007, Taipei, Taiwan). Each picture includes (in order): the original image, normalized RGB skin tones, HSV skin tones, and YUV skin tones. Notice how none match the exact same things.&lt;br /&gt;
&lt;br /&gt;
(As an aside: due to &lt;a href=&quot;http://www.reddit.com/r/programming/comments/hql8b/looks_like_it_for_the_last_few_months_i_have_had/c1xjtxw&quot;&gt;criticism&lt;/a&gt; for continually using pictures of distractingly beautiful women, I&#039;ve decided to use an alternate set of photos for the examples.)&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;Pregnant in Flesh coloured Shirt (Source: &lt;a href=&quot;http://www.flickr.com/photos/27822510@N00/698941220/&quot;&gt;Zoe Strauss&#039;s Photostream&lt;/a&gt;)&lt;/i&gt;&lt;br /&gt;
&lt;img border=&#039;1&#039; src=&quot;http://lh5.googleusercontent.com/-qx4m6FCNxLY/T3-GZ7ISfVI/AAAAAAAACN8/TD2Zhf_4xfA/s400/698941220_06423a1d1c_o_d.jpg&quot; title=&#039;Source&#039;&gt; &lt;img border=&#039;1&#039; src=&quot;http://lh3.googleusercontent.com/-pwfb3opFDGI/T3-GZO8_FpI/AAAAAAAACNk/yMiXsdPqyS0/s400/698941220_06423a1d1c_o_d-rgb.png&quot; title=&#039;Normalized RGB skin tone&#039;&gt; &lt;img border=&#039;1&#039; src=&quot;http://lh4.googleusercontent.com/-J_FccbxVARo/T3-GZF7EFbI/AAAAAAAACNo/R5ib9hGfYug/s400/698941220_06423a1d1c_o_d-hsv.png&quot; title=&#039;HSV skin tone&#039;&gt; &lt;img border=&#039;1&#039; src=&quot;http://lh6.googleusercontent.com/-b051HUTVvq0/T3-GZpqUvdI/AAAAAAAACOI/wSNnjc1JMTM/s400/698941220_06423a1d1c_o_d-yuv.png&quot; title=&#039;YUV skin tone&#039;&gt;&lt;br /&gt;
For the flesh-colored shirt, the RGB filter did a good job identifying actual skin. The HSV filter did a better job identifying flesh tones. In contrast, the YUV filter matched the shirt and not the face or arm.&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;Beer Belly (Source: &lt;a href=&quot;http://www.flickr.com/photos/beer-belly/4646918677/sizes/l/&quot;&gt;BigBeerBelly1&#039;s Photostream&lt;/a&gt;)&lt;/i&gt;&lt;br /&gt;
&lt;img border=&#039;1&#039; src=&quot;http://lh3.googleusercontent.com/-D3wS2rCJ5Cw/T3-GY3kEmRI/AAAAAAAACNs/KUk3PKroAlU/s400/4646918677_fa3f82272b_b.jpg&quot; title=&#039;Source&#039;&gt; &lt;img border=&#039;1&#039; src=&quot;http://lh3.googleusercontent.com/-xcHLcUPpSqw/T3-GYYg6UbI/AAAAAAAACNc/MEx2az4LZXg/s400/4646918677_fa3f82272b_b-rgb.png&quot; title=&#039;Normalized RGB skin tone&#039;&gt; &lt;img border=&#039;1&#039; src=&quot;http://lh6.googleusercontent.com/-UD3-0ZEvDaI/T3-GYGGy0wI/AAAAAAAACNU/o55Mblay5nE/s400/4646918677_fa3f82272b_b-hsv.png&quot; title=&#039;HSV skin tone&#039;&gt; &lt;img border=&#039;1&#039; src=&quot;http://lh6.googleusercontent.com/-SkrOT3Jtb2M/T3-GYdFhNEI/AAAAAAAACNY/Y7LV5eT7sEk/s400/4646918677_fa3f82272b_b-yuv.png&quot; title=&#039;YUV skin tone&#039;&gt;&lt;br /&gt;
In the beer belly picture, RGB was very noisy and missed nearly all of the people in the background. HSV got the background people, but also marked the woman&#039;s yellow top as skin. YUV performed much like HSV, but matched fewer people in the background.&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;&quot;Best. Wedding. Ever.&quot; (Source: &lt;a href=&quot;http://www.reddit.com/r/funny/comments/on6ga/best_wedding_ever/&quot;&gt;Reddit&lt;/a&gt;)&lt;/i&gt;&lt;br /&gt;
&lt;img border=&#039;1&#039; src=&quot;http://lh5.googleusercontent.com/-qWS4NW6YtU8/T3-GdZBE1_I/AAAAAAAACOk/Qf7GPooQ7ow/s400/oZA63.jpg&quot; title=&#039;Source&#039;&gt; &lt;img border=&#039;1&#039; src=&quot;http://lh3.googleusercontent.com/-sqjaHzfJWr4/T3-Gc2IU2hI/AAAAAAAACOc/UseFwwy2rt8/s400/oZA63-rgb.png&quot; title=&#039;Normalized RGB skin tone&#039;&gt; &lt;img border=&#039;1&#039; src=&quot;http://lh4.googleusercontent.com/-u4LoR1sMQbo/T3-GcLvA_KI/AAAAAAAACOU/fPusGAjOus4/s400/oZA63-hsv.png&quot; title=&#039;HSV skin tone&#039;&gt; &lt;img border=&#039;1&#039; src=&quot;http://lh3.googleusercontent.com/-Oea-9LzN4tI/T3-Gc3BcS0I/AAAAAAAACOo/5RDdCGOaj-8/s400/oZA63-yuv.png&quot; title=&#039;YUV skin tone&#039;&gt;&lt;br /&gt;
In the wedding photo (they seriously know how to party...), RGB matched the most skin, but still failed on everyone except the bride. HSV did a great job matching the ground, trees, and shirts (non-skin), but missed most of the actual skin. And YUV, well, it missed every person. (I actually suspect that this YUV filter is the same algorithm used by the HP cameras.)&lt;br /&gt;
&lt;br /&gt;
While these examples show YUV performing the worst, I have plenty of other examples where YUV performs better than the other options. It all depends on the picture&#039;s content, lighting, and colouring.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Smut and Innuendo&lt;/H3&gt;&lt;br /&gt;
I looked up &lt;a href=&quot;http://en.wikipedia.org/wiki/British_humour&quot;&gt;British Humour&lt;/a&gt; on Wikipedia and immediately saw that it was based on smut, innuendo, and satire. &quot;Benny Hill&quot; and &quot;Monty Python&quot; were listed as examples. (I wonder how many Monty Python movies have no nudity...) So perhaps I am reading too much into this proposed bill. Maybe it is British humour -- and I missed the joke. Or maybe some lawmaker really does want to curtail this aspect of British culture?&lt;br /&gt;
&lt;br /&gt;
In any case, it appears that this bill misses the target. It requires network service providers to perform complex content filtering. This is ignoring the fact that access service providers just pass bits between computers and may not have access to the content due to cryptography between the client and server. This bill also requires filtering technologies that do not currently exist with any significant degree of accuracy.&lt;br /&gt;
&lt;br /&gt;
If the UK lawmakers seriously wanted to address this issue, they would (1) require hosting providers, and not service providers, to filter the content that they host, (2) put in a clause that limits filtering efforts to the currently available technologies, and (3) leave the type of filtering up to the people running the filter, rather than explicitly specifying image-based filtering in Section 1. 
    </content:encoded>

    <pubDate>Fri, 06 Apr 2012 19:08:00 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/479-guid.html</guid>
    
</item>
<item>
    <title>Fool Me Once</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/478-Fool-Me-Once.html</link>
            <category>Forensics</category>
            <category>Image Analysis</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/478-Fool-Me-Once.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=478</wfw:comment>

    <slash:comments>26</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=478</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    In my &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/477-We-Have-A-Winner!.html&quot;&gt;previous blog entry&lt;/a&gt;, I wrote about using photo forensics to identify fake pictures of lottery tickets. However, one forgery looked good enough to deserved a closer inspection. This particular picture was &lt;a href=&quot;http://www.reddit.com/r/AskReddit/comments/rm4xo/guys_i_cant_sleep_i_have_640_million_reasons_why/&quot;&gt;posted on Reddit&lt;/a&gt;. (Thanks to &lt;a href=&quot;http://thanland.com/&quot;&gt;Than Tibbetts&lt;/a&gt; for the pointer.)&lt;br /&gt;
&lt;br /&gt;
In the Reddit message, the guy claimed to have a real winning lottery ticket and even took multiple photos -- at the request of other Reddit readers -- to prove that it is real.&lt;br /&gt;
&lt;br /&gt;
&lt;a href=&quot;http://i.imgur.com/IGPZA.jpg&quot;&gt;&lt;img src=&quot;http://lh4.googleusercontent.com/-Ip_GAgD5ycA/T3d7lgHGJKI/AAAAAAAACLM/yPgzK6-sZ64/s400/IGPZA.jpg&quot;&gt;&lt;/a&gt;&lt;br /&gt;
&lt;small&gt;The initial photo shows the winning numbers on line D.&lt;/small&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;a href=&quot;http://i.imgur.com/R4LjO.jpg&quot;&gt;&lt;img src=&quot;http://lh3.googleusercontent.com/-w-t8af0WGSw/T3jA9z4bkPI/AAAAAAAACMU/fuVrlmVF-AY/s400/R4LjO.jpg&quot;&gt;&lt;/a&gt;&lt;br /&gt;
&lt;small&gt;Requested to write &quot;Villa22&quot; next to the ticket -- to prove a current photo. But the winning ticket it only partially in the picture.&lt;/small&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;a href=&quot;http://i.imgur.com/tAgrm.jpg&quot;&gt;&lt;img src=&quot;http://lh4.googleusercontent.com/-dGvJ2jzObcA/T3jA-Pn7qFI/AAAAAAAACMQ/OdoijLgP1PE/s400/tAgrm.jpg&quot;&gt;&lt;/a&gt;&lt;br /&gt;
&lt;small&gt;Requested an uncropped ticket next to &quot;Villa22&quot;.&lt;/small&gt;&lt;br /&gt;
&lt;br /&gt;
The initial problem mentioned by the Reddit crowd is that this was a ticket for Massachusetts, but the three winners were in Illinois, Kansas, and Maryland. However, when this posting first came out, the media had only begun to declare one winner. Later, the media reports were updated to say &quot;&lt;a href=&quot;http://www.reuters.com/article/2012/03/31/usa-lottery-idUSL2E8EV0H620120331&quot;&gt;at least one winner&lt;/a&gt;&quot; and then &lt;a href=&quot;http://www.reuters.com/article/2012/04/01/usa-lottery-idUSL2E8EV0IP20120401&quot;&gt;three winners&lt;/a&gt;. So at the time that this message was posted, we couldn&#039;t be certain that there were not four or more winners... Even though Massachusetts was not mentioned as one of the winning states, we could not be sure that the media was accurate.&lt;br /&gt;
&lt;br /&gt;
A few Reddit readers also noticed that the numbers were not &lt;a href=&quot;http://en.wikipedia.org/wiki/Monotonic_function&quot;&gt;monotonically&lt;/a&gt; increasing. However, I&#039;m not a lottery expert and I don&#039;t know if states can choose to sort the numbers on the ticket.&lt;br /&gt;
&lt;br /&gt;
So rather than depending on speculation about number order and winning states, I rely on photo forensics to identify forgeries.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Basic Tests&lt;/H3&gt;&lt;br /&gt;
I have a small set of basic image analysis tests that I run on every image. I call these my &quot;90% tests&quot; because they catch about 90% of forgeries. And while there is a steep learning curve, if it is a fake, you&#039;ll know it within seconds or minutes. For the forgeries that they do not catch, they will usually show something odd -- not enough to call it a fake, but enough to take a deeper look.&lt;br /&gt;
&lt;br /&gt;
With the first picture, it easily passed most tests. However, I was bothered by the image quality. This is not a camera-original picture, it lacks major Photoshop artifacts, and yet the ELA was high across the entire image. Whatever was applied to this picture was applied across the entire image.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh4.googleusercontent.com/-89pImJKNvTQ/T3jA8uBR80I/AAAAAAAACL0/ozQeYw4d3N4/s400/IGPZA-ela.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
In contrast, the second picture has a low ELA and significant rainbowing (the red/blue splotches typically associated with Adobe products), while the third picture has a high ELA and no major rainbowing artifacts.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh5.googleusercontent.com/-vvAuFyEw61w/T3jFXmKGBOI/AAAAAAAACNE/FjSObED70_Y/s288/R4LjO-ela.png&quot;&gt; &lt;img src=&quot;http://lh4.googleusercontent.com/-GFs1_Ju4a-c/T3jD2WoeHOI/AAAAAAAACM8/Mhpk4y2ayAI/s288/tAgrm-ela.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
Why are the first and third pictures different from the second? Shouldn&#039;t they all have similar ELA results? And why is the ELA from the first and third so high? There&#039;s something odd going on here...&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;PCA&lt;/H3&gt;&lt;br /&gt;
One of my favorite tests is the &lt;a href=&quot;http://en.wikipedia.org/wiki/Principal_component_analysis&quot;&gt;Principal Component Analysis&lt;/a&gt;. PCA identifies an orthogonal coordinate system based on the variance of the data space. (Uh, can you explain that in English?)&lt;br /&gt;
&lt;br /&gt;
All of the RGB colors in the picture can be represented as XYZ coordinates. This yields an RGB cloud. For this picture, the cloud is spear-shaped since it doesn&#039;t have much color.&lt;br /&gt;
&lt;img src=&quot;http://lh5.googleusercontent.com/-luuOLqNV9q0/T3jC4rjWLDI/AAAAAAAACM0/kkzuu0YgJKo/s800/IGPZA-cs.png&quot;&gt;&lt;br /&gt;
The widest path through the cloud defines the first principal component (PC1; in red). PC2 is the second widest path (at a right angle to the first path; in green), and PC3 is whatever is left over (the shortest path relative to PC1 and PC2; in blue).&lt;br /&gt;
&lt;br /&gt;
There&#039;s a bunch of different things that can be done with PCA. One of my common tests is to measure the distance from each color to the PC1 line.  This shows JPEG resave artifacts and can be used to identify splicing or editing based on image qualities. However, with this picture nothing stands out as unusual; the picture has lots of large chunks indicating multiple JPEG resaves, but no obvious splicing or drawing.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh6.googleusercontent.com/-DAjAcE3OqZE/T3jA8V0IsUI/AAAAAAAACLs/Mcrp0CtJCgg/s400/IGPZA-p1l.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
The JPEG quantization tables claim that this was saved at 90%, but this is much less than 90%. The difference could be explained by the fact that it is not camera original. So far, this picture could be real...&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Cross-Product&lt;/H3&gt;&lt;br /&gt;
One of the tests that I usually do not need to use checks the PCA &lt;a href=&quot;http://en.wikipedia.org/wiki/Cross_product&quot;&gt;cross-product&lt;/a&gt;. The cross-product identifies a vector normal to two other vectors. It is usually used to identify the angle between two vectors or the orientation between vectors.&lt;br /&gt;
&lt;br /&gt;
In this case, I have two vectors: the PC1 line and a vector from the center of the color cloud to the specific color. Adjacent-but-different colors in the picture should form a sweeping arc between the cross-product vectors. So if the text is gray on white with a faint pink background (as seen in the first photo), then whatever color results from the cross-product, all text should show the same cross-product result.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh5.googleusercontent.com/-bmdnqcWWi-E/T3jA9MOcGNI/AAAAAAAACL4/vhhG_HMb_2s/s640/IGPZA-p1x.png&quot; onmouseover=&#039;this.src=&quot;http://lh6.googleusercontent.com/-2m8UcHgdCAA/T3jA9RJAukI/AAAAAAAACME/ml49MZnCgxY/s640/IGPZA-p1xl.png&quot;&#039; onmouseout=&#039;this.src=&quot;http://lh5.googleusercontent.com/-bmdnqcWWi-E/T3jA9MOcGNI/AAAAAAAACL4/vhhG_HMb_2s/s640/IGPZA-p1x.png&quot;&#039;&gt;&lt;br /&gt;
&lt;small&gt;Colored based on cross-product with PC1.&lt;/small&gt;&lt;br /&gt;
&lt;br /&gt;
All of the lottery numbers appear as a turquoise on a faint purple background.&lt;br /&gt;
&lt;br /&gt;
And there&#039;s the problem... Don&#039;t see it?  Mouse over the image to see a blend between the cross-product image and the original. The thing that immediately stands out: the &quot;04 02&quot; is the wrong color. The cross-product lacks the turquoise. The blended image shows the &quot;04 02&quot; as much darker than all other numbers. So we know that at least two of the &quot;winning&quot; numbers are fake.&lt;br /&gt;
&lt;br /&gt;
A little more subtle is the shape of Massachusetts in the center of the ticket (the watermark). The corner of the state under the &quot;38&quot; (line D) has the wrong shape. In fact, it appears to have almost the same shape as the &quot;38&quot; found in line E. So we have two winning numbers that are modified, and a third that is likely modified.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Whoops&lt;/H3&gt;&lt;br /&gt;
However, the big screw-up happened when the guy released his third picture. PCA (PC1 measured from the line) shows that all five of the winning numbers are significantly different from all other lottery numbers; they were modified. The only original number was the bonus &quot;23&quot;.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh4.googleusercontent.com/-KKwFwKoq2do/T3jBQ6rU2YI/AAAAAAAACMk/AXD5QagOIQE/s600/tAgrm-p1l.png&quot;&gt;&lt;br /&gt;
&lt;small&gt;PCA identifies that all five of the winning numbers in line D are forgeries.&lt;/small&gt;&lt;br /&gt;
&lt;br /&gt;
Now we know why the second photo only showed the bonus number and why the second photo&#039;s ELA was lower. Since the final &quot;23&quot; is real, he didn&#039;t need to edit the second picture.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;One More Thing&lt;/H3&gt;&lt;br /&gt;
Whenever I show investigators how to do photo forensics, I always emphasize the need to validate findings with multiple algorithms. A single algorithm can trigger false-positive or false-negative results. If something really is tampered with, then it should show up with multiple analysis algorithms. And if something is really real, then it should pass everything. In this case, the first picture fails the PCA cross-product test (and a few other tests, but this blog entry is long enough). And the third picture fails the PCA line test (and PCA cross-product test and a few other tests).&lt;br /&gt;
&lt;br /&gt;
ELA -- by itself -- identifies nothing abnormal with any of these pictures. That would be a false-negative when asking &quot;is it fake?&quot; But combined with (1) the image quality and ballistics, (2) the results from the PCA line test, and (3) the results from the other two pictures, we have a contradiction -- the ELA seems too high. And when tested with PCA&#039;s cross-product, we have a high-confidence result regarding the manipulation.&lt;br /&gt;
&lt;br /&gt;
It is one thing to say &quot;the ticket is fake because the state is wrong&quot;. It is another thing to be able to show the actual modifications. In this case, we can prove that the picture was modified &lt;i&gt;despite&lt;/i&gt; the odd results from ELA. 
    </content:encoded>

    <pubDate>Sun, 01 Apr 2012 15:40:00 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/478-guid.html</guid>
    
</item>
<item>
    <title>We Have A Winner!</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/477-We-Have-A-Winner!.html</link>
            <category>Forensics</category>
            <category>Image Analysis</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/477-We-Have-A-Winner!.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=477</wfw:comment>

    <slash:comments>0</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=477</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    Last night&#039;s Mega Millions Jackpot of $650 MILLON dollars matched &lt;a href=&quot;http://www.nytimes.com/2012/04/01/us/mega-millions-jackpot-split-by-winners-in-3-states.html&quot;&gt;three lottery tickets&lt;/a&gt;. Each ticket is worth over $215 million dollars. (Well, before taxes and expenses like hiring an accountant.) The tickets were reportedly sold in Illinois, Kansas, and Maryland.&lt;br /&gt;
&lt;br /&gt;
Over at &lt;a href=&quot;http://fotoforensics.com/&quot;&gt;FotoForensics&lt;/a&gt;, it wasn&#039;t more than 15 minutes after the drawing that we began to see an influx of fake lottery tickets being uploaded for analysis. Some fakes were much better than others...&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Right Numbers, Wrong State&lt;/H3&gt;&lt;br /&gt;
One of the first ones claimed to be &lt;a href=&quot;http://fotoforensics.com/analysis.php?id=f3f748ef86ca021194b6c920fb0696d67ea70389.82587&quot;&gt;from Louisiana&lt;/a&gt; and was posted over at 4chan. (4chan? Why am I not surprised...) Error Level Analysis (ELA) immediately identified this fake. (As an aside, I don&#039;t mind re-posting the image and direct FotoForensics link since the source originated from a public URL.)&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh3.googleusercontent.com/-1OCpfv4Um0U/T3dT4apu7RI/AAAAAAAACKY/zW_tjfCgGmI/s640/analysis.php%253Fid%253Df3f748ef86ca021194b6c920fb0696d67ea70389.82587%2526fmt%253Dorig.jpg&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh3.googleusercontent.com/-AHPmlmlpMkw/T3dT4RLiZ_I/AAAAAAAACKc/Iaf-OBQem1k/s640/analysis.php%253Fid%253Df3f748ef86ca021194b6c920fb0696d67ea70389.82587%2526fmt%253Dela.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
For Louisiana, ELA clearly identifies that the &quot;4&quot; in &quot;04&quot; and &quot;46&quot;, and both &quot;23&quot; values were altered.&lt;br /&gt;
&lt;br /&gt;
Similarly, the two-ticket picture of &lt;a href=&quot;http://p.twimg.com/ApSNnq8CEAArgHn.jpg:large&quot;&gt;New York lottery tickets&lt;/a&gt; that was tweeted by &lt;a href=&quot;https://twitter.com/#!/MelissaStetten/status/185929197305335808&quot;&gt;@MelissaStetten&lt;/a&gt; shows up as fake &lt;a href=&quot;http://fotoforensics.com/analysis.php?id=2e0cd670d18a8cd7d6e441c74454376d34a700dc.165160&quot;&gt;under ELA&lt;/a&gt;; the &quot;4&quot; in &quot;04&quot;, &quot;3&quot; in &quot;38&quot;, &quot;6&quot; in &quot;46&quot; and both &quot;23&quot; values were modified to look like a winner.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh4.googleusercontent.com/-RQxDVpylXBE/T3dT7Zj7K4I/AAAAAAAACKo/ttDL4jR39LQ/s400/ny.jpg&quot;&gt;&lt;img src=&quot;http://lh4.googleusercontent.com/-oYlPqPuj-WU/T3dU3rQ6ieI/AAAAAAAACLA/_AIg_cAk1bE/s400/ny-ela.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;More than one winner&lt;/H3&gt;&lt;br /&gt;
There are other ways besides ELA to tell that these were fakes. For example, we can check the text alignment. The New York picture is rotated about 22.6 degrees clockwise (measuring a line across the fives in the &quot;C&quot; line). By straightening the picture and adding a grid, we can see that all of the digits fall into straight lines. Well, all except the modified line. In the modification, the &quot;38&quot; and &quot;46&quot; are too low, and the &quot;23&quot; appears a little skewed.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh3.googleusercontent.com/-XzwvqV0ydxU/T3dT7m5hPWI/AAAAAAAACKw/8lQFEz35wZE/s640/ny-rotated-cropped.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
Using the &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/185-Myth-Busting-Boats.html&quot;&gt;copy-move detector&lt;/a&gt;, we can further identify that all three &quot;23&quot; values are identical (cloned -- from line C), and the &quot;04&quot; was copied from line B and not from line A.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Really Winning&lt;/H3&gt;&lt;br /&gt;
Of course, what most people forget (or don&#039;t really care about when fabricating pictures like this) is that the winning number isn&#039;t tracked by the numbers printed digits on the lottery ticket. It&#039;s tracked by the barcode. The encoded number is mapped in the lottery&#039;s database to the winning ticket. Even if you manage to change the printed numbers to look like the winning sequence, the barcode will be used to identify the forgery and validate the winner. (But if you know the barcode, then you could -- theoretically -- create a second winning ticket.)&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Update&lt;/b&gt;: Not every fake lottery ticket shows up under ELA. An example is covered in the next blog entry: &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/478-Fool-Me-Once.html&quot;&gt;Fool Me Once&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;small&gt;Thanks to &lt;a href=&quot;http://thanland.com/notes/error-level-analysis-on-megamillions-jokesters&quot;&gt;Than Tibbetts&lt;/a&gt; for the pointer to the original tweet.&lt;/small&gt; 
    </content:encoded>

    <pubDate>Sat, 31 Mar 2012 12:13:44 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/477-guid.html</guid>
    
</item>
<item>
    <title>It's not all about flattening tummies</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/476-Its-not-all-about-flattening-tummies.html</link>
            <category>Image Analysis</category>
            <category>Mass Media</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/476-Its-not-all-about-flattening-tummies.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=476</wfw:comment>

    <slash:comments>4</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=476</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    We seem to be hearing more and more about photo manipulation in the news. Is it because people are becoming conscious of it, tools are becoming &lt;a href=&quot;http://fotoforensics.com/&quot;&gt;more available&lt;/a&gt; (probably not), or that people are starting to realize how manipulative the media really has become. (I&#039;ll vote for the latter.)&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Dead Cat Bounce&lt;/H3&gt;&lt;br /&gt;
The Shelby 1000 (a &lt;i&gt;car&lt;/i&gt; for you non-autophiles) is getting some serious media coverage this week. It seems that someone photoshopped the press release to show the frontend off the ground. (Implying a lot of torque, or as the &lt;a href=&quot;http://honda-tech.com/showthread.php?t=2907589&quot;&gt;Honda forums&lt;/a&gt; suggest, it could indicate bad motor mounts.)&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh5.googleusercontent.com/-Rfco1RcHzws/T3SAL5r9ZmI/AAAAAAAACJs/gHuPalySRAQ/s800/fakex-wide-community.jpg&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
What really surprises me is that this photo manipulation &lt;a href=&quot;http://content.usatoday.com/communities/driveon/post/2012/03/shelby-1000-carroll-shelby-fake-photos-shots/1&quot;&gt;was identified&lt;/a&gt; by someone at &lt;i&gt;USA Today&lt;/i&gt;! (I typically associate them with manipulation and not detection.)&lt;br /&gt;
&lt;br /&gt;
When caught releasing doctored photos, John Luft, &lt;a href=&quot;http://content.usatoday.com/communities/driveon/post/2012/03/shelby-1000-fake-photos-shelby-american-carroll-shelby/1&quot;&gt;CEO of Carroll Shelby&lt;/a&gt;&#039;s Las Vegas-based aftermarket outfit&lt;/a&gt;, stated that the pictures were edited tongue-in-cheek and released accidentally to the media. He continued on, saying that the photos were never intended to be misleading.&lt;br /&gt;
&lt;br /&gt;
Yeah, right -- an accident and never intended to be misleading? All of their photos have been digitally manipulated. For example, the &lt;a href=&quot;http://www.fotoforensics.com/analysis.php?id=ba0b41fb7125e417f6507781710b9ae26c2a0069.2026076&quot;&gt;picture&lt;/a&gt; of Shelby&#039;s 2012 Shelby Mustang GT 1000 coupe seems innocent enough; the car is sitting in front of a yellow building. Yet, according to &lt;a href=&quot;http://www.sno.phy.queensu.ca/~phil/exiftool/&quot;&gt;ExifTool&lt;/a&gt;, this picture was resaved at least 18 times over a month and a half.&lt;br /&gt;
&lt;br /&gt;
&lt;a href=&quot;http://features.blogs.fortune.cnn.com/2012/03/27/carroll-shelby-mustang/&quot;&gt;&lt;img src=&quot;http://lh3.googleusercontent.com/-5RDW12YAWug/T3R9rwCr6EI/AAAAAAAACJc/yrY-Xax13q8/s640/shelby_mustang.jpg&quot;&gt;&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;blockquote&gt;History Action: saved, saved, saved, saved, saved, saved, saved, converted, derived, saved, saved, saved, converted, derived, saved, saved, saved, saved, saved, saved, saved, converted, derived, saved&lt;br /&gt;
&lt;br /&gt;
History Instance ID: xmp.iid:5DC5EBD225206811A84DDF80EC75E9A6, xmp.iid:5EC5EBD225206811A84DDF80EC75E9A6, xmp.iid:BA44176C26206811A84DDF80EC75E9A6, xmp.iid:BB44176C26206811A84DDF80EC75E9A6, xmp.iid:8F223F0E2B206811A84DDF80EC75E9A6, xmp.iid:90223F0E2B206811A84DDF80EC75E9A6, xmp.iid:887F1A922B206811A84DDF80EC75E9A6, xmp.iid:897F1A922B206811A84DDF80EC75E9A6, xmp.iid:8A7F1A922B206811A84DDF80EC75E9A6, xmp.iid:A3908EE7155CE1118387C874E4F410D9, xmp.iid:A4908EE7155CE1118387C874E4F410D9, xmp.iid:A6908EE7155CE1118387C874E4F410D9, xmp.iid:A7908EE7155CE1118387C874E4F410D9, xmp.iid:31E0C2720872E11199D88B59CD45EE3E, xmp.iid:32E0C2720872E11199D88B59CD45EE3E, xmp.iid:33E0C2720872E11199D88B59CD45EE3E, xmp.iid:D7D478623474E111B678BF5FBD5033F3, xmp.iid:D8D478623474E111B678BF5FBD5033F3&lt;br /&gt;
&lt;br /&gt;
History When: 2012:02:12 14:46:44-08:00, 2012:02:12 14:46:44-08:00, 2012:02:12 14:47:02-08:00, 2012:02:12 14:47:02-08:00, 2012:02:12 15:23:44-08:00, 2012:02:12 15:23:44-08:00, 2012:02:12 15:23:53-08:00, 2012:02:12 15:23:53-08:00, 2012:02:12 15:24:22-08:00, 2012:02:20 15:14:51-08:00, 2012:02:20 15:14:51-08:00, 2012:02:21 07:13:06-08:00, 2012:02:21 07:19:29-08:00, 2012:03:20 07:09:34-07:00, 2012:03:20 07:11:12-07:00, 2012:03:20 07:12:09-07:00, 2012:03:22 09:02:09-07:00, 2012:03:22 09:02:09-07:00&lt;br /&gt;
&lt;br /&gt;
History Software Agent: Adobe Photoshop CS5 Macintosh, Adobe Photoshop CS5 Macintosh, Adobe Photoshop CS5 Macintosh, Adobe Photoshop CS5 Macintosh, Adobe Photoshop CS5 Macintosh, Adobe Photoshop CS5 Macintosh, Adobe Photoshop CS5 Macintosh, Adobe Photoshop CS5 Macintosh, Adobe Photoshop CS5 Macintosh, Adobe Photoshop CS5.1 Windows, Adobe Photoshop CS5.1 Windows, Adobe Photoshop CS5.1 Windows, Adobe Photoshop CS5.1 Windows, Adobe Photoshop CS5.1 Windows, Adobe Photoshop CS5.1 Windows, Adobe Photoshop CS5.1 Windows, Adobe Photoshop CS5.1 Windows, Adobe Photoshop CS5.1 Windows&lt;/blockquote&gt;&lt;br /&gt;
&lt;br /&gt;
Even this seemingly simple picture of a car in front of a building was touched up to the extreme. (By the way, the meta data says it was taken with a Nikon D2X. But the image is 1419x1296 -- a resolution not supported by the camera. So at minimum there was cropping. A deeper analysis detects at least two different color modifications -- one in the YUV/YIQ color space and the second in the XYZ color space, a conversion in the subsampling, and some blemish removals on the yellow wall.) So to say that a modified photo was &quot;accidentally&quot; released seems like an outrageous claim.&lt;br /&gt;
&lt;br /&gt;
Frankly, Shelby would have been better off saying that it was a &lt;a href=&quot;http://www.youtube.com/watch?v=xHjxGb0jJQ8&quot;&gt;tricked out car bouncing&lt;/a&gt;. Instead, they claimed that the manipulation was a joke that was not intended to be released.&lt;br /&gt;
&lt;br /&gt;
And keep in mind: Shelby isn&#039;t the only automobile manufacturer who doctors photos. They all do it. It&#039;s the norm in their industry. And even extreme modifications are commonplace. Otherwise, Ford would have us believe that they have a convertible that can drive really fast &lt;a href=&quot;http://www.ford.com/resources/ford/mustang/2013//gallery/photos/mst13_pg_009_ext_full.jpg&quot;&gt;without messing up the woman&#039;s hair&lt;/a&gt;. (And I don&#039;t know what to say about Honda&#039;s speeding car with tree leaves following it, except &quot;&lt;a href=&quot;http://automobiles.honda.com/images/2012/cr-z/exterior-gallery/gal_lg2.jpg&quot;&gt;very artistic&lt;/a&gt;&quot;.) Shelby&#039;s press release wasn&#039;t an accident. Auto companies have so many people involved in advertisements that they don&#039;t let anything out the door without multiple checks and authoritative approval.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh6.googleusercontent.com/-UaUV8a9wbQk/T3SDUf2QZ7I/AAAAAAAACJ8/QBi-0j9Niow/s400/mst13_pg_009_ext_full.jpg&quot;&gt; &lt;img src=&quot;http://lh4.googleusercontent.com/-7mhpmOt0kmg/T3SDUQk-rlI/AAAAAAAACKE/6ac7-Aubf5k/s400/gal_lg2.jpg&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
For Shelby, caught with a manipulated photo is bad. Caught lying about it is worse. Perhaps this is why their &lt;a href=&quot;http://www.reuters.com/finance/stocks/overview?symbol=CSBI.PK&quot;&gt;stock&lt;/a&gt; is doing the &lt;a href=&quot;http://en.wikipedia.org/wiki/Dead_cat_bounce&quot;&gt;dead-cat-bounce&lt;/a&gt;.&lt;br /&gt;
&lt;img src=&quot;http://lh6.googleusercontent.com/-DOJQ-HIksW8/T3SCm3S-O6I/AAAAAAAACJ0/OW9bJEfCszI/s800/chart_api.asp%253Fwidth%253D458%2526height%253D292%2526symbol%253DCSBI.PK%2526duration%253D5%2526lowers%253Dvolume%2526headertype%253Dnone.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;If Looks Could Kill&lt;/H3&gt;&lt;br /&gt;
I had a friend who was told by a receptionist to not bother applying for a particular job because he&#039;d never get it. Even though the person telling him this wasn&#039;t the decision maker, she had seen enough applicants go through the process that she could already tell the outcome. By the same means, law enforcement officers are trained to evaluate situations. They&#039;re not supposed to detain people who do not appear to have been in the wrong. At least, that&#039;s how the system is &lt;i&gt;supposed to work&lt;/i&gt;.&lt;br /&gt;
&lt;br /&gt;
I&#039;ve heard lots of different -- and conflicting -- stories about the Trayvon Martin shooting. One version says that the shooter (George Zimmerman) fired in self defense. Another says that he shot an innocent person. The police claim that Zimmerman&#039;s actions were justified. The media claims otherwise. Frankly, I&#039;d rather see a trial in a court of law than performed by the media. It isn&#039;t like the family doesn&#039;t have the option to sue the shooter in civil court. But instead, they have taken to the media rather than the judicial system. And the amount of media bias is astounding.&lt;br /&gt;
&lt;br /&gt;
Now keep in mind, I&#039;m not making a decision about who was the victim, whether the shooting was justified, or even about the accuracy of the information provided by the news reports. I&#039;m only focusing on how the &quot;facts&quot; are being reported by the media.&lt;br /&gt;
&lt;br /&gt;
People have &lt;a href=&quot;http://news.yahoo.com/blogs/cutline/trayvon-martin-shooting-debate-over-photos-escalates-155103512.html&quot;&gt;begun to notice&lt;/a&gt; how the media has been presenting this case. Most reports show two photos side-by-side. One picture shows Martin (the dead guy) smiling and fresh faced. Next to him is Zimmerman (the shooter) looking dour.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh5.googleusercontent.com/-T7wJmQy2qHw/T3R_5IJgpHI/AAAAAAAACJk/GVf9vcCdtlE/s800/trayvon-martin-and-george-zimmerman-500x333.jpg&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
These photos actually remind me of the 2007 &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/78-Image-Tweaks-in-the-News.html&quot;&gt;campaign photos&lt;/a&gt; that were enhanced to show a nice looking Obama countered by a shriveled Clinton. (&lt;a href=&quot;http://www.fotoforensics.com/analysis.php?id=b31e18098058e2a63d406a51fc9ef0e016ccb35d.38357&quot;&gt;ELA&lt;/a&gt; shows that the picture was enhanced since Clinton is not at the same error level as Obama.)&lt;br /&gt;
&lt;br /&gt;
&lt;a href=&quot;http://images.usatoday.com/news/_photos/2007/03/02/obamaclinton-large.jpg&quot;&gt;&lt;img src=&quot;http://lh4.googleusercontent.com/-pbATN35VMpI/T3SEv32MlzI/AAAAAAAACKM/dPAP_zwCupU/s800/obamaclinton-large.jpg&quot;&gt;&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
In this case, Martin&#039;s picture is an old photo, showing him younger than 17 years old. They don&#039;t show the picture of him with (alleged) gold teeth (seen in a photo from his now-closed online account). And early reports didn&#039;t mention his &lt;a href=&quot;http://www.washingtonpost.com/blogs/post-partisan/post/sanford-police-throw-pot-at-trayvon-martin/2011/03/04/gIQA4eEgcS_blog.html&quot;&gt;drug-related suspension&lt;/a&gt; from high school a month before the shooting, or his &lt;a href=&quot;http://www.dailymail.co.uk/news/article-2120504/Trayvon-Martin-case-He-suspended-times-caught-burglary-tool.html&quot;&gt;other charges&lt;/a&gt; including vandalism, carrying burglary tools, and being in possession of women&#039;s jewelry (that he claimed did not belong to him).&lt;br /&gt;
&lt;br /&gt;
The Zimmerman&#039;s photo is reportedly from a &lt;a href=&quot;http://www.palmbeachpost.com/news/state/zimmerman-of-sanford-has-past-arrest-charges-2266725.html&quot;&gt;2005/2006 mug shot&lt;/a&gt; for domestic violence, speeding, and &quot;tussling with a police office&quot;. (Seriously? &quot;Tussling&quot;? Was that the official charge? Gotta find the humor in media reports.)&lt;br /&gt;
&lt;br /&gt;
Ironically, Martin&#039;s family says that pulling up Trayvon&#039;s past (from as recent as a month before his death) is a &quot;smear campaign&quot; initiated by the police. While nobody mentions that doing the same thing to Zimmerman (mention his arrests more than 5 years ago) is a smear campaign against him. From my viewpoint, it appears that neither person was an angel. But the media seems to be trying to heavily sway a bias.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;As I See It&lt;/H3&gt;&lt;br /&gt;
Both of these stories from this week&#039;s news drive home the importance of truth in photography and balanced news reporting. Pictures sway opinions. When combined with biased news reports, they can polarize an entire population.&lt;br /&gt;
&lt;br /&gt;
It is clear that the media is not fair and reports are not balanced. It is also clear that photo manipulation -- whether to make a vehicle look more desirable or a person more innocent/guilty -- is commonplace. It is for this reason that we need to watch closely for signs of manipulation. Not just manipulation to make pictures look nicer, but manipulation designed to sway our opinions.&lt;br /&gt;
&lt;br /&gt;
&lt;small&gt;Thanks to &lt;a href=&quot;http://www.fourandsix.com/&quot;&gt;FourAndSix&lt;/a&gt; for the news stories, and Woody for the blog title.&lt;/small&gt; 
    </content:encoded>

    <pubDate>Thu, 29 Mar 2012 12:58:52 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/476-guid.html</guid>
    
</item>
<item>
    <title>You Want My Facebook Password?</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/475-You-Want-My-Facebook-Password.html</link>
            <category>Privacy</category>
            <category>Security</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/475-You-Want-My-Facebook-Password.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=475</wfw:comment>

    <slash:comments>9</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=475</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    According to the &lt;i&gt;Wall Street Journal&lt;/i&gt;, job seekers are being asked during the interview to provide their &lt;a href=&quot;http://online.wsj.com/article/AP35b6fb378cc64062a3bceb87e17e2e03.html&quot;&gt;Facebook passwords&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
Wow. The number one rule that every security evangelist promotes is &quot;Don&#039;t Share Passwords&quot;. This includes giving your Facebook password to a potential employer. Moreover, Facebook is a personal social network with public and private content. What kind of employer wants to judge your worthiness based on private, and not public, content? Does that private message your mom wrote on your wall really reflect your skill set?&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Pick Your Battles Carefully&lt;/H3&gt;&lt;br /&gt;
I always say that people shouldn&#039;t fight TSA at the airport -- unless you don&#039;t mind missing your flight. Instead, file a complaint the instant you get past the security checkpoint. Similarly, if you really want the job, then don&#039;t fight the interviewer on this point -- there are plenty of other options available.&lt;br /&gt;
&lt;br /&gt;
So, what should you do if the job interviewer asks you for this information? My first through is to ask yourself: do you really want to work for a company that is &lt;i&gt;this&lt;/i&gt; intrusive? But then again, I have a job. Other people are desperate for work and in these desperate times, we do desperate things. However, you can still ask them -- with curiosity and not defensiveness -- why they require this information, who will have access to your password, who will see your private information, and how often will they be checking your account? Emphasize that you take security &lt;i&gt;very&lt;/i&gt; seriously and sharing passwords with anyone does not seem like a good idea.&lt;br /&gt;
&lt;br /&gt;
You might also want to mention that sharing your password with anyone is a direct violation of the Facebook Terms of Service. (While their ToS changes regularly, this issue is &lt;a href=&quot;http://www.facebook.com/legal/terms&quot;&gt;currently listed&lt;/a&gt; under Section 4, &quot;Registration and Account Security&quot;, item #8: &quot;You will not share your password, (or in the case of developers, your secret key), let anyone else access your account, or do anything else that might jeopardize the security of your account.&quot;) Are they seriously asking you to breach a service contract in order to successfully complete the interview? Let them know that, just as you won&#039;t breach any contract with their company, you will not breach a contract with Facebook.&lt;br /&gt;
&lt;br /&gt;
The WSJ article said that people who feel uncomfortable with providing their passwords may be asked to login at the human resources desk or to &quot;friend&quot; the HR person so they can monitor your non-public content. Even in these cases, ask them if you can use your own laptop to login because you are concerned about cookie theft and the security of your account. I&#039;m also fine with accepting HR as a &quot;friend&quot; since you can place friends in categories and you can put their HR person in a &lt;a href=&quot;http://www.facebook.com/help/?faq=206571136073851&amp;_fb_noscript=1&quot;&gt;restricted category&lt;/a&gt; that can only see your public information.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;An Ounce of Prevention&lt;/H3&gt;&lt;br /&gt;
If you suspect ahead of time that you might be asked this question, then you can create a new Facebook account before going to the interview. Give it a minimal configuration and a unique (and friendly) password, like &quot;AllWorkNoPlay&quot;. Let their HR person read into this whatever they want. Inform them that you rarely use Facebook.&lt;br /&gt;
&lt;br /&gt;
Then again, your existing profile may include your name, pictures, and more. So even a semi-competent HR minion will see through this ploy. Since you&#039;re giving them your login, be sure to minimize the damage: don&#039;t use that password anywhere else and disconnect your links to &lt;a href=&quot;http://www.facebook.com/note.php?note_id=404086182059&amp;_fb_noscript=1&quot;&gt;Twitter&lt;/a&gt;, &lt;a href=&quot;http://www.facebook.com/Gmail&quot;&gt;Gmail&lt;/a&gt;, and other services. The last thing you want is for HR to gain access to more than Facebook.&lt;br /&gt;
&lt;br /&gt;
And since you suspect that you&#039;ll be giving them access, clean the place up. Remove all swear words, comments that insult anyone, and pictures of you drinking beer or partying.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Oh Those Games We Play&lt;/H3&gt;&lt;br /&gt;
Of course, my brain immediately went to the lottery scenario. I could only &lt;i&gt;wish&lt;/i&gt; that someone would ask me for my Facebook information! This is like winning the lottery! You cannot lose here! Some of the options that just popped into my head:&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;No job? Sue!&lt;/b&gt;&lt;br /&gt;
Regardless of what they give as a reason for not hiring you, file a complaint that you were dismissed after the interviewer took offense at something in your Facebook account. What was there? Your religious preference? Political orientation? Close ties to family or friends? Taste in music? Whatever it was, that is a bias that is unrelated to your ability to perform the job. Sue them.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Smear it!&lt;/b&gt;&lt;br /&gt;
I don&#039;t care whether you got the job or not. When you get home, anonymously post your Facebook password to one of 4chan&#039;s random boards. Let other people into the account -- other people who you know will deface, impersonate, post insults, and worse. Then, &lt;i&gt;blame the company&lt;/i&gt; for leaking your password! Be sure to point out that the compromise didn&#039;t happen until &lt;i&gt;after&lt;/i&gt; you gave them your password during the interview.&lt;br /&gt;
&lt;br /&gt;
If they ask how it was leaked, well, you don&#039;t know and you don&#039;t have to know. Maybe they didn&#039;t have an antivirus or their antivirus was ineffective (accuse them of having malware). Maybe they told someone else who leaked it. Heck, maybe the interviewer leaked it just for fun. You don&#039;t care. You only know that your online profile and reputation has been tarnished -- and you are going to sue... right after you report this in a YouTube rant and to CNN!&lt;br /&gt;
&lt;br /&gt;
Of course, no company will ever let this sit. (Particularly if they are a publicly traded company.) They will want to shut you up as fast as possible. &quot;Please don&#039;t post to YouTube, give us a moment to investigate this.&quot; Apply pressure: don&#039;t wait for more than a day because this is &lt;i&gt;your&lt;/i&gt; reputation. Meanwhile, contact Facebook and ask them to suspend the account.&lt;br /&gt;
&lt;br /&gt;
The employer won&#039;t want the negative press, they won&#039;t want a lawsuit, and they will know that they were in the wrong to require your password. Expect a big payout just to keep you quiet. (And if you got the job and then lose it because of this? Get a second payout for wrongful termination.)&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Change is Good&lt;/H3&gt;&lt;br /&gt;
But let&#039;s say you want to take the easy-out. You don&#039;t want to get rich quick through a legal complaint; you just want a job. And you feel that you must show them your account. When you get home, change your password. Nobody except you should know your password. 
    </content:encoded>

    <pubDate>Tue, 20 Mar 2012 19:04:59 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/475-guid.html</guid>
    
</item>
<item>
    <title>Or Perish</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/474-Or-Perish.html</link>
            <category>Conferences</category>
            <category>Mass Media</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/474-Or-Perish.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=474</wfw:comment>

    <slash:comments>3</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=474</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    One of my coworkers is trying to convince me to submit some of my research to academic journals. Although I reference other people&#039;s published papers, I have only made my work public through conferences, blog entries, and white papers; not through scholarly journals.&lt;br /&gt;
&lt;br /&gt;
It isn&#039;t that I don&#039;t think I could get them published. Or even that I don&#039;t want to publish my work. Rather, during my six years of graduate school, I became bitter and disillusioned by the publication process. Here are my six reasons for not submitting for publication...&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;1. Speed to Publication&lt;/H3&gt;&lt;br /&gt;
The publication process is not speedy. Papers may take over a year just to be reviewed. After the review (assuming no revisions needed), the paper&#039;s publication may still be delayed by a year or more as it enters the queue of items needing to be published.&lt;br /&gt;
&lt;br /&gt;
In the computer security world, speed means everything. A month is the difference between a few compromised systems and widespread havoc. What if a paper describing a new way viruses spread isn&#039;t published for two years? By the time it comes out, the vulnerability will likely be patched (or made obsolete by updates) and the novel approach becomes old news.&lt;br /&gt;
&lt;br /&gt;
In contrast, conferences generally take six months to a year from submission to presentation. I can submit a talk at a security conference and present the material to my peers long before it would ever be printed in a journal. For an even faster disclosure, I have the option to blog, post to a widely-read forum, or otherwise just put it out there.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;2. Peer Review&lt;/H3&gt;&lt;br /&gt;
Some people associate journals with a thorough peer review process. However, I have seen far too many papers that are either incomplete or inaccurate. The incompleteness is usually due to missing steps, unspecified assumptions, or an assumed common background which may not exist.&lt;br /&gt;
&lt;br /&gt;
For example, I was recently reading a paper on saliency. In computer vision, a &lt;i&gt;saliency&lt;/i&gt; algorithm identifies the areas that initially attract the eye. This is the visually important area. As Craig Reynolds wrote in LinkedIn&#039;s Image Processing Interest Group:&lt;br /&gt;
&lt;blockquote&gt;[A]n iconic example of saliency is a photo of a field of green plants with yellow flowers, and one red flower. The saliency map will have a peak at the red flower. It finds areas of the image which differ in hue, brightness, dominant frequency, etc. However ... Imagine a photo of a dozen smiling faces and one frowning face. A human observer would zero in on the frowning face while saliency presumably wouldn&#039;t notice it.&lt;/blockquote&gt;&lt;br /&gt;
&lt;br /&gt;
The paper I was reading is &quot;&lt;a href=&quot;http://www.cranfield.ac.uk/~toby.breckon/publications/papers/katramados11salient.pdf&quot;&gt;Real-time Visual Saliency by Division of Gaussians&lt;/a&gt;&quot; (I. Katramados, T.P. Breckon), &lt;i&gt;In Proc. International Conference on Image Processing&lt;/i&gt;, IEEE, pp. 1741-1744, 2011. This paper describes a very interesting variation to saliency mapping that is supposed to be wickedly fast.  And the paper is well written... until you get to Step 2 on page 2. This step defines a second Gaussian pyramid &#039;D&#039;, but never explains what &#039;D&#039; is, or how &#039;D&#039; differs from &#039;U&#039; (&#039;U&#039; is defined in Step 1 as a Gaussian pyramid). Step 3 references a &#039;MiR&#039; matrix, but never defines this term. I would hope that, if I could find out what MiR means, I could properly evaluate this paper. But there isn&#039;t even a reference to this term (and Google pulls up nothing relevant). In effect, this paper describes the speed of a very fast algorithm, but lacks details needed to understand how the algorithm works.&lt;br /&gt;
&lt;br /&gt;
In contrast, I found &lt;a href=&quot;http://ivrgwww.epfl.ch/supplementary_material/RK_CVPR09/index.html&quot;&gt;the paper&lt;/a&gt; &quot;Frequency-tuned Salient Region Detection&quot; to be very informative (R. Achanta, S. Hemami, F. Estrada and S. Süsstrunk, &quot;Frequency-tuned Salient Region Detection&quot;, &lt;i&gt;IEEE International Conference on Computer Vision and Pattern Recognition&lt;/i&gt; (CVPR), 2009). This paper is easy to read and the diagrams really explain the algorithm well. (I implemented it in 10 minutes and, with a little tweaking, got it working with even better results than they claim.) If it wasn&#039;t for one typographical error in their third figure, I would say that this paper is perfect. (The &#039;if&#039; condition should compare each S&lt;sub&gt;k&lt;/sub&gt; to 2&amp;times;S&lt;sub&gt;&amp;micro;&lt;/sub&gt;, and the scalar 2 should be adjusted based on the number of graph segments.)&lt;br /&gt;
&lt;br /&gt;
So the peer review process failed to identify missing steps, undefined variables, and typographical errors in these papers. And this is just from papers I was reading one night; I see this kind of mistake in nearly every paper. On occasion, I review papers for IEEE. I end up rejecting most of them due to readability, missing steps, errors, and false premises. (I still shake my head about a paper on a vulnerability that started off with the assumption that the system was fully compromised by another exploit from the same attacker. Uh, if the system is already fully compromised, then exploiting another vulnerability doesn&#039;t gain you anything.)&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;3. Readability&lt;/H3&gt;&lt;br /&gt;
I&#039;m dismayed by the number of papers that are nothing more than technical mumbo jumbo. I described most of these problems in a &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/221-Oh-Baby,-Talk-Technical-To-Me.html&quot;&gt;previous blog entry&lt;/a&gt;. I really do believe that the number of mathematical symbols is inversely proportional to the value of the content.&lt;br /&gt;
&lt;br /&gt;
On occasion I see this leak through to source code. If I see a program with variable names like &quot;omega&quot; and &quot;phi&quot;, then this tells me that the programmer doesn&#039;t know what the hell they are doing. They are just copying an algorithm that someone else wrote. If you really understand the algorithm, then you will use intelligent variable names. Rather than writing &quot;lambda=7&quot;, you might write &quot;AdjustmentRate=7&quot;. Frankly, I&#039;d rather see &quot;L=7&quot; than &quot;lambda=7&quot; since at least this tells me that the variable&#039;s name is just a shortcut for something, and not the spelled out shortcut to something that someone wrote on a whiteboard. (Writing out &quot;lambda&quot; is like writing &quot;Ell=7&quot;. Spelling out the phonetic pronunciation of the single-letter variable doesn&#039;t make it any more readable.)&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;4. References&lt;/H3&gt;&lt;br /&gt;
Back in grad school, I had a teacher who assigned a few books and a ton of papers as light reading. Everyone in the class quickly noticed that every single reference was limited to a small group of authors (which included the professor). It isn&#039;t that there were not other authors in this field or that some of those other papers were not as good or better than the ones supplied. This was strictly friends citing friends -- no outsiders allowed.&lt;br /&gt;
&lt;br /&gt;
As it turns out, the problem of cyclical citations is very wide-spread. This is similar to Google&#039;s issue with cyclical links in order to raise your page-rank. A small circle of friends repeatedly citing your paper doesn&#039;t mean you have a good paper. But unlike Google, the journals don&#039;t seem to be actively combating this problem.&lt;br /&gt;
&lt;br /&gt;
On occasion, I have also seen papers that list fake sources, unavailable references, or that ignore the main paper on the topic in order to claim some partial credit for the discovery. But the thing that offends me the most is that most academic papers won&#039;t cite non-academic sources. Conference proceedings, presentations, whitepapers, and online resources are usually ignored. (I&#039;m not saying that everyone should cite Wikipedia, but some Wikipedia entries are well written and make good references. It just depends on the Wikipedia article and the current state of the entry.)&lt;br /&gt;
&lt;br /&gt;
This is different from conferences, whitepapers, blogs, etc. -- these types of disclosures have no problem citing references and/or linking to authoritative sources beyond academic journals. And if you miss a big one or attempt to plagiarize work, your peers &lt;i&gt;will&lt;/i&gt; let you know.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;5. Cost&lt;/H3&gt;&lt;br /&gt;
The price for some journal articles is just staggering. While some papers cost $10 or $20 for people who want to read it, others cost $100 or more. And that&#039;s just for the papers. What if you want the whole journal? Now you&#039;re talking about the &lt;a href=&quot;http://store.elsevier.com/searchResults.jsp?frPag=BS&amp;_requestid=4763&quot;&gt;$100 to $2000&lt;/a&gt; range. And a subscription? Prices can &lt;a href=&quot;http://www.elsevier.com/wps/find/journalpricing.cws_home/subscrippricelistlibr/description&quot;&gt;range&lt;/a&gt; from a few hundred to tens of thousands of dollars per year. (Is the journal &lt;i&gt;Adverse Reactions Titles (Section 38 EMBASE)&lt;/i&gt; really worth $31,298 for 12 issues?)&lt;br /&gt;
&lt;br /&gt;
Conferences may not be cheap. Transportation, food, lodging, and the admission fee could cost for a few thousands dollars. But you still get more out of it than one academic journal offers. And that&#039;s assuming that you attend in person. Many conferences offer streaming videos, CDs of papers/slides, and more. And most of the expensive conferences have more talks than any single journal has articles.&lt;br /&gt;
&lt;br /&gt;
The best thing about conferences is that it is a true peer-review system. You are speaking to your peers. If you are bad or wrong, they &lt;i&gt;will&lt;/i&gt; let you know it. And if your content is good, they will explicitly tell you how good it is.&lt;br /&gt;
&lt;br /&gt;
The blogosphere is another good medium. Content is usually free and peer feedback is immediate (and sometimes brutal).&lt;br /&gt;
&lt;br /&gt;
But what if you want to make money? I mean, it took time and effort to write that whitepaper. Well, self-published ebooks are a good option. If the paper is really good and useful, I&#039;ll pay a few dollars for it.&lt;br /&gt;
&lt;br /&gt;
While academic journals claim to be about making information available, they are price limiting. Only the richest can afford this information. I might be persuaded that they are worth the cost if the entries are complete, accurate, and readable, but they usually are not.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;6. Variations&lt;/H3&gt;&lt;br /&gt;
At most universities, the quantity of publications is a key tenure metric. So rather than describing everything in one paper, they will spread it over a dozen papers. What you end up with are dozens of very similar papers that differ in the minor details. In order to get the full concept, you must track down all of the variants and find out what each one contains.&lt;br /&gt;
&lt;br /&gt;
Academic papers also have a sense of one-upmanship. Every paper with a minor increase in performance, accuracy, or precision is a &#039;scholarly topic&#039;. What you end up with are tons of papers with minor variations or special circumstances that claim to be better than other papers.&lt;br /&gt;
&lt;br /&gt;
Ironically, most journals are open to minor variations but closed to completely radical concepts. In the February issue of &lt;i&gt;Wired&lt;/i&gt; magazine is a story about a brain researcher who &lt;a href=&quot;http://www.wired.com/magazine/2012/02/ff_forgettingpill/all/1&quot;&gt;discovered how to erase memories&lt;/a&gt;. The concept seems sound and the experiments were reported to be repeatable, provable, and definitive. (The brain creates proteins to recall memories. If you stop the proteins from forming, then you forget the memory.) Yet the researcher couldn’t get published anywhere. This is because things that run contrary to the norm are undesirable to journals. (He should have refocused his paper topic on &#039;forgetting faster&#039; or &#039;more accurately forgetting&#039; since one-upmanship is publishable.)&lt;br /&gt;
&lt;br /&gt;
Perhaps the journals are unwilling to risk being proven wrong later. Or maybe they don&#039;t want to contradict things that they previously published. Or maybe some of the reviewers had a personal stake in not being proven wrong. (&quot;Contrary research&quot; and &quot;peer review&quot; means you are proving your peers wrong. And if you are trying to get tenure and your work is shown to be wrong, then this impacts your livelihood. Following this mentality, it is better to have a nice house and a steady income than permit correct and radical work to be published.)&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;The Up Side&lt;/H3&gt;&lt;br /&gt;
There is an up-side to academic publications. These journals are usually the first place people go when doing research. (Well, today they go to Google first and the stacks second...) The content is considered authoritative and the authors are considered experts.&lt;br /&gt;
&lt;br /&gt;
Perhaps I&#039;m just hyper-focusing on the negatives. Are there other benefits from scholarly journals? Are my views too biased? 
    </content:encoded>

    <pubDate>Fri, 16 Mar 2012 13:45:21 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/474-guid.html</guid>
    
</item>
<item>
    <title>Keystone Kops</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/473-Keystone-Kops.html</link>
            <category>Mass Media</category>
            <category>Politics</category>
            <category>Unfiction</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/473-Keystone-Kops.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=473</wfw:comment>

    <slash:comments>1</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=473</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    Ever since Maricopa County Sheriff Joe Arpaio &lt;a href=&quot;http://www.politico.com/news/stories/0312/73537.html&quot;&gt;publicly declared&lt;/a&gt; that Obama&#039;s birth certificate was a forgery, I have been receiving a steady flow of requests to evaluate this claim. Personally, I&#039;ve been trying to stay out of it since it really isn&#039;t new. Arpaio&#039;s entire claim is based on the fact that the PDF document contains &quot;layers&quot; and he interprets that as fraud.&lt;br /&gt;
&lt;br /&gt;
I &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/428-After-Birth.html&quot;&gt;debunked this specific claim of fraud&lt;/a&gt; nearly a year ago. The layers are actually due to a simple optimized setting when the PDF was created. Since then, major media outlets have duplicated the same layer-effect by enabling the optimization, and there are even &lt;a href=&quot;http://www.youtube.com/watch?v=XcWQw2AAIho&quot;&gt;YouTube&lt;/a&gt; &lt;a href=&quot;http://www.youtube.com/watch?v=ZXdtQ2FU4Ww&quot;&gt;videos&lt;/a&gt; that show how to do it yourself. To reiterate: it is not fraud; it is a configuration setting for one type of optimized PDF. The results are repeatable and conclusive. And most importantly: this claim of fraud was debunked nearly a year ago.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Behind the Badge&lt;/H3&gt;&lt;br /&gt;
Normally I would just disregard this as another Birther conspiracy -- one that the extremists want to believe so strongly that they will disregard all evidence to the contrary. Kind of like how they said Obama was a Muslim who follows a radical Christian Pastor (except that Muslims don&#039;t follow Christian Pastors).&lt;br /&gt;
&lt;br /&gt;
However, Joe Arpaio isn&#039;t just your run-of-the-mill conspiracy nut. He&#039;s a sheriff. He&#039;s a law enforcement officer. He has taken an oath to protect and to serve. So when he calls for a &lt;a href=&quot;http://www.politico.com/news/stories/0312/73537.html&quot;&gt;criminal investigation&lt;/a&gt;, one would assume that he has real facts to back up his claim.&lt;br /&gt;
&lt;br /&gt;
Over at the &lt;i&gt;East Valley Tribune&lt;/i&gt; and Politico are &lt;a href=&quot;http://www.eastvalleytribune.com/arizona/article_f171a23a-63f1-11e1-b5a2-001871e3ce6c.html&quot;&gt;detailed&lt;/a&gt; &lt;a href=&quot;http://www.politico.com/news/stories/0312/73537.html&quot;&gt;write-ups&lt;/a&gt; of Sheriff Arpaio&#039;s claim. The facts listed by these media outlets are backed up by the &lt;a href=&quot;http://www.youtube.com/watch?v=Vrg6z489K3Q&quot;&gt;press conference&lt;/a&gt; that Arpaio held; he really did say these things.&lt;br /&gt;
&lt;br /&gt;
Frankly, I am just amazed that someone who is supposed to uphold the law would conduct an investigation this way. According to Arpaio, both he and Mike Zullo -- head of the &quot;Cold Case Posse&quot; -- relied on research conducted by unnamed &#039;volunteers&#039;. He said many things that are provably inaccurate. For example, he said that the layers in the PDF were an indication of tampering, even though there are plenty of videos and write-ups that show step-by-step how to enable the optimization and see the same type of layers yourself. One would think that, after six months of investigation, they would have actually seen that it is trivial to duplicate by enabling the optimization. I cannot help by think that the unnamed volunteers are actually people like &lt;a href=&quot;http://barackryphal.blogspot.com/2009/07/meet-ronald-jay-polland.html&quot;&gt;Ron Polland&lt;/a&gt; -- a person who repeatedly lied about his credentials and used pseudo-science to support his &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/235-Bad-Science-How-Not-To-Do-Image-Analysis-Part-II.html&quot;&gt;baseless claims&lt;/a&gt; of fraud and tampering.&lt;br /&gt;
&lt;br /&gt;
More stunning is this quote from the &lt;i&gt;East Valley Tribune&lt;/i&gt;:&lt;br /&gt;
&lt;br /&gt;
&lt;blockquote&gt;Zullo also said the fact the Honolulu newspaper had a birth announcement at the time proves nothing, saying the newspaper accepted such messages without requiring proof the person actually had been born in the state.&lt;/blockquote&gt;&lt;br /&gt;
&lt;br /&gt;
So... Zullo is implying that the conspiracy to elect Obama dates back over 51 years. I mean, Zullo is implying that the Honolulu newspaper regularly announces non-Hawaiian births for foreigners. This would make sense if the paper had a few million birth announcements for 1961. Otherwise, why would the paper randomly select one alleged foreign birth out of everyone born in the world &lt;i&gt;and&lt;/i&gt; forget to mention Kenya? And what are the odds that the random selection became President nearly a half-century later? (Hey, perhaps this is related to the &lt;a href=&quot;http://en.wikipedia.org/wiki/Bay_of_Pigs_Invasion&quot;&gt;Bay of Pigs invasion&lt;/a&gt; that happened a few months before Obama was born...)&lt;br /&gt;
&lt;br /&gt;
I have some serious concerns here. If this is the type of logic that Zullo and Arpaio use for investigations, then all of their previous work should be suspect.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Then again&lt;/H3&gt;&lt;br /&gt;
Then again, this isn&#039;t the first time Arpaio has gotten in trouble for shoddy investigations. For example, his office failed to investigate the sexual assault of a &lt;a href=&quot;http://en.wikipedia.org/wiki/Maricopa_County_Sheriff%27s_Office_controversies&quot;&gt;14-year-old&lt;/a&gt;. It was later found that his office had failed to investigate &lt;a href=&quot;http://www.nytimes.com/2011/12/10/us/sheriff-joe-arpaio-criticized-over-handling-of-sex-crimes-cases.html&quot;&gt;over 400 cases&lt;/a&gt; of sexual abuse, including crimes against children. &lt;a href=&quot;http://www.timesherald.com/article/20120301/NEWS04/120309985&quot;&gt;According to the Sheriff&#039;s office&lt;/a&gt;, there was a backlog and it was &quot;cleared up after the problem was brought to Arpaio&#039;s attention.&quot; I cannot help but wonder how they closed out these cases since they claim to have accumulated 400 cases in 3 years and resolved the backlog in less than 2 years. That&#039;s a rate of closing more than 2 cases per day (assuming new reports were coming in at the same rate). How thorough could these investigations have really been?&lt;br /&gt;
&lt;br /&gt;
Sheriff Arpaio has also been &lt;a href=&quot;http://www.azcentral.com/news/election/azelections/articles/2010/09/02/20100902joe-arpaio-sued-by-justice-department-brk-02-ON.html&quot;&gt;sued&lt;/a&gt; by the Justice Department for civil rights violations, &lt;a href=&quot;http://www.huffingtonpost.com/2011/12/21/sheriff-joe-arpaio_n_1163490.html&quot;&gt;investigated&lt;/a&gt; for deaths and injuries of people in custody, and found to &lt;a href=&quot;http://www.azcentral.com/news/articles/2008/10/23/20081023ruling1023.html&quot;&gt;violate&lt;/a&gt; the constitutional rights of inmates. (I was surprised to find an entire &lt;a href=&quot;http://en.wikipedia.org/wiki/Maricopa_County_Sheriff&#039;s_Office_controversies&quot;&gt;Wikipedia page&lt;/a&gt; devoted to controversies from his Sheriff&#039;s office.)&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Not Alone, Yet So Alone&lt;/H3&gt;&lt;br /&gt;
Today, most people seem to view the Birthers as nuts who fear a black democratic president -- even though he has already been in office for over 3 years, the world has not ended, and we are &lt;a href=&quot;http://obamaachievements.org/list&quot;&gt;much better off&lt;/a&gt; compared to how Bush left us.&lt;br /&gt;
&lt;br /&gt;
Most of the notable Birthers have gone quiet on the topic. Donald Trump and Roy Blunt have not said much since Obama produced his birth certificate, Newt Gingrich has mainly limited his comments to political gaffs, and even idiot &lt;strike&gt;savant&lt;/strike&gt; Sarah Palin isn&#039;t in the news much anymore.&lt;br /&gt;
&lt;br /&gt;
While researching this story, I only came upon one interesting birther update. Last month Orly Taitz managed to bring her complaint -- that Obama is not eligible to be President -- to a court of law. The defense (Obama&#039;s legal team) decided that the claims were so baseless that they didn&#039;t even show up to court. That&#039;s right, Taitz was allowed to make her claims and present her evidence without any counter argument or challenge from the defense. And &lt;a href=&quot;http://www.dailykos.com/story/2012/02/03/1061562/-Orly-Taitz-loses-birther-case-to-an-empty-table&quot;&gt;she still lost&lt;/a&gt;. The judge found that Taitz offered no credible evidence and had no case.&lt;br /&gt;
&lt;br /&gt;
So I have to wonder: if the courts believe Obama qualifies for the Presidency, all Birther arguments about a fake birth certificate are easily explained with rational arguments and provable demonstrations, and even the State of Hawaii has consistently said that Obama is a US citizen, why is it that Sheriff Arpaio could not uncover any of this in his &quot;six month investigation&quot;? I can only conclude that Arpaio either did not conduct an investigation (fraud) or was so grossly incompetent that all investigation performed by his office should be reviewed for criminal ineptitude. (Arpaio: I hope you like &lt;a href=&quot;http://www.bizjournals.com/phoenix/morning_call/2012/03/courts-may-ban-arpaios-pink-underwear.html&quot;&gt;pink underwear&lt;/a&gt;.) 
    </content:encoded>

    <pubDate>Mon, 12 Mar 2012 17:40:08 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/473-guid.html</guid>
    
</item>
<item>
    <title>Use Cases</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/472-Use-Cases.html</link>
            <category>Forensics</category>
            <category>FotoForensics</category>
            <category>Image Analysis</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/472-Use-Cases.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=472</wfw:comment>

    <slash:comments>10</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=472</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    One of my big concerns regarding any kind of public forensic analysis site is that it would be misused. I worried that people would upload pictures and -- regardless of the results -- claim that a picture was real or fake (or whatever supports their viewpoint). For this reason, I created a basic tutorial and FAQ to help people understand the results. Then again, I was also concerned that people would ignore these documents.&lt;br /&gt;
&lt;br /&gt;
Well, I am no longer worried. I can now honestly say that the tutorial is being used. I&#039;ve been watching the web logs and the vast majority of new visitors either go the the tutorial and then upload pictures, or upload a picture and then go to the tutorial.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;The Naked Truth&lt;/H3&gt;&lt;br /&gt;
We&#039;ve been working on a few behind-the-scenes items. For example, we&#039;re getting better at rapidly &lt;u&gt;banning&lt;/u&gt; perverts who upload porn. As the &lt;a href=&quot;http://fotoforensics.com/faq.php#Is%20this%20site%20broken&quot;&gt;FAQ states&lt;/a&gt;, FotoForensics is not for your personal porn archive. Right now, porn comprises about 3% of the uploaded images but comes from far less than 1% of people who upload images. However, without content filtering the perverts would run rampant over the entire site.&lt;br /&gt;
&lt;br /&gt;
We have a simple policy concerning pornography: block and ban. The content is blocked and the person is banned. And we might even choose to humiliate perverts who complain about being banned without telling the whole truth about why they were banned. Keep in mind: we are only censoring pornography. So if you see someone complain about the site banning users, you should ask them: what kind of perverted content did they upload? (And if you&#039;re curious or bi-curious about &lt;a href=&quot;https://twitter.com/#!/KaraDemon/statuses/170761524623843328&quot;&gt;Ricardo&lt;/a&gt;, he&#039;s an ass-man.)&lt;br /&gt;
&lt;br /&gt;
So far, the most disturbing source of pictures has been from the &lt;a href=&quot;http://www.cca.com/&quot;&gt;Corrections Corporation of America&lt;/a&gt;. CCA runs many of the US prisons. I sent them a notification through their site&#039;s contact form, but haven&#039;t heard a reply. The disturbing part is that I don&#039;t know if it is an inmate or a guard. Then again, a major airline appears to employ a pervert with child-related fantasies and that company didn&#039;t reply either.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;From Abuse To Use&lt;/H3&gt;&lt;br /&gt;
The vast majority of the people who use FotoForensics are not uploading porn. (97% of the pictures and over 99% of the site visitors are basically workplace safe. And 3% of the pictures have already been blocked and removed.)&lt;br /&gt;
&lt;br /&gt;
Today I decided to search and see who was using &lt;a href=&quot;http://fotoforensics.com/&quot;&gt;FotoForensics&lt;/a&gt;. I was thrilled to see that Google has already indexed plenty of people using the system -- and using it correctly! (I guess the tutorial works!)&lt;br /&gt;
&lt;br /&gt;
For example, over at &lt;a href=&quot;http://fstoppers.com/news-canons-5d-mark-iii-first-pics-in-the-wild&quot;&gt;Fstoppers&lt;/a&gt;, someone used FotoForensics to identify that pictures of the new Canon EOS 5D Mark III were actually modified. And they are correct -- the pictures are not camera-original. This particular &lt;a href=&quot;http://fotoforensics.com/analysis.php?id=fe4554f21c0172f3624628f9a22a956c30a17e91.115239&quot;&gt;ELA result&lt;/a&gt; is typically found with artificial sharpening. Someone wanted the picture to look sharper than the original photo appeared. However, this isn&#039;t too grievous of a manipulation and does not appear to have been malicious.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh3.googleusercontent.com/-MLxFeBTi3Hw/T1KYcUkvnBI/AAAAAAAACIo/dKgDzU7em0M/s800/5d01.jpeg&quot;&gt;&lt;img src=&quot;http://lh6.googleusercontent.com/-d1Q9GnitZJE/T1KYcle9mBI/AAAAAAAACJA/f8bVZp_BUrg/s800/5d01-ela.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Red Light District&lt;/H3&gt;&lt;br /&gt;
But my favorite use of FotoForensics so far comes from the UFO community. I like these people because many of them really do want to investigate and not just blindly accept things as fact.&lt;br /&gt;
&lt;br /&gt;
Over at 2012hoax.org is a posting about a &lt;a href=&quot;http://www.2012hoax.org/forum/t-443039/big-red-object-new-mexico&quot;&gt;Big Red Object New Mexico&lt;/a&gt;. This is a continuation of the original posting found at &lt;a href=&quot;http://www.abovetopsecret.com/forum/thread653208/pg1&quot;&gt;Above Top Secret&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
The photo shows the night sky, a bright white object (the moon) and a bright red object near it (the unidentified object). According to the posting, &quot;ALL HE DID WAS EMAIL THE IMAGE FROM HIS PHONE TO HIS COMPUTER THEN POSTED IT ONTO THE WEBSITE&quot; (it was written in ALL CAPS). Here&#039;s the picture:&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh6.googleusercontent.com/-hhHCgY-qBGM/T1KYc4mcDvI/AAAAAAAACIw/Q5ZJbppJomI/s800/5ff95d18e982.jpg&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
There are so many problems with this picture that it makes a great analysis example.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Meta data. The first claim is that this came directly from his cellphone camera. However, it is missing camera meta data; it is &lt;b&gt;not&lt;/b&gt; camera-original. Moreover, it has an embedded JPEG comment: &quot;CREATOR: gd-jpeg v1.0 (using IJG JPEG v62), quality = 80&quot;. So now we know that it used a software image library to last save it at 80% quality.&lt;br /&gt;
&lt;li&gt;Quality. Just looking at the picture, you can see grainy 8x8 blocky artifacts. This comes from multiple resaves.&lt;br /&gt;
&lt;li&gt;ELA. The error level analysis is mostly black, supporting the observed artifacts from multiple resaves. The picture uses 16x16 chrominance subsampling, so I&#039;ve drawn in the 16x16 grid.&lt;br /&gt;
&lt;img src=&quot;http://lh3.googleusercontent.com/-UPbxSivFLh0/T1KYcZ8TrOI/AAAAAAAACIk/svW2qgnqcPA/s800/5ff95d18e982-ela.png&quot;&gt;&lt;br /&gt;
Notice how the red dot impacts the four 16x16 grids that contain it; it is at a much higher quality than the rest of the image. If this were simply an issue of the red dot being bright, then the white/moon should also have a higher error level potential. If this red dot were original, it would look similar to the white light&#039;s faded ELA result.&lt;br /&gt;
&lt;li&gt;LG. The luminance gradient shows us something else.&lt;br /&gt;
&lt;img src=&quot;http://lh3.googleusercontent.com/-kjxA2s5YdN8/T1KZoa8nlpI/AAAAAAAACJM/rX-6CQS-LmU/s800/5ff95d18e982-lg.png&quot;&gt;&lt;br /&gt;
The white light has a very wide gradient -- like a real light. In fact, the spurious noise seen far from the white light actually seem to form a loose ring around the white light. In contrast, the gradient from the red object is strictly limited to the four 16x16 squares; there is no ringing and it cuts off sharply. If it were really that bright, it would radiate out further than 16 pixels from the center.&lt;br /&gt;
&lt;li&gt;PCA. The Principal Component Analysis is sensitive to the image quality and JPEG artifacts. (I&#039;ve applied a colorization just to make the dark picture easier to see.)&lt;br /&gt;
&lt;img src=&quot;http://lh6.googleusercontent.com/-j14BbqrtEY8/T1KYcwnsBJI/AAAAAAAACI0/Jsla1z4tgzU/s800/5ff95d18e982-pca.png&quot;&gt;&lt;br /&gt;
The overall background is a solid purple color -- it is such a low quality that it appears uniformly colored. The white moon has very large artifacts that align with the 16x16 grid. So this is a very low-quality picture. However, the red light has very fine details. Relative to the rest of the picture, this is a high-quality region.&lt;/ul&gt;&lt;br /&gt;
The people at 2012hoax noticed a few other things as well. For example, the photographer described the camera settings (focal length, exposure time, etc.). However, this picture&#039;s quality is inconsistent with those settings.&lt;br /&gt;
&lt;br /&gt;
The net result? The red light is a hoax. 
    </content:encoded>

    <pubDate>Sat, 03 Mar 2012 15:23:26 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/472-guid.html</guid>
    
</item>
<item>
    <title>The Real World</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/471-The-Real-World.html</link>
            <category>Image Analysis</category>
            <category>Mass Media</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/471-The-Real-World.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=471</wfw:comment>

    <slash:comments>4</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=471</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    This has been a crazy month, so I haven&#039;t had much time to cover all of the topics that have been on my mind.&lt;br /&gt;
&lt;br /&gt;
Earlier this month, the &lt;i&gt;Washington Post&lt;/i&gt; featured a cover photo that used &lt;a href=&quot;http://en.wikipedia.org/wiki/High_dynamic_range_imaging&quot;&gt;high dynamic range&lt;/a&gt; (HDR) imaging. HDR is a type of pseudo-color that gives pictures a wider color range. The use of the picture caused some &lt;a href=&quot;http://www.fourandsix.com/blog/2012/2/13/photojournalism-ethics-on-shifting-technological-ground.html&quot;&gt;controversy&lt;/a&gt; since photo-journalism is not supposed to manipulate the image. This controversy peaked when Sean Elliot, the president of the National Press Photographers Association, &lt;a href=&quot;http://www.poynter.org/latest-news/top-stories/159412/washington-post-raises-eyebrows-with-composite-photo-on-front-page/&quot;&gt;declared&lt;/a&gt; that &quot;HDR is not appropriate for documentary photojournalism.&quot;&lt;br /&gt;
&lt;br /&gt;
The net result? Press photographers should not use high dynamic range (HDR) imaging.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Color My World&lt;/H3&gt;&lt;br /&gt;
Elliot&#039;s ruling makes sense from a manipulation standpoint. The application of false-color can alter the meaning of the image. For example, are the false color images &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/389-Out-of-the-Blue.html&quot;&gt;released by BP&lt;/a&gt; (after the gulf oil spill) an artistic rendering, or an attempt to make the brown water look blue and the brown sand look white?&lt;br /&gt;
&lt;br /&gt;
&lt;a href=&quot;http://www.flickr.com/photos/bpamerica/4834228009/sizes/l/&quot;&gt;&lt;img src=&quot;http://lh3.ggpht.com/_Uw91icJn-go/TFBcgFCG17I/AAAAAAAABl4/2ML7OXf1UkQ/s400/4834228009_31e1843cba_o.jpg&quot;&gt;&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
Of course, not everyone agrees with Elliot. For example, John Omvik at Unified Color &lt;a href=&quot;http://www.dpreview.com/news/2012/02/03/HDR_in_photojournalism&quot;&gt;strongly disagreed&lt;/a&gt;. He points out that photos captured by the camera don&#039;t always look like what our eyes see. HDR can make the picture appear more like how the photographer saw it. As he wrote: &quot;Using HDR software and processing tools is the only method a photographer has to deliver precisely what he or she witnessed at the time of an image capture.&quot;&lt;br /&gt;
&lt;br /&gt;
I&#039;ll take a moment to point out that John Omvik is the V.P. of Marketing at &lt;a href=&quot;http://www.unifiedcolor.com/&quot;&gt;Unified Color Technologies&lt;/a&gt;, and UCT creates HDR software. So it in his company&#039;s best interest to try to promote this technology.&lt;br /&gt;
&lt;br /&gt;
The one big thing that caught my attention came from Omvik&#039;s description of HDR&#039;s potential. He &lt;a href=&quot;http://www.dpreview.com/news/2012/02/03/HDR_in_photojournalism/&quot;&gt;wrote&lt;/a&gt;, &quot;When properly used, HDR does the most accurate job of reconstructing the dynamic range of the original scene at the time the photo was taken.&quot;&lt;br /&gt;
&lt;br /&gt;
And that&#039;s the problem: &quot;when properly used&quot;. There is no license or certification for identifying HDR competency. And there&#039;s no deterministic method to identify when a picture&#039;s color has been over-corrected.&lt;br /&gt;
&lt;br /&gt;
There&#039;s also the little problem about color -- we all see things differently. While the artist (or for people who feel slighted, &quot;the photographer&quot;) may try to match the blue in the sky, he may not notice that he&#039;s blown out the blues in the water.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;A Vision to Behold&lt;/H3&gt;&lt;br /&gt;
Personally, I experience this color issue daily. I wear glasses and glasses &lt;a href=&quot;http://micro.magnet.fsu.edu/optics/lightandcolor/refraction.html&quot;&gt;refract&lt;/a&gt; light. I&#039;ve noticed that I have a choice concerning the way I see the world. I can either wear my glasses and see the world with crisp edges but muted colors, or I can remove my classes and see the world as a big blur with vibrant colors. (If you&#039;re nearsighted, try looking at something bright and colorful with and without glasses. Which looks more colorful to you?) So if I adjust the photo to match what I saw, then you&#039;ll either see something blurry but colorful, or you&#039;ll see something crisp but with colors that don&#039;t pop out.&lt;br /&gt;
&lt;br /&gt;
With a camera, there are many components that impact the colors in the captured photo. There is the lens, color filter array, sensor, JPEG quality, and &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/250-Showing-JPEGs-True-Color.html&quot;&gt;JPEG color loss&lt;/a&gt;. All of these add up to colors that approximate the scene but do not duplicate it. However, this approach is deterministic and not an arbitrary skew based on the photographer&#039;s recollection of how bright elements appeared.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Consistent View&lt;/H3&gt;&lt;br /&gt;
Personally, I think Elliot&#039;s opinion is consistent with industry practices. For example, the National Press Photographer&#039;s Association (NPPA) has a published &lt;a href=&quot;http://www.nppa.org/professional_development/business_practices/ethics.html&quot;&gt;Code of Ethics&lt;/a&gt;. It states that &quot;Editing should maintain the integrity of the photographic images&#039; content and context.&quot; It&#039;s about the photo and not the photographer. Or more specifically, it isn&#039;t &quot;as the photographer saw it&quot; -- it is &quot;as the camera recorded it&quot;.&lt;br /&gt;
&lt;br /&gt;
I &lt;a href=&quot;http://www.hackerfactor.com/blog/index.php?/archives/274-Getting-Real.html&quot;&gt;previously wrote&lt;/a&gt; how major photo suppliers, such as Reuters, AP, and Getty Images, all have requirements concerning photo accuracy. Getty goes so far as &lt;a href=&quot;http://contributors.gettyimages.com/img/articles/downloads/Submission_Requirements_v2_0_APRIL_2007_(3).pdf&quot;&gt;specifying&lt;/a&gt; specific cameras and permitted camera functions; nothing else is approved. And for clarity, Getty and the other media outlets have not approved HDR for photojournalism. 
    </content:encoded>

    <pubDate>Sun, 26 Feb 2012 19:29:46 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/471-guid.html</guid>
    
</item>
<item>
    <title>Awesome First Week</title>
    <link>http://www.hackerfactor.com/blog/index.php?/archives/470-Awesome-First-Week.html</link>
            <category>Forensics</category>
            <category>FotoForensics</category>
            <category>Image Analysis</category>
            <category>Network</category>
            <category>Programming</category>
            <category>Security</category>
    
    <comments>http://www.hackerfactor.com/blog/index.php?/archives/470-Awesome-First-Week.html#comments</comments>
    <wfw:comment>http://www.hackerfactor.com/blog/wfwcomment.php?cid=470</wfw:comment>

    <slash:comments>7</slash:comments>
    <wfw:commentRss>http://www.hackerfactor.com/blog/rss.php?version=2.0&amp;type=comments&amp;cid=470</wfw:commentRss>
    

    <author>Dr. Neal Krawetz</author>
    <content:encoded>
    The &lt;a href=&quot;http://fotoforensics.com/&quot;&gt;FotoForensics&lt;/a&gt; site has been up for a week. I&#039;m thrilled -- it&#039;s been years since I needed to create a new, hardened server and I half expected auto-scanning bots to compromise the site through some unpatched vulnerability. I&#039;ve already recorded a few dozen automated attacks; the first happened about 4 hours after I got the web server installed. (For a comparison, the Google and Microsoft search engines indexed the site faster than all of the attack-bots -- and I didn&#039;t even have links on the site when Google found it!)&lt;br /&gt;
&lt;br /&gt;
I can honestly say that installing a &quot;more secure&quot; system is easier today than it was a few years ago. Just four years ago, there were dozens of services and auto-update systems that posed risks and needed to be disabled. Today, well, most of the system was secure right out of the box, and there are some &lt;a href=&quot;http://secure-ubuntu-server.blogspot.com/2009/07/howto-hardening-your-apache-and-php-on_07.html&quot;&gt;great instructions&lt;/a&gt; for quickly securing the rest. I decided to go with an Ubuntu Lucid Lynx (10.04 LTS) since it is the current version with long-term support. The install is the basic server, not the full desktop, and I added in whatever minimal additional tools I need. (Thanks to Bill and Chris for the installation!) I was provided with a basic install and one remote login account. I hardened it from there, installed the web server, scripts, and detection/prevention elements.&lt;br /&gt;
&lt;br /&gt;
I consider this exercise to be a good refresher and learning experience. I&#039;ve been so engrossed in computer forensics that I haven&#039;t had time to keep up my sysadmin skills. (It&#039;s like falling off a bike. You never really forget how, but you can always use the reminder.) And as a learning exercise, I certainly got some great experience.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Lacking Sense&lt;/H3&gt;&lt;br /&gt;
The first big problem to overcome came from Websense. They provide a web filtering system that blocks hostile sites. And a huge number of big companies use their filtering software. The problem is, Websense blocked my site. According to their online &quot;try-before-you-buy&quot; &lt;a href=&quot;http://aceinsight.websense.com/Results.aspx?url=fotoforensics.com#/?url=fotoforensics.com&quot;&gt;ACE Insight&lt;/a&gt; system, my server was classified as &quot;Suspicious content was found or the reputation of the server is low.&quot; Their security categories said that the site contained &quot;Potentially Damaging Content&quot;.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh5.googleusercontent.com/-yK1H4s3RPNY/TzgFzi1-iUI/AAAAAAAACH4/ERJbEI2SeWg/s640/websense-fotoforensics.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh3.googleusercontent.com/-V0kIfvsiv5w/TzgFzdh1XII/AAAAAAAACHw/1Gpb2gH97w4/s640/websense-fotoforensics-category.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
However, their scanner also reported that there were no viruses detected, no links, and they really didn&#039;t know anything about the site.&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh3.googleusercontent.com/-p0vxKk76R3Q/TzgFzIlCw5I/AAAAAAAACHo/6LGNUrcq2p0/s800/websense-fotoforensics-av.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh5.googleusercontent.com/-8PV9xPdJ5AI/TzgFzXSHmuI/AAAAAAAACHs/Se6c2_jB4T4/s800/websense-fotoforensics-details.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
When I clicked on their scanner, I watched the system for their accesses.&lt;br /&gt;
&lt;blockquote&gt;&lt;code&gt;204.15.64.200 - - [09/Feb/2012:11:43:31 -0600] &quot;GET / HTTP/1.0&quot; 200 1021 &quot;-&quot; &quot;Mozilla/5.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; InfoPath.2; SLCC1; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; .NET CLR 2.0.50727)&quot;&lt;/code&gt;&lt;/blockquote&gt;&lt;br /&gt;
They only went to &quot;/&quot;, and that only returned HTML. No Java, no JavaScript, and nothing active. So how can plain HTML be hostile? Even the text in the HTML was passive (&quot;Submit a picture for Error Level Analysis&quot;) -- it isn&#039;t like it said &quot;boobs&quot; or &quot;exploits&quot;. Websense claimed that there were no links, but the site did have hyperlinks to other pages on the same site. Yet, they didn&#039;t even scan the rest of the site; they didn&#039;t download the CSS, any images, or even visit the other pages. Is Websense really afraid of plain HTML?&lt;br /&gt;
&lt;br /&gt;
I wrote to an acquaintance at Websense and asked why they had blocked me. It turns out, the ASN assigned to the hosting subnet also hosted some porn and warez sites. As I understand it, those sites get shutdown if they are reported and found to be conducting illegal activities. However, because they are/were at the same upstream provider, Websense decided that my server lives in a bad neighborhood and blocked it by default. This explains why they blocked access, but not why they claimed that the site contained potentially damaging content -- even after they &quot;scanned&quot; it and found nothing harmful.&lt;br /&gt;
&lt;br /&gt;
After sending my query, Websense reclassified FotoForensics as &quot;Computer Security&quot;, noted that there is &quot;No threat detected at this time&quot;, and removed the block. However, it is still listed as a &quot;Potential Security Risk&quot; because it is new (low reputation).&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh5.googleusercontent.com/-DRSoljyRlkU/TzgIQ3JNEPI/AAAAAAAACIQ/1gzbFCf3HHs/s800/websense-fotoforensics-category2.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;img src=&quot;http://lh3.googleusercontent.com/-m2WIRW_DfVk/TzgIQ7rb3nI/AAAAAAAACIU/LKMzQM8p6iQ/s800/websense-fotoforensics-details2.png&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
Considering that Websense &lt;a href=&quot;http://www.websense.com/content/AboutUs.aspx&quot;&gt;describes themselves&lt;/a&gt; as &quot;a global leader in unified Web, data, and email content security, delivers the best security for modern threats&quot;, one would think that they would actually check for modern threats before claiming that a site poses a risk. Blocking access to a site and classifying it as a source for &quot;potentially damaging content&quot; &lt;b&gt;without checking first&lt;/b&gt; seems along the same lines as slander, libel, and security &lt;a href=&quot;http://en.wikipedia.org/wiki/Snake_oil&quot;&gt;snake-oil&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;Are they real?&lt;/H3&gt;&lt;br /&gt;
The second issue I needed to contend with was porn. As stated in the FotoForensics&#039; &lt;a href=&quot;http://fotoforensics.com/faq.php&quot;&gt;FAQ&lt;/a&gt;, the site administrators may review content. I&#039;ve been keeping watch on the site to make sure it isn&#039;t compromised, everything stays up and running, and the content doesn&#039;t violate US laws concerning pornography and obscenity.&lt;br /&gt;
&lt;br /&gt;
Even though the site has only been up for a week (and less than a week since the formal announcement), I&#039;m already seeing a few patterns. For example, 99.9% of the uploaded pictures seem to be sent once and then not visited again. But the few pictures that are revisited by other people seem to be very popular.&lt;br /&gt;
&lt;br /&gt;
Most of the uploaded pictures seem to be associated with controversy. Basically, someone somewhere said that it looked fake. The second biggest category are pictures from the media. It seems that many people don&#039;t trust what they are shown. (People are skeptical! Awesome! This reestablishes my belief that people are not as stupid as the media wants us to believe.)&lt;br /&gt;
&lt;br /&gt;
Unfortunately, about 5% of the uploads right now are porn. And I&#039;m not taking about the good kind. (Girl-on-girl with spatulas and duct tape is just wrong.) In the first 24 hours, I had to add an enhancement to the admin script. I had a means to quickly view, block, or delete content, but I also needed a way to ban users. You see, regular people seem to just upload one or two pictures at a time. But if there are a bunch of pictures uploaded in a short amount of time, then it is usually the creepy porn. These perverts do not seem interested in photo forensics; they seem interested in uploading their collections. So, I ban them. Right now, about a dozen users are banned. That isn&#039;t much, but it cut off 5% of the uploaded content. (Like the 90-10 rule: a small group of people cause the most problems.) I also updated the FAQ just for the pervs: this site is not for your personal porn collection.&lt;br /&gt;
&lt;br /&gt;
On the positive side, a few people are giving the ELA system a serious workout. Some have even submitted controlled analysis experiments. AWESOME! I&#039;m eager to hear their thoughts if they are interested in sharing.&lt;br /&gt;
&lt;br /&gt;
&lt;H3&gt;What&#039;s next?&lt;/H3&gt;&lt;br /&gt;
Based on the content I&#039;ve seen and a little feedback I&#039;ve received, I&#039;m currently updating the FAQ and training page. In particular, I&#039;m adding descriptions about how to get better results. (I&#039;m seeing a good number of pictures with multiple resaves and scaling, and those deter ELA analysis. For better results, find a picture that is closer to the original and not a resave of a resave.)&lt;br /&gt;
&lt;br /&gt;
I promised myself that I wouldn&#039;t update or expand the system for at least a while -- just to give it time to see if the current system works. However, I do have dreams of it doing more than just ELA. So I guess that&#039;s one question I&#039;ll throw out there: what else would the users of this system like to see? My dream list includes:&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Comments. Something like Disqus or Intense Debate, where people could comment on uploaded pictures. However, this requires a moderator and I don&#039;t have that kind of time. (It would need volunteer moderators.)&lt;br /&gt;
&lt;li&gt;Forum. Perhaps the site could use a forum where people comment on pictures, assist with analysis, or discuss anything related to computer forensics. (Again, would require assistance since I really hate spammers and have no time to moderate.)&lt;br /&gt;
&lt;li&gt;More analyzers. ELA is cute, but other analyzers would be even more entertaining. I&#039;m actually thinking that people could submit their own analyzers for inclusion on the system. (I&#039;d need to document the API and write up requirements -- like must run in real-time and not consume 100% CPU or 100% memory.) Of course, there would need to be a training page for each analyzer that says what it does, how it works, and the limitations... I am also thinking that I might add ExifTool to the analysis process -- many photo-sharing sites already use ExifTool for meta data. But if there are no volunteers, then I&#039;ll just make it do what I want (which is probably not what other people want).&lt;br /&gt;
&lt;li&gt;Detectors. Right now, the site doesn&#039;t generate any interpretations about the data. Even things like &quot;outside photo&quot; vs &quot;night shot&quot; or &quot;in a room&quot; would be awesome. Other detectors, like a people counter (e.g., &quot;there are three people&quot;) or identifying known objects (e.g., &quot;there is a car&quot;) would be very cool. (Can anyone reliably tell if the picture contains pets and distinguish dogs from cats?) Automatically identifying real vs graphics and edits would be a great bonus. This would be along the same lines as the analyzers: if other people with existing (real or experimental) detectors want to help, I&#039;m open to the idea, but if people expect it from me, then you probably shouldn&#039;t hold your breath.&lt;/ul&gt;&lt;br /&gt;
So having said that, FotoForensics has a lot of potential to be more than an error level analyzer. What else would people like to see and would people be willing to contribute?&lt;br /&gt;
&lt;br /&gt;
&lt;small&gt;I still have serious concerns about distributing code that bad guys can use for offline analysis. If they had the tools, they would attempt to exploit existing analysis method -- either to hide their identities or frame other people. So even with an API and contributions, I don&#039;t see this becoming an open source project.&lt;/small&gt; 
    </content:encoded>

    <pubDate>Sun, 12 Feb 2012 12:49:51 -0700</pubDate>
    <guid isPermaLink="false">http://www.hackerfactor.com/blog/index.php?/archives/470-guid.html</guid>
    
</item>

</channel>
</rss>
