<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Die spammers die!</title>
	<atom:link href="http://blog.niceperson.org/2005/01/15/die-spammers-die/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/</link>
	<description>Making it up as I go along.</description>
	<lastBuildDate>Sun, 11 Jul 2010 04:30:43 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>By: Dorothea Salo</title>
		<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/comment-page-1/#comment-480</link>
		<dc:creator>Dorothea Salo</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.niceperson.org/wordpress/2005/01/15/die-spammers-die/#comment-480</guid>
		<description>Try taking out some quotation marks, especially around ^$. I don&#039;t know why, but I have a rule relating to ^$ also, and it just wouldn&#039;t fly if I quote-marked it.</description>
		<content:encoded><![CDATA[<p>Try taking out some quotation marks, especially around ^$. I don't know why, but I have a rule relating to ^$ also, and it just wouldn't fly if I quote-marked it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pjm</title>
		<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/comment-page-1/#comment-481</link>
		<dc:creator>pjm</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.niceperson.org/wordpress/2005/01/15/die-spammers-die/#comment-481</guid>
		<description>I think I&#039;m with Dorothea on that. The ^ and $ are regular expression special characters to anchor the match at the start and end of the string (^ is start, $ is end) so ^$ is, specifically, a null string. I don&#039;t think the quotes are necessary, because there&#039;s nothing to quote.</description>
		<content:encoded><![CDATA[<p>I think I'm with Dorothea on that. The ^ and $ are regular expression special characters to anchor the match at the start and end of the string (^ is start, $ is end) so ^$ is, specifically, a null string. I don't think the quotes are necessary, because there's nothing to quote.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Laurabelle</title>
		<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/comment-page-1/#comment-482</link>
		<dc:creator>Laurabelle</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.niceperson.org/wordpress/2005/01/15/die-spammers-die/#comment-482</guid>
		<description>I&#039;ve taken out the quotation marks, because it shouldn&#039;t hurt.  Still, I think that if that were failing, &lt;em&gt;I&lt;/em&gt; wouldn&#039;t be able to access any file in my MT directory, and I can. The rule is deny-everyone-except-local where local is defined as niceperson.org or a blank referer, so if the blank referer weren&#039;t matching, then I wouldn&#039;t have been able to load mt.cgi like I did a few minutes ago.

I did some Googling after I posted, and I think perhaps Apache just isn&#039;t configured to recognize or use .htaccess.  I&#039;ll drop a line to my Friendly Neighborhood Sysadmin.</description>
		<content:encoded><![CDATA[<p>I've taken out the quotation marks, because it shouldn't hurt.  Still, I think that if that were failing, <em>I</em> wouldn't be able to access any file in my MT directory, and I can. The rule is deny-everyone-except-local where local is defined as niceperson.org or a blank referer, so if the blank referer weren't matching, then I wouldn't have been able to load mt.cgi like I did a few minutes ago.</p>
<p>I did some Googling after I posted, and I think perhaps Apache just isn't configured to recognize or use .htaccess.  I'll drop a line to my Friendly Neighborhood Sysadmin.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Laurabelle</title>
		<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/comment-page-1/#comment-483</link>
		<dc:creator>Laurabelle</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.niceperson.org/wordpress/2005/01/15/die-spammers-die/#comment-483</guid>
		<description>Okay, I know for sure that the rule works on my /images directory, because it blocks images from showing in the preview of my Atom feed on Bloglines.

If anyone cares, the rules for images are the same as what I was trying in my MT directory, except that it applies only to image files:

&lt;FilesMatch &quot;\.(gif&#124;jpe?g&#124;png)$&quot;&gt;
Order deny,allow
Deny from all
Allow from env=local_referal
&lt;FilesMatch&gt;

This has the same effect as the &lt;a href=&quot;http://blog.niceperson.org/2003/03/05/htaccess-tricks/&quot;&gt;ModRewrite rules&lt;/a&gt; I wrote almost two years ago, but it&#039;s much easier on Apache. As I understand it, ModRewrite is powerful but heavy, so it&#039;s generally more efficient to use &lt;q&gt;normal&lt;/q&gt; rules when possible.</description>
		<content:encoded><![CDATA[<p>Okay, I know for sure that the rule works on my /images directory, because it blocks images from showing in the preview of my Atom feed on Bloglines.</p>
<p>If anyone cares, the rules for images are the same as what I was trying in my MT directory, except that it applies only to image files:</p>
<p>&lt;FilesMatch "\.(gif|jpe?g|png)$"&gt;<br />
Order deny,allow<br />
Deny from all<br />
Allow from env=local_referal<br />
&lt;FilesMatch&gt;</p>
<p>This has the same effect as the <a href="http://blog.niceperson.org/2003/03/05/htaccess-tricks/">ModRewrite rules</a> I wrote almost two years ago, but it's much easier on Apache. As I understand it, ModRewrite is powerful but heavy, so it's generally more efficient to use <q>normal</q> rules when possible.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pjm</title>
		<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/comment-page-1/#comment-484</link>
		<dc:creator>pjm</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.niceperson.org/wordpress/2005/01/15/die-spammers-die/#comment-484</guid>
		<description>Renaming mt-comment and mt-tb are both very good and useful steps. I&#039;ve found that adding some caps to the new names is also helpful - if their bots find the new names, apparently they&#039;re not always bright enough to do so in a case-sensitive way. That plus MT-Blacklist has had me spam-free for a few months now.

W.r.t. Ed&#039;s mod_rewrite rule, he left a comment on my entry suggesting that what he was seeing wasn&#039;t specifically referrer spam - it seemed to be attempts to exploit his site as an open proxy, with referrer spam just piggy-backed on. So it&#039;s no wonder his rules don&#039;t always help us. I just left &#039;em out and went with the regex plus user-agent block, which has helped hugely.</description>
		<content:encoded><![CDATA[<p>Renaming mt-comment and mt-tb are both very good and useful steps. I've found that adding some caps to the new names is also helpful - if their bots find the new names, apparently they're not always bright enough to do so in a case-sensitive way. That plus MT-Blacklist has had me spam-free for a few months now.</p>
<p>W.r.t. Ed's mod_rewrite rule, he left a comment on my entry suggesting that what he was seeing wasn't specifically referrer spam - it seemed to be attempts to exploit his site as an open proxy, with referrer spam just piggy-backed on. So it's no wonder his rules don't always help us. I just left 'em out and went with the regex plus user-agent block, which has helped hugely.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Laurabelle</title>
		<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/comment-page-1/#comment-485</link>
		<dc:creator>Laurabelle</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.niceperson.org/wordpress/2005/01/15/die-spammers-die/#comment-485</guid>
		<description>Ohhhh, that makes sense (about Ed&#039;s rule). It also correlates with other things I&#039;ve been reading about referrer-spammers exploiting open &lt;acronym&gt;HTTP&lt;/acronym&gt; proxies in order to mask their true origins.

Good idea about case-sensitivity. I&#039;ll institute that when I&#039;m done posting this comment.

I don&#039;t black-list, but I use MT-Bayesian. It filters spam very effectively; unfortunately it&#039;s apparently going through a phase of marking everything as spam. 90% of my &lt;em&gt;own&lt;/em&gt; comments get marked as spam!</description>
		<content:encoded><![CDATA[<p>Ohhhh, that makes sense (about Ed's rule). It also correlates with other things I've been reading about referrer-spammers exploiting open <acronym>HTTP</acronym> proxies in order to mask their true origins.</p>
<p>Good idea about case-sensitivity. I'll institute that when I'm done posting this comment.</p>
<p>I don't black-list, but I use MT-Bayesian. It filters spam very effectively; unfortunately it's apparently going through a phase of marking everything as spam. 90% of my <em>own</em> comments get marked as spam!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ptt_</title>
		<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/comment-page-1/#comment-486</link>
		<dc:creator>ptt_</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.niceperson.org/wordpress/2005/01/15/die-spammers-die/#comment-486</guid>
		<description>some comments pages use the &quot;type the letters you see in this image&quot; hurdle, to allow only humans to comment</description>
		<content:encoded><![CDATA[<p>some comments pages use the "type the letters you see in this image" hurdle, to allow only humans to comment</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Laurabelle</title>
		<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/comment-page-1/#comment-487</link>
		<dc:creator>Laurabelle</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.niceperson.org/wordpress/2005/01/15/die-spammers-die/#comment-487</guid>
		<description>True, but that&#039;s a different problem from the one I was trying to solve with blocking referers. :-)

What you describe is called a &quot;captcha,&quot; and I don&#039;t find that I need that right now. I&#039;ve actually more or less solved the comment-spam problem. (Maybe I&#039;m just not popular enough for anyone to try to circumvent the measures I currently have in place.)

Trackback spam isn&#039;t susceptible to captchas, of course, but I haven&#039;t had too much of a problem with that either.</description>
		<content:encoded><![CDATA[<p>True, but that's a different problem from the one I was trying to solve with blocking referers. :-)</p>
<p>What you describe is called a "captcha," and I don't find that I need that right now. I've actually more or less solved the comment-spam problem. (Maybe I'm just not popular enough for anyone to try to circumvent the measures I currently have in place.)</p>
<p>Trackback spam isn't susceptible to captchas, of course, but I haven't had too much of a problem with that either.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff</title>
		<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/comment-page-1/#comment-488</link>
		<dc:creator>Jeff</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.niceperson.org/wordpress/2005/01/15/die-spammers-die/#comment-488</guid>
		<description>Captchas can also be defeated in a couple ways.  Advanced OCR techniques can defeat most simple captchas and even some of the more distorted ones, but there is always the ability to get a human to overcome a captcha for the spammer.

One scenario (reported to be in use): Porn site operator and spammer needs yahoo.com accounts, so he writes a script to autofill in the form, captures the captcha, feeds it to his &quot;new user&quot; page, gets some schmuck to sign up, decode the captcha, and when the yahoo.com email account has been verified, the schmuck gets his porn, and he just thinks that the porno page has implemented a captcha itself.

Anyway, between &quot;social engineering&quot; techniques and advanced pattern recognition algorithms, captchas are just another arms race.

Jeff</description>
		<content:encoded><![CDATA[<p>Captchas can also be defeated in a couple ways.  Advanced OCR techniques can defeat most simple captchas and even some of the more distorted ones, but there is always the ability to get a human to overcome a captcha for the spammer.</p>
<p>One scenario (reported to be in use): Porn site operator and spammer needs yahoo.com accounts, so he writes a script to autofill in the form, captures the captcha, feeds it to his "new user" page, gets some schmuck to sign up, decode the captcha, and when the yahoo.com email account has been verified, the schmuck gets his porn, and he just thinks that the porno page has implemented a captcha itself.</p>
<p>Anyway, between "social engineering" techniques and advanced pattern recognition algorithms, captchas are just another arms race.</p>
<p>Jeff</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Laurabelle's Blog</title>
		<link>http://blog.niceperson.org/2005/01/15/die-spammers-die/comment-page-1/#comment-489</link>
		<dc:creator>Laurabelle's Blog</dc:creator>
		<pubDate>Wed, 30 Nov -0001 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.niceperson.org/wordpress/2005/01/15/die-spammers-die/#comment-489</guid>
		<description>&lt;strong&gt;Google and comment spam&lt;/strong&gt;

This weekend I noticed that Googlebot had started indexing the URLs of my newly renamed comment script, even though it...</description>
		<content:encoded><![CDATA[<p><strong>Google and comment spam</strong></p>
<p>This weekend I noticed that Googlebot had started indexing the URLs of my newly renamed comment script, even though it...</p>
]]></content:encoded>
	</item>
</channel>
</rss>
