Scott on Writing

Musings on technical writing...

Comment Spam Script Gone Awry

As I blogged about earlier, I've altered .Text (the blog software that currently runs ScottOnWriting.NET) data model to include a trigger that, when a new item is added to the blog_Content table, checks to see if it contains either over 20 hyperlinks or contains a link to one of the URLs listed in a blog_BannedURLs table.  If such a nefarious comment is found, not only is it not saved, but another table has a counter incremented to let me ascertain just how much comment spam has been stopped.  Since January 24 of this year my trigger technique has caught 2,292 comment spams.  Amazing and depressing at the same time.

The trigger approach looks for offending URLs in both trackbacks and comments, searching both the body of the comment as well as the poster's specified URL.  One item that my trigger does not check, though, is the comment one might leave when rating a blog entry.  There's no reason to check what the user enters into the (optional) comments section when rating a blog entry because the comment appears only in an email that is sent directly to me.

Today I found out that comment spammers don't necessarily check too closely whether or not their spammed content actually appears on the site.  Today I received a little over 50 emails from my comment rater, chalked full of links to a sundry of adult sites purporting to have pictures of Ashley and Mary Kate.  (Why someone would want to look at those anorexics is beyond me...)


Been meaning to blog more as of late, but work's been keeping me down.  My tentative plans for upcoming tasks relating to this blog include:

  • Porting ScottOnWriting.NET from .Text 0.94 to Community Server, although part of me wants to wait until version 1.2, when Rob promises the API will be frozen.  The nice thing about moving to Community Server is that I'll be able to move the blogs for skmMenu and RssFeed from GotDotNet - which appears to be down every other day - to Community Server Forums here on the ScottOnWriting.NET server.
  • Making a blog entry about the webcam software I wrote back in January.  Earlier in this year I picked up a Logitech Webcam and wrote some software that will periodically upload the latest pic to a web server using FTP.  The app's been running in the background on my machine without incident since late January and, seeing as the app's built upon a number of open-source projects, I thought it would be nice to share the source with those who are interested.
  • Blogging about ASP.NET 2.0.  Prior to the latest work crunch (which started, not coincidentally, as the same date as my last blog entry), I had spent some additional time with the 2.0 bits.  With Beta 2 coming out at the end of this month, I thought it would be good to get proactive and start rambling on about v Next.

posted on Sunday, March 06, 2005 9:58 PM

Feedback

# re: Comment Spam Script Gone Awry 3/7/2005 12:54 PM Ben Strackany

Actually I'm surprised you didn't catch even more comment spams ... are you able to do a search on the comments to see how many contain, say, 2-5 hyperlinks?

Looking forward to some asp.net 2 posts. Oh and stop posting those AMK pics. :)

# Comment Spam attacks 3/8/2005 9:45 AM Chris Hammond

Title:  
Name:  
Url:
Protected by Clearscreen.SharpHIPEnter the code you see:
Comments   

Add To Your Reader

My Links

Archives

Post Categories

 

I am a Microsoft MVP for ASP.NET.
I am an ASPInsider.
<May 2008>
SMTWTFS
27282930123
45678910
11121314151617
18192021222324
25262728293031
1234567

Comment Stats

DayTotal% of Total
Sunday 1866.8%
Monday 37913.9%
Tuesday 45316.7%
Wednesday 50418.5%
Thursday 53519.7%
Friday 49418.2%
Saturday 1666.1%
Total 2717100.0%

Hour1Total% of Total
12:00 AM 652.4%
1:00 AM 682.5%
2:00 AM 622.3%
3:00 AM 742.7%
4:00 AM 572.1%
5:00 AM 1033.8%
6:00 AM 1084.0%
7:00 AM 1585.8%
8:00 AM 1716.3%
9:00 AM 1475.4%
10:00 AM 1716.3%
11:00 AM 1816.7%
12:00 PM 1886.9%
1:00 PM 1696.2%
2:00 PM 1605.9%
3:00 PM 1324.9%
4:00 PM 1073.9%
5:00 PM 923.4%
6:00 PM 913.3%
7:00 PM 963.5%
8:00 PM 833.1%
9:00 PM 782.9%
10:00 PM 792.9%
11:00 PM 772.8%
Total 2717100.0%

Comments by Blog Entry Date/Time

Day Entry MadeAvg.Total
Sunday 5.54144
Monday 5.22339
Tuesday 4.28419
Wednesday 7.67637
Thursday 6.90607
Friday 5.48411
Saturday 5.33160
Total 5.842717

Hour1 Entry MadeAvg.Total
12:00 AM 5.0035
1:00 AM 1.002
5:00 AM 0.000
7:00 AM 7.0035
8:00 AM 5.35107
9:00 AM 6.32278
10:00 AM 6.47246
11:00 AM 4.41181
12:00 PM 6.88330
1:00 PM 3.00111
2:00 PM 5.41222
3:00 PM 8.64285
4:00 PM 4.0589
5:00 PM 5.92154
6:00 PM 4.52113
7:00 PM 9.67174
8:00 PM 9.80147
9:00 PM 5.05111
10:00 PM 5.4265
11:00 PM 4.5732
Total 5.842717

Learn More About Comment Stats
1 - All times GMT -8...


Blog Stats

Favorite Web Sites

My Books

My MSDN Articles